4,810 Matching Annotations
  1. Jun 2023
    1. My children live with an unconscious fear that they may not live out their natural lives. I am not saying that fear is good. I am trying to find a way to deal with that anxiety. An architecture that puts its head in the sand and goes back to neoclassicism, and Schinkel, Lutyens, and Ledoux, does not seem to be a way of dealing with the present anxiety. Most of what my colleagues are doing today does not seem to be the way to go. Equally, I do not believe that the way to go, as you suggest, is to put up structures to make people feel comfortable, to preclude that anxiety. What is a person to do if he cannot react against anxiety or see it pictured in his life? After all, that is what all those evil Struwwel Peter characters are for in German fairy tales. CA: Don't you think there is enough anxiety at present? Do you really think we need to manufacture more anxiety in the form of buildings?

      to manufacture more anxiety in the form of buildings

    1. Author Response:

      The following is the authors' response to the current reviews.

      Reviewer #1 (Public Review):

      This revised manuscript by Walker et. al. addresses some of the editorial points and conceptual discussion, but in general, most of my suggestions (as the previous reviewer #1) for additional experimentation or addition were not addressed as discussed below. Therefore, my overall review has not changed.

      In our previous response, we included i) extra experimental data illustrating the reproducibility of our results and ii) added transcription start site data at the request of this reviewer. We included the information because we agreed with the reviewer that these were important points to address. For the points raised again below, we explained why the additional analysis was unlikely to add much in terms of insight or rigour. We have elaborated further below.   

      1) For example, in point 1, the suggested analysis was not performed because it is not trivial. My reason for making this suggestion is that the original manuscript was limited to Vibrio cholerae, and the impact of the manuscript would increase if the findings here were demonstrated to be more broadly applicable. I expect papers published in eLife to have such broad applicability. But no changes were made to the manuscript in this regard. The revised version is still limited to only Vibrio cholerae.

      Our paper is focused on the unexpected co-operative interactions between HapR and CRP. Such co-binding of two transcription factors to the same DNA site is unexpected. Consequently, it is this mode of DNA binding that is likely to be of broad interest. With this in mind, we did provide experimental, and bioinformatic, analyses for other regulatory regions and other vibrio species (Figures S3 and S6). This, in our view, is where the “broad applicability” for papers published in eLife comes from.

      The analysis the reviewer suggests is not related to the main message of our paper. Instead, the reviewer is asking how many HapR binding sites seen here by ChIP-seq are also seen in other vibrio species by ChIP-seq. This is only likely to be of interest to readers with an extremely specific interest in both vibrio species and HapR. The reviewer states above that we did not make the change “because it is not trivial”. This is an oversimplification of the rationale we presented in our response. The analysis is indeed not straightforward. However, much more importantly, the outcome is unlikely to be of interest to many readers, and has no bearing on the rigour of work. With this in mind, we do not think our position is unreasonable. We also stress that, should a reader with this very specific interest want to explore further, all of our data are freely available for them to do so.

      2) For point 2, the activity of FLAG-tag luxO could have been simply validated in a complementation assay. Yes, they demonstrated DNA binding, but that is not the only activity of LuxO.

      DNA binding by LuxO is the only activity of the protein with which we are concerned in our paper. Furthermore, LuxO is very much a side issue; we found binding to only the known targets and potentially, at very low levels, one additional target. No further LuxO experiments were done for this reason. Indeed, even if these data were removed completely, our conclusions would not change or be supported any less vigorously. We are happy to remove the LuxO data if the reviewer would prefer but this would seem like overkill.

      3) For point 7, the transcriptional fusions were not explored at different times or different media, which is also something that was hinted at by other reviewers. In regard to exploring expression at different time points, this seems particularly relevant for QS regulated genes.

      In their previous review, the reviewer did not request that such experiments were done. Similarly, no other reviewer requested these experiments. Instead, this reviewer i) commented that lacZ fusions were not as sensitive as luciferase fusions ii) asked if we had done any time point experiments. We agreed with the first point, whilst also noting that lacZ is not unusual to use as a reporter. For the second point, we responded that we had not done such experiments (which by the reviewer’s own logic would have been complicated using lacZ as a reporter). This seems like a perfectly reasonable way to respond.   

      We should stress that these comments all refer to Figure 2a, which was our initial screening of 23 promoter::lacZ fusions, supported by separate in vitro transcription assays. Only one of these fusions was followed up as the main story in the paper. Given that the other 22 fusions were not investigated further, and do not form part of the main story, there would seem little value in now going back to assay them at different time points.

      4) For point 13, the authors express that doing an additional CHIP-Seq is outside of the scope of this manuscript. Perhaps that is the case, but the point of the comment is to validate the in vitro binding results with an in vivo binding assay. A targeted CHIP-Seq approach specifically analyzing the promoters where cooperative binding was observed in vitro could have addressed this point.

      We did appreciate the original comment, and responded as such, but we do think additional ChIP-seq assays are outside the scope of this paper.

      Reviewer #2 (Public Review):

      This manuscript by Walker et al describes an elegant study that synergizes our knowledge of virulence gene regulation of Vibrio cholerae. The work brings a new element of regulation for CRP, notably that CRP and the high density regulator HapR co-occupy the same site on the DNA but modeling predicts they occupy different faces of the DNA. The DNA binding and structural modeling work is nicely conducted and data of co-occupation are convincing. The work seeks to integrate the findings into our current state of knowledge of HapR and CRP regulated genes at the transition from the environment and infection. The strength of the paper is the nice ChIP-seq analysis and the structural modeling and the integration of their work with other studies.

      We thank the reviewer for the positive comments.

      The weakness is that it is not clear how representative these data are of multiple hapR/CRP binding sites

      This comment does not consider all data in our paper. We did test our model experimentally at multiple HapR and CRP binding sites. These data are shown in Figure S6 and confirm the co-operative interaction between HapR and CRP at 4 of a further 5 shared binding sites tested. We also used bioinformatics to show the same juxtaposition of CRP and HapR sites in other vibrio species (Figure S3). Hence, the model seems representative of most sites shared by HapR and CRP.

      or how the work integrates as a whole with the entire transcriptome that would include genes discovered by others.

      At the request of the reviewers, our revision integrated our ChIP-seq data with dRNA-seq data. No other suggestions to ingrate transcriptome data were made by the reviewers. 

      Overall this is a solid work that provides an understanding of integrated gene regulation in response to multiple environmental cues.

      We thank the reviewer for the positive comment.

      —————

      The following is the authors' response to the original reviews.

      Reviewer #1 (Public Review):

      This manuscript by Walker et. al. explores the interplay between the global regulators HapR (the QS master high cell density (HDC) regulator) and CRP. Using ChIP-Seq, the authors find that at several sites, the HapR and CRP binding sites overlap. A detailed exploration of the murPQ promoter finds that CRP binding promotes HapR binding, which leads to repression of murPQ. The authors have a comprehensive set of experiments that paints a nice story providing a mechanistic explanation for converging global regulation.

      We thank the reviewer for their positive evaluation.

      I did feel there are some weak points though, in particular the lack of integration of previously identified transcription start sites

      For completeness, we have now added the position and orientation or the nearest TSSs to each HapR or LuxO binding peak in Table 1 (based on Papenfort et al.).

      the lack of replication (at least replication presented in the manuscript) for many figures,

      We assume that the reviewer is referring to gel images rather than any other type of assay output (were error bars, derived from replicates, are shown). As is standard, we show representative gel images. All associated DNA binding and in vitro transcription experiments have been done multiple times. Indeed, comparison between figures reveals several instances of such replication (e.g. Figures 4b & 5d, Figures 4d & 5e). We have added details of repeats done to the methods section.

      some oddities in the growth curve

      We do not know why cells lacking hapR have a growth curve that appears biphasic. We can only assume that this is due to some regulatory effect of HapR, distinct from the murQP locus. Despite the unusual shape of the growth curve, the data are consistent with our conclusions.

      and not reexamining their HapR/CRP cooperative binding model in vivo using ChIP-Seq.

      We agree that these would be interesting experiments and, in the future, we may well do such work. Even without these data, our current model is well supported by the data presented (and the reviewer seems to agree with this above).

      Reviewer #2 (Public Review):

      This manuscript by Walker et al describes an elegant study that synergizes our knowledge of virulence gene regulation of Vibrio cholerae. The work brings a new element of regulation for CRP, notably that CRP and the high density regulator HapR co-occupy the same site on the DNA but modeling predicts they occupy different faces of the DNA. The DNA binding and structural modeling work is nicely conducted and data of co-occupation are convincing. The work could benefit from doing a better job in the manuscript preparation to integrate the findings into our current state of knowledge of HapR and CRP regulated genes and to elevate the impact of the work to address how bacteria are responding to the nutritional environment. Importantly, the focus of the work is heavily based on the impact of use of GlcNAc as a carbon source when bacteria bind to chitin in the environment, but absent the impact during infection when CRP and HapR have known roles. Further, the impact on biological events controlled by HapR integration with the utilization of carbon sources (including biofilm formation) is not explored.

      We thank the reviewer for their overall positive evaluation.

      The rigor and reproducibility of the work needs to be better conveyed.

      Reviewer 1 made a similar comment (see above) and we have modified the manuscript accordingly.

      Specific comments to address:

      1)  Abstract. A comment on the impact of this work should be included in the last sentence. Specifically, how the integration of CRP with QS for gene expression under specific environments impacts the lifestyle of Vc is needed. The discussion includes comments regarding the impact of CRP regulation as a sensor of carbon source and nutrition and these could be quickly summarized as part of the abstract.

      We have added an extra sentence. However, we have used cautious language as we do not show impacts on lifestyle (beyond MurNAc utilisation) directly. These can only be inferred.

      2)  Line 74. This paper examines the overlap of HapR with CRP, but ignores entirely AphA. HapR is repressed by Qrrs (downstream of LuxO-P) while AphA is activated by Qrrs. With LuxO activating AphA, it has a significant sized "regulon" of genes turned on at low density. It seems reasonable that there is a possibility of overlap also between CRP and AphA. While doing an AphA CHIP-seq is likely outside the scope of this work, some bioinformatic or simply a visual analysis of the promoters known AphA regulated genes would be interest to comment on with speculation in the discussion and/or supplement.

      In short, everything that the reviewer suggests here has already been done and was covered in our original submission (see text towards the end of the Discussion). Also, we would like to point the referee to our earlier publication (Haycocks et al. 2019. The quorum sensing transcription factor AphA directly regulates natural competence in Vibrio cholerae. PLoS Genet. 15:e1008362).

      3)  Line 100. Accordingly with the above statement, the focus here on HapR indicates that the focus is on gene expression via LuxO and HapR, at high density. Thus the sentence should read "we sought to map the binding of LuxO and HapR of V. cholerae genome at high density".

      Note that expression of LuxO and HapR is ectopic in these experiments (i.e. uncoupled from culture density).

      4)  Line 109. The identification of minor LuxO binding site in the intergenic region between VC1142 and VC1143 raises whether there may be a previously unrecognized sRNA here. As another panel in figure S1, can you provide a map of the intergenic region showing the start codons and putative -10 to -35 sites. Is there room here for an sRNA? Is there one known from the many sRNA predictions / identifications previously done? Some additional analysis would be helpful.

      We have added an extra panel to Figure S1 showing the position of TSSs relative to the location of LuxO binding. We have altered the main text to accommodate this addition..

      5)  Line 117. This sentence states that the CHIP seq analysis in this study includes previously identified HapR regulated genes, but does not reveal that many known HapR regulated genes are absent from Table 1 and thus were missed in this study. Of 24 HapR regulated investigated by Tsou et al, only 1 is found in Table 1 of this study. A few are commented in the discussion and Figure S7. It might be useful to add a Venn Diagram to Figure 1 (and list table in supplement) for results of Tsou et al, Waters et al, Lin et al, and Nielson et al and any others). A major question is whether the trend found here for genes identified by CHIP-seq in this study hold up across the entire HapR regulon. There should also be comments in the discussion on perhaps how different methods (including growth state and carbon sources of media) may have impacted the complexity of the regulon identified by the different authors and different methods.

      We have added a list of known sites to the supplementary material (new Table S1). We were unsure what was meant by the comment “A major question is whether the trend found here for genes identified by CHIP-seq in this study hold up across the entire HapR regulon”. We have added the extra comment to the discussion re growth conditions, also noting that most previous studies relied on in vitro, rather than in vivo, DNA binding assays.

      6)  The transcription data are generally well performed. In all figures, add comments to the figure legends that the experiments are representative gels from n=# (the number of replicate experiments for the gel based assays). Statements to the rigor of the work are currently missing.

      See responses above. We have added a comment on numbers of repeats to the methods section.

      7)  Line 357-360. The demonstration of lack of growth on MurNAc is a nice for the impact of the work. However, more detailed comments are needed for M9 plus glucose for the uninformed reader to be reminded that growth in glucose is also impaired due to lack of cAMP in glucose replete conditions and thus minimal CRP is active. But why is this now dependent of hapR? A reminder also that in LB oligopeptides from tryptone are the main carbon source and thus CRP would be active.

      We find this point a little confusing and, maybe, two issues (murQP regulation, and growth in general) are being conflated. In particular, we do not understand the comment “growth in glucose is also impaired due to lack of cAMP in glucose replete conditions and thus minimal CRP is active”.

      Growth in glucose should indeed result in lower cAMP levels*, and hence less active CRP, but this does not impair growth. This is simply the cell’s strategy for using its preferred carbon source. If the reviewer were instead referring to some aspect of P_murQP_ regulation then yes, we would expect promoter activity to be lower because less active CRP would be available in the presence of glucose. The reviewer also comments “why is this now dependent of hapR?”. We assume that they are referring to some aspect of growth in minimal media with glucose. If so, the only hapR effect is the change in growth rate as cells enter mid-late log-phase (i.e. the growth curve looks somewhat biphasic). A similar effect is seen in all conditions. We do not know why this happens and can only conclude this is due to some unknown regulatory activity of HapR. Overall, the key point from these experiments is that loss if luxO, which results in constitutive hapR expression, lengthens lag phase only for growth with MurNAc as the sole carbon source.

      *Although in V. fischeri (PMID: 26062003) cAMP levels increase in the presence of glucose.

      8)  A great final experiment to demonstrate the model would have been to show co-localization of the promoter by CRP and HapR from bacteria grown in LB media but not in LB+glucose or in M9+glycerol and M9+MurNAc but not M9+glucose. This would enhance the model by linking more directly to the carbon sources (currently only indirect via growth curves)

      This is unlikely to be as straightforward as suggested. The sensitivity of CRP binding to growth conditions is not uniform across different binding sites. For instance, the CRP dependence of the E. coli melAB promoter is only evident in minimal media (PMID: 11742992) whilst the role of CRP at the acs promoter is evident in tryptone broth (PMID: 14651625). Similarly, as noted above, in Vibrio fischeri glucose causes and increase in cAMP levels. (PMID: 26062003).

      9) Discussion. Comments and model focus heavily on GlcNAc-6P but HapR has a regulator role also during late infection (high density). How does CRP co-operativity impact during the in vivo conditions?

      We really can’t answer this question with any certainty; we have not done any infection experiments in this work.

      Does the Biphasic role of CRP play a role here (PMID: 20862321)?

      Again, we cannot answer this question with any confidence as experimentation would be required. However, the suggestion is certainly plausible.

      Reviewer #3 (Public Review):

      Bacteria sense and respond to multiple signals and cues to regulate gene expression. To define the complex network of signaling that ultimately controls transcription of many genes in cells requires an understanding of how multiple signaling systems can converge to effect gene expression and ensuing bacterial behaviors. The global transcription factor CRP has been studied for decades as a regulator of genes in response to glucose availability. It's direct and indirect effects on gene expression have been documented in E. coli and other bacteria including pathogens including Vibrio cholerae. Likewise, the master regulator of quorum sensing (QS), HapR), is a well-studied transcription factor that directly controls many genes in Vibrio cholerae and other Vibrios in response to autoinducer molecules that accumulate at high cell density. By contrast, low cell density gene expression is governed by another regulator AphA. It has not yet been described how HapR and CRP may together work to directly control transcription and what genes are under such direct dual control.

      We thank the reviewer for their assessment of our work.

      Using both in vivo methods with gene fusions to lacZ and in vitro transcription assays, the authors proceed to identify the smaller subset of genes whose transcription is directly repressed (7) and activated (2) by HapR. Prior work from this group identified the direct CRP binding sites in the V. cholerae genome as well as promoters with overlapping binding sites for AphA and CRP, thus it appears a logical extension of these prior studies is to explore here promoters for potential integration of HapR and CRP. Inclusion of this rationale was not included in the introduction of CRP protein to the in vitro experiments.

      We understand the reviewer’s comment. However, the rationale for adding CRP was not that we had previously seen interplay between AphA and CRP (although this is a relevant discussion point, which we did make). Rather, we had noticed that there was an almost perfect CRP site perfectly positioned to activate PmurQP. Hence, CRP was added.

      Seven genes are found to be repressed by HapR in vivo, the promoter regions of only six are repressed in vitro with purified HapR protein alone. The authors propose and then present evidence that the seventh promoter, which controls murPQ, requires CRP to be repressed by HapR both using in vivo and vitro methods. This is a critical insight that drives the rest of the manuscripts focus. The DNase protection assay conducted supports the emerging model that both CRP and HapR bind at the same region of the murPQ promoter, but interpret is difficult due to the poor quality of the blot.

      There are areas of apparent protection at positions +1 to +15 that are not discussed, and the areas highlighted are difficult to observe with the blot provided.

      We disagree on this point. The region between +1 and +15 is inherently resistant to attack by DNAseI and there are only ever very weak bands in this region (lane 1). Other than seeing small fluctuations in overall lane intensity (e.g. lanes 7-12 have a slightly lower signal throughout) the +1 to +15 banding pattern does not change. Conversely, there are dramatic changes in the banding pattern between around -30 and -60 (again, compare lane 1 to all other lanes). That CRP and HapR bind the same region is extremely clear. Also note that this is backed up by mutagenesis of the shared binding site (Figure 4c).

      The model proposed at the end of the manuscript proposes physiological changes in cells that occur at transitions from the low to high cell density. Experiments in the paper that could strengthen this argument are incomplete. For example, in Fig. 4e it is unclear at what cell density the experiment is conducted.

      Such details have been added to the figure legends and methods section.

      The results with the wild type strain are intermediate relative to the other strains tested.

      This is correct, and exactly what we would expect to see based on our model.

      Cell density should affect the result here since HapR is produced at high density but not low density. This experiment would provide important additional insights supporting their model, by measuring activity at both cell densities and also in a luxO mutant locked at the high cell density. Conducting this experiment in conditions lacking and containing glucose would also reveal whether high glucose conditions mimicking the crp results.

      We agree with this idea in principle but note that the output from our reporter gene, β- galactosidase, is stable within cells and tends to accumulate. This is likely to obscure the reduction in expression as cells transition from low to high cell density. Since we have demonstrated the regulatory effects of HapR and CRP both in vivo using gene knockouts, and in vitro with purified proteins, we think that our overall model is very well supported. Further experimental additions may provide an incremental advance but will not alter our overall story. Also note the unexpected increase in intracellular cAMP due to addition of glucose, in Vibrio fischeri (PMID: 26062003).

      Throughout the paper it was challenging to account for the number of genes selected, the rationale for their selection, and how they were prioritized. For example, the authors acknowledged toward the end of the Results section that in their prior work, CRP and HapR binding sites were identified (line 321-22).

      This is not quite what we say, and maybe the reviewer misunderstood, which is our fault. The prior work identified CRP sites whilst the current work identified HapR sites. We have made a slight alteration to the text to avoid confusion.

      It is unclear whether the loci indicated in Table 1 all from this prior study. It would be useful to denote in this table the seven genes characterized in Figure 2 and to provide the locus tag for murPQ.

      Again, we are unsure if we have confused the reviewer. The results in Table 1 are all HapR sites from the current work, not a prior study. However, some of these also correspond to CRP binding regions found in prior work.

      The reviewer mentions “the seven genes characterised in Figure 2” but 23 targets were characterised in Figure 2a and 9 in Figure 2b. The “VC” numbers used in Figure 2 are the same as used in Table 1 so it is easy to cross reference between the two. We have added a footnote to Table 1, also referred to in the Figure 2 legend, to allow cross referencing between gene names and locus tags (including for murQP and hapR).

      Of the 32 loci shown in Table 1, five were selected for further study using EMSA (line 322), but no rationale is given for studying these five and not others in the table.

      This is not quite correct, we did not select 5 from the 32 targets listed in Table 1. We selected 5 targets from Table 1 that were also targets for CRP in our prior paper. This was the rationale.

      Since prior work identified a consensus CRP binding motif, the authors identify the DNA sequence to which HapR binds overlaps with a sequence also predicted to bind CRP. Genome analysis identified a total of seven sites where the CRP and HapR binding sites were offset by one nucleotide as see with murPQ. Lines 327-8 describe EMSA results with several of these DNA sequences but provides no data to support this statement. Are these loci in Table 1?

      This comment is a little difficult to follow, and we may have misunderstood, but we think that the reviewer is asking where the EMSA data referred to on lines 327-328 resides. We can see that the text could be confusing in this regard. We had referred to the relevant figure (Figure S6) on line 324 but did not again include this information further down in the description of the result. We have changed the text accordingly.

      Using structural models, the authors predict that HapR repression requires protein-protein interactions with CRP. Electromobility shift assays (EMSA) with purified promoter DNA, CRP and HapR (Fig 5d) and in vitro transcription using purified RNAP with these factors (Figure 5e) support this hypothesis. However, the model proports that HapR "bound tightly" and that it also had a "lower affinity" when CRP protein was used that had mutations in a putative interaction interface. These claims can be bolstered if the authors calculate the dissociation constant (Kd) value of each protein to the DNA. This provides a quantitative assessment of the binding properties of the proteins.

      The reviewer is correct that we do not explicitly provide a Kd. However, in both Figures 5d and 5e, we do provide very similar quantification. In 5d, our quantification is the % of the CRP-DNA complex bound by HapR (using either wild type or E55A CRP). Since the % of DNA bound is shown, and the protein concentrations are provided in the figure legend, information regarding Kd is essentially already present. In 5e, we show the % of maximal promoter activity. This is a reasonable way of quantifying the result. Furthermore, Kd is not a metric we can measure directly in this experiment that is not a DNA binding assay.

      The concentrations of each protein are not indicated in panels of the in vitro analysis, but only the geometric shapes denoting increasing protein levels.

      The protein concentrations are all provided in the figure legend. It is usual to indicate relative concentrations in the body of the figure using geometric shapes.

      Panel 5e appears to indicate that an intermediate level of CRP was used in the presence of HapR, which presumably coincides with levels used in lane 4, but rationale is not provided.

      There was no particular rationale for this, it was simply a reasonable way to do the experiment.

      How well the levels of protein used in vitro compare to levels observed in vivo is not mentioned.

      The protein concentrations that we use (in the nM to low μM range) are very typical for this type of work and consistent with hundreds of prior studies of protein-DNA interactions. The general rule of thumb is that 1000 molecules of a protein per bacterial cell equates to a concentration of around 1 μM. However, molecular crowding is likely to increase the effective concentration. Additionally, in vitro, where the DNA concentration is higher.

      The authors are commended for seeking to connect the in vitro and vivo results obtained under lab conditions with conditions experienced by V. cholerae in niches it may occupy, such as aquatic systems. The authors briefly review the role of MurPQ in recycling of the cell wall of V. cholerae by degrading MurNAc into GlcNAc, although no references are provided (lines 146-50). Based on this physiology and results reported, the authors propose that murPQ gene expression by these two signal transduction pathways has relevance in the environment, where Vibrios, including V. cholerae, forms biofilms on exoskeleton composed of GlcNAc.

      We have added a citation to the section mentioned.

      The conclusions of that work are supported by the Results presented but additional details in the text regarding the characteristics of the proteins used (Kd, concentrations) would strengthen the conclusions drawn. This work provides a roadmap for the methods and analysis required to develop the regulatory networks that converge to control gene expression in microbes. The study has the potential to inform beyond the sub-filed of Vibrios, QS and CRP regulation.

      As noted above, quantification essentially equivalent to Kd is already shown (% of bound substrate is indicated in figures and all protein concentrations are given in the figure legends).

      Reviewer #1 (Recommendations For The Authors):

      1.  As similar experiments have been performed in other Vibrios, it would be interesting to do a more detailed analysis of the similarities and differences between the species. Perhaps a Venn diagram showing how many targets were found in all studies versus how many are species specific.

      We appreciate this suggestion but would prefer not to make this change. A cross-species analysis would be very time consuming and is not trivial. The presence and absence of each target gene, for all combinations of organisms, would first need to be determined. Then, the presence and absence of binding signals for HapR, or its equivalent, would need to be determined taking this into account. For most readers, we feel that this analysis is unlikely to add much to the overall story. Given the amount of effort involved, this seems a “non-essential” change to make.

      2.  Line 101-Are the FLAG tagged versions of LuxO and HapR completely functional? Can they complement a luxO or hapR deletion mutant?

      The activity of FLAG tagged HapR (LuxR in other Vibrio species) has been shown previously (e.g. PMIDs 33693882 and 23839217). Similarly, N-terminal HapR tags are routinely used for affinity purification of the protein without ill effect. We have not tested LuxO-3xFLAG for “full” activity, though this fusion is clearly active for DNA binding, the only activity that we have measured here, since all know targets are pulled down.

      3.  Line 106-As the authors state later that there are additional smaller peaks for HapR that could be other direct targets, I think a brief mention of the methodology used to determine the cutoff for the 5 and 32 peaks for LuxO and HapR, respectively, would be informative here.

      We have added a little more text to the methods section. The added text states “Note that our cut- off was selected to identify only completely unambiguous binding peaks. Hence, weak or less reproducible binding signals, even if representing known targets, were excluded (see Discussion for further details)”.

      4.  Line 118-Need a reference here to the prior HapR binding site.

      This has been added.

      5.  Figs. 1e-What do the numbers on the x-axis refer to? Why not just present these data as bases? The authors also refer to distance to the nearest start codon, but this is irrelevant for 4/5 of the luxO targets as they are sRNAs. They should really refer to the distance to the transcription start site. Likewise, for HapR, distance to the nearest start codon is not as informative as distance to the nearest transcription start site. A recent paper used transcriptomics to map all the transcription start sites of V. cholerae, and these results should be integrated into the author's study rather than just using the nearest start codon (PMID: 25646441).

      The numbers are kilo base pairs, this has been added to the axis label. We have also changed “start codon” to “gene start” (since “gene start” is also suitable for genes that encode untranslated RNAs).

      Re comparing binding peak positions to transcription start sites (TSSs) rather than gene starts, this analysis would be useful if TSSs could be detected for all genes. However, some genes are not expressed under the conditions tested by PMID: 25646441, so no TSS is found. Consequently, for HapR or LuxO bound at such locations, we would not be able to calculate a meaningful position relative to the TSS. We stress that the point of the analysis is to determine how peaks are positioned with respect to genes (i.e. that sites cluster near gene 5’ ends). Also note that nearest TSSs are now shown in the revised Table 1. In some cases, these are unlikely to be the TSS actually subject to regulation (e.g. because the regulated gene is switched off).

      6.  Fig. 1e-Is there directionality to the site? In other words, if a HapR binding site is located between two genes that are transcribed in opposite directions, is there a way to predict which gene is regulated? It looks like this might be the case with the list presented in Table 1, but how such determination is made and what the various symbol in Table 1 mean are not clear to me. This also has ramifications for Fig. 2a as the direction to construct the fusion is critical for the experiment.

      The site is a palindrome so lacks directionality. The best prediction re regulation is likely to be positioning with respect to the nearest TSS (which is now included in Table 1). However, this would remain just a prediction and, where TSSs are in odd locations with respect to binding sites (taking into account the caveats above) predictions would be unreliable.

      We are unsure which symbol the reviewer refers to in Table 1, a full explanation of any symbols used is provided in the table footnotes.

      With respect to Figure 2a, if sites were between divergent genes, and met our other criteria, we tested for regulation in both directions. For example, see the divergent genes VCA0662 (classified inactive) and VCA0663 (classified repressed).

      7.  Fig. 2a-It is a little disappointing that the authors use LacZ fusions to measure transcription as this is not the most sensitive reporter gene. Luciferase gene fusions would have been much more sensitive. Also, did the authors examine multiple time points. The methods only describe "mid-log phase" but some of the inactive promoters could be expressed at other time points. Also, it would be simple to repeat this experiment in different media, such as minimal with glucose or another non- CRP carbon source, to expand which promoters are expressed.

      The reviewer is correct regarding the sensitivity of β-galactosidase, which is very stable and so accumulates as cells grow. Even so, this reporter has been used very successfully, across thousands of studies, for decades. We did not examine multiple timepoints. We appreciate that the 23 promoter::lacZ fusions could be re-examined using varying growth conditions but this is unlikely to impact the overall conclusions, though it could generate some new leads for future work.

      8.  Fig. 2a legend-typos

      This has been corrected.

      9.  Line 138-I think you mean Fig. 2a here.

      This has been corrected.

      10.  Fig. 2b and many additional figures quantify band intensity but do not show any replication or error. Therefore, it is impossible to gauge reproducibility of these experiments.

      We have added a reproducibility statement (all experiments were done multiple times with similar results) as is standard throughout the literature. Also note that there is a lot of internal replication between figures. Figure 4d and Figure 5e lanes 1-9 show essentially the same experiment (albeit with slightly different protein concentrations) and very similar results. To the same effect, Figure 5e lanes 10-18 and lanes 19-27 show the same experiment for two different mutations of the same CRP residue. Again, the results are very similar. Also see the response to your comment 15 below.

      11.  Fig. 4a-lanes 2-4-the footprint does not change with additional CRP. In other words, it looks the same at the lowest concentration of CRP versus the highest concentration of CRP. The footprints for HapR look similar. This is somewhat troubling as in these types of experiments one would like to observe a dose dependent change in the footprint correlating with more DNA occupancy.

      For CRP we agree but are not concerned at all by this. The site is simply full occupied at the lowest protein concentration tested. Given that the footprint exactly coincides with a near consensus CRP site (which, when mutated, abolishes CRP binding in EMSAs, and regulation by CRP in vivo) all our results are perfectly consistent. Note that i) our only aim in this experiment was to determine the positions of CRP and HapR binding ii) our conclusions are independently backed up using gel shifts and by making promoter mutations. With respect to HapR, there are changes at the periphery of the main footprint.

      12.  Fig. 4e-Why does the transcriptional activation of murQP decrease with increasing concentrations of CRP? This is also seen in Fig. 5e.

      In our experience, this often does happen when doing in vitro transcription assays (with CRP and many other activators). The anecdotal explanation is that, at higher concentrations, the regulator can start to bind the DNA non-specifically and so interfere with transcription.

      13. The authors demonstrate in vitro that HapR requires binding of CRP to bind the murQP promoter. It would strengthen their model if they demonstrated this in vivo. To do this, the authors only need to repeat their ChIP-Seq experiment in a delta CRP mutant and the HapR signal at murQP would be lost. In fact, such an experiment would experimentally confirm which of the in vivo HapR binding sites are CRP dependent.

      We agree, appreciate the comment, and do plan to do such experiments in the future as a wider assessment of interactions between transcription factors. However, doing this does have substantial time and resource implications that we cannot devote to the project at present. We feel that our overall conclusions, regarding co-operative interactions between HapR and CRP at PmurQP, are well supported by the data already provided. This also seems the overall opinion of the reviewers.

      14.  Fig. 5b-I am confused by the Venn diagram. The text states that "In all cases, the CRP and HapR targets were offset by 1 bp", but the diagram only shows 7 overlapping sites. The authors need to better describe these data.

      We mean that, in all cases where sites overlap, sites are offset by 1 bp (i.e. we didn’t find any sites

      overlapping but offset by 2, 3 4 bp etc).

      15. Line 287-288 and Fig. 5d-The authors state that HapR binds with less affinity to the CRP E55A mutant protein bound to DNA. There does seem to be a difference in the amount of shifted bands at the equivalent concentrations of HapR, but the difference is subtle. In order to make such a conclusion, the authors should show replication of the data and calculate the variability in the results. The authors should also use these data to determine the actual binding affinities of HapR to WT and the E55A mutant CRP, along with error or confidence intervals.

      All of these experiments have been run multiple times and we are absolutely confident of the result. With respect to Figure 5d, this was done many times. We note that not all experiments were exact repeats. E.g. some of the first attempts had fewer HapR concentrations. Even so, the defect in HapR binding to the CRP E55A complex was always evident. The two gels to the left show the final two iterations of this experiment (these are exact repeats). The top image is that shown in Figure 5d. The lower image is an equivalent experiment run a day or so previously. Both clearly show a defect in HapR binding to the CRP E55A complex. We appreciate that our conclusion re these experiments is somewhat qualitative (i.e. that HapR binds the CRP E55A complex less readily) but this is not out of kilter with the vast majority of similar literature and our results are clearly reproducible.

      16.  Fig. 6a-It is odd that the locked low cell density mutants have such a growth defect in MurNAc, minimal glucose, and LB. To my knowledge, such a growth defect is not common with these strains. Perhaps this has to do with the specific growth conditions used here, but I can't find that information in the manuscript (it should be there). Furthermore, the growth rate of the luxO and hapR mutants appears to be similar up to the branch point (i.e. slope of the curve), but the lag phage of the luxO mutant is much longer. The authors need to address these issues in relationship to previous published literature and specify their growth conditions because the results are not consistent with their simple model described in Fig 6b.

      This comment is a little difficult to pick apart as it covers several different issues. We’ll try and

      answer these individually.

      a)     The unusual “biphasic growth curve with hapR and hapRluxO cells: We do not know why cells lacking hapR have a growth curve that appears biphasic. We can only assume that this is due to some regulatory effect of HapR, distinct from the murQP locus. Despite the unusual shape of the growth curve, the data are consistent with our conclusions.

      b)     The extended lag phase of the luxO mutant in minimal media + MurNAc: We appreciate this comment and had considered possible explanations prior to submission. In the end, we left out this speculation but are happy to include it as part of our response. The extended lag phase might be expected if CRP/HapR regulation is largely critical for controlling the basal transcription of murQP. The locus is likely also regulated by the upstream repressor MurR (VC0204) as in E. coli. So, if deprepression of MurR overwhelms the effect of HapR on murQP, we think you would expect that once the cells start growing on MurNAc, the growth rates are unchanged. But the extended lag is due to the fact that it took longer for those cells to achieve the critical threshold of intracellular MurNAc-6-P necessary to drive murR derepression. Obviously, we can not provide a definitive answer.

      c)     We have added further details regarding growth conditions to the methods section and the Figure 6a legend.

      17.  Fig. S6-The data to this point with murPQ suggested a model in which CRP binding then enabled HapR binding. But these EMSA suggest that both situations occur as in some cases, such as VCA0691, HapR binding promotes CRP binding. How does such a result fit with the structural model presented in Fig. 5?

      This is to be expected and is fully consistent with the model. Cooperativity is a two-way street, and each protein will stabilise binding of the other. Clearly, it will not always be the case that the shared DNA site will have a higher affinity for CRP than HapR (as at PmurQP). Depending on the shared site sequence, expected that sometimes HapR will bind “first” and then stabilise binding of CRP.

      18. Line 354-356-The HCD state of V. cholerae occurs in mid-exponential phase and several cell divisions occur before stationary phase and the cessation of growth, at least in normal laboratory conditions. Therefore, there is not support for the argument that QS is a mechanism to redirect cell wall components at HCD because cell wall synthesis is no longer needed.

      We did not intent to suggest cell wall synthesis is not needed at all, rather that there is a reduced need. We made a slight change to the discussion to reflect this.

      19. Line 357-360-Again, as stated in point 16, the statement that cells locked in the HCD are "defective for growth" is an oversimplification. The luxO mutants have a longer lag phage, but they actually outgrow the hapR mutants at higher cell densities and reach the maximum yield much faster.

      In fairness, we do go on to specify that the defect is an extended lag phase. Also see our response above.

      Reviewer #2 (Recommendations For The Authors):

      Comments to improve the text

      1)  Line 103-106, line 130, line 136, etc. Details of the methods and the text directing to presentations of figures should be in the methods and/or figure legends with (Figure x) in citation after the statement. The sentences in lines indicated can be deleted from the results. Although several lines are noted specifically here, this comment should be applied throughout the entire results section.

      We appreciate this comment but would prefer not to make this change (it seems mainly an issue of personal stylistic choice). It is sometimes helpful for the reader to include such information as it avoids them having to cross reference between different parts of the manuscript.

      2)  Line 115. Recommend a paragraph between content on LuxO and HapR (before "Of the 32 peaks for HapR binding")

      We agree and have made this change.

      3)  Line 138 and Figure 1a. I am not convinced this gel shows that VC1375 is activated by HapR. Is the arrow pointing to the wrong band? There does seem to be an induced band lower down.

      We understand this comment as it is a little difficult to see the induced band. This is because this is a compressed area of the gel and the transcript is near to an additional band.

      4)  Line 147. Add the VC0206-VC0207 next to murQP (and the gene name murQP into Table 1).

      We have added the gene name to the figure foot note. The text has been changed as requested.

      5) Methods. It is essential for this paper to have detailed methods on the bacterial growth conditions. Referring to prior paper, bacteria were grown in LB (add composition...is this high salt LB often used for vibrios or low salt LB often used for E. coli). Growth is to "mid log". Please provide the OD at collection. Is mid log really considered "high density". Provide a reference regarding HapR activity at mid log to support the method. Could the earlier collection of bacteria account for missing known HapR regulated genes? In preparing the requested ç, include growth conditions for other experiments in the legends.

      Note that we have included a new supplementary table, rather than a Venn diagram. We have also added further details of growth conditions as mentioned above. Also not that, for the ChIP-seq, HapR and LuxO were expressed ectopically and so uncoupled from the switch between low and high cell density.

      6)  Content of Table 1, HapR Chip-seq peaks, needs to be closely double checked to the collected data as there seems to be some errors. Specifically, VC0880 and VC0882 listed under Chromosome I are most likely VCA0880 (MakD) and VCA0882 (MakB), both known HapR induced genes on Chromosome II with VCA0880 previously validated by EMSA. This notable error suggests the table may have other errors and thus requires a very detailed check to assure its accuracy.

      We appreciate the attention to detail! We have double checked, thankfully this is not an error, the table is correct (even so, we have also checked all other entries in the table). As an aside, VCA0880 is one of the locations for which we see a weak HapR binding signal below our cut-off (included in the new Table S1). In cross checking between Table 1 and all other data in the paper we noticed that we had erroneously included assay data for VC0620 in Figure 2A. This was not one of our ChIP-seq targets but had been assayed at the same time several years ago. This datapoint, which wasn’t related to any other part of the manuscript, has been removed.

      If VCA0880 and VCA0882 are correctly placed on Chr. I, then add comment to text that the Mak toxin genomic island found on Chromosome II in N16961 is on Chr. I in E7946. (See recent references PMID: 30271941, 35435721, 36194176, 34799450).

      See above, this is not an error.

      7)  Alternatively for both comments 8 & 9, are these problems of present/missing genes or misannotations the result of the annotation of E7946 gene names not aligning with gene names of N16961? (if so, in Table 1, please give the gene name as in E7946 but include a separate column with the N16961 name for cross study comparison)

      See above and below, this is not an issue.

      8)  Line 126-127. Also regarding Table 1, please add a column with function gene annotation. For example, VC0916 needs to be identified as vpsU. If function is unknown, type unknown in the column. This will help validate the approach of selecting "HapR target promoters where adjacent coding sequence could be used to predict protein function."

      We added an extra column to Table 1 in response to a separate reviewer request (TSS locations). This leaves no space for any additional columns. Instead, to accommodate the reviewer’s request, we have added alternative gene names to the footnote.

      Not following up on VCA0880 (promoter for the mak operon) is a sad missed opportunity here as it is one of the most strongly upregulated genes by HapR (PMC2677876)

      As noted above, this was not an error and VCA0880 was not one of our 32 HapR targets. As such, we would not have followed this up.

      9)  Figure Legends. Add a unit to the bar graphs in Figure 1e (should be kb??) This has been corrected.

      10) The yellow color text labels in figures 3c, 4a, 4c are difficult to read. Can you use an alternative darker color for CRP.

      We have made this slightly darker (although to our eye it is easily reliable). We haven’t changed the colour too much, for consistency with colour coding elsewhere.

      11) Figure S3. Binding is misspelled. Add units to the x-axis

      This has been corrected.

      12) Figure S7. The text in this figure is too small to read. Figure could be enlarged to full page or text enlarged. Are these 4 the only other known regulated promoters? Could all the known alternative promoters linked to HapR be similarly probed?

      We have increased the font size and included a new Table S1 for all previously proposed HapR sites.

      13) Figure S8. Original images..are any of these the replicate gels (see public comment 6)

      We have added a statement regarding reproducibility, and also note the internal reproducibility between different figures in our reviewer response. The gels in Figure S8 are full uncropped versions of those shown in the main figures.

      Reviewer #3 (Recommendations For The Authors):

      None

      Whilst there were no specific recommendations from this reviewer, we have still responded to the public review and made changes if required.

  2. May 2023
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Dear Editor and reviewers,

      Thank you very much for the thorough assessment of our manuscript. We have carefully considered the comments and reflected most of them in the new version. We recognized the need to shorten and clarify the manuscript. Therefore, we have omitted particularly the less important passages concerning metabolism and the loss of genes encoding mitochondrial proteins, which cut the text by six pages in the current layout. We have also removed the text relating this model to eukaryogenesis. Finally, we have slightly changed the structure and linked the different sections to improve the flow of the story and to emphasize the key messages, which are the absence of mitochondria in a large proportion of oxymonads and the impact of this loss, loss of Golgi stacking and transformation to endobiotic lifestyle on selected gene inventories. We hope the manuscript is now clear and more concise and will be of interest to a broad readership interested in the evolution of eukaryotes, mitochondria and protists.

      1. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity):

      This is a very interesting paper that investigates through detailed comparative genomics the tempo and mode of the evolution of microbial eukaryotes/protists members of the Metamonada with a focus on Preaxostyla, currently the only known lineage among eukaryotes to have species that have lost, by all accounts, the mitochondria organelle all together. Notably, it includes a free-living representative of the lineage allowing potential interesting comparison between lifestyles among the Preaxostyla. This is a generally nicely crafted manuscript that presents well supported conclusions based on good quality genome sequence assemblies and careful annotations. The manuscript presents in particular (i) additional evidence for the common role of LGT from various bacterial sources into eukaryotic lineages and (ii) more details on the transition from a free-living lifestyle to an endobiotic one and (iii) the related evolution of MROs and associated metabolism.

      Thank you very much for the positive assessment.

      I have some comments to improve a few details:

      In the introduction, lines 42-43, the last sentence should be more conservative by replacing "whole Oxymonadida" with "...all known/investigated Oxymonadida".

      The sentence has been changed to: "Our results provide insights into the metabolic and endomembrane evolution, but most strikingly the data confirm the complete loss of mitochondria and every protein that has ever participated in the mitochondrion function for all three oxymonad species (M. exilis, B. nauphoetae, and Streblomastix strix) extending the amitochondriate status to all investigated Oxymonadida."

      Similarly on line 62, the sentence could state "... contain 140 described...".

      The sentence has been changed to: "Oxymonadida contain approximately 140 described species of morphologically divergent and diverse flagellates exclusively inhabiting digestive tracts of metazoans, of which none has been shown to possess a mitochondrion by cytological investigations (Hampl 2017)."

      When discussing the estimated completeness of the genome are discussed (lines 117-120) and contrasted with the values for Trypanosoma brucei and other genomes, the author should explicitly state that these genomes are considered complete, which seems is what they imply, is that the case? If so, please provide more details to support this idea.

      We have elaborated on this part also in reaction to comments of other reviewers. The text now reads: "It should be noted that, despite their wide usage, BUSCO values are not expected to reach 100% in lineages distant from model eukaryotes simply due to the true absence (or high sequence divergence) of some of the assessed marker genes. For example, various Euglenozoa representatives with highly complete genome sequences, including Trypanosoma brucei, have BUSCO completeness estimates in the range of 71-88% (Butenko et al. 2020), and representatives of Metamonada fall within the range of 60-91% (Salas-Leiva et al. 2021). Specifically in the case of oxymonad M. exilis, the improvement of the genome assembly using long-read resequencing from 2092 scaffolds to 101 contigs led to only a marginal increase of BUSCO value from 75.3 to 77.5 (Treitli et al. 2021). "

      Also please see the detailed table prepared in response to reviewers 2 and 3 summarizing the presence/absence of genes from BUSCO set in the selected representatives of Metamonada and Trypanosoma brucei. The table is commented in the answer to Reviewer 3 comment (page 18)

      The supplementary file named "132671_0_supp_2540708_rmsn23" is listed as a Table SX? (note: I found it rather difficult to establish exactly what file corresponds to what document referred in the main text)

      We apologize for this mistake. We have checked and corrected references to tables, figures and supplementary material throughout the manuscript and hope it now does not contain any errors.

      Lines 243-245, where 46 LGTs are discussed, it is relevant that the authors investigate their functional annotations. Indeed, it is suggested that these could have adaptive values, hence investigating their functional annotation will allow the authors to comment on this possibility in more details and precision. When discussing LGTs it would also be very useful to cite relevant reviews on the topic - covering their origins, functional relevance when known, distribution among eukaryotes. This is done when discussing the evolution and characteristics of MROs but not when discussing LGTs, with several reviews cited and integrated in the discussion of the data and their interpretation.

      Available annotations of all putative LGT genes are provided in Supplementary_file_3 and also in the Supplementary_file_6 if the gene belongs to a manually annotated cellular system. Although we agree with the reviewer that the discussion of 46 species-specific LGTs might be interesting, for the sake of conciseness and brevity of the manuscript, we have decided not to expand the discussion further. However, note that we discuss selected cases of P. pyriformis-specific LGTs in the part “P. pyriformis possesses unexpected metabolic capacities” which follows right after the lines reviewer is referring to.

      The sentence, lines 263-265, where the distribution of some LGTs are discussed, needs to be made more precise. When using the work "close" the authors presumably refer to shared/similar habitat,s or else? Entamoeba is not a close relative to the other listed taxa.

      The “close relatives” mentioned in the text were meant as close relatives of all p-cresol-synthesizing taxa discussed in the paragraph, including Mastigamoeba, i.e. a specific relative of Entamoeba. We have modified the text such as to make the intended meaning easier to follow.

      Lines 346-348, that sentence needs to end with a citation (e.g. Carlton et al. 2007).

      The citation proposed by the reviewer has been added. The sentence was changed to: " The most gene-rich group of membrane transporters identified in Preaxostyla is the ATP-binding cassette (ABC) superfamily represented by MRP and pATPase families, just like in T. vaginalis (Carlton et al. 2007). "

      In the paragraph (line 580-585) discussing ATP transporters, note that Major et al. (2017) did not describes NTTs but distantly related members of MSF transporter, shared across a broader range of organisms then the NTTs. Did the authors check if the genome of interest encoded homologues of these transporters too?

      The citation has been removed; we admit that it was not the most appropriate one in the given

      context. Concerning the NTT-like transporters, encouraged by the reviewer we searched for them in the Preaxostyla genome and transcriptome assemblies and found no candidates. This is not explicitly stated in the revised manuscript. The paragraph now reads: “MROs export or import ATP and other metabolites typically using transporters from the mitochondrial carrier family (MCF) or sporadically by the bacterial-type (NTT-like) nucleotide transporters (Tsaousis et al. 2008). We did not identify any homolog of genes encoding proteins from these two families in any of the three oxymonads investigated. In contrast, MCF carriers, but not NTT-like nucleotide transporters, were recovered in the number of four for each P. pyriformis and T. marina (Supplementary file 6).

      Line 920-921, I don't understand how the number 30 relates to "guarantee" inferring the directionality of LGTs events. This will be very much dataset dependent, 100 sequences might still not allow to infer directionality of LGT events. The authors probably meant to "increase the possibility to infer directionality".

      We agree the original wording has not been particularly fortunate, so the sentence has changed to: "Files with 30 sequences or fewer were discarded, as the chance directionality of the transfer can be determined with any confidence is low when the gene family is represented by a small number of representatives."

      Reviewer #2 (Evidence, reproducibility and clarity):

      Using draft genome sequencing of the free-living Paratrimastix pyriformis and the sister lineage oxymonad Blattamonas nauphoetae, Novack et al. infer the metabolic potential of the two protists using comparative genomics. The authors conclude that the common oxymonad ancestor lost the mitochondrion/mitosome and discuss general strategies for adapting to commensal/symbiotic life-style employed by this taxon. Some elaborations on pathways go on for several paragraphs and feel unnecessarily stretched, which made those sections of the paper rather difficult to digest.

      Having seen reflections on the manuscript by three reviewers we carefully reconsidered its content and attempted to make it shorter and more compact by removing some of the less substantial material. Namely, we have dispensed completely with the original last section of Results and Discussion (“No evidence for subcellular retargeting of ancestral mitochondrial proteins in oxymonads”) and made various cuts throughout other sections. We hope that the revised version makes a substantially better job of delivering the key messages of our study to the readers compared to the original submission.

      This might be also be because the work, and all conclusions drawn, depend entirely on incomplete (ca. 70-80%) genome data and simple similarity searches, and e.g. no kind of biochemistry or imaging is presented to underpin the manuscripts discussion.

      This is a very crude and superficial assessment of our data. We have actually good reasons to believe that the genome assemblies are close to complete. Please see the discussion on this topic below and an answer to a particular comment from reviewer 3 (page 18).

      This is noteworthy in light of other protist genome reports published in the last few years that differ in this respect, including previous work by this group. And for sequencing-only data, this paper - https://doi.org/10.1016/j.dib.2023.108990 - might offer an example of where we are at in 2023.

      Frankly, we do not think it is fair or relevant to compare our study to the paper pointed to by the reviewer, as that paper reports on a metagenomic study that delivers a set of metagenomically assembled genomes (MAGs) of varying quality retrieved from environmental DNA samples without providing any in-depth analysis of the gene content. Our study is very different in its scope and aims, and we are not certain what lesson we should take from this reviewer’s point. We have good reasons to believe that the datasets are close to complete. Please see the discussion on this topic below and answer to comment of reviewer 3 (page 18).

      With respect to previous work of the group (Karnkowska et al. 2016 and 2019), this submission is very similar (analysis pattern, even some figures and more or less the conclusion), i.e. to say, the overall progress for the broader audience is rather incremental. Then there are also some incidents, where the data presented conflicts with the author‘s own interpretation.

      It was our intention to use the previous analytical experiences and approaches, which at the same time makes the new results comparable with those published before. Although the format is intentionally similar, this work is a substantial step forward because only with our present study the amitochondrial status of the large part of Oxymonadida group can be considered solidly established. This in turn allows us to estimate the timing of the loss of mitochondrion (more than 100 MYA) demonstrating that the absence of mitochondrion in this group is not an episodic transient state but a long-established status. We do not understand what exactly the reviewer had in mind when pointing to “incidents, where the data presented conflicts with the author‘s own interpretation” – we are not aware of such cases.

      The text (including spelling and grammar) needs some attention and the choice of words is sometimes awkward. The overuse of quotation marks ("classical", "simple", "fused", "hits", "candidate") is confusing (e.g. was the BLAST result a hit or a "hit").

      The whole text has been carefully checked and the language corrected whenever necessary by a one of the co-authors, who is a native English speaker. The use of quotation marks has been restricted as per the reviewer’s recommendation.

      In its current formn the manuscript is, unfortunately, very difficult to review. This reviewer had to make considerable efforts to go through this very large manuscript, mainly because of issues affecting to the presentation and the lack of clarity and conciseness of the text. It would be greatly appreciated if the authors would make more efforts upfront, before submission, to make their work more easily accessible both to readers and facilitate the task of the reviewers.

      We admit that the story we are trying to tell is a complex one, consisting of multiple pieces whose integration into a coherent whole is a challenging task. As stated above, the reports provided by the reviewers provided us with an important stimulus, leading us to substantially modify the manuscript to make it more concise, less ambiguous when it comes to particular claims, and easier to read. We hope this intention has been fulfilled to a larger degree.

      About a fifth of the two genome is missing according the authors prediction (table 1). Early on they explain the (estimated) incompleteness of the genomes to be a result from core genes being highly divergent. In light of this already suspected high divergence, using (the simplest NCBI) sequence similarity approach to call out the absence of proteins (for any given lineage) may need lineage-specific optimization. The use of more structural motif-guided approaches such as hidden Markov models could help, but it is not clear whether it was used throughout or only for the search for (missing) mitochondrial import and maturation machinery. The authors state that the low completeness numbers are common among protists, which, if true, raises several questions: how useful are then such tools/estimates to begin with and does this then not render some core conclusions problematic? The reader is just left with this speculation in the absence of any plausible explanation except for some references on other species for which, again, no context is provided. Do they have similar issues such as GC-content, same core genes missing, phylogenetic relevance?, etc.. No info is provided, the reader is expected to simply accept this as a fact and then also accept the fact that despite this flaw, all conclusions of the paper that rests on the presence/absence of genes are fine. This is all odd and further skews the interpretations and the comparative nature of the paper.

      The question of the completeness of the data sets was raised also by reviewer 3 and we would like to provide an explanation at this point. First, it should be stated that there is no ideal and objective way how to measure the completeness of the eukaryotic genomic assembly. In the manuscript, we have used the best established method, adopted by the community at large, which is based on the search for a set of „core eukaryotic genes“ using a standardized pipeline BUSCO or previously popular CEGMA. The pipeline uses its own tools to identify the homologues of genes/proteins which ensures standardization of the procedure. This answers the question of reviewer 2, why we have not used more sensitive tools for these searches. We did not use them, because we followed the procedure that is the gold standard for such assessments, for comparability with other genomes and to make this as clear to the reader as possible. Although the result of the pipeline is usually interpreted as the completeness of the assembly, this is a simplification. Strictly speaking, the result is a percentage of the genes from the set of 303 core eukaryotic genes (in our case) which were detected in the assembly by the pipeline. Even in complete assemblies, the value is usually below 100% because some of the genes are not present in the organism and some diverged beyond recognition. We do not see any other way how to deal with this drawback than to compare with related complete genome assemblies acting as standards. This we have done in Supplementary file 11, where we list the presence/absence of each gene for Preaxostyla species and three highly complete assemblies of Trypanosoma brucei, Giardia intestinalis and Trichomonas vaginalis. T. brucei and G. intestinalis are assembled into chromosomes. As you can see, in these three „standards“ 63, 148 and 77 genes from the core were not detected resulting in BUSCO completeness values of 79%, 51% and 75%, respectively. 18 of the non-detected genes function in mitochondria (shown in red), which are highly reduced in some of these species, so the absence of the respective genes is therefore expected. Simply not considering these genes would increase the “completeness measure” for oxymonads by 6%. The values for our standards are not higher than the values for Preaxostyla (69-82%). In summary, the BUSCO incompleteness measure is far from ideal, particularly in these obscure groups of eukaryotes. The values received for Preaxostyla give no reason for concern about their incompleteness. See also our answer to reviewer 3 (page 18).

      At the same time, we admit that the BUSCO values do not confirm the high completeness of our assemblies. So, why do we think they are highly complete? One reason is that we do not see suspicious gaps in any of the many pathways which we annotated but the main reason is the high contiguity of the assemblies. Thanks to Nanopore long read sequencing, the assembly of P. pyriformis and B. nauphoetae compose of 633 and 879 scaffolds, suggesting that there are “only” hundreds of gaps. Although this may still sound too much, it is a relatively good achievement for genomes of this size and the experience shows that a further decrease in the number of scaffolds would allow the detection of additional genes but not in huge numbers. As we have shown for M. exilis (Treitli et al. 2021, doi:10.1099/mgen.0.000745) the decrease from 2 092 scaffolds to 101 contigs, i.e., filling almost 2 000 gaps, allowed the prediction of additional 1 829 complete gene models, of which 1 714 were already present in the previous assembly but only partially and just 115 were completely new. None of these newly predicted genes was functionally related to the mitochondrion. Thus, we infer the chance that all mitochondrion-related genes are hidden in the gaps of assemblies is very low.

      We have provided these arguments in a condensed form in the text following the description of genome assemblies: “It should be noted that, despite their wide usage, BUSCO values are not expected to reach 100% in lineages distant from model eukaryotes simply due to the true absence (or high sequence divergence) of some of the assessed marker genes. For example, various Euglenozoa representatives with highly complete genome sequences, including Trypanosoma brucei, have BUSCO completeness estimates in the range of 71-88% (Butenko et al. 2020), and representatives of Metamonada fall within the range of 60-91% (Salas-Leiva et al. 2021). Specifically in the case of oxymonad M. exilis, the improvement of the genome assembly using long-read resequencing from 2092 scaffolds to 101 contigs led to only a marginal increase of BUSCO value from 75.3 to 77.5 (Treitli et al. 2021).

      As a side note, this will also influence the number of proteins absent in other lineages and as such has consequences on LGT calls versus de novo invention. For the cases with LGT as an explanation, it would help to briefly discuss the candidate donors and some details of the proteins in the eco-physiological context (e.g. lines 263-268 suggest that HPAD may have been acquired by EGT which was facilitated by a shared anaerobic habitat and also comment on adaptive values for acquiring this gene). Exchanging metabolic genes via LGT (Line 163) blurs the differences between roles and extent of LGT in prokaryote vs eukaryote, and therefore is exciting and could use support/arguments other than phylogenies. I guess the number of reported LGTs among protists (whatever the source) over the last decade has by now deflated the novelty of the issue in more general; a report of the numbers is expected but they alone won't get you far anymore in the absence of a good story (such as e.g. work on plant cell wall degrading enzymes in beetles).

      We agree with the reviewer that the cases of LGT involving Preaxostyla would deserve more discussion in the manuscript. On the other hand, we also agree that none of them provides such a “cool” story that would deserve a special chapter or even a separate paper. Therefore, we have decided, also with regard to keeping the text in a reasonable dimension, not to expand the discussion of LGTs with the exception of HgcAB, where some new information has been included and the phylogeny of the genes updated. Please note that we had discussed in the original manuscript the donor lineages and ecological/biochemical context in the cases of GCS-L2, HPAD, UbiE, and NAD+ synthesis and this material has been kept also in the revised version.

      It would help to clarify which parts of the mitochondrial ancestor were reduced during the process of reductive evolution at what time in their hypothesized trajectory. For instance, loosing enzymes of anaerobic metabolism conflicts with the argued case of an aerobic (as opposed to facultative anaerobic) mitochondrial ancestor followed by gains of anaerobic metabolism in the rest of the eukaryotes via LGT, and some papers the authors themselves cite (e.g. the series by Stairs et al.). There is no coherent picture on LGT and anaerobic metabolism, although a reader is right to expect one.

      These are very interesting questions, that would fill a separate article. In the manuscript, we focus on the Preaxostyla lineage only and there the trajectory seems relatively simple: replacement of the mitochondrial ISC by cytosolic SUF in the common ancestor of Preaxostyla, loss of methionine cycle and in in consequence mitochondrial GCS and the mitochondrion itself. We have modified the first conclusion paragraph in this sense and it now reads the following:

      The switch to the SUF pathway in these species has apparently not affected the number of Fe-S-containing proteins but led to a decrease in the usage of 2Fe-2S clusters. The loss of MRO impacted particularly the pathways of amino acid metabolism and might relate also to the loss of large hydrogenases in oxymonads.

      It is not clear to us how to understand the reviewer’s remark concerning the conflict between loss of enzymes of anaerobic metabolism and the (presumed) aerobic nature of the mitochondrial ancestor. Provided that we read the reviewer’s rationale correctly, is it really so implausible that the anaerobic metabolism gained laterally by a particular lineage was then secondarily lost in specific descendant lineages? As a clear example demonstrating the feasibility of such an evolutionary pattern consider the evolution of plastids. There is no doubt these organelles move across eukaryotes by secondary or higher-order endosymbiosis or kletoplastidy, establishing themselves in lineages where there was no plastid before. Secondary simplification of such plastids, e.g. by the loss of photosynthesis, in its extreme form culminating in the complete loss of the organelle, has been robustly documented from several lineages, such as Myzozoa (e.g., https://pubmed.ncbi.nlm.nih.gov/36610734/). Hence, we see absolutely no reason to rule out the possibility that the ancestral mitochondrion was obligately aerobic and enzymes of anaerobic metabolism spread secondarily by eukaryote-to-eukaryote LGT, with their secondary loss in particular lineages. We really do not see any conflict here and we do not agree with the interpretation provided by the reviewer. That said, we admit that the discussion on the earliest stages of mitochondrial evolution is not an essential ingredient of the story we try to tell in our manuscript, so to avoid any unnecessary misunderstanding we have removed the original last sentence of Conclusions (“Thorough searches revealed …”) from the revised manuscript.

      In light of their data the authors also discuss the importance of the mitochondrion with respect to the origin of eukaryotes:

      First, the mitochondrion brought thousands of genes into the marriage with an archaeon, surely hundreds of which provided the material to invent novel gene families through fusions and exon shuffling and some of which likely went back and forth over the >billion years of evolution with respect to localizations. The authors look at a minor subset of proteins (pretty much only those of protein import, Fig. 6) to conclude, in the abstract no less: „most strikingly the data confirm the complete loss of mitochondria and every protein that has ever participated in the mitochondrion function for all three oxymonad species." I do not question the lack of a mitochondrion here, but this abstract sentence is theatrical in nature, nothing that data on an extant species could ever proof in the absence of a time machine, and is evolutionary pretty much impossible. A puzzling sentence to read in an abstract and endosymbiont-associated evolution.

      We feel that the reviewer is putting too much emphasis on an aspect of our original manuscript that is rather peripheral to its major message. Indeed, the manuscript is not, and has never been thought to be, primarily about eukaryogenesis and the exact role the mitochondrion played in it. We are, therefore, somewhat reluctant to react in full to the very long and complex argument the reviewer has raised in his/her report, so we keep our reaction at the necessary minimum. Concerning the criticized sentence in the original version of the abstract, it alluded to a section of the manuscript (“No evidence for subcellular retargeting of ancestral mitochondrial proteins in oxymonads”) that we have removed from the revised version, and hence we have modified also the abstract accordingly by removing the sentence. We still think our original arguments were valid, but apparently, much more space and more detailed analyses are required to deliver a truly convincing case, for which there is no space in the manuscript.

      Second, using oxymonads as an example that a lineage can present eukaryotic complexity in the absence of mitochondria and conflating it with eukaryogenesis is a logical fallacy. This issue already affected the 2019 study by Hampl et al.. We have known that a eukaryote can survive without an ATP-synthesizing electron transport chain ever since Giardia and other similar examples and the loss of Fe-S biosynthesis and the last bit of mitosome (secondary loss) doesn't make a difference how to think about eukaryogenesis. It confuses the need and cost to invent XYZ with the need and cost of maintenance. How can the authors write "... and undergo pronounced morphological evolution", when they evidently observe the opposite and show so in their Fig. 1? The authors only present evidence for reductive evolution of cellular complexity with the loss of a stacked Golgi. What morphological complexity did oxymonads evolve that is absent in other protists? A cytosolic metabolic pathway doesn't count in this respect, because it is neither morphological, nor was it invented but likely gained through LGT according to the authors. This is quite confusing to say the least. A recent paper (https://doi.org/10.7554/eLife.81033) that refers to Hampl et al. 2019 has picked this up already, and I quote: "Such parasites or commensals have engaged an evolutionary path characterized by energetic dependency. Their complexity might diminish over evolutionary timescale, should they not go extinct with their hosts first." Here the authors raise a red flag with respect to using only parasites and commensals that rely on other eukaryotes with canonical mitochondria as examples. If we now look at Fig. 1 of this submission, Novak et al. underpin this point perfectly, as the origin of oxymonads is apparently connected to the strict dependency on another eukaryote (or am I wrong?), and they support the prediction with respect to complexity reducing after the loss of mitochondria - mitosome gone, Golgi almost gone. What's next? This is a good time to remember that extant oxymonads are only a single picture frame in the movie that is evolution, and their evolution might be a dead-end or result in a prokaryote-like state should they survive 100.000s to millions of years to come.

      It seems that in this point the reviewer is particularly concerned with the following sentence that is part of the Introduction and which relates to the existence of amitochondrial eukaryotes we are studying: “The existence of such an organism implies that mitochondria are not necessary for the thriving of complex eukaryotic organisms, which also has important bearings to our thinking about the origin of eukaryotes (Hampl et al. 2018). Even after re-reading the sentence we confess we stay with it and find it perfectly logical. Nevertheless, we decided to omit it from the text so as not to distract from the main topic of the study.

      Next, when mentioning “… pronounced morphological evolution” we mean the evolution of four oxymonad families (Streblomastigidae, Oxymonadidae, Pyrsonymphidae and Saccinobaculidae) comprising almost a hundred described species with often giant and morphologically elaborated cells that evolved from a simple Trimastix-like ancestor (Hampl 2017, Handbook of Protists, 0.1007/978-3-319-32669-6_8-1). This is a fact that can hardly be dismissed. Also, given the current oxymonad phylogenies (Treitli et al. 2018, doi.org/10.1016/j.protis.2018.06.005) and the reported absence of a mitochondrion in M. exilis, B. nauphoetae, and S. strix we can infer that the mitochondrion was lost in the common ancestor of the three species at latest. This organism must have lived more than 100 MYA, as at that time oxymonads were clearly diversified into the families (Poinar 2009, 10.1186/1756-3305-2-12). So, these organisms indeed have lived without mitochondria for at least 100 MY. We think that these facts and our inferences based on them are solid enough to keep in the conclusion the following statement: “This fact moves this unique loss to at least 100 MYA deep past, when oxymonads had been already diversified (Poinar 2009), and shows that a eukaryotic lineage without mitochondria can thrive for eons and undergo pronounced morphological evolution, as is apparent from the range of shapes and specialized cellular structures exhibited by extant oxymonads (Hampl 2017).” Furthermore, as documented in Karnkowska et al. 2019 (https://pubmed.ncbi.nlm.nih.gov/31387118/), apart the loss of the mitochondrion oxymonads are surprisingly “normal” and complex eukaryotes, in fact much less reduced than, e.g., Giardia, Microsporidia, or even S. cerevisiae (in terms of the number of genes, introns, etc.). We strongly disagree with the claim that “Golgi is almost gone” in oxymonads, and our manuscript shows exactly the opposite. Viewing oxymonads as a lineage heading towards a prokaryote-like simplicity is dogmatic and ignores the known biology of these organisms.

      Some more thoughts: Line 47-52: Hydrogenosome or mitosome is a biological and established label as (m)any other and I find the use of the word "artificial" in this context strange. While the authors are correct to note that there is a (evolutionary) continuum in the reduction - obviously it is step by step - they exaggerate by referring to the existing labels as "artificial". You make Fe-S clusters but produce no ATP? Well, then you're a mitosome. It's a nomenclature that was defined decades ago and has proven correct and works. If the authors think they have a better scheme and definition, then please present one. Using the authors logic, terms such as amyloplast or the TxSS nomenclature for bacterial secretions systems are just as artificial. As is, this comes across as grumble for no good reason.

      We agree that the original wording sounded like unwarranted grumbling and we have changed the sentence in the following way: "However, exploration of a broader diversity of MRO-containing lineages makes it clear that MROs of various organisms form a functional continuum (Stairs et al. 2015; Klinger et al. 2016; Leger et al. 2017; Brännström et al. 2022)."

      Line 158: A duplication-divergence may also explain this since sequence similarity-based searches will miss the ancestral homologues.

      We do not disagree about this, in fact, the gene the reviewer’s point is concerned with for sure is a result of duplication and divergence, as it belongs to a broader gene family (major facilitator superfamily, as stated in the manuscript) together with other distant homologs. Nevertheless, this is not in conflict with our conclusion that it “may represent an innovation arising in the common ancestor of Metamonada”.

      Lines 201-202: Presence of GCS-L in amitochondriate should be explained in light of this group once having a mitochondrion, which then makes ancestral derivation and differential loss (as invoked for Rsg1) also a likely explanation along with eukaryote-to-eukaryote LGT.

      Yes, this most likely holds for the standard paralogue GCS-L1 (in P. pyriformis PAPYR_5544), which has the expected distribution and phylogenetic relationships and is absent in oxymonads. The discussion is, however, mainly about the rare, divergent and until now overlooked paralogue GCS-L2 (in P. pyriformis PAPYR_1328), which we found only in three distantly related eukaryote groups, Preaxostyla, Breviatea, and Archamoebae, which strongly suggests inter-eukaryotic LGT.

      Lines 356-392: Describes plenty of genomic signal for Golgi bodies but simultaneously cites literature suggesting the absence of a morphologically an identifiable Golgi in oxymonads. An explicit prediction regarding what to observe in TEM for the mentioned species might be nice to stimulate further work.

      We thank the reviewer for their suggestion and are glad that they are enthusiastic about this aspect of the manuscript. Unfortunately, the morphology of unstacked Golgi ranges from single cisternae (yeast, Entamoeba), vesicles (Mastigamoeba), and a “tubular membranous structure” in Naegleria. Therefore, no strong prediction is possible of what the oxymonad Golgi might look like under light or TEM. However, the data that we have provided should lead to molecular cell biological analyses aimed at identifying the organelle, giving target proteins to tag or against which to create antibodies as Golgi markers. An additional sentence to this effect has been added to the manuscript, “They also set the stage for molecular cell biological investigations of Golgi morphological variation, once robust tools for tagging in this lineage are developed.”

      Lines 414: The preceding paragraphs in this result section describes only the distribution, without mentioning origins - a sweeping one-line summary that proclaims different origin needs some context and support. Furthermore, the distribution of glycolytic enzymes might indeed be patchy, but to suggest it represents an 'evolutionary mosaic composed of enzymes of different origins' without discussing the alternative of a singular origin and different evolutionary paths (including a stringer divergence in one vs. another species) discredits existing literature and the authors own claim with respect to why BUSCO might fail in protists.

      The part of the text about glycolysis the reviewer alluded to has been removed while shortening the manuscript.

      Line 486: How uncommon are ADI and OTC in lineages sister to metamonada?

      This is an interesting but difficult question. Firstly, we are uncertain what is the sister lineage to Metamonada. Discoba, maybe, but a recent unpublished rooting of the eukaryotic tree does not support it (https://pubmed.ncbi.nlm.nih.gov/37115919/). Generally, the individual genes of the pathway (ADI, OTC and CK) are quite common in eukaryotes, but the combination of all three is rare (Metamonada, the heterolobosean Harpagon, the green algae Coccomyxa and Chlorella, the amoebozoan Mastigamoeba, and the breviate Pygsuia), see figure 1 in Novak et al 2016, doi: 10.1186/s12862-016-0771-4.

      Line 504: It might help an outside reader to include a few lines on consequences and importance of having 2Fe-S vs 4Fe-S clusters and set an expectation (if any) in Oxymonads.

      We apologize for omitting this explanation. The 2Fe-2S proteins are more common in mitochondria where 2Fe-2S clusters are synthesized in the early pathway of FeS cluster assembly, while the cytosolic CIA pathways produce 4Fe-4S clusters (https://pubmed.ncbi.nlm.nih.gov/33007329/). The original expectation therefore is that species without mitochondria should not have 2Fe-2S cluster proteins. Obviously, the switch to the SUF pathway affects this expectation as we do not know, what type of cluster this pathway produces in oxymonads (https://www.biorxiv.org/content/10.1101/2023.03.30.534840v1). For the sake of brevity, we have included a short statement as the beginning of the sentence in question, which now reads as follows: “As 2Fe-2S clusters are more frequent in mitochondrial proteins, the higher number of 2Fe-2S proteins in P. pyriformis compared to the oxymonads may reflect the presence of the MRO in this organism.

      Any explanations on what unique selection pressures and gene acquisition mechanisms may be operating in P. pyriformis which might allow for the unique metabolic potential?

      Every species exhibits a unique combination of traits that results from changing selection pressures imposed on historical contingency (including neutral evolutionary processes such as genetic drift). We lack real understanding of these factors for a majority of taxa including the familiar ones, so we should not expect to have a good answer to the reviewer’s question. In fact, we do not know how unique is the particular combination of P. pyriformis traits discussed in our manuscript, as there has been no comprehensive comparative analysis that would include ecologically and evolutionarily comparable taxa. We note that Paratrimastix represents only a third free-living metamonad with a sequenced genome (together with Kipferlia and Carpediemonas), so more data and additional analyses are needed to be in a position when we may start hoping answers to questions like the one posed by the reviewer are in reach.

      ** Referees cross-commenting** To R3: Hampl et al. 2019, to which Novak et al. refer, is about eukaryogensis and that is exactly the context in which this is discussed again and what Raval et al. 2022 had decided to touch upon. If the authors do not bring this up in light of the ability to evolve (novel) eukaryote complexity, then what else? Maybe they can elaborate, especially with respect to energetics to which they explicitly refer to in 2019 (and here). And with respect to text-book eukaryotic traits (and the evolution of new morphological ones), I do not see any new ones evolving in any oxymonad, but reduction as Novak et al. themselves picture it in this submission. Is a change in the number of flagella pronounced morphological evolution? Maybe for some, but I believe this needs to be seen in light of the context of how they discuss it. I see a reduction of eukaryotic complexity and not a gain. They have an elaborate section on the loss of Golgi characteristics (and a figure), but I fail to read something along the same lines with respect to the gain of new morphological traits. Again, novel LGT-based biochemistry does not equal the invention of a new morphology such as a new compartment. Oxymonads depend on mitochondria-bearing eukaryotes for their survival or don't they? This is the main point, and if evidence show that I am wrong, then I will be the first to adapt my view to the data presented.

      While we do see the logic of the reviewer’s point, a good reply would have to be too elaborate and certainly beyond the scope of the current manuscript. As the reviewers’ reports led us to reconsider the structure of the manuscript and to make it more focused and concise, we decided to simplify the matter by removing the allusions to eukaryogenesis, realizing that it is perhaps more suitable for a different type of paper (opinion, review). The comment on the evolution of complex morphology has been answered previously (see above).

      I have concerns with the presentation of a narrative that in my opinion is too one-sided and that has been has been publicly questioned in the community (in press, at meetings, personally). For the benefit of science and of the young authors on this study, this reviewer feels strongly that these issues should be taken very seriously and discussed openly in a more balanced way. . We only truly move forward on such complex topics, if we allow an open and transparent discussion.

      We agree that opinions on specific details of eukaryogenesis are divided in the community and that the topic requires a nuanced discussion for which there is perhaps no place in the current manuscript. As stated in the reply to the previous point, we have removed the discussion of the implications of our current study to eukaryogenesis from the revised manuscript.

      Having said that, I am happy that R3 has picked up exactly the same major concerns as I did with respect to e.g. the phrasing on mito (gene) loss and the BUSCO controversy.

      We appreciate these comments and hopefully have resolved the concern in the previous answers.

      Reviewer #2 (Significance):

      Using draft genome sequencing of the free-living Paratrimastix pyriformis and the sister lineage oxymonad Blattamonas nauphoetae, Novack et al. infer the metabolic potential of the two protists using comparative genomics. The authors conclude that the common oxymonad ancestor lost the mitochondrion/mitosome and discuss general strategies for adapting to commensal/symbiotic life-style employed by this taxon. Some elaborations on pathways go on for several paragraphs and feel unnecessarily stretched, which made those sections of the paper rather difficult to digest. This might be also be because the work, and all conclusions drawn, depend entirely on incomplete (ca. 70-80%) genome data and simple similarity searches, and e.g. no kind of biochemistry or imaging is presented to underpin the manuscripts discussion.

      We have addressed the concern about the possible incompleteness of our genome data above, demonstrating it is not substantiated ad stems from an inadequate interpretation of quality measures we provide in the manuscript. We hope that the revised manuscript, which is streamlined and more concise compared to the initial submission, conveys the key messages in a substantially more persuasive way and will be appreciated by a broad community of readers.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary: The genome sequences of two members of the protist group Preaxostyla are presented in this manuscript: Paratrimastix pyriformis and Blattamonas nauphoetae. The authors use a comparative genomics and phylogenetic approaches and compare the new genome datasets with three previously available genomes and transcriptomes from the group. The availability of genome-scale data from five Preaxostyla species is powerful to address interesting basic evolutionary questions. A substantial part of the manuscript is spent on testing the hypothesis of mitochondrial loss in the oxymonad lineage, which turns out to be supported. The datasets are also explored regarding the role of lateral gene transfer in the group, metabolic diversification and the evolution of Golgi.

      Major comments: I find the manuscript very interesting with many different fascinating results presented. However, the manuscript is very long. Two genome sequences are presented and it is not clear to me what the main question was when this project was initiated and why these two species was selected to answer this question. I do not see an obvious reason for sequencing the P. pyriformis genome if the mitochondrial loss was the main question (given that a transcriptome was already available). Why not spend the time and resources on a member of Preoxystyla, which lacked previous data? The authors should more clearly state why these organisms were chosen to answer the main question or questions of the study.

      We are sorry for having done a poor job when explaining the choice of the taxa for the comparison. The idea was to sample an outgroup of oxymonads (P. pyriformis) and a representative of other clades of oxymonads than M. exilis (B. nauphoetae and S. strix) for which it was feasible to obtain the data, or the data were already available. Obviously, more representatives of morphologically a probably also genetically diverse oxymonads should be investigated (e.g. Pyrsonympha, Oxymonas, Saccinobacullus) and we have such a plan but these organisms are difficult to work with. We considered it necessary to sequence the genome of P. pyriformis, and not rely on the transcriptome only, to avoid the issue of data set incompleteness (raised also by R2). Transcriptomes by nature provide an incomplete coverage of the full gene complement of the species, while our genome assemblies are close to complete, as we explain elsewhere.

      The evolution of MROs have received substantial attention from the protist research community since the 1990's. During this period the mitochondrial organelle have been considered essential for eukaryotes. Therefore, the result presented in the manuscript has a high significance. However, I am not convinced that it is appropriate to use the term "evolutionary transition" for the mitochondrial loss. The loss of MRO is the endpoint of a gradual change of the internal organisation of the cell that probably started when the ancestor of these organism adapted to an anaerobic lifestyle. The last step described in the manuscript probably had little impact on how these organisms interacted with their environment. The presence or absence of biosynthesis of p-cresol by some, but not all, Preaxystyla probably is much more significant from an ecological point of view. My point is that the authors need to consider how they use the term evolutionary transition and be explicit about that.

      We appreciate the comment concerning the use of the term “evolutionary transition”. Nevertheless, we believe there is no real consensus in the literature on what is and what is not an “evolutionary transition”, and the application of the term to specific cases is more or less arbitrary. For a lack of a standardized or better terminology, we have kept the term to refer to three evolutionary changes in the evolution of the Preaxostyla lineage that are particularly important from the cytological or ecological perspective, i.e. dispensing with the mitochondrion, reorganizing the Golgi apparatus by losing the stacked arrangement of the cisternae, and gaining the endobiotic life style.

      In the abstract the main finding is describes as "the data confirm the complete loss of mitochondria and every protein that has ever participated in the mitochondrion function for all three oxymonad species (M. exilis, B. nauphoetae, and Streblomastix strix) extending the amitochondriate status to the whole Oxymonadida.". I find this a really interesting observation, but I do find the wording a bit too bold for several reasons: • Not every protein that has participated in the mitochondrial function is known. • Mitochondrial proteins could be present in oxymonads, but divergent beyond the detection limit for existing methods. • Genes for one or several mitochondrial proteins could be present in one or more oxymonad genomes, but remain undetected due to the incomplete nature of the datasets.

      Although I do think that the authors' claim very well could be true, I don't think their data fully support it. Therefore, it needs to be rephrased.

      As a result of our decision to streamline the manuscript by removing the final part of Results and Discussion (“No evidence for subcellular retargeting of ancestral mitochondrial proteins in oxymonads”, the revised manuscript no longer support the statement “the data confirm the complete loss of … every protein that has ever participated in the mitochondrion function for all three oxymonad species” that is criticized by the reviewer, and hence the statement has been removed from the abstract. This addresses bullet point 1. As for bullet points 2 and 3, the proof of absence is in principle impossible to deliver, and we have been fighting with this already in the Karnkowska et al. 2016 paper. Although our certainty will never reach 100% (this is in fact impossible for a scientific, i.e., falsifiable, hypothesis), the mounting of evidence through studies gives the hypothesis on the amitochodriate status of oxymonads more and more credit. The genes for mitochondrial marker proteins have not been detected by the most sensitive methods available neither in the first genome assembly of M. exilis (Karnkowska et al. 2016), nor in the improved M. exilis genome assembly composed of only 101 contigs (Treitli et al. 2021), nor in either of the other two oxymonad species investigated here. On the other hand, they were readily detected in the data sets of P. pyriformis and T. marina. What is the probability that these genes always hide in the assembly gaps, or that they have all escaped recognition? Obviously, this probability is not zero, but we believe it is approaching so low values that it is reasonably safe to make the conclusion on the amitochondriate status of these species.

      The sentence was changed to: "Our results provide insights into the metabolic and endomembrane evolution, but most strikingly the data confirm the complete loss of mitochondria for all three oxymonad species investigated (M. exilis, B. nauphoetae, and Streblomastix strix), suggesting the amitochondriate status may be common to Oxymonadida."

      The third point maybe could be analysed further. BUSCO scores are reported, but also argued not being reliable for this group of organisms (which is true). Would it, for example, be useful to analyse how large fraction of the BUSCO proteins found in all non-Preoxystyla metamonada genomes that are present in the various Preoxystyla datasets?

      We provide a comprehensive answer to a similar comment of reviewer 2 above (page 6-8). We performed the requested analysis and provide the result in Supplementary file 11. In this table, we record presence/absence of each gene from the BUSCO set for our data sets and the highly complete “standard” datasets of Trypanosoma brucei, Giardia intestinalis and Trichomonas vaginalis. Of the 303 genes, 117 were present in all data sets and 17 in none (see column I). 20 were present only in Trypanosoma and not in metamonads. 6 were present in all Preaxostyla and absent in other metamonads (Trichomonas and Giardia), 44 were present in all Preaxostyla and Trichomonas and absent in Giardia, suggesting high divergence of this species. Only 23 (marked by *) were present in the three “standard” genomes and absent in one or more Preaxostyla species. Of those 8 and 8 were absent specifically in S. strix and P. pyriformis, respectively, but only 1 was absent specifically in M. exilis and no such case was observed in B. nauphoetae. We conclude that this non-random pattern argues for lineage-specific divergence rather than incomplete data sets, particularly in the case of M. exilis and B. nauphoetae.

      Line 160-161: 15 LGT events specific for the Preaxostyla+Fornicata clade is reported. This is an exciting finding because it supports a phylogenetic relationship between these two groups. But such an argument is only valid if the observed pattern is more common than the alternative hypotheses (Preaxostyla+Parabasalids and Fornicata+Parabasalids). How many LGT events support each of these groupings? How are these observation affected by the current taxon sampling with the highest number of datasets from Fornicata? How were putative metamonada-to-metamonada LGTs treated in this context?

      19 LGT are uniquely shared between Preaxostyla+Parabasalids, which is more than the number of shared LGTs between Preaxostyla and Fornicata. No common LGT was unique to Fornicata+Parabasalids. However, the latter is a direct consequence of our investigation method, which involved reconstruction phylogenies of genes present in Preaxostyla, and not across all metamonads. So, we do not have a way to investigate LGT gene families uniquely shared between Fornicata and parabasalids.

      When it comes to the effect of taxon sampling, we agree that it is possible that the number of genes of horizontal origin shared between parabasalids and Preaxostyla is underestimated because of the lower taxon sampling in parabasalids. However, it is still larger (19) than the number of LGTs shared uniquely between fornicate and Preaxostyla (15). In addition, while the taxon sampling is larger in fornicates, it also contains some representatives of closely related lineages (e.g., Chilomastix caulleryi and Chilomastix cuspidate) which, while they increase the number of fornicate representatives, does not increase the detection of shared genes between fornicates and Preaxostyla. Altogether, it's difficult to estimate how the current taxon sampling is biasing the detection of LGTs one way or another.

      Regarding metamonad-to-metamonad putative LGTs: we did not consider this possibility for the sake of not overestimating the number of gene transfers for two main reasons. First of all, our LGT detection relies on the incongruence between species tree and gene tree. The closer the lineages are in the species tree, the more difficult it is to interpret any incongruence in the gene tree as single protein phylogenies are notoriously poorly resolved because they rely on the little phylogenetic signal contained in few amino-acid positions. Because of this, small incongruences with the species tree could either reflect recent LGT events between metamonads, or simply blurry phylogenetic signal. Second, we can certainly use the argument that a limited taxonomic distribution among metamonads favors an LGT event between them. However, here again, the closer the lineages involved are, the more difficult it is to distinguish a scenario where one lineage was the recipient of an LGT from prokaryote before donating it to another metamonad, from a scenario involving a single ancestral LGT from prokaryotes to metamonads, followed by differential loss, leading to a patchy taxonomic distribution. Finally, we are working with both limited taxon sampling and incomplete genomic/transcriptomic data, which makes it more difficult to identify true absences. For all these reasons, we chose to be conservative and invoke the smallest number of LGT events.

      The authors have used a large-scale approach to make single-gene trees for inferences of LGT. In other parts of the manuscript inferences of evolutionary origins of single genes are made without support of phylogenetic trees. I find this inconsistent and argue that the hypothesis of the origin of a specific protein should be tested with the same rigor whether it is a putative LGT, gene duplication, gene loss or an ancestral member of LECA. Specific cases where I think a phylogenetic analysis is needed includes: • Line 222-223: It is concluded that Rsg1 is a component of LECA. • Line 307: HgcAB are argued to be acquired by LGT of a whole opeon. • Lines 350-355: It is unclear how the different numbers of transporters are interpreted (loss or expansion by duplication). This could be address with phylogenetics. • Lines 407-408: A tree should support the claim of LGT origin. (PFP) • Lines 414-415: The different origins of glycolytic enzymes should be supported by data or references. • Line 486: Trees or a reference (if available) should support the claim for LGT.

      As requested, trees were constructed for HgcA, HgcB, PFP and the transporters AAAP, CTL, ENT, pATPase, and SP. Citations were added for the glycolytic enzymes and the ADI pathway. No tree for Rsg1 is needed, as this is a eukaryote-specific protein lacking any close prokaryotic relatives. The inference on its presence in the LECA is based on the phylogenetically wide, however patchy, distribution across the eukaryote phylogeny. Testing possible eukaryote-eukaryote LGTs is hampered by a limited phylogenetic signal in the short and rapidly evolving Rsg1 sequences, resulting in very poorly resolved relationships among Rgs1 sequence in a tree we attempted to make (data not shown). For this reason, we opt for not presenting any phylogenetic analysis for Rsg1.

      Lines 530-531 and 773-774: "The switch to the SUF pathway in these species has apparently not affected the number of Fe-S-containing proteins but led to a decrease in the usage of 2Fe-2S clusters." I find it difficult to evaluate if the data support this because no exact numbers or identities are given for 2Fe-2S and 4Fe-4S proteins in the various genomes in Suppl. Fig. S4 or Supplementary file 4.

      The functional annotation of all detected FeS clusters containing proteins is provided in Supplementary Table S8 including the types of predicted clusters (columns G or F). Basically, the only putative 2Fe2S cluster containing proteins in species of oxymonad is xanthine dehydrogenase, while Paratrimastix and Trimastix contain also 2Fe2S cluster-containing ferredoxins and hydrogenases.

      The method used in the paper varies between the different parts of the paper. One example is single gene phylogenies, which are described three times in the method section [Lines 959-973, lines 1011-1034, lines 1093-1101], in addition to the automated approach within the LGT detection pipeline lines 923-926]. The approaches are slightly different with, for example, different procedures for trimming. This makes it difficult to know how the different presented analyses were done in detail. No rationale for using different approaches is given. At the least, it should be clear in the method section which approach was used for which analysis.

      The reviewer is correct, and we apologize for the inconsistency. The reason is only historical –the analyses were performed by different laboratories in different periods of time. We believe this fact does not make our results less robust, although it does not “look” nice and makes the description of the methods employed longer. We have double-checked the description and introduced slight changes as to make it maximally clear which method has been used for particular analyses presented in the Results and Discussion.

      Specific comments on single gene phylogenies:

      • Line 966-967: Why max 10 target sequences?

      The limit of 10 was applied in order to keep the datasets in manageable dimensions. The sentence has been changed to: " In order to detect potential LGT from prokaryotes while keeping the number of included sequences manageable, prokaryotic homologues were gathered by a BLASTp search with each eukaryotic sequence against the NCBI nr database with an e-value cutoff of 10-10 and max. 10 target sequences.

      • Lines 996-998: Is it a problem that these are rather old datasets?

      Although the publications are slightly older the set of queries is absolutely sufficient for the purpose.

      Minor comments: I appreciate that many data is included as supplementary material. However, the organisation of the data could be improved. The numbering of the files is not included in their names or within the files, as far as I could find. Descriptions of the files are often missing and information on the annotation such as colour coding is not always included. These aspects of the supplementary material needs to be strengthened in order to make it more useful. Specific comments: • Supplementary file 1, Table 1: accession numbers are missing. Kipferlia bialta appears to have a much smaller number of sequences than reported in the publication. The file consists of three tables and it would be very helpful if the reference in the main manuscript indicate the table number. • Supplementary file 4: The trees lack proper species names and a documented colour coding. There are multiple trees in the file, which make it difficult to find the correct tree. I would appreciate if the different trees were labelled A, B, C, etc., and if these were used in the main text.

      Supplementary file 1: Accession numbers were added.

      Supplementary file 4: Species names and alphabetical labelling were added. Colour coding was explained in the text at the first mention of the file: "(Supplementary file 4 H; Preaxostyla sequences in red)."

      o There is no HPAD-AE tree (as indicated on line 258), but a HPAD tree. Which part of the tree contain the described fusion protein?

      Thank you for spotting the mistake. There should have been “HPAD” instead of “HPAD-AE” indicated in the text. The sentence has been changed to:" The P. pyriformis HPAD sequence is closely related to its homolog in the free-living archamoebid M. balamuthi (Supplementary file 4 K), the only eukaryote reported so far to be able to produce p-cresol (Nývltová et al. 2017)."

      o Line 280-281: "UbiE homologs occur also in some additional metamonads, including the oxymonad B. nauphoetae and certain fornicates." These sequences should be clearly highlighted in the tree.

      We discovered these additional UbiE homologs only after the tree presented in the supplement had been constructed, so these sequences are missing from it. To ensure consistency we have decided to remove the remark on the presence of UbiE homologs metamonads other than P. pyriformis, so it is no longer part of the revised manuscript.

      o Lines 538-544: A three-gene system is mentioned, but only two AmmoMemoRadiSam trees are found.

      This part has been removed while streamlining the manuscript.

      • Supplementary file 6: I find it difficult to find the proteins discussed in the text, for example "the biosynthesis of p-cresol from tyrosine (line 254-255)".

      Abbreviations identifying the different enzymes have now been added to all mentions in the text, facilitating their localization in the supplementary file: "P. pyriformis encodes a complete pathway required for the biosynthesis of p-cresol from tyrosine (Supplementary file 6), only the second reported eukaryote with such capability. This pathway consists of three steps of the Ehrlich pathway (Hazelwood et al. 2008) converting tyrosine to 4-hydroxyphenyl-acetate (AAT, HPPD, ALDH) and the final step catalyzed by a fusion protein comprised of 4-hydroxyphenylacetate decarboxylase (HPAD) and its activating enzyme (HPAD-AE)."

      • Supplementary file 11: Which group of species are highlighted in red? How do I know from which species these sequences are (I can make educated guesses, but prefer full species names). I do not find any reference to this file in the main manuscript.

      We apologise for this inconvenience. The taxon labels in the treed in this supplementary file have been corrected to contain full species names.

      Line 227-228: "630 OGs seem to be oxymonad-specific or divergent, without close BLAST hits". It is unclear if BLAST searches includes only a representative of each 630 OGs, or every single protein in these OGs.

      The BLAST searches include every single protein in the investigated OGs. We clarified it in the text: “Of these, 630 OGs seem to be oxymonad novelties or divergent ancestral genes, without close BLAST hits (e-value -15) to any of these sequences.

      Line 243: I think it is five LGT mapped to internal nodes of Preoxystyla in Figure 1 (1+3+1).

      You are correct, we apologize for the mistake. The sentence has been changed to: "Also, 46 LGT events were mapped to the terminal branches and 5 to internal nodes of Preaxostyla, suggesting that the acquisition of genes is an ongoing phenomenon, and it might be adaptive to particular lifestyles of the species."

      Lines 325-331: The argument would be stronger with a figure showing the fusion and the alignment indicating the conserved amino acids mentioned in the text.

      We agree with the reviewer but for the sake of space, we finally decided not to include a new figure.

      Lines 425: "none of the species encoded" should be replaced by something like "none of the enzyme could be detected in any of the species" (the datasets are incomplete).

      The sentence has been changed to: "None of the alternative enzymes mediating the conversion of pyruvate to acetyl-CoA, pyruvate:NADP+ oxidoreductase (PNO) and pyruvate formate lyase (PFL), could be detected in any of the studied species."

      Line 455: "suggesting a cytosolic localization of these enzymes in Preaxostyla." The absence of a phylogenetic affiliation with the S. salmonicida homolog does not preclude a MRO localisation.

      The sentence was changed to: "Phylogenetic analysis of Preaxostyla ACSs (Supplementary file 4 B) shows four unrelated clades, none in close relationship to the S. salmonicida MRO homolog, consistent with our assumption that these enzymes are cytosolic in Preaxostyla."

      Lines 570-571: "Manual verification indicated that all the candidates recovered in oxymonad data sets are false positives" Using which criteria?

      The manual verification was based on the annotation of predicted proteins by BLAST and InterProScan. If the annotations did not correspond to the suggested function, they were considered false positives. For example, the protein BLNAU_15573 of Blattamonas nauphoetae was detected by Sam50 HMM profile and thus was considered a candidate for Sam50 proteins. Its functional annotation from BLAST was, however, unrelated to Sam50 (“putative phospholipase B”). Therefore, this candidate was concluded as a false positive hit of the HMM search resulting from the very high sensitivity of this method.

      We clarified this in the Results

      Reciprocal BLASTs indicated that all the candidates recovered in oxymonad data sets are very likely to be false positives based on the annotations of their top BLAST hits (mainly vaguely annotated kinases, peptidases and chaperones) (Fig. 6, Supplementary file 9).”.

      And Material and Methods

      Any hits received by the methods described above were considered candidates and were furter inspected as follows. All candidates were BLAST-searched against NCBI-nr and the best hits with the descriptions not including the terms 'low quality protein', 'hypothetical', 'unknown', etc. were kept. For each hit, the Gene Ontology categories were assigned using InterProScan-5.36-75.0. If the annotations received from BLAST or InterProScan corresponded to the originally suggested function, the candidates were considered as verified. Otherwise, they were considered as false positives.

      Lines 743-755: "Similar observations were made in other protists with highly reduced mitochondria, such as G. intestinalis or E. histolytica,..." References are needed.

      This part of the manuscript has been removed while streamlining the text.

      Line 849: How was the manually curation done for the gene models in the training set?

      The sentence has been changed to: "For de novo prediction of genes, Augustus was first re-trained using a set of gene models manually curated with regard to mapped transcriptomic sequences and homology with known protein-coding genes."

      Lines 853-856: It is a bit unclear which dataset was used for BUSCO and downstream analysis. Was it the Augustus-predicted proteins, or the EVM polished?

      The sentence has been changed to: "The genome completeness for each genome was estimated using BUSCO v3 with the Eukaryota odb9 dataset and the genome completeness was estimated on the sets of EVM-polished protein sequences as the input."

      Lines 858: What is it meant that KEGG and similarity searches was used in parallel (what if both gave a functional annotation?)?

      A sentence has been added for clarity: "KEGG annotations were given priority in cases of conflict."

      Lines 861-862 and 1007-1008: Which genes or sub-projects does this apply to? How many genes were detected in this procedure?

      The sentence has been changed to make this clear: "Targeted analyses of genes and gene families of specific interest were performed by manual searches of the predicted proteomes using BLASTp and HMMER (Eddy 2011), and complemented by tBLASTn searches of the genome and transcriptome assemblies to check for the presence of individual genes of interest that were potentially missed in the predicted protein sets (single digits of cases per set). Gene models were manually refined for genes of interest when necessary and possible."

      Lines 878-879: It is not clear to me why the sum of the two described numbers should be as high as possible and would appreciate an argument or a reference.

      When optimizing the inflation parameter of OrthoMCL, we reasoned that the optimal level of grouping/splitting for our purpose should result in the highest number of orthogroups containing all representatives of the groups of interest (i.e. Preaxostyla) but no other species – pan-Preaxostyla orthogroups. When going down with the values, you observe more and more groupings of pan-Preaxostyla OGs with others (indication of overgrouping) in the opposite direction you observe splitting of pan Preaxostyla OGs which indicates oversplitting. Because we were optimizing the inflation parameter for Preaxostyla and Oxymonadida at the same time, we maximized the sum of pan-Preaxostyla and pan-Oxymonadida groups.

      Lines 879-881: "Proteins belonging to the thus defined OGs were automatically annotated using BLASTp searches against the NCBI nr protein database (Supplementary file 1)." Why were these annotated in a different way (compare lines 857-859).

      This little inconsistency resulted from the fact that these parts of the analyses were performed by different researchers who did not cross-standardize the procedures. This inconsistency has no effect on the downstream analyses and conclusions as the annotations from Supplementary file 1 were not used in any further analyses.

      Lines 894-957: "Detection of lateral gene transfer candidates": • It is not clear which sequences were tested in the procedure. All Preaxostyla, or all metamonada? I think I am confused because in the result sections you only report numbers for Preaxostyla, but in the method section metamonada is mentioned repeatedly.

      Thank you for noticing. There was indeed some inconsistency in our writing.

      We did an all-against-all search using all metamonads. However, we filtered out all homologous families in which Preaxostyla were not present or that had no hit against GTDB. So in the end, the LGT search was restrained to protein families containing Preaxostyla homologues. We corrected the wording in our method section.

      • It would be easier to follow the procedure if numbers are provided for the different steps.

      We are not sure what numbers the reviewer refers to here.

      • Why was only small oxymonad proteins discarded (line 900)?

      This is indeed a mistake. We meant “Preaxostyla proteins”. This is because we only considered Preaxostyla sequences with significant hits against GTDB as a starting point, so we aimed to first remove those that might be too short to yield reliable phylogenies.

      • Line 911: How many sequences were collected?

      Up to 10,000 hits were retained. We have added that information to the text.

      • Lines 916-919: What is the difference between the protein superfamilies (line 916) and the OGs (line 919)? Are the OGs the same orthogroups that is described earlier in the method section? How are the redundancy of NCBI nr entries retrieved in different searches dealt with?

      We understand the confusion here. It primarily stemmed from two different ways to establish homologous families across the manuscript because of different researchers being responsible for different parts. Protein superfamilies that were used for reconstructing the single protein trees used for the LGT analyses were assembled based on the procedure describe line 916-919 (“Protein superfamilies were assembled by first running DIAMOND searches of all metamonad sequences against all (-e 1e-20 --id 25 --query-cover 50 --subject-cover 50). Reciprocal hits were gathered into a single FASTA file, as well as their NCBI nr homologues.”). However, this was a somewhat stricter procedure than the one used to establish the OGs that are discussed in the rest of the manuscript (because of the e-value and identity cut-off used), so we eventually enriched the datasets with the putatively missing metamonad sequences that were present in the OGs but not in the initial superfamily assembly. However, since these were often more divergent sequences, we did not use these as queries for our BLAST searches against prokaryotes.

      Line 987-989: "...was facilitated by Rsg1 being rather divergent from other Ras superfamily members" This statement is vague. What does it mean in practise?

      The sentence has been changed to: " The discrimination was facilitated by Rsg1 having low sequence similarity to other Ras superfamily members (such as Rab GTPases)."

      Lines 1037-1038: Why were these proteins re-annotated?

      They were not. We are sorry for this mistake, which has been fixed in the revised manuscript.

      Figures: The figures would be easier to follow if the colour coding for the five different species were consistent between the figures.

      This is a good point, the colour coding has been unified across all figures.

      Figure 1: It appears that the Venn diagram in C only shows the Preaxostyla-specific protein in B, not all OGs for which contain Preaxostyla proteins. This is not clear from legend or from the figure itself. The same comment applies to D.

      The interpretation of the figure by the reviewer is correct; we have modified the legend to make the meaning of the figure easier to understand.

      Figures 2 and 6: It would be clearer with panel labels A, B, etc, instead of "upper" and "lower" panel, as in the other figures.

      This is a fair point, we have added the alphabetical labels proposed by the reviewer to the figures.

      Figure 6: What is the colour code in the figure? The numbers within the boxes are not aligned.

      We have added an explanation of the color code to the legend and edited the figure to make it aesthetically more pleasing.

      Supplementary figures 1-3: What do green and magenta indicate in the figure?

      As with the previous figure, the color code is now explained in the revised legend.

      ** Referees cross-commenting** I agree with the other reviewers that the discussion of the functional and ecological implications of the LGTs could be developed.

      We understand the reviewers but as already explained in response to Reviewer 1, we have decided not to extend the already rather long manuscript further. We believe that the several exemplar LGT cases that we do discuss in detail provide a good impression of the significance of LGT in the evolution of Preaxostyla.

      In contrast to reviewer 2, I do not see that the authors discuss their result in the context of eukaryogenesis in this manuscript. Maybe the reference reviewer 2 mention could be cited in the introduction together with Hampl et al. 2018 to acknowledge that there are different views about the importance of secondarily amitochondrial eukaryotes on our thinking about the origin of eukaryotes. I disagree with reviewer 2's objection against the wording "... and undergo pronounced morphological evolution" because I think Fig. 4 in Hampl 2017 shows a large morphological diversity among oxymonads.

      We are glad to see that our perspective is not shared by other colleagues in the field. Nevertheless, having carefully considered the case we have decided to remove any mentions of eukaryogenesis from the revised manuscript, as we admit this topic is peripheral to the key message of our present study. On the other hand, we appreciate very much the note by the reviewer on the large morphological diversity among oxymonads – we have now added a similar remark to the revised manuscript (the last sentence of Conclusions).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Using draft genome sequencing of the free-living Paratrimastix pyriformis and the sister lineage oxymonad Blattamonas nauphoetae, Novack et al. infer the metabolic potential of the two protists using comparative genomics. The authors conclude that the common oxymonad ancestor lost the mitochondrion/mitosome and discuss general strategies for adapting to commensal/symbiotic life-style employed by this taxon. Some elaborations on pathways go on for several paragraphs and feel unnecessarily stretched, which made those sections of the paper rather difficult to digest. This might be also be because the work, and all conclusions drawn, depend entirely on incomplete (ca. 70-80%) genome data and simple similarity searches, and e.g. no kind of biochemistry or imaging is presented to underpin the manuscripts discussion. This is noteworthy in light of other protist genome reports published in the last few years that differ in this respect, including previous work by this group. And for sequencing-only data, this paper - https://doi.org/10.1016/j.dib.2023.108990 - might offer an example of where we are at in 2023. With respect to previous work of the group (Karnkowska et al. 2016 and 2019), this submission is very similar (analysis pattern, even some figures and more or less the conclusion), i.e. to say, the overall progress for the broader audience is rather incremental. Then there are also some incidents, where the data presented conflicts with the authors own interpretation. The text (including spelling and grammar) needs some attention and the choice of words is sometimes awkward. The overuse of quotation marks ("classical", "simple", "fused", "hits", "candidate") is confusing (e.g. was the BLAST result a hit or a "hit").

      In its current form the manuscript is, unfortunately, very difficult to review. This reviewer had to make considerable efforts to go through this very large manuscript, mainly because of issues affecting to the presentation and the lack of clarity and conciseness of the text. It would be greatly appreciated if the authors would make more efforts upfront, before submission, to make their work more easily accessible both to readers and facilitate the task of the reviewers.

      About a fifth of the two genome is missing according the authors prediction (table 1). Early on they explain the (estimated) incompleteness of the genomes to be a result from core genes being highly divergent. In light of this already suspected high divergence, using (the simplest NCBI) sequence similarity approach to call out the absence of proteins (for any given lineage) may need lineage-specific optimization. The use of more structural motif-guided approaches such as hidden Markov models could help, but it is not clear whether it was used throughout or only for the search for (missing) mitochondrial import and maturation machinery. The authors state that the low completeness numbers are common among protists, which, if true, raises several questions: how useful are then such tools/estimates to begin with and does this then not render some core conclusions problematic? The reader is just left with this speculation in the absence of any plausible explanation except for some references on other species for which, again, no context is provided. Do they have similar issues such as GC-content, same core genes missing, phylogenetic relevance?, etc.. No info is provided, the reader is expected to simply accept this as a fact and then also accept the fact that despite this flaw, all conclusions of the paper that rests on the presence/absence of genes are fine. This is all odd and further skews the interpretations and the comparative nature of the paper.

      As a side note, this will also influence the number of proteins absent in other lineages and as such has consequences on LGT calls versus de novo invention. For the cases with LGT as an explanation, it would help to briefly discuss the candidate donors and some details of the proteins in the eco-physiological context (e.g. lines 263-268 suggest that HPAD may have been acquired by EGT which was facilitated by a shared anaerobic habitat and also comment on adaptive values for acquiring this gene). Exchanging metabolic genes via LGT (Line 163) blurs the differences between roles and extent of LGT in prokaryote vs eukaryote, and therefore is exciting and could use support/arguments other than phylogenies. I guess the number of reported LGTs among protists (whatever the source) over the last decade has by now deflated the novelty of the issue in more general; a report of the numbers is expected but they alone won't get you far anymore in the absence of a good story (such as e.g. work on plant cell wall degrading enzymes in beetles). It would help to clarify which parts of the mitochondrial ancestor were reduced during the process of reductive evolution at what time in their hypothesized trajectory. For instance, loosing enzymes of anaerobic metabolism conflicts with the argued case of an aerobic (as opposed to facultative anaerobic) mitochondrial ancestor followed by gains of anaerobic metabolism in the rest of the eukaryotes via LGT, and some papers the authors themselves cite (e.g. the series by Stairs et al.). There is no coherent picture on LGT and anaerobic metabolism, although a reader is right to expect one.

      In light of their data the authors also discuss the importance of the mitochondrion with respect to the origin of eukaryotes:

      First, the mitochondrion brought thousands of genes into the marriage with an archaeon, surely hundreds of which provided the material to invent novel gene families through fusions and exon shuffling and some of which likely went back and forth over the >billion years of evolution with respect to localizations. The authors look at a minor subset of proteins (pretty much only those of protein import, Fig. 6) to conclude, in the abstract no less: „most strikingly the data confirm the complete loss of mitochondria and every protein that has ever participated in the mitochondrion function for all three oxymonad species." I do not question the lack of a mitochondrion here, but this abstract sentence is theatrical in nature, nothing that data on an extant species could ever proof in the absence of a time machine, and is evolutionary pretty much impossible. A puzzling sentence to read in an abstract and endosymbiont-associated evolution.

      Second, using oxymonads as an example that a lineage can present eukaryotic complexity in the absence of mitochondria and conflating it with eukaryogenesis is a logical fallacy. This issue already affected the 2019 study by Hampl et al.. We have known that a eukaryote can survive without an ATP-synthesizing electron transport chain ever since Giardia and other similar examples and the loss of Fe-S biosynthesis and the last bit of mitosome (secondary loss) doesn't make a difference how to think about eukaryogenesis. It confuses the need and cost to invent XYZ with the need and cost of maintenance. How can the authors write "... and undergo pronounced morphological evolution", when they evidently observe the opposite and show so in their Fig. 1? The authors only present evidence for reductive evolution of cellular complexity with the loss of a stacked Golgi. What morphological complexity did oxymonads evolve that is absent in other protists? A cytosolic metabolic pathway doesn't count in this respect, because it is neither morphological, nor was it invented but likely gained through LGT according to the authors. This is quite confusing to say the least. A recent paper (https://doi.org/10.7554/eLife.81033) that refers to Hampl et al. 2019 has picked this up already, and I quote: "Such parasites or commensals have engaged an evolutionary path characterized by energetic dependency. Their complexity might diminish over evolutionary timescale, should they not go extinct with their hosts first." Here the authors raise a red flag with respect to using only parasites and commensals that rely on other eukaryotes with canonical mitochondria as examples. If we now look at Fig. 1 of this submission, Novak et al. underpin this point perfectly, as the origin of oxymonads is apparently connected to the strict dependency on another eukaryote (or am I wrong?), and they support the prediction with respect to complexity reducing after the loss of mitochondria - mitosome gone, Golgi almost gone. What's next? This is a good time to remember that extant oxymonads are only a single picture frame in the movie that is evolution, and their evolution might be a dead-end or result in a prokaryote-like state should they survive 100.000s to millions of years to come.

      Some more thoughts:

      Line 47-52: Hydrogenosome or mitosome is a biological and established label as (m)any other and I find the use of the word "artificial" in this context strange. While the authors are correct to note that there is a (evolutionary) continuum in the reduction - obviously it is step by step - they exaggerate by referring to the existing labels as "artificial". You make Fe-S clusters but produce no ATP? Well, then you're a mitosome. It's a nomenclature that was defined decades ago and has proven correct and works. If the authors think they have a better scheme and definition, then please present one. Using the authors logic, terms such as amyloplast or the TxSS nomenclature for bacterial secretions systems are just as artificial. As is, this comes across as grumble for no good reason.

      Line 158: A duplication-divergence may also explain this since sequence similarity-based searches will miss the ancestral homologues.

      Lines 201-202: Presence of GCS-L in amitochondriate should be explained in light of this group once having a mitochondrion, which then makes ancestral derivation and differential loss (as invoked for Rsg1) also a likely explanation along with eukaryote-to-eukaryote LGT.

      Lines 356-392: Describes plenty of genomic signal for Golgi bodies but simultaneously cites literature suggesting the absence of a morphologically an identifiable Golgi in oxymonads. An explicit prediction regarding what to observe in TEM for the mentioned species might be nice to stimulate further work.

      Lines 414: The preceding paragraphs in this result section describes only the distribution, without mentioning origins - a sweeping one-line summary that proclaims different origin needs some context and support. Furthermore, the distribution of glycolytic enzymes might indeed be patchy, but to suggest it represents an 'evolutionary mosaic composed of enzymes of different origins' without discussing the alternative of a singular origin and different evolutionary paths (including a stringer divergence in one vs. another species) discredits existing literature and the authors own claim with respect to why BUSCO might fail in protists.

      Line 486: How uncommon are ADI and OTC in lineages sister to metamonada?

      Line 504: It might help an outside reader to include a few lines on consequences and importance of having 2Fe-S vs 4Fe-S clusters and set an expectation (if any) in Oxymonads

      Any explanations on what unique selection pressures and gene acquisition mechanisms may be operating in P. pyriformis which might allow for the unique metabolic potential?

      ** Referees cross-commenting**

      To R3: Hampl et al. 2019, to which Novak et al. refer, is about eukaryogensis and that is exactly the context in which this is discussed again and what Raval et al. 2022 had decided to touch upon. If the authors do not bring this up in light of the ability to evolve (novel) eukaryote complexity, then what else? Maybe they can elaborate, especially with respect to energetics to which they explicitly refer to in 2019 (and here). And with respect to text-book eukaryotic traits (and the evolution of new morphological ones), I do not see any new ones evolving in any oxymonad, but reduction as Novak et al. themselves picture it in this submission. Is a change in the number of flagella pronounced morphological evolution? Maybe for some, but I believe this needs to be seen in light of the context of how they discuss it. I see a reduction of eukaryotic complexity and not a gain. They have an elaborate section on the loss of Golgi characteristics (and a figure), but I fail to read something along the same lines with respect to the gain of new morphological traits. Again, novel LGT-based biochemistry does not equal the invention of a new morphology such as a new compartment. Oxymonads depend on mitochondria-bearing eukaryotes for their survival or don't they? This is the main point, and if evidence show that I am wrong, then I will be the first to adapt my view to the data presented.

      I have concerns with the presentation of a narrative that in my opinion is too one-sided and that has been has been publicly questioned in the community (in press, at meetings, personally). For the benefit of science and of the young authors on this study, this reviewer feels strongly that these issues should be taken very seriously and discussed openly in a more balanced way. . We only truly move forward on such complex topics, if we allow an open and transparent discussion.

      Having said that, I am happy that R3 has picked up exactly the same major concerns as I did with respect to e.g. the phrasing on mito (gene) loss and the BUSCO controversy.

      Significance

      Using draft genome sequencing of the free-living Paratrimastix pyriformis and the sister lineage oxymonad Blattamonas nauphoetae, Novack et al. infer the metabolic potential of the two protists using comparative genomics. The authors conclude that the common oxymonad ancestor lost the mitochondrion/mitosome and discuss general strategies for adapting to commensal/symbiotic life-style employed by this taxon. Some elaborations on pathways go on for several paragraphs and feel unnecessarily stretched, which made those sections of the paper rather difficult to digest. This might be also be because the work, and all conclusions drawn, depend entirely on incomplete (ca. 70-80%) genome data and simple similarity searches, and e.g. no kind of biochemistry or imaging is presented to underpin the manuscripts discussion.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)): **

      This study evaluates the effect of fungal toxin candidalysin on neutrophils. The authors show that candidalysin induces NETosis when secreted by hyphae, but when candidalysin is added on its own, NLS are formed instead which are distinct from NETs. The authors have done lots of carefully controlled experiments, and delineated key components of the pathway inducing NLS, including the role of ROS and histone modifications. The data provided is high quality and well presented in figures.

      Reviewer #1 (Significance (Required)):

      Strengths are the depth of analysis - many different aspects of NETosis is assessed and robustly tested.

      Comments: 1. I was a bit confused by what should be the main message of the paper - is it that candidalysin on its own doesn't induce NETosis but only NLS? The answer to this question wasn't well addressed in my opinion, but the paper switches between using live fungi and purified candidalysin so it became confusing at times.

      *

      Responses:

      Thank you for this important comment. We have clarified our narrative on candidalysin throughout the manuscript to provide a red thread for the readership. Our message is that candidalysin alone has not the capacity to induce a full cycle of signalling events which result in canonical NET formation. Our data show that candidalysin alone falls short and can only produce NLS. On the other hand, our data show that in the context of growing C. albicans cells candidalysin is able to promote the release of NETs. This is important, since previously the hyphal form of C. albicans has been reported to be a formidable inducer of NETs, whereas the yeast form was not. Our data put candidalysin in the centre of this observation, showing that it indeed is a major contributor to NET formation when present with growing C. albicans cells. Since candidalysin expression and release is strictly connected to hyphal growth our new data agrees with previous assessment and provides new insight in how this hyphae-specific inductive effect is accomplished.

      In the revised manuscript, at first, we describe the difference of neutrophil stimulation when using strains expressing and lacking candidalysin as compared to candidalysin stimulation alone. We added or modified the following phrases:

      • Line (163) “As candidalysin-expressing C. albicans strains induced more NETs than candidalysin-deficient strains, we investigated the role of the toxin alone in stimulating neutrophil extracellular trap release”.
      • Line (173) “In order to ensure consistency in NET/NLS quantification, NLS were quantified with the same criteria as previous described for NETs.”
      • Line (182) “In summary, candidalysin alone triggers morphologically distinct NLS in a time- and dose-dependent manner, whereas candidalysin-producing C. albicans hyphae induce canonical NETs (Fig. 1a).” Next, we describe the different morphology of NLS triggered by candidalysin alone in comparison to canonical NETs triggered by C. albicans strains expressing candidalysin. We added or modified the following phrases:

      • Line (198) “To investigate candidalysin-triggered NLS in further detail, we used scanning electron microscopy (SEM) that allows a more detailed view of the neutrophil-derived structures (Fig 3a).” Furthermore, to prevent switching between experiments using candidalysin alone and experiments with different Candida strains, we have moved the next paragraph “Candidalysin-expressing strains induce more NETs and higher citrullination levels than candidalysin-deficient strains” to the end of the result section (old Fig. 4 is now new Fig. 8). In doing so, we focus on the direct morphological, signalling and functional effects of candidalysin alone on neutrophils and towards the end, we analyse how the strains are affected by different neutrophil killing mechanisms (phagocytosis and NETs). Subsequently, we synthetize our findings by showing that candidalysin is the main driver of histone citrullination by quantifying this histone modification in the context with NET induction comparing candidalysin-expressing and -deficient strains. We conclude that citrullination-induced chromatin decondensation in combination with candidalysin-induced ROS production are most probably the main contributors of increased NET formation stimulated by C. albicans hyphae expressing candidalysin. This is the also a good conclusion of the manuscript showing that candidalysin alone is not enough but together with growing C. albicans cells it contributes to NET induction and increased NETs in turn inhibit growth and limit spreading of C. albicans.

      • We modified and added the following sentences to the discussion section: (line 457) “The data suggests that candidalysin is the key driver of histone citrullination in neutrophils infected with C. albicans and that addition of evenly distributed, external candidalysin in high concentration (15 µM) drives neutrophils towards NLS despite the presence of C. albicans cells. We conclude that, during infection, candidalysin-triggered Ca2+ influx and histone hypercitrullination amplify processes in neutrophils which are induced by C. albicans hyphae. These amplified processes culminate in a strongly increased release of NETs that in turn are formidable weapons to control hyphal filaments.”

        *2. If candidalysin on its own only induces NLS - what is the relevance of this for disease? A lot of work has been provided on the pathway driving NLS formation, but it wasn't clear to me why this is important. More in discussion needed or evidence of disease relevance. *

      Responses:

      Thank you for giving us the opportunity to clarify this issue. Candidalysin expression strongly increases with and is restricted to hyphal growth, which is the adhesive growth form of C. albicans. Given that epithelial cells expunge candidalysin for their own protection while hyphae remain attached, it could be possible that neutrophils get exposed to candidalysin before they encounter C. albicans cells. Therefore, it is relevant to understand how candidalysin per se shapes neutrophil responses. We have added the following sentences to the discussion section: (line 527) ”As epithelial cells are able to expunge candidalysin for protection while C. albicans hyphae remain adeherent 46 recruited neutrophils may encounter candidalysin before direct contact with hyphae.”

      With regard to relevance for candidiasis the observation that candidalysin-deficient strains are poor inducers of NETs is most important. Since candidalysin expression is entirely restricted to hyphal growth. this finding gives crucial, new insight into the previous observation that hyphae are better NET inducers than yeast from C. albicans. In this context, we wanted to make it very clear to the reader that this effect only works when C. albicans cells and candidalysin are combined and that candidalysin alone does not lead to full-blown NET formation. Therefore, we have included a thorough investigation of the effects of candidalysin on neutrophils to be able to better contextualize our findings comparing candidalysin-expressing and candidalysin-deficient strains.

      To make this point clearer. we added the following sentence to the summary at the end of the discussion section: (line 565) “Neutrophils encountering candidalysin-expressing hyphae are able to adequately respond by releasing increased amounts of NETs whereas secretion of candidalysin does not allow hyphae to evade from neutrophil attack.”

      In addition, we are convinced that the move of former Fig.4 to the end of the result section (now Fig. 8) additionally helps the reader to better understand the importance to first delineate the effect of candidalysin on neutrophils alone and then to conclude the manuscript with experiments using different C. albicans strains to put the findings into context.

      To add more substance to our conclusions we wrap up the new version of the manuscript with data comparing wild-type and candidalysin-deficient strains in neutrophil antimicrobial assays and quantification of histone citrullination. With the newly added antimicrobial assays we demonstrate that candidalysin expression does not affect phagocytic killing (Fig. 7d and 7e) as assessed by plating assays and that candidalysin does not affect inhibition by PMA-induced NETs (Fig. 7f and 7g). Thus, as stated above, during the interaction of hyphae and neutrophils candidalysin promotes the release of more NETs, but does otherwise not affect anti-Candida activity by neutrophils. Increased NETs in turn, however, inhibit growth and limit spreading of C. albicans. The manuscript now ends with the data on differences in histone citrullination when using wild-type and candidalysin-deficient strains indicating that citrullination-induced chromatin decondensation in combination with C. albicans cells ultimately leads to increased NET release.

      We added the following text to the manuscript: (Line 416) “To corroborate, whether candidalysin deficiency affects C. albicans’ susceptibility to neutrophil attack we performed two antimicrobial assays. In the first assay we determined NET-mediated anti-Candida activity by preformed NETs comparing wild-type and candidalysin-deficient strains. We used the same imaged-based analysis with calcofluor white staining. To be able to better observe differences in susceptibility of the different strains we used a slightly higher MOI than for the previous NET inhibition assays which explains higher survival percentage (Fig. 7c, black bars on the right side). As expected, candidalysin did not affect the inhibitory effect on C. albicans imposed by NETs (Fig. 7c). In the second assay, we determined short-term anti-Candida activity of intact neutrophils, which is predominantly phagocytic elimination, by serial dilution and plating for colony counts. Candidalysin-deficient and wild-type strains are killed similarly over the time of 1 to 4 h, both at MOI 1 and 3 (Fig. 7d and 7e). This indicates that candidalysin expression does not enable evasion from neutrophil phagocytic attack and this result agrees well with our previous finding that wild-type C. albicans engulfed by human neutrophils are unable to escape by hyphal outgrowth 16. In conclusion, while candidalysin strongly increases the NET-inductive capacity of C. albicans hyphae, the toxin does neither affect the anti-Candida effect of intact neutrophils nor of NETs.”

      Notably, it is not informative to use C. albicans as inducer of NETs and as target of anti-Candida activity by NETs in the same assay, since both induction and anti-Candida activity are dependent on the amount of C. albicans cells. We therefore chose to show two separate assays where we (i) quantify short-term killing by plating (mainly phagocytosis) and (ii) quantify growth inhibition of C. albicans by pre-stimulated NETs.

      *3. In Figure 2, it would be helpful to include images of ionomycin-stim neutrophils for comparison of the NLS structures across different stim conditions. *

      Response:

      This is a very good point. We supply a structural comparison between NLS and NETs induced by PMA, ionomycin and candidalysin in Figure 3. Additionally, the time-dependent changes for ionomycin are now included in the supplementary Figure S1.

      4. Few places where reference manager has failed (see bottom on page 10, line 190 for example)

      Response:

      We have fixed this issue, thank you for pointing it out.

      *5. Lines 191-198 - I was confused here by the text. I thought the point was that candidalysin induced NLS similar to ionomycin, but here the point is being made that the two are different? This led me to being confused as to the point of all the comparisons made between ionomycin NLS and candidalysin NLS... this could be made clearer. *

      Responses:

      Thank you for highlighting this. According to previous literature ionomycin, a bacterial peptide toxin, was the most prominent example for induction of leukotoxic hypercitrullination. Therefore, we used ionomycin to put our findings with candidalysin, a fungal peptide toxin, into context. We find that candidalysin share similarities but also some striking differences to ionomycin. While we could not investigate the nature of these differences in more detail, this could be the basis of a follow-up study, we think it is important to give the reader the comparison in order to better understand how candidalysin shapes neutrophil responses. One clear difference which we show in the manuscript is that candidalysin induces some ROS whereas ionomycin does not at all (Fig. 4).

      We changed the text in the result section accordingly to make our point clearer: (line 203) “PMA exposure generated widespread chromatin fibers in the extracellular space (Fig. 3a, left panels) whereas ionomycin exposure resulted in more compact, patchy areas occasionally dispersed with long, thin chromatin fibres (Fig. 3b, middle panels). With regard to morphological changes, candidalysin treatment resulted in compact, fibrous structures resembling those stemming from ionomycin treatment, however long, thread-like structures were absent in candidalysin-treated neutrophil samples (Fig. 3a right panels, for 7 h treatment see Fig. S1c).”

      And did so as well in the discussion section: (Line 513) ”While ionomycin- and candidalysin-induced NLS shared similar key features, such as increased histone citrullination, our study revealed striking differences between the two toxins. In contrast to ionomycin, candidalysin stimulation led to ROS production in neutrophils.”

      *6. Could the authors include some unstim neutrophil control images in Fig 3 for the SEM? Can the SEM sample processing affect neutrophil structure in anyway? Feels like an important control although I don't have much experience with SEM personally *

      Response:

      This is of course a relevant control image. We have included an image showing unstimulated neutrophils from similar time points, but without exposure to candidalysin (Fig. 3). The unstimulated neutrophils are spherical and morphologically distinctly different from candidalysin-treated neutrophils.

      *7. I was very intrigued by the experiments where the authors added candidalysin in to neutrophils infected with ece1-null strain. Those experiments showed that candidalysin addition still drove NLS instead of NETosis. Can the authors investigate why this is? Is membrane intercalation different when candidalysin is delivered by hyphae vs added on its own? Could that explain some of the differences they have seen? *

      Responses:

      Thank you for this comment. Yes, there is a clear difference, since we add candidalysin to the medium such that the peptide is evenly distributed and reaches membranes rather evenly from the extracellular space. When released from growing C. albicans hyphae candidalysin is then predominantly released on hyphal tips as demonstrated in the referenced article (doi.org/10.1111/cmi.13378). Hyphal tips in turn are readily attacked by human neutrophils (doi.org/10.1189/jlb.0213063). Hence, we can safely assume according to these previous publications that there will be a more uneven distribution of candidalysin concentrations over neutrophil membranes, when the sole source of the toxin stems from growing hyphae interacting with neutrophils. It would of course be very interesting to know how the toxin exactly intercalates into membranes and which morphologies potential pores may have. These questions are currently under investigation in the laboratories of Profs Hube and Naglik. To include these findings here would certainly be far beyond the scope of this study.

      We include and modify the following sentences to the discussion of this manuscript to clarify the issue: (Line 541). ”One of the main goals of the study was to delineate contribution of candidalysin to neutrophil responses either as factor released by C. albicans hyphae or as singular peptide toxin. Our data demonstrates that candidalysin is the main driver of histone citrullination in neutrophils infected with C. albicans (Fig. 8). Lack of candidalysin production in C. albicans results in significantly reduced histone citrullination, accompanied with decreased NET formation. However, citrullination is not required for NET release, but rather governs the formation of NLS, which is dominant when candidalysin is added exogenously with even distribution throughout the cell suspension. With regard to C. albicans hyphae secreting candidalysin, local concentrations of the toxin are likely to vary to a large degree, particularly when the candidalysin-secreting hypha is engulfed by a neutrophil. Therefore, it may be difficult to discriminate NLS form NETs during the interaction of neutrophils and C. albicans, as both structures may be induced concurrently 10. It seems logical that the pore-forming activity of candidalysin augments the release of NET fibres during C. albicans infection, where PRRs will additionally be triggered on neutrophils, resulting in combinatorial activation of downstream pathways. In line with this notion, candidalysin drives histone citrullination, which contributes to chromatin decondensation.”

      *8. Is phagocytosis needed for NETosis induction by candidalysin? What happens if you add beads or beta-glucan particles with candidalysin stimulation? Do you get NLS or NETs? *

      Responses:

      This is an interesting question. Physical contact is required for the induction of NET formation (10.1111/j.1462-5822.2005.00659.x, 10.1371/journal.ppat.1000639) and physical contact leads to pattern recognition unequivocally followed by phagocytic events in neutrophils. Hence, at the least indirectly, phagocytosis and NET formation are connected, but may not be so causally.

      While glucan-covered particles have been shown to induce NETs (10.1159/000365249), we show that C. albicans cells devoid of candidalysin induce NETs, but to a much lesser extent than wild-type C. albicans. In addition, the experiment shown in Fig. 8 shows exactly that. Instead of glucan-covered beats we used C. albicans cells (Fig. 8f) which by virtue are glucan covered.

      *9. Please confirm what the n numbers refer to in the figure legends - are these biological or technical replicates? How many experiments are the representative images representing? *

      Response:

      Thank you very much for pointing this out. We adapted our figure legends accordingly and added the number of biological and technical replicates (n=x(y), x=biological replicates, y=technical replicates). Each experiment has been performed with at least three biological replicates which includes the use of different neutrophil donors.


      *Reviewer #2 (Significance (Required)):

      *

      *The advantage of this work is the presentation of the mechanism associated with NLS formation in contact with candidalysin, where activation of NADPH oxidase and calcium influx have been documented to be important. This toxin can trigger ROS production and activate downstream signaling that is important for morphological changes and NLS formation. The important finding is also that NLS are resistant to nuclease treatment and increase the ability of neutrophils to control C. albicans hyphae formation and fungal cell growth. These findings provide a better understanding of the role of neutrophils in the treatment of infections caused by these microorganisms. Below I present are minor suggestions that, in my opinion, will improve the text and correct the presentation of the results, making this set of results a valuable source for explaining such a complex problem.

      *

      Response:

      Thank you for this assessment. In cases which we have identified as crucial for our message we have decided to include additional experiments to better convey our message (Fig. 6e-f and Fig. 7d-g). We also included a time course for ionomycin stimulation of neutrophils in Fig. S1. We appreciate that the overall assessment was that no additional experiments were required.

      1/ The authors should decide what thesis about NLS they want to prove: 100 NLS are less fibrous and ....... than canonical NETs and are triggered in an NADPH oxidase-independent fashion.

      * 121 NLS were dependent on NADPH oxidase-mediated reactive oxygen species (ROS) production

      *

      Response:

      This was indeed imprecisely formulated from our side. NLS were previously described as NADPH-independent processes stimulated by toxins (see ionomycin). Candidalysin seems to trigger NADPH-dependent and NADPH-independent pathways. However, the main differentiation criteria were described through the hypercitrullination which we could observe for candidalysin. To clarify, we have modified the following sentence: (line 121) ”In contrast to previously described stimuli of NLS, candidalysin induced NLS in partial dependence on NADPH oxidase-mediated reactive oxygen species (ROS) production, wheras PAD4-mediated histone citrullination could be observed as well. Notably, candidalysin alone failed to induce NETs as indicated by a lack of cell cycle activation determined via lamin A/C phosphorylation assays.”

      *2/ for the experiment described in the line below, MOI 2 was chosen; did the authors conduct an analysis of the response/eventual change in it, depending on the MOI?

      *

      Response:

      Yes, from our experience in in vitro experiments with human neutrophils MOI3 C. albicans overgrows too quickly. This is why an MOI 1-3 is the best option to analyse NET induction capacities.

      131 we infected neutrophils with wild-type C. albicans, ECE1-deficient (ece1ΔΔ), and corresponding revertant (ece1ΔΔ*+ECE1) strains,

      3/ Has the effect of deletion of ECE1 on other aspects of virulence, such as adhesion, virulence factor production, or biofilm formation, been analyzed? *

      Response:

      Yes indeed, the effect of candidalysin on other aspects has been studied. Candidalysin has no effect on adhesion and is expressed during biofilm formation. It has a broad effect on virulence in general and promotes neutrophil recruitment indirectly by a robust induction of damages responses. To clarify the amount of studies investigating these other aspects and to pinpoint the knowledge gap for direct interaction of neutrophils and candidalysin we include the following sentence: (line 132) “C. albicans hyphae release candidalysin and while the effects of the toxin for instance on virulence in general and on adhesion to host cells have been widely studied 17,18,23,28,30, the direct impact of candidalysin on the neutrophil immune response towards C. albicans, remains poorly understood. To investigate the role of candidalysin, we infected neutrophils with wild-type C. albicans,…”

      *137 the ECE1- and candidalysin-deficient strains triggered reduced levels

      4/ Fig.1 - How were C. albicans cells stained? Does 100%NET mean the number of cells netting after PMA treatment? This information should be given.

      *

      Response:

      Thank you for pointing this out. We were a bit unclear here. We added details in the respective figure legend and method section. C. albicans cells were visualised with anti-Candida antibody (1 µg/mL, ProSci, Cat#35-645). Furthermore, C. albicans nuclei are stained by DAPI, too. 100% NETs would mean that every single neutrophil (an image event which stains for neutrophil markers) in the analysed microscopic picture shows NET or NET-like morphology. We did not normalize to PMA treated cells.

      5/ 168 dependent effect with increased NLS formation from 3 μM to 15 μ*M. However, the reduced NLS

      How was determined the limiting concentration value of the toxin, for which an increase in NLS was observed? Was a wide range of concentrations used in the analysis or was the determination made only for these three selected values? A complete concentration analysis should be performed. *

      Response:

      This is of course a valid point. We showed data on these concentrations as established from previous studies of our collaborators (10.1111/cmi.13378; 10.1038/nature17625; 10.1038/s41467-019-09915-2). Under 3 µM we did not observe much measurable results and therefore omitted these. Concentrations above 70 µM did not change the outcome anymore than at 70 µM, so higher concentrations were omitted. We, thus, show 3µM at which we see mild effects, show 15 µM (a 5-fold increase compared to 3µM) at which we see profound effects and show 70 µM (again approximately a 5-fold increase compared to 15 µM) at which we see an overwhelming effect. Additional concentrations in between the applied concentration values would not add much new information.

      6/ 169 formation was observed at 70 μ*M (Fig. 2b), which can be explained by neutrophil cell death induced by the toxin as determined by a DNA Sytox Green assay (Fig. S1a).

      Was another viability test conducted? AnnexinV? Caspase 3/7? Sytox is not a specific staining in this regard. Furthermore, in Fig. S1a you state the kinetics of cell death, also after PMA treatment. On the one hand, you say that the production of candidalysin of NLS above 70 uM is reduced due to cell death, but at the same time you define as cell death the changes under PMA, which induce netosis. Please explain this reasoning better. *

      Responses:

      Thank you for pointing this out. We have no indication that candidalysin stimulates apoptosis in neutrophils. Therefore, no AnnexinV/Caspase 3/7 stain was performed. What we wanted to emphasize is that at 70 µM candidalysin the cytotoxic character of candidalysin is overwhelming leading to rather quick cell death, as assessed by the Sytox assay. Sytox is specific in the regard that it determines whether the plasma membrane is permeable and gives the stain access to the nuclear DNA to result in a positive signal. We use this assay to quantify NET formation, since it is a quantitative assay and less laborious than microscopy. However, we always back up NET assays with microscopic, image-based analyses and do not use the Sytox assay as standalone experiment for NET quantification, since the Sytox assay is not specifically staining netting cells, but it also stains other types of cell death.

      We clarify this in the text as follows: (line 659) “Neutrophil cell death or the presence of extracellular DNA was quantified using a Sytox Green-based (Invitrogen) fluorescence assay similar to previous descriptions 2,35. To ultimately quantify NETs or NLS we always used image-based assys, the cell death assay was only used as complementation.”

      *7/ 175 mixing of granular and nuclear components at ~120 min after stimulation (Fig. 2d and Fig. S2).

      Figure S2 does not show mixing with the content of the granules. You are not labeling any granule component, only histones. You cannot draw that conclusion from these results. *

      Response:

      We respectfully disagree. As indicated in the figure legend for Figure 2d we were labelling for neutrophil elastase (red) which is located in azurophilic granules and thereby presents a marker for granular content. Since we wrongfully referred to Figure S2 here, we removed this from the text. The latter reference probably remained erroneously from a previous version.

      *8/ Fig. 2. What concentration of PMA was used? What does 100% NLS mean? How is it different from 100% NET, since you are using PMA in both cases. Please explain. *

      Responses:

      We have now defined PMA concentration in the respective figure legend (100nM). The criteria for image-based assessment of NLS and NET quantification are the same for reason of comparison. PMA is included in each of the experiments as a positive control to show that the used neutrophils react upon stimulation. To clarify, we now specify at the y-axis %NETs or NLS. As stated above, 100% NLS means that each cell event in the image has increased in diameter such that it is considered as a NET or NLS. Hence, we use a common coordinate system to quantify extracellular events (NETs and NLS) based on size.

      We have adjusted the figure legend as follows: (line 186) “Fig 2. Candidalysin induces ____NLS ____in human neutrophils. Candidalysin, but not scrambled candidalysin or pep2, another Ece1p-derived peptide (all 15 µM), induce (a) DNA decondensation in human neutrophils after 4 h (n = 4(10-14)) in a (b) dose-dependent manner (n = 3(10-14)). To allow comparability, NLS were quantified with the same criteria as previously described for NETs. Data shown as mean ± SEM. Confocal images (c) of immunostained cells display morphological changes involving nuclear and granular proteins after 4 h compared to unstimulated cells or 100 nM PMA, or cells exposed to scrambled candidalysin and pep2. The morphological changes evoked by PMA considerably deviate from morphological changes evoked by candidalysin and, hence, are defined as NETs (for PMA) and NLS (for candidalysin). Time-dependent progression of morphological changes (d) in neutrophils induced by candidalysin over the course of 5 h (all images are with 60X magnification).”

      *9/ 181 NLS were quantified with the same criteria as previous described for NETs.

      The criterion for NETs was an area above 100um2, so what is the criterion for NLS? If we assume that this is the same as for NETs, then what is the difference between NLS and NETs? The criteria adopted do not differentiate between the two forms and appear to be subjective. *

      Responses:

      As stated above, for us it was very important to find a common coordinate system to quantify NETs and NLS, since we wanted to deliver comparable and solid quantitative data. Hence, the quantification method does not discriminate between NETs and NLS. The notable morphological differences of NETs and NLS are thoroughly described with Figure 2 and Figure 3 and defined by differences in their structure. In addition, we present differences and similarities of induced pathways leading to canonical NETs or candidalysin-induced NLS in Figure 6 and Figure 7. We are convinced that, since NETs and NLS vary in size (DNA area covered), it will not be accurate for quantification purposes to include an additional size cut-off in the attempt to discriminate NLS and NETs. Instead we have established that candidalysin alone induces morphologically distinct NLS, whereas Candida albicans hyphae induce morphologically distinct NETs. By combination of quantitative data and image-based assessment, both structures can be discriminated from each other. In addition, we have established that during neutrophil and C. albicans interaction, citrullination of histone mainly stems from candidalysin. We show here and others have shown previously (10.3389/fimmu.2018.01573) that citrullination of histone occurs during but is not required for NET formation. But histone citrullination is promoted mainly by candidalysin and is also required for formation of NLS. Thus, histone citrullination constitutes another important discriminatory factor between NETs and NLS.

      We added modified and added text to the respective figure legend: (line 188) ”To allow comparability, NLS were quantified with the same criteria as previously described for NETs. Data shown as mean ± SEM. Confocal images (c) of immunostained cells display morphological changes involving nuclear and granular proteins after 4 h compared to unstimulated cells or 100 nM PMA, or cells exposed to scrambled candidalysin and pep2. The morphological changes evoked by PMA considerably deviate from morphological changes evoked by candidalysin and, hence, are defined as NETs (for PMA) and NLS (for candidalysin).”

      *10/ 190 allows a more detailed view of the neutrophil-derived structures (Error! Reference source not Please, eliminate this error. *

      Response:

      Thank you for pointing this out to us. We have fixed this error.

      *11/ 193 Ionomycin has been previously reported to induce NLS, also... 194 Both, PMA and ionomycin generated widespread chromatin fibers in the extracellular space 197 In addition, C. albicans hyphae induced NETs with observable fibers and 198 threads similar to PMA- and ionomycin-stimulated neutrophils (Fig. 3b). 199 Image-based quantification of NLS events (candidalysin and ionomycin)

      In a sentence earlier (193) you mentioned that the action of PMA leads to classical netosis and ionomycin leads to NLS. You pointed out earlier that NLS are poorly developed NETs (line 100), and here you write that PMA and ionomycin generate the same developed structures. You again differentiate between these structures depending on the stimulating factors. Pointing out the differences between the two forms, you should be more precise and consistent in your descriptions. This comment applies to the entire manuscript. *

      Responses:

      Thank you, we agree that consistency and clarity is required to describe the observed phenomena. We therefore modified or included the following sentences to the manuscript:

      • (line 203) ”PMA exposure generated widespread chromatin fibres in the extracellular space (Fig. 3a, left panels) whereas ionomycin exposure resulted in more compact, patchy areas occasionally dispersed with long, thin chromatin fibres (Fig. 3b, middle panels). With regard to morphological changes, candidalysin treatment resulted in compact, fibrous structures resembling those stemming from ionomycin treatment, however long, thread-like structures were absent in candidalysin-treated neutrophil samples (Fig. 3a right panels, for 7 h treatment see Fig. S1c)”
      • (Line 513) ”While ionomycin- and candidalysin-induced NLS shared similar key features, such as increased histone citrullination, our study revealed striking differences between the two toxins. In contrast to ionomycin, candidalysin stimulation led to ROS production in neutrophils.”

        12/ 203 NLS after 3 h and 5 h, respectively, and led to overall fewer NLS events. This was confirmed by observation. 204 area-based analysis of the events (Fig. 3d). The average area per event that exceeded 100 μ*m2 was 205 determined using the images from the DNA stain. What is the accepted criterion for distinguishing between NLS and NETs? *

      Response:

      The main criteria distinguishing canonical NETs from NLS is a higher compactness for NLS and an increased citrullination of histones, the latter being absent in canonical NETs (10.3389/fimmu.2016.00461; 10.1016/j.mib.2020.09.011). Please see our comment above (regarding reviewer comment 9). Comparing candidalysin and ionomycin as stimuli for NLS they share key similarities, such as increased citrullination of histone (Fig. 3) and more compact structures than NETs (Fig. 3) with an average size of 151 µm2 for candidalysin-induced and 149 µm2 for ionomycin-induced NLS compared to 262 µm2 for PMA-induced and 231 µm2 for C. albicans-induced NETs (for clarification these average sizes are stated in the text). However, the NLS triggered by candidalysin and ionomycin also show differences. Ionomycin occasionally results in extended chromatin threads, whereas candidalysin does not. Ionomycin induces no ROS at all, whereas candidalysin does to some extent. By consistent usage of the definitions for NETs and NLS and by pinpointing the differences between ionomycin and candidalysin in terms of NLS induction (which are previously unknown) we hope we have sufficiently addressed this comment.

      *13/ line 218, 243 - reference error *

      Response:

      Thank you, we have fixed this error

      14/ What form are we actually talking about? Are we focusing on the effect of a natural agent or a synthetic one in relation to NLS/NET? Perhaps it is more important to focus on the citrullination process.

      • 247 synthetic candidalysin only induces NLS, we concluded that candidalysin augments NET release when the toxin is secreted by C. albicans hyphae. 256 This confirmed that candidalysin promotes C. albicans-triggered NET release. 262 Interestingly, the addition of synthetic candidalysin resulted in a shift to NLS, 274 External addition of synthetic candidalysin resulted in a shift to NLS structures rather than NETs as visualized by microscopy after 5 h incubation (20X).*

      Response:

      We used the adjective “synthetic” here to make clear that this is a synthetized peptide and not candidalysin isolated from growing C. albicans. Having said that, we fully agree that the synthetized peptide and the one released by C. albicans cells are essentially identical on the molecular level and thus it is irrelevant and confusing to state in this context here. Therefore, we removed the adjective “synthetic” throughout the study and refer the reader to the method section for information on the origin of candidalysin used in the study. At times, we state “candidalysin alone” when we want to emphasize that candidalysin was the sole trigger used for the respective assay.

      15/ Has there been any method to track candidalysin production during contact of C. albicans with neutrophils?

      Responses:

      Thank you for this comment. Yes, there is a QVQ nanobody that can be used which is currently not to our disposal (doi.org/10.1111/cmi.13378). However, we already know from this publication that candidalysin concentrations vary when released naturally. The concentrations are particularly high in invasion pockets or dense biofilms. We also know that if we add candidalysin to the medium we have even distribution throughout and this is by definition different from concertation spikes at host cell-fungal interaction sites. As we have stated above, hyphal tips in turn are readily attacked by human neutrophils (doi.org/10.1189/jlb.0213063). Hence, we can safely assume, according to these previous publications, that there will be a more uneven distribution of candidalysin concentrations over neutrophil membranes, when the sole source of the toxin stems from growing hyphae interacting with neutrophils. It would of course be very interesting to know how the toxin exactly intercalates into membranes and which morphologies potential pores may have. These questions are currently under investigation in the laboratories of B. Hube and J. Naglik. To incorporate these findings here would certainly be far beyond the scope of this study.

      We include and modify the following sentences to the discussion of this manuscript to clarify the issue: (Line 544). ” Lack of candidalysin production in C. albicans results in significantly reduced histone citrullination, accompanied with decreased NET formation. However, citrullination is not required for NET release, but rather governs the formation of NLS, which is dominant when candidalysin is added exogenously with even distribution throughout the cell suspension. With regard to C. albicans hyphae secreting candidalysin, local concentrations of the toxin are likely to vary to a large degree, particularly when the candidalysin-secreting hypha is engulfed by a neutrophil. Therefore, it may be difficult to discriminate NLS form NETs during the interaction of neutrophils and C. albicans, as both structures may be induced concurrently 10. It seems logical that the pore-forming activity of candidalysin augments the release of NET fibres during C. albicans infection, where PRRs will additionally be triggered on neutrophils, resulting in combinatorial activation of downstream pathways. In line with this notion, candidalysin drives histone citrullination, which contributes to chromatin decondensation.”

      *16/ In Figure 4f-the given information indicates 1,2 hour incubation, in the caption of the figure there is information about 5 hour incubation - please clarify. The description of the stains used is lacking. *

      Response:

      Microscopic analysis performed after 5h incubation time, whereas candidalysin has been added to different time points indicated in the Figure (in the new version this is now Figure 8f). We clarified in the legend as follows: (line 472) “(f) Neutrophils were infected with C. albicans and 15 µM candidalysin was added 0 h, 1 h or 2 h after the infection. Addition of candidalysin at the different time points after C. albicans infection resulted in a shift to NLS structures rather than NETs as visualized by microscopy after 5 h total incubation (20X).” The description of the strains is depicted directly in the Figure, next to the microscopic images.

      *17/ Fig. 5 - result for 15 uM MitoTEMPO - adds nothing to the results and introduces image information noise - should be removed. No information on the concentration of the peptide used. *

      Responses:

      We would like to keep the 15 µM MitoTEMPO concentration, since it is the more reasonable concentration at which we do not observe an effect. This argues that ROS is more-likely derived from NADPH oxidase and not mitochondrial ROS. We show TEMPOL effects at 15 µM and at 100 µM to document the dose dependency and for the sake of comparability, we would like to keep both concentrations also for MitoTEMPO.

      The indicated peptide concentration was added to the figure legend. Thank you for pointing this out.

      *18 / Fig. 5, line 309: and cell-permeable Sytox Green DNA dye (250310 nM) to determine the total number of cells".

      Please correct the information on the use of both dyes, according to the manufacturer's description: "SYTOX® Green nucleic acid stain is an excellent green-fluorescent nuclear and chromosome counterstain that is impermeant to live cells, making it a useful indicator of dead cells within a population." *

      Response:

      Thank you for highlighting this error. Indeed, we used Syto Green for this particular staining, a dye which stains both live and dead cells since the dye is cell-permeable. We corrected the error at this section of the text.

      *19/ 324 At later time points, BAPTA-AM led to an increase in NLS, probably due to toxic effects as indicated by higher background levels of NLS formation in non-stimulated, BAPTA-AM-treated neutrophils (Fig. 6d).

      If such an assumption is made, the toxic effect should also be observed for the control. *

      Response:

      The toxic effect was observed while conducting the experiments, but cannot be seen in the size-base quantification which is the read out for this particular experiment. We have performed a cytotoxicity assay using flow cytometry and PI staining to confirm the effect. The results are added as supplemental Figure (Fig. S3b).

      *20/Fig. 6C PAD inhibitor should affect PMA-induced netosis, but the figure presents NLS existence - how was this change found? *

      Responses:

      We are grateful for the opportunity to explain this more thoroughly. PMA does not trigger histone citrullination (10.3389/fimmu.2016.00461) and thereby there is no effect of the PAD inhibitor on PMA-induced NETs. Notably, some level of histone citrullination can also be observed in unstimulated neutrophils (see Fig. 3, 5 and 8), since histone modification is not exclusively dependent on stimulation. However, upon PMA stimulation we observe a decrease (Fig. S1b), not an increase, of histone citrullination consistent with previous reports.

      We adjusted the text as follows: (line 235) “. Expectedly, citH3 levels upon PMA stimulation did not increase, but rather decreased which is consistent with previous reports 10 (Fig. 3d and Suppl. Fig. S1b). While citrullination levels in unstimulated neutrophils decreased over time, ionomycin stimulation sustained high levels over 5 h.

      *21/ line320 "This indicates that candidalysin most probably causes Ca2+ influx via pore formation and not via direct receptor stimulation" And: line 358. As C.albicans hyphae bind to pathogen recognition receptors (PRRs), activate neutrophils and ultimately promote the release of NETs, we aimed to elucidate whether candidalysin alone leads to the activation of similar pathways in neutrophils. Hence, we stimulated neutrophils with candidalysin in the presence or absence of specific inhibitors for SYK, PI3K, and Akt.

      Lack of consistency in conclusion. *

      Response:

      Thank you for pointing this out. We adjusted the paragraph (line 331) as follows: “As C. albicans hyphae bind to pathogen recognition receptors (PRRs), activate neutrophils and ultimately promote the release of NETs, we aimed to elucidate whether candidalysin alone can trigger similar pathways in neutrophils via signalling cross talk induced by Ca2+ influx. Hence, we stimulated neutrophils with candidalysin in the presence or absence of specific inhibitors for SYK, PI3K, and Akt (Fig. 6b).”

      *22/ Fig. 7 It would be good to verify these results with experiments using mutants. Figures 7b, 7c, and 7d can be combined to make the whole drawing clearer. *

      Response:

      We thought this is very relevant and included additional experiments showing that the mutant strains also induce phosphorylation of lamin A/C independent of the expression of candidalysin (new Fig. 6e and 6f).

      *23/ line 603 'The percentage of dead cells was calculated using TritonX-100 lysed neutrophils as 100% control' - maybe use " treated or permeabilized" *

      Response:

      Thank you, we changed the phrasing accordingly.

    1. Reviewer #1 (Public Review):

      The authors aimed to contrast the effects of pharmacologically enhanced catecholamine and acetylcholine levels versus the effects of voluntary spatial attention on decision making in a standard spatial cueing paradigm. Meticulously reported, the authors show that atomoxetine, a norepinephrine reuptake inhibitor, and cue validity both enhance model-based evidence accumulation rate, but have several distinct effects on EEG signatures of pre-stimulus cortical excitability, evoked sensory EEG potentials and perceptual evidence accumulation. The results are based on a reasonable sample size (N=28) and state-of-the art modeling and EEG methods.

      Although the authors draw a few partial conclusions that are not fully supported by the data (see below), I think that the authors' EEG findings provide sufficient support for the overall conclusion that "selective attention and neuromodulatory systems shape perception largely independently and in qualitatively different ways". This is an important conclusion because neuromodulatory systems and selective spatial attention are both known to regulate the neural gain of task-relevant single neurons and neural networks. Apparently, these effects on neural gain affect decision making in dissociable ways.

      The effects of donepezil, a cholinesterase inhibitor, were generally less strong than those of atomoxetine, and in various analyses went in the opposite direction. The authors fairly conclude that more work is necessary to determine the effects of cholinergic neuromodulation on perceptual decision making.

      1) I believe that the following partial conclusions are not fully supported by the data:

      a) In the results section on page 6, the authors conclude that "Attention and ATX both enhanced the rate of evidence accumulation towards a decision threshold, whereas cholinergic effects were negligible." I believe "negligible" is wrong here: the corresponding effects of donepezil had p-values of .09 (effect of donepezil on drift rate), .07 (effect of donepezil on the cue validity effect on drift rate) and .09 (effect of donepezil on non-decision time), and were all in the same direction as the effects of atomoxetine, and would presumably have been significant with a somewhat larger sample size. I would say the effects of donepezil were "in the same direction but less robust" (or at the very least "less robust") instead of "negligible".

      b) "In the results section on page 8, the authors conclude that "Summarizing, we show that drug condition and cue validity both affect the CPP, but they do so by affecting different features of this component (i.e. peak amplitude and slope, respectively)."<br /> This conclusion is a bit problematic for two reasons. First, drug condition had a significant effect not only on peak amplitude but also on slope. Second, cue validity had a significant effect not only on slope but also on peak amplitude. It may well be that some effects were more significant than others, but I think this does not warrant the authors' conclusion.

      c) In the discussion section on page 11, the authors conclude that "First, although both attention and catecholaminergic enhancement affected centro-parietal decision signals in the EEG related to evidence accumulation (O'Connell et al., 2012; Twomey et al., 2015), attention mainly affected the build-up rate (slope) whereas ATX increased the amplitude of the CPP component (Figure 3D-F)."<br /> As I wrote above, I believe it is not correct that "attention mainly affected the build-up rate or slope", given that the effect of cue-validity on CPP slope was also significant. Also, while the authors' data do support the conclusion that ATX increased the amplitude and not the slope of the CPP component, a previous study in humans found the opposite: ATX increased the slope but did not affect the peak amplitude of the CPP (Loughnane et al 2019, JoCN, https://pubmed.ncbi.nlm.nih.gov/30883291/). Although the authors cite this study (as from 2018 instead of 2019), they do not draw attention to this important discrepancy between the two studies. I encourage the authors to dedicate some discussion to these conflicting findings.

      2) On page 12 and page 14 the authors suggest a selective effect of ATX on *tonic* catecholamine activity, but to my knowledge the exact effects of ATX on phasic vs. tonic catecholamine activity are unknown. Although microdialysis studies have shown that a single dose of atomoxetine increases catecholamine concentrations in rodents, it is unknown whether this reflects an increase in tonic and/or phasic activity, due to the limited temporal resolution of microanalysis. Thus, atomoxetine may affect tonic and/or phasic catecholamine activity, and which of these two effects dominates is still unknown, I think.

    1. Inventors ignoring the ethical consequences of their creations is nothing new as well, and gets critiqued regularly:

      I think that this is true. We are so busy with creating things that are technologically advanced and never thinking about whether or not the invention is good and if this invention may lead to something negative. For example, ChatGPT has been viewed with many different opinions, it can be very helpful to us, but in some ways or in further applications, it may actually be really harmful to us.

    1. winnicott once said you know there's no such thing as a baby there's only a baby and someone
      • "gestation rewires your brain in fundamental ways um you it rewire it primes you for caretaking as a as a mother in a way which is far more visceral and far it's it's pre-rational it's it's immensely transformative experience and it's permanent you know once you've been rewired for mummy brain you'd never really go back um and that from the point of view of raising a child that matters um because when after a baby is born it's you know as winnicott once said you know there's no such thing as a baby there's only a baby and someone there's a a baby doesn't exist as an independent entity until it's some years some years into its life arguably quite a few years into its life um and what I would say about artificial wounds is that you may be you may think that what you're doing is creating a baby without the misery of gestation but what you're doing in practice is creating a baby without creating a mother because a pregnancy doesn't just create a baby it also creates a mother"

      • Comment

    2. when people do die it is almost like I think a colleague of mine under Sandberg 00:31:27 says that when somebody died the library Burns because all of that wisdom that they're carrying around in their minds that it took decades and decades to build up inside of them gets extinguished
      • comment
        • I think many of us have had this thought!
          • that when we die, vast amounts of wisdom is extinguished along with that person
          • As our digital tools become more sophisticated, however,
        • we are uploading our libraries to the digital collective intelligence network
          • the internet may well evolve to become the epitome and master repository of human cumulative cultural evolution.
          • even AI could not exist if it did not mine a training set of billions of human and their shared ideas
          • Perhaps it is the internet which is the vehicle for collective hybridized human-cyborg immortality?
          • If knowledge is preserved this way, then this flavor of immortality is only meaningful for our species
    1. @chrisaldrich I think the is an underated idea more broadly. I would love to see this done with other authors books that use an index card system, like Robert Greene. I think it would be a useful illustration to help people better understand the research and writing process. I've been wanting to and created a few experimental vaults where I do a similar thing except for a podcast (all of Sean Carroll's Mindscape transcripts are free) or a textbook (Introduction to Psychology). But I never followed through on the projects just because of how much work it takes to due it right. This also makes me wish for a social media type zettelkasten, where a community can keep a shared vault, creating a social cognition of sorts. I know this was kind of happening with the shared vaults Dan Alloso was experimenting with but his seemed more focused than random/chaotic. I'm also not sure if he continued it for later books.

      Reply to Nick at https://forum.zettelkasten.de/discussion/comment/17926/#Comment_17926

      Some pieces of social media come close to the sort of sense making and cognition you're talking about, but none does it in a pointed or necessarily collaborative way. The Hypothes.is social annotation tool comes about as close to it as I've seen or experienced beyond Wikipedia and variations which are usually a much slower boil process. As an example of Hypothes.is, here's a link to some public notes I've been taking on the "zettekasten output problem" which I made a call for examples for a while back. The comments on the call for examples post have some rich fodder some may appreciate. Some of the best examples there include videos by Victor Margolin, Ryan Holiday (Robert Greene's protoge), and Dustin Lance Black along with a few other useful examples that are primarily text-based and require some work to "see".

      For those interested, I've collected a handful of fascinating examples of published note collections, published zettelkasten, and some digitized examples (that go beyond just Luhmann) which one can view and read to look into others' practices, but it takes some serious and painstaking work. Note taking archaeology could be an intriguing field.

      Dan Allosso's Obsidian book club has kept up with additional books (they're just finishing Rayworth's Doughnut Economics and about to start Simon Winchester's new book Knowing What We Know, which just came out this month.) Their group Obsidian vault isn't as dense as it was when they started out, but it's still an intriguing shared space. For those interested in ZK and knowledge development, this upcoming Winchester book looks pretty promising. I'd invite everyone to join if they'd like to.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We would like to thank the reviewers for their professional comments and constructive suggestions. Our current plan is to revise the manuscript and supplemental materials in response to the reviewers’ requests and suggestions. Toward this goal we began experiments to obtain new data requested by the reviewers and anticipate the outlined experiments can be completed within the next three months.

      2. Description of the planned revisions:

      __Reviewer #1: __

      The results of experiments where Arp2/3 is blocked (Fig.2) should be confirmed by Arp2/3 knock-down and with an independent Arp2/3 inhibitor. Several are available (CK-869, Benproperine, Pimozide). For Fig.3 and 4, that would not be necessary, but to establish the specificity of the effect in fig.2 this is absolutely required.

      __Response: __As requested, we will include new data with CK-869, the indicated Arp2/3 complex inhibitor. We purchased the inhibitor and are currently confirming its efficacy before testing whether it inhibits transition to the hESC naïve state. However, we respectfully disagree with generating a Arp2/3 knock-down hESC line. Arp2/3 complex genes are known to be essential genes in both mouse and human embryonic stem cells (PMID: 29662178 and PMID: 31649057). Furthermore, reports on successful knockout of complex subunits indicate that additional genetic manipulations are needed to maintain cell survival, including knockout of INK4A/ARF to bypass apoptosis associated with Arp2 shRNA knockdown (PMID: 22385962) and genetic manipulations in mouse models (PMID: 22492726. Thus, knock-down of Arp2/3 complex members in our cells is beyond the scope of this manuscript.

      Yilmaz A, Peretz M, Aharony A, Sagi I, Benvenisty N. Defining essential genes for human pluripotent stem cells by CRISPR-Cas9 screening in haploid cells. Nat Cell Biol. 2018 May;20(5):610-619. doi: 10.1038/s41556-018-0088-1. Epub 2018 Apr 16. PMID: 29662178.

      Shohat S, Shifman S. Genes essential for embryonic stem cells are associated with neurodevelopmental disorders. Genome Res. 2019 Nov;29(11):1910-1918. doi: 10.1101/gr.250019.119. Epub 2019 Oct 24. PMID: 31649057; PMCID: PMC6836742.

      Wu C, Asokan SB, Berginski ME, Haynes EM, Sharpless NE, Griffith JD, Gomez SM, Bear JE. Arp2/3 is critical for lamellipodia and response to extracellular matrix cues but is dispensable for chemotaxis. Cell. 2012 Mar 2;148(5):973-87. doi: 10.1016/j.cell.2011.12.034. PMID: 22385962; PMCID: PMC3707508.

      I believe that the status of the actin cytoskeleton in both states is not well enough characterized. This is especially obvious for branched actin networks themselves that depend on the Arp2/3. To this end, the authors may localize Arp2/3 or cortactin, a useful surrogate that often gives a better staining. This point is particularly important since contractile fibers are not made of branched actin. Myosin cannot walk or pull along branched actin networks because of steric hindrance. It might well be that branched actin networks are debranched after Arp2/3 polymerization. I suggest staining tropomyosins that would indicate where the transition between branched and unbranched actin would be. Along this line, phosphoERMs should be localized and revealed by Western blots (we expect an increase from primed to naive state) because they cannot perform the proposed function of linker between the membrane and actin filaments if they are not phosphorylated.

      Response: As requested we will include new data with cortactin immunolabeling, which we already completed. These new data, shown below, confirm that cortactin, which binds to branched actin filaments, co-localizes with the F-actin fence around hESC naïve colonies, suggesting that the fence includes branched F-actin. Also as requested, we are currently immunoblotting for phosphorylated ERMs to more thoroughly assess if they may serve as a linker between the membrane and actin filaments.

      Branched actin is required for cell cycle progression and cell proliferation in normal cells. This requirement is lost in most cancer cells (Wu et al., Cell 2012; Molinie et al., Cell Res 2019). This would be really important to know whether ESCs stop proliferating upon CK-666 treatment. In other words, do they behave like normal cells or transformed cells. Proliferation is a major function that depends on the YAP pathway. Cell counts and EdU incorporation can easily provide answers to this important question.

      __Response: __As requested, we will include new data on proliferation. We anticipate that these new data will complement data we already have showing that CK-666 does not impair proliferation compared with hESC controls. We also note that the role of the actin cytoskeleton in proliferation is well established and an increase in proliferation is a hallmark of acquisition of the naïve state of pluripotency (PMID: 35005567).

      Chen C, Zhang X, Wang Y, Chen X, Chen W, Dan S, She S, Hu W, Dai J, Hu J, Cao Q, Liu Q, Huang Y, Qin B, Kang B, Wang YJ. Translational and post-translational control of human naïve versus primed pluripotency. iScience. 2021 Dec 17;25(1):103645. doi: 10.1016/j.isci.2021.103645. PMID: 35005567; PMCID: PMC8718978.

      Minor Comments:

      What about the rescue of cell morphology? Does active YAP restore the intercellular contractile bundle?

      __Response: __As requested, we obtained these data, as shown below. Expression of the YAP-S127A mutant does rescue the formation of the actin ring architecture in the presence of CK666. We are currently performing additional dedifferentiation assays to immunolabel for pMLC and address the question of if expression of YAP-S127A restores the contractile bundle.

      __Reviewer #2: __

      The authors found that a ring of actin filaments at the colony periphery was characteristic of the naive hESCs. However, because all the data are presented as an image of a single confocal section, the 3D organization of the actin filaments is not clear. Although the authors drew a scheme for this actin ring being located in the apical domain of polarized cells, such data have not been provided in the manuscript. Since naive hESCs form dome-like colonies, it is important to show the 3D organization of actin filaments in the colony. 3D reconstruction of confocal microscopy images of the naive hESC colonies is required to show the relationship between actin filaments, adherens junctions, and the nuclei (as a reference for the Z axis). If 3D reconstruction is not technically possible, confocal images at different Z levels and maximum projection images should be obtained and provided.

      __Response: __As requested, we are currently generating 3D images of the actin fence by using Imaris software, which we previously used to show 3D images of mitochondrial morphology (PMID: 34038242)

      Manoli SS, Kisor K, Webb BA, Barber DL. Ethyl isopropyl amiloride decreases oxidative phosphorylation and increases mitochondrial fusion in clonal untransformed and cancer cells. Am J Physiol Cell Physiol. 2021 Jul 1;321(1):C147-C157. doi: 10.1152/ajpcell.00001.2021. Epub 2021 May 26. PMID: 34038242; PMCID: PMC8321791.

      Some of the statistical analyses were inappropriate. The authors have used Student's t-test for all analyses; however, one-way ANOVA and post-hoc analysis must be used to compare three or more groups (Figs. 2B, D, E, 3G, 4B, D, E).

      __Response: __As requested, we will re-evaluate our statistical analysis. We note that our submission reports comparisons between two groups, and hence, Student’s t-test is appropriate. For example, we compared primed and naïve to demonstrate successful acquisition of naïve pluripotency, and then we compared the naïve condition to the CK666-treated conditions to demonstrate the impact of CK666-treatment. As Reviewer 2 suggests we will reanalyze all quantifications using one-way ANOVA with post-hoc analysis in the full revision and we will also discuss with Stuart Gansky, a statistician at UCSF whom we previously consulted for most appropriate statistical analysis of our studies.

      Minor Comments:

      Page 9, second paragraph. In the discussion section, authors have written that "Cells within the ICM of mouse blastocysts exclude YAP from the nucleus whereas cells within the ICM of human blastocysts maintain nuclear YAP." However, a recent study has reported that the ICM/epiblast of mouse late blastocysts also express nuclear YAP. Epiblast Formation by TEAD-YAP-Dependent Expression of Pluripotency Factors and Competitive Elimination of Unspecified Cells. Hashimoto M, Sasaki H. Dev Cell. 2019, 50:139-154.e5. doi: 10.1016/j.devcel.2019.05.024.

      __Response: __As requested, we will revise our Discussion section to include findings from the indicated new publication.

      Reviewer #3:

      Many of their conclusions seem to be based on the qualitative analysis of a single image (e.g. Figures 1D-G, Fig 2G, Supplementary Figure 2). The authors should provide quantitative information regarding these analyses and indicate the number of cells/replicas collected for each experiment.

      __Response: __As requested, our revision will have added quantitative data when feasible. We note that in the field, traction force microscopy isn’t commonly quantified beyond including scale bars, which our original manuscript shows. Moreover, pluripotency is standardly not quantified because it is a binary switch - cells are either double positive or they are not. We show 100% double positive, and rtPCR data with known stage-specific markers.

      The actin ring surrounding hESCs colonies was previously described by Närvä et al. Although the authors cited this previous work, they do not discuss in deep the differences and similarities with their observations.

      __Response: __As requested, our revised manuscript with include additional detail comparing our results with those from Närvä et al. In brief, we observe the formation of this actin ring only in the naïve state of pluripotency, whereas Närvä et al. observe an actin architecture in the primed state. One possible source of difference between their study and ours are the cells used for analysis. Närvä et al. utilize induced pluripotent stem cells, long since proposed to be closer to naïve pluripotency than primed stem cells as conventionally isolated and maintained (see PMID: 27424783 and PMID: 19497275). Additionally, we observe that the contractile actin ring in naïve pluripotent stem cells is in a higher z-plane than reported by Närvä et al., although a direct comparison is difficult to make.

      Theunissen TW, Friedli M, He Y, Planet E, O'Neil RC, Markoulaki S, Pontis J, Wang H, Iouranova A, Imbeault M, Duc J, Cohen MA, Wert KJ, Castanon R, Zhang Z, Huang Y, Nery JR, Drotar J, Lungjangwa T, Trono D, Ecker JR, Jaenisch R. Molecular Criteria for Defining the Naive Human Pluripotent State. Cell Stem Cell. 2016 Oct 6;19(4):502-515. doi: 10.1016/j.stem.2016.06.011. Epub 2016 Jul 14. PMID: 27424783; PMCID: PMC5065525.

      Nichols J, Smith A. Naive and primed pluripotent states. Cell Stem Cell. 2009 Jun 5;4(6):487-92. doi: 10.1016/j.stem.2009.05.015. PMID: 19497275.

      The qualitative observation of Figure 3F suggests a lower overall YAP levels in primed and +CK666 cells in comparison to naive cells. Could the authors check if this is correct and, if this is the case, explain the observation?

      __Response: __As requested, our revision will include new data on YAP protein expression by immunoblotting.

      The authors should discuss deeper the rationale of the pan-ERM immunostaining experiments (since they used the individual antibodies afterwards) and provide a brief discussion of their results and, in particular, the colocalization with moesin but not with ezrin or radixin.

      __Response: __As requested, our revised manuscript will include a more detailed discussion of our results with ERM immunolabeling.

      2. Description of the revisions that have already been incorporated in the transferred manuscript:

      __Reviewer #1: __

      Minor Comments:

      Fig2F: non-representative pictures or wrong quantification of the CK666 condition.

      __Response: __We thank the review for alerting us to this error. The CK666 Primed and Naïve condition images were swapped. We have edited the figure to correct this.

      Fig3A: Y-axis? What is it? How is it adjusted? -Log P?

      __Response: __Please see the methods section. Differential expression analysis was performed using DESeq2 R package. The resulting P values were adjusted (padj) using the Benjamini and Hochberg’s approach for controlling the False Discovery Rate (FDR). Genes with a padj

      Colors of dots not really visible (in reference to Figure 3A).

      __Response: __We thank the reviewer for this comment and have updated the figure to use more standard, colorblind-friendly color choices (see the above figure). Additionally, we fixed a drawing error in the figures when creating the volcano plots.

      Typos: Apr2/3 in the abstract, Hoeschst in Fig.S1B.

      __Response: __We thank the review for alerting us to these errors. We have edited the manuscript to correct them.

      __Reviewer #3: __

      There are many experimental details missing that are extremely relevant to fully understand the experiments and evaluate the robustness of the analyses (e.g., microscopy setup, fluorescent probes used for immunostaining, incubation conditions with the inhibitors SMIFH2 and CK666).

      __Response: __As requested we have updated the Materials and Methods section with more detailed information on procedures and reagents.

      Minor Comments:

      The Introduction makes the reader think that actin is the only cytoskeletal network involved in embryo development and stem cell properties. They should also include a brief discussion on the relevance of the other cytoskeletal networks in mechanotransduction and cell fate decisions.

      __Response: __As requested, we will revise our Introduction. We note, however, that in the field additional cytoskeleton components, including intermediate filaments and microtubules have mostly been shown for interacting with the nucleus with limited evidence for roles in differentiation.

      Many of the images seem to require a flat-field correction. Could the authors check that the illumination is homogeneous? This artifact could affect the data analysis.

      __Response: __As we indicate in the Methods section, the spinning disc confocal microscopes used in our study are equipped with a Borealis to mitigate uneven illumination across the field of view. Additionally, quantification in Figures 2C-E, Figures 3F-G, and Figures 4A-D are comparing measurements to a local background (i.e. cytoplasm nearby) in order to normalize for any uneven illumination effects.

      There are many abbreviations that are not defined in the text and are extremely specific to the field.

      __Response: __As requested, we have expanded the definition of many abbreviations in the text and any additional abbreviations changes will be clearly defined in our revised manuscript.

      Could the authors explain the selection of the pluripotency markers studied by qPCR? Specifically, why they studied DNMT3L, DPPA3, KLF2, and KLF4 (Fig. 1B) and the different set PECAM1, ESRRB, KLF4, and DNMT3L in Fig. 2B.

      __Response: __Defining the exact molecular and cell behavioral characteristics of naïve pluripotency remains an evolving point of development within the field. The pluripotency markers used in both original panels are known and established markers of naïve pluripotency. The original panel of DNMT3L, DPPA3, KLF2, and KLF4 was established based upon RNAseq datasets publicly available, whereas the secondary panel of PECAM1, ESRRB, KLF4, and DNMT3L was a more targeted analysis of genes found in the literature which have been interrogated in more detail for potential roles in naïve pluripotency. To facilitate clarity within the manuscript, we have updated Fig. 1B to match Fig. 2B for the purposes of defining a transcriptional hallmark of naïve pluripotency for the purposes of this manuscript.

      Figures 1G and 2G, please include the images of the colonies.

      __Response: __As requested, our revised manuscript will include phase contrast images, which we already have, as shown below. These images will be included Supplemental Figure 1 and Supplemental Figure 2 for the colonies used to show representative tractions in Figure 1G and 2G, respectively.

      3. Description of analyses that authors prefer not to carry out

      Reviewer #1:

      The results of experiments where Arp2/3 is blocked (Fig.2) should be confirmed by Arp2/3 knock-down and with an independent Arp2/3 inhibitor. Several are available (CK-869, Benproperine, Pimozide). For Fig.3 and 4, that would not be necessary, but to establish the specificity of the effect in fig.2 this is absolutely required.

      __Response: __As requested, we will include new data with CK-869, the indicated Arp2/3 complex inhibitor. We purchased the inhibitor and are currently confirming its efficacy before testing whether it inhibits transition to the hESC naïve state. However, we respectfully disagree with generating a Arp2/3 knock-down hESC line. Arp2/3 complex genes are known to be essential genes in both mouse and human embryonic stem cells (PMID: 29662178 and PMID: 31649057). Furthermore, reports on successful knockout of complex subunits indicate that additional genetic manipulations are needed to maintain cell survival, including knockout of INK4A/ARF to bypass apoptosis associated with Arp2 shRNA knockdown (PMID: 22385962) and genetic manipulations in mouse models (PMID: 22492726. Thus, knock-down of Arp2/3 complex members in our cells is beyond the scope of this manuscript.

      Yilmaz A, Peretz M, Aharony A, Sagi I, Benvenisty N. Defining essential genes for human pluripotent stem cells by CRISPR-Cas9 screening in haploid cells. Nat Cell Biol. 2018 May;20(5):610-619. doi: 10.1038/s41556-018-0088-1. Epub 2018 Apr 16. PMID: 29662178.

      Shohat S, Shifman S. Genes essential for embryonic stem cells are associated with neurodevelopmental disorders. Genome Res. 2019 Nov;29(11):1910-1918. doi: 10.1101/gr.250019.119. Epub 2019 Oct 24. PMID: 31649057; PMCID: PMC6836742.

      Wu C, Asokan SB, Berginski ME, Haynes EM, Sharpless NE, Griffith JD, Gomez SM, Bear JE. Arp2/3 is critical for lamellipodia and response to extracellular matrix cues but is dispensable for chemotaxis. Cell. 2012 Mar 2;148(5):973-87. doi: 10.1016/j.cell.2011.12.034. PMID: 22385962; PMCID: PMC3707508.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the reviewers for their constructive feedback, which has helped us improve the manuscript considerably (no comment on whether the improvements are “significant”). Below are our point-by-point responses. We have also highlighted all changes in the manuscript.

      2. Point-by-point description of the revisions

      Reviewer 1

      Summary

      In this study, Obodo et al. present a new iteration of their popular rhythm analysis tool LimoRhyde. The conceptual advancement in this new iteration is the focus on effect sizes (in the form of point estimates of amplitude and their prediction intervals) rather than the p-values, which has been the predominant form of statistical testing for rhythm analysis. Therefore, compared to a well-established non-parametric method for rhythm testing, LimoRhyde2 selects genomic features with larger amplitudes (effect-sizes) as it is designed to do.

      Major Comments

      1. (LimoRhyde2 algorithm, Page 2-) It is unclear what exactly the contributions/advancements of the authors are? Is it a novel statistical method, the combination of well-established tools in a novel workflow, or is it a novel application to a new field (rhythms)? I am afraid the sentence "LimoRhyde2 builds on previous work by our group and others to rigorously analyze data from genomic experiments [9,16,17], capture non-sinusoidal rhythms [18], and accurately estimate effect sizes [14,19]." is rather ambiguous.

      We have revised this sentence in the last paragraph of the Introduction to clarify LimoRhyde2’s contributions.

      1. (Moderate model coefficients, Page 3-) The authors implement empirical Bayes shrinkage on the coefficients. But the state-of-the-art methods used in LimoRhyde2 for linear model fitting, such as DESeq2/limma-voom/limma-trend, already implement shrinkage for the coefficients. Does algorithm implement a second round of Bayes shrinkage on the rhythm effect-sizes? How or why is this a statistically valid procedure? If not, how does Limorhyde2 add to shrinkage already implemented in DESeq2/limma-voom/limma-trend? Please elaborate.

      To our understanding, the two shrinkage procedures work at different levels and serve different purposes. Limma applies shrinkage on residual variances to account for any technical variation and to give a higher power to detect effects for data with smaller sample sizes within each condition; it does not shrink coefficients. In practice, limma’s shrinkage has little effect given the relatively large sample sizes of most circadian experiments. LimoRhyde2, on the other hand, uses mashr to apply shrinkage to the coefficients themselves to account for shared patterns of effects and variation across both features and conditions. We see no reason this approach is invalid, and in our conversations with Matthew Stephens, the author of ashr and mashr, he felt the same. We elaborate on each method’s contributions in the Discussion (paragraph 2).

      1. I think the goal to move to effect-sizes which lead to more reproducible results and better biological significance is sound and highly appreciated. However, to make the community switch to a completely different way of viewing their genomic analysis requires more convincing examples(s)/use-cases on why they should abandon the old method that they are used to. Now, results section merely shows that this algorithm performs as designed (to find large amplitude rhythms).

      We appreciate the comment and acknowledge that some readers may be particularly attached to p-values and our current analysis may not wholly convince them of the value of effect sizes. We believe the manuscript stands on its own, however, and are using LimoRhyde2 to guide experiments whose conclusions we hope to describe in future work. Nonetheless, we have revised the Discussion (paragraph 4) to clarify that some known relevant genes highly ranked by LimoRhyde2 were underappreciated by BooteJTK.

      1. Related to point 3, others have previously proposed using amplitude (effect-size) thresholds in addition to the p-value cutoffs (Lück & Westermark, 2016, Pelikan et al, 2022), how would the results of Limorhyde2 compare in a fairer contrast where both p-value and amplitude thresholds are implemented? Does the proposed sound method outperform the two-step approach. The authors may perform this analysis on their chosen datasets as well.

      Thank you for raising this point. Indeed, one way to view LimoRhyde2 is as a data-driven balancing of raw effect size and p-value. However, the approach of considering both raw amplitude and p-value is uncommon and requires yet another arbitrary cutoff, which complicates any genewise ranking and side-by-side comparison with other methods. Thus, we have decided to not perform this analysis, and instead mention what we see as the advantage of LimoRhyde2 in the Discussion (paragraph 2).

      1. I am also not completely convinced of the author's approach to compare their tool against BooteJTK. P-values only show ordering when the alternative hypothesis is true. P-values under the null hypothesis are uniformly distributed in [0,1] so would be meaningless for the purpose of ordering. Without knowing the ground-truth, ordering by p-values is rather risky. I understand the authors' difficulty. But maybe point 4 above yields a better evaluation strategy for LimoRhyde2.

      If one accepts that these datasets have a non-zero number of “true” rhythmic genes, which to us seems more than reasonable, then we don’t see this is a large issue. Ranking by (adjusted) p-value is also the standard in differential expression analyses.

      1. (OPTIONAL) LimoRhyde2 orders results by the point estimates of the effect-sizes (amplitudes). Is this biologically the most meaningful? Should the effect-size CIs be ordered at all? Maybe we only care about whether the lower limit of the CI is greater than a chosen threshold without any ordering. A discussion of this would be valuable to a user.

      We discussed this issue amongst ourselves as well, and ultimately elected for simplicity in ranking by only the point estimate and not the credible interval. We have now mentioned this issue in the penultimate paragraph of the Discussion.

      1. (OPTIONAL) If indeed the authors want to move away from p-values, one could argue that most of the insights from p-value analysis are or could be biased. So why compare against ordering by p-values at all in the results?

      We are not arguing that results from p-value-based analyses are biased. We seek to show the differences on real data between an analysis based on p-values, the dominant approach in the field, and one based on estimated effect sizes. We believe this has greater potential to promote thoughtful progress than does outright rejection of p-values based on a purely theoretical argument.

      Minor Comments

      1. In page 3, it is unclear why averaging the three fits is the best thing to do? How bad would the performance be if m = 1 was chosen compared to m=3.

      We have elaborated the relevant section of the Methods. For most genes in most datasets, the difference between m=1 and m=3 wasn’t much. However, m=1 tended to go noticeably sideways for some of the most rhythmic genes, depending on the relative locations of timepoints and spline knots, whereas m=3 did not.

      1. In page 4, "To account for this uncertainty, LimoRhyde2 constructs..." was difficult to understand and sounded arbitrary. Please explain further.

      We have revised this sentence.

      1. Lachmann et al. (2021) also use bootstrap confidence intervals rather than p-values to quantify rhythmicity that ought to be mentioned.

      We have now cited this paper in the Introduction.

      Significance Comments

      1. General assessment: The authors present an exciting new way of viewing results of high-throughput data analysis in the context of biological rhythms using a Bayesian-like approach. Previously work has revealed the flaws in focusing on p-values and how focusing of effect-sizes (in this context amplitudes) can yield more robust, reproducible results. Although this promises to also yield more biological meaningful results, it is unclear from this study how this might be.

      See reply to Major Comment 3 above.

      1. Advance: This study presents the first tool in the context of the rhythm analysis to provide prediction intervals for different rhythm parameters to facilitate a move away from the hypothesis testing framework of p-values. This is a technical advance in the field of rhythm analysis, but it is unclear what insights this could yield.

      See reply to Major Comment 6 above.

      Reviewer 2

      Major Comments

      1. The manuscript introduces a new tool to select rhythmic genes and to quantify amplitudes and phases. The authors combine splines, linear regression, Bayes sampling, and Mash. They focus on amplitudes instead p-values as in other packages. The performance and independence of JTK methods are illustrated using selected circadian expression profiles from different mammalian tissues. The paper is clearly written and provides a valuable extension of existing tools. I miss, however, an intuitive explanation of Mash.

      Thank you.

      1. I agree with their claim that amplitudes are quite important for physiological regulations. However, p-values are also helpful to explore, e.g., transcription factor binding sites. Moreover, amplitudes are taken into account in many studies (see e.g. papers of Naef, Korencic, Westermark, Ananthasubramaniam...). Since JTK or RAIN are non-parametric methods amplitudes are not in focus. The authors should discuss the biological relevance of amplitudes more clearly.

      Thanks for raising this point. We are careful to limit our claims to bulk transcriptome data, and have tried to cite the relevant prior work. We have revised the Discussion to clarify what we see as the potential value of amplitudes, as illustrated by our analysis.

      1. The selection of the 3 data sets and of specific genes seems reasonable since a range of technologies (microarrays versus RNS-seq), of durations (1 day versus 2 days), and of gene amplitudes are represented. Still the authors should comments their selections of data sets and genes.

      We have added justification for our choices.

      1. I find also the tissue-dependent phase distributions of clock-controlled genes of interest. However, a comparison with other studies (Zhang, GTEx from Talamanca et al.) and a discussion how amplitude thresholds such as 10%, 25%, 50% affect the phase distributions would be valuable.

      Thank you for the suggestion. We initially explored several values of the amplitude threshold for those histograms (Figure S4C) before selecting the top 25%, all led to the same conclusion. We consider this a minor issue and tangential to the main point of the paper, so we have left the figure as is. We invite any interested reader to explore the publicly available results.

      Reviewer 3

      Summary

      The authors developed LimoRhyde2, a method for quantifying rhythmicity in genomic data, and applied it to mouse transcriptome data from liver, lung, and suprachiasmatic nucleus (SCN) tissues. The method uses periodic spline-based linear models and an Empirical Bayes procedure (Mash) to produce posterior fits and rhythm statistics. LimoRhyde2 prioritizes high-amplitude rhythms of various shapes rather than monotonic rhythms with high signal-to-noise ratios, which contrasts with previous methods like BooteJTK. The authors demonstrated the value of LimoRhyde2 in quantifying rhythmicity and highlighted some of its advantages over traditional methods. However, they also acknowledged limitations, such as the inability to compare rhythmicity between conditions and the assumption of fixed rhythms.

      Major Comments

      1. The key conclusions are convincing, as the authors demonstrated LimoRhyde2's ability to fit non-sinusoidal rhythms and prioritize high-amplitude rhythms over monotonic rhythms with high signal-to-noise ratios. This is shown by the comparison with BooteJTK, a popular method in the field, and by the analysis of real circadian transcriptome data from mouse tissues. However, the authors acknowledged some limitations that could impact the method's broader applicability.

      Thank you.

      1. Data and methods are presented in a reproducible manner, with detailed descriptions of the periodic spline-based linear models, the use of Mash for moderating raw fits, and the calculation of rhythm statistics. This information is sufficient for other researchers to replicate the study and apply the LimoRhyde2 method to their own datasets. The code is available already.

      Thank you.

      1. Adequate replication and statistical analysis are provided, with the authors analyzing the same datasets using both LimoRhyde2 and BooteJTK to compare their performance. The use of Spearman correlation to assess the relationship between the adjusted p-values from BooteJTK and the amplitudes from LimoRhyde2 further supports the statistical rigor of the study.

      Thank you.

      Minor Comments

      1. Addressing LimoRhyde2's limitations would help improve the study.

      We have extensively addressed the method’s limitations to the best of our knowledge in Discussion paragraphs 6 and 7.

      1. Authors could provide more details on how LimoRhyde2 could be applied to single-cell RNA-seq data to improve the presentation. Single-cell quantification over time would be a challenging task, so some insight into this would be appreciated, rather than a brief comment at the end of the paper.

      Thank you for your interest in this topic. To do it justice, however, requires its own project and paper, so scRNA-seq is beyond the scope of the current paper.

      Significance Comments

      1. This study represents a technical advance in the field of genomic analysis of biological rhythms by introducing LimoRhyde2, a method that prioritizes high-amplitude rhythms and directly estimates biological rhythms and their uncertainty. The method's ability to capture non-monotonic rhythms and account for uncertainty makes it a valuable tool for researchers interested in understanding circadian systems and their physiological impact.

      1. The work is placed in the context of existing literature, as the authors compare LimoRhyde2 with BooteJTK, a refinement of the popular JTK_CYCLE method. The comparison highlights the differences in output, prioritization, and runtime, demonstrating LimoRhyde2's potential advantages over traditional methods in the field.

      2. However, BooteJTK is relatively underused compared to many other methods, partly because of the difficulty and time required to run the analysis. The paper would be improved by comparing LimoRhyde2 to JTK_Cycle itself, as well as RAIN and ARSER. The latter are the most commonly used methods for rhythm detection, and thus the value of the paper's findings would be far greater by comparing to these methods. Like LimoRhyde2, they are also not resource-intensive to run.

      Thanks for your feedback on this point, which is one we discussed at length amongst ourselves. In the end, we decided on BooteJTK because it seems to be the best performing version of the most common method. ARSER and RAIN are simply not the standard, and based on our interpretation of the evidence, not generally superior to JTK. If we had selected the vanilla JTK_Cycle, we felt a reviewer could discard our results by saying "well, they're comparing their method to a version of a method known to be flawed". Given our objective to highlight the differences between prioritization based on estimated effect size and prioritization based on p-value, we do not see the value of including additional methods in the analysis.

      1. LimoRhyde2's ability to efficiently prioritize large effects with functional significance in the circadian system can provide valuable insights for these researchers and advance the understanding of biological rhythms. The LimoRhyde2 approach is different to conventional reliance on arbitrary p- or q-values, which are taken as almost sacrosanct in the field as a measure of a dataset's worth. LimoRhyde2 could thus help to change this false perception of how to rate a circadian rhythm, which has particularly been ushered in by a reliance on JTK_Cycle p- and q-values as the method of choice for assigning meaningfulness to rhythms. Unfortunately, JTK_Cycle is very conservative and is limited to detecting sinusoidal-type rhythms. LimoRhyde2 could overcome these limitations (as RAIN does too) if widely adopted. However, to do this, it must be compared to things like JTK_Cycle directly.

      See reply to Significance Comment 3 above.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this study, Obodo et al. present a new iteration of their popular rhythm analysis tool LimoRhyde. The conceptual advancement in this new iteration is the focus on effect sizes (in the form of point estimates of amplitude and their prediction intervals) rather than the p-values, which has been the predominant form of statistical testing for rhythm analysis. Therefore, compared to a well-established non-parametric method for rhythm testing, LimoRhyde2 selects genomic features with larger amplitudes (effect-sizes) as it is designed to do.

      Major Comments:

      1. (LimoRhyde2 algorithm, Page 2-) It is unclear what exactly the contributions/advancements of the authors are? Is it a novel statistical method, the combination of well-established tools in a novel workflow, or is it a novel application to a new field (rhythms)? I am afraid the sentence "LimoRhyde2 builds on previous work by our group and others to rigorously analyze data from genomic experiments [9,16,17], capture non-sinusoidal rhythms [18], and accurately estimate effect sizes [14,19]." is rather ambiguous.
      2. (Moderate model coefficients, Page 3-) The authors implement empirical Bayes shrinkage on the coefficients. But the state-of-the-art methods used in LimoRhyde2 for linear model fitting, such as DESeq2/limma-voom/limma-trend, already implement shrinkage for the coefficients. Does algorithm implement a second round of Bayes shrinkage on the rhythm effect-sizes? How or why is this a statistically valid procedure? If not, how does Limorhyde2 add to shrinkage already implemented in DESeq2/limma-voom/limma-trend? Please elaborate.
      3. I think the goal to move to effect-sizes which lead to more reproducible results and better biological significance is sound and highly appreciated. However, to make the community switch to a completely different way of viewing their genomic analysis requires more convincing examples(s)/use-cases on why they should abandon the old method that they are used to. Now, results section merely shows that this algorithm performs as designed (to find large amplitude rhythms).
      4. Related to point 3, others have previously proposed using amplitude (effect-size) thresholds in addition to the p-value cutoffs (Lück & Westermark, 2016, Pelikan et al, 2022), how would the results of Limorhyde2 compare in a fairer contrast where both p-value and amplitude thresholds are implemented? Does the proposed sound method outperform the two-step approach. The authors may perform this analysis on their chosen datasets as well.
      5. I am also not completely convinced of the author's approach to compare their tool against BooteJTK. P-values only show ordering when the alternative hypothesis is true. P-values under the null hypothesis are uniformly distributed in [0,1] so would be meaningless for the purpose of ordering. Without knowing the ground-truth, ordering by p-values is rather risky. I understand the authors' difficulty. But maybe point 4 above yields a better evaluation strategy for LimoRhyde2.
      6. (OPTIONAL) LimoRhyde2 orders results by the point estimates of the effect-sizes (amplitudes). Is this biologically the most meaningful? Should the effect-size CIs be ordered at all? Maybe we only care about what whether the lower limit of the CI is greater than a chosen threshold without any ordering. A discussion of this would be valuable to a user.
      7. (OPTIONAL) If indeed the authors want to move away from p-values, one could argue that most of the insights from p-value analysis are or could be biased. So why compare against ordering by p-values at all in the results?

      Minor Comments:

      1. In page 3, it is unclear why averaging the three fits is the best thing to do? How bad would the performance be if m = 1 was chosen compared to m=3.
      2. In page 4, "To account for this uncertainty, LimoRhyde2 constructs..." was difficult to understand and sounded arbitrary. Please explain further.
      3. Lachmann et al. (2021) also use bootstrap confidence intervals rather than p-values to quantify rhythmicity that ought to be mentioned.

      Significance

      General assessment:

      The authors present an exciting new way of viewing results of high-throughput data analysis in the context of biological rhythms using a Bayesian-like approach. Previously work has revealed the flaws in focusing on p-values and how focusing of effect-sizes (in this context amplitudes) can yield more robust, reproducible results. Although this promises to also yield more biological meaningful results, it is unclear from this study how this might be.

      Advance:

      This study presents the first tool in the context of the rhythm analysis to provide prediction intervals for different rhythm parameters to facilitate a move away from the hypothesis testing framework of p-values. This is a technical advance in the field of rhythm analysis, but it is unclear what insights this could yield.

      Audience:

      This will be useful to all chronobiologists (clinical and basic research) who use high-throughput genomic assays. Since this is an open R-package, I suspect most of those who want to will be able to easily use it. My expertise is in chronobiology, data science and systems biology.

    1. Author Response

      Reviewer #2 (Public Review):

      The paper by Arribas et al. examines the coding properties of adult-born granule cells in the hippocampus at both single cell and network level. To address this question, the authors combine electrophysiology and modeling. The main findings are:

      Noisy stimulus patterns produce unreliable spiking in adult-born granule cells, but more reliable responses in mature granule cells.

      Analysis of spike patterns with a spike response model (SRM) demonstrates that adult-born and mature GCs show different coding properties.

      Whereas mature GCs are better decoders on the single cell level, heterogeneous networks comprised of both mature and adult-born cells are better encoders at the network level.

      Based on these results, the authors conclude that granule cell heterogeneity confers enhanced encoding capabilities to the dentate gyrus network.

      Although the manuscript contains interesting ideas and initial data, several major points need to be addressed.

      Major points:

      1) The authors use and noisy stimulation paradigm to activate granule cells at a relatively high frequency. However, in the intact network in vivo, granule cells fire much more sparsely. Furthermore, granule cells often fire in bursts. How these properties affect the coding properties of granule cells proposed in the present paper remains unclear. At the very least, this point needs to be better discussed.

      In vivo whole cell recordings of granule cells are very scarce. In our study, we based the design of our stimulus on recordings from the intact network in vivo (PerniaAndrade and Jonas 2014), which show that granule cells receive a wide range of frequencies, with a power spectrum that exhibits a power law decay. These properties are built in our noisy stimuli. These in vivo recordings have also reported the presence of theta oscillations, showing a peak in the spectrum. However, in our approach we deliberately removed these oscillations from our stimuli because it is best to fit GLMs using white noise or noise with an exponentially decaying autocorrelation (Paninski et al. 2004).

      Thus, our choice of the stimuli is far from arbitrary, but rooted on experimental evidence from intact network in vivo recordings, together with previous knowledge about GLM/SRM fitting. This comment reveals to us that we did not clarify this enough in the manuscript. We are grateful to the reviewer for revealing this omission, since this is in fact an important aspect of the study strategy. In the revised manuscript, we brought these points up front in the results section when we introduce the stimulus for the first time, and more thoroughly discussed it in the Methods section that describes the stimulus.

      Still, the bursts observed in granule cells are an important feature and they have been observed to be phase locked to the theta-gamma oscillations in vivo (Pernia-Andrade and Jonas 2014). In the revised version of the manuscript we included new experiments and simulations with stimuli that include a peak in theta frequency. We found that immature neurons also improve decoding performance with these theta modulated stimuli.

      2) The authors induce spiking in granule cells by injection of current waveforms. However, in the intact network, neurons are activated by synaptic conductances. As current and conductance have been shown to affect spike output differently, controls with conductance stimuli need to be provided. Dynamic clamp is not a miracle anymore these days.

      The use of dynamic clamp sounds in principle like a good suggestion. However, in the manuscript we have taken a different approach to enable the use of a single neuron GLM that uses currents as inputs. To control for the differences between mature and immature neurons we used currents with amplitude normalized by the input resistance, and both types of neurons were measured with the same technique to allow for the comparison.

      Importantly, the GLM type model that we use assumes that the membrane potential is a linear convolution of the input, which permits a straightforward and robust fitting approach. We argue that this is not a minor issue, since using dynamic clamp would require a drastic modification of the model. Furthermore, the use of conductance stimuli would not allow for the straightforward model fitting we perform with our approach. The key point here is that the membrane potential would not be correctly approximated as a linear function of the conductance stimulus, precluding the fitting strategy.

      Finally, at the moment we do not have the equipment to perform the suggested experiment, so this suggestion would require a big amount of time to acquire the equipment and set up the experiments in mature and immature neurons. In addition, we would have to change the model and develop a different fitting strategy. With the controls that we already have in the manuscript, we do not think dynamic clamp experiments would fundamentally change the conclusions of the manuscript. Thus, we argue that this is beyond a reasonable timeframe for this revision, but could be something to further explore in future. We now mention this possibility in the discussion.

      3) The greedy procedure is a good idea, but there are several issues with its implementation. First, it is unclear how the results depend on the starting value. What we end up with the same mixed network if we would start with adult-born cells? Second, the size of the greedy network is very small. It is unclear whether the main conclusion holds in larger networks, up to the level of biological network size (1 million). Finally, the fraction of adult-born granule cells in the optimal network comes out very large. This is different from the biological network, where clearly four or five-week-old granule cells cannot represent the majority. Much more work is needed to address these issues.

      The reviewer approves the greedy procedure that we apply in our manuscript and poses three issues for consideration.

      First, the reviewer queries what would be the result of starting the procedure with a different pool of simulated neurons, and whether we would obtain “the same mixed network if we would start with adult-born cells”. Let us remark that the outcome of the greedy procedure is not always the same mixed population of neurons. For each different mature neuron that we use to start the procedure, the trajectory (see Fig. 4A) of selected neurons will be different. Thus, the final population (network) will be different, and this is reflected in the error bars that we obtain in Fig. 4. Presumably, starting with adult-born cells will change the outcome of the greedy procedure. However, note that this is not the point of the approach. The motivation to start with mature neurons is to ask whether adult-born cells can contribute something to decoding, given that mature cells on their own perform better.

      Second, the reviewer questions the size of the population that we reach with the greedy procedure. Note that for the population sizes that we show in the manuscript the decoding performance already begins to saturate, Fig. 4F-H. Furthermore, it is unfeasible to construct a 1M neurons population due to the computational cost –the time it takes to run the algorithm. These two facts motivated us to stop at 12 neurons as it strikes a good balance between computational time and saturation. Importantly, as we expand below, the aim of the greedy procedure simulation is not reconstructing the actual network of the dentate gyrus. Rather, we seek to understand whether immature neurons could improve coding in a population.

      Third, the reviewer observes that the fraction of adult born cells in the reconstructed populations using the greedy procedure are large as compared to the biological network. Again, here note that the aim of the whole in-silico experiment is not to recover the biological network, where other aspects are at play. More simply, we query the possible contribution of adult born cells to coding. In fact, if we obtained the same proportion it would be by chance, since we do not think that adult-born cells in the dentate gyrus are chosen according to the greedy algorithm.

      Still, this comment from the reviewer motivated us to include further simulations of the greedy procedure with constraints. In the revised manuscript we show new results using the greedy procedure, but constraining the fraction of immature neurons in the resulting populations, see Figure 4-figure supplement 2.

      More generally, we think that these comments reveal a possible misunderstanding about the approach, its purpose and the interpretation of the results. The point of the greedy procedure is to show that immature neurons do in fact contribute to improve the decoding, despite being generally worse individually. We do not claim that the population obtained with the greedy procedure faithfully reflects the actual shape of the in vivo network. We are aware that it does not. We see that this may have not been clear in the original version. In the revised version, we now explain the purpose of the greedy procedure when we introduced it. Additionally, we comment on the proportion of immature neurons in the same paragraph.

      4) Likewise, the idea of dynamic pattern separation seems quite nice. However, the authors focus on the differences between mixed and pure networks, which are extremely small. Furthermore, the correlation coefficients of "low", "medium", and "high" correlation groups are chosen completely arbitrarily. A correlation coefficient of 0.99, considered low here, would seem extremely high in other contexts. Whether dynamic pattern separation is possible over a wider range of input correlation coefficients is unclear (see O'Reilly and McClelland, 1995, Hippocampus, for a possible relationship). Finally, aren't code expansion and lateral inhibition the key mechanisms underlying pattern separation? None of these potential mechanisms are incorporated here.

      The reviewer positively appreciates the idea of the pattern separation task that we propose in the manuscript, and poses some questions concerning the extent of the contribution of adult-born neurons.

      We agree that code expansion and lateral inhibition are key mechanisms for pattern separation in the DG, and we do not claim that adult-born neurogenesis is the key mechanism behind pattern separation. Rather, in our work we explore the role of adultborn immature neurons in coding in general, and in pattern separation in particular, given that it’s a commonly attributed function to the DG.

      We note that the correlation in O'Reilly and McClelland 1994 (actually, what they call pattern overlap) is of a very different nature than the one we compute in our work. They compute the overlap between different patterns of activation in a population of neurons, that is the probability that a single neuron is active in two different patterns of activation. In our manuscript we compute the correlation between different continuous time-varying stimuli that stimulate single neurons.

      Importantly, previous work has shown that ablating neurogenesis particularly affects fine spatial discrimination, that is when the separation between patterns is small, but not when it is large (Clelland 2009, Science). Hence, we were actually expecting the impact of adult-born neurons to be important only for relatively large correlation coefficient values.

      In the revised manuscript, we now explain the rationale for the choice of correlation values, both in the main text when we introduce the task, and in the Methods when we set the values for the low, medium and high correlation classes. We also added a sentence to the discussion on pattern separation, bringing in the importance of the ideas of lateral inhibition, code expansion, and the work of O’Reilly 1994.

      5) A main conclusion of the paper is that while mature GCs are better decoders on the single cell level, heterogeneity in mixtures improves coding in neuronal networks. However, this seems to be true only for r^2 as a readout criterion (Fig. 4F). For information, the result is less clear (Fig. 4G). The results must be discussed in a more objective way. Furthermore, intuitive explanations for this paradoxical observation are not provided. Saying that "this is an interesting open question for future work" is not enough.

      This is an interesting point raised by the Reviewer. While r^2 is quantified by comparing the decoded stimuli with the true stimuli, mutual information is related to the uncertainty about the decoding. That is, it quantifies the correspondence between decoded and true stimuli, but does not tell us whether it is a good approximation to it. For example, a decoder could achieve perfect mutual information but result in a poor reconstruction by performing a perfectly scrambled one-to-one mapping of the true stimulus [Schneidman et al. 2003], see also our reply to point [5] by Reviewer #1 above.

      We agree that this is an important point and we realize that it was not clear in the original version of the manuscript. In the revised manuscript we added some sentences to clarify this point.

      6) The authors ignore possible differences in the output of mature and adult-born granule cells in their thinking. If mature and adult-born granule cells had different outputs, this could affect their contributions to the code (either positively or negatively). At the very least, this possibility should be discussed.

      Newborn neurons contact the same targets as mature neurons, born during development: pyramidal cells in CA3, and interneurons in CA3 and the DG. During the maturation, there is a sequence of connectivity with CA3 and within the DG (Toni et a. 2008). At 4 weeks, newborn cells are already contacting their postsynaptic targets. Still, there may be subtle differences in the strength of these connections compared to mature neurons.

      So, although the targets are the same, there may be quantitative differences in the way they contribute to the code. Thus the point raised by the reviewer is interesting, so we decided to discuss it further in the revision.

    1. Author Response

      Reviewer #1 (Public Review):

      This study used intersectional genetic approaches to stimulate a specific brainstem region while recording swallow/laryngeal motor responses. These results, coupled with histology, demonstrate that the PiCo region of the IRt mediates swallow/laryngeal behaviors, and their coordination with breathing. The data were gathered using solid methods and difficult electrophysiological techniques. This study and its findings are interesting and relevant. The analysis (and/or the presentation of the analysis) is incomplete, as there are analyses that need to be added to the manuscript. The interpretation of the data is mostly valid, but there are claims that are too speculative and are not well-supported by the results. The introduction and discussion would benefit from more citations and a deeper exploration of how this study relates to other work - especially a thorough accounting of and comparison to other studies concerning putative swallow gates.

      General/major concerns:

      The field of respiratory control is far from unified regarding the role of PiCo in breathing or any other laryngeal behaviors. If anything, the current consensus does not support the triple-oscillator hypothesis (in which PiCo is one of 3 essential respiratory oscillators). The name "PiCo", short for "post-inspiratory complex", suggests a function that has not been well-supported by data - it is a putative post-inspiratory complex, at best. I suggest putting this area in context with other discussions i.e. IRt (such as in Toor et al., 2019) or Dhingra et al. 2020 showed broad activation of many brainstem sites at the post-I period (including pons, BotC, NTS)

      The reviewer’s comment refers to our previous publication and not the present one. With all due respect to the reviewer, the submitted study investigates PiCo’s involvement in swallow and laryngeal activation and its coordination with breathing.

      We did not feel that it is appropriate for us to critique the Dhingra paper in the present study. However, since this seems to be important to this reviewer, we would like to clarify: Because of filter characteristics, and the low temporal and spatial resolution of these field recordings, the approach used by Dhingra is inappropriate for providing insights into the presence or absence of PiCo. We therefore developed an alternative approach, which provides more detailed insights into population activity, the Neuropixel approach. This Neuropixel recording from PiCo (black trace) exemplifies how field recordings (yellow) fail to pick up post-I activity. We could provide many more examples, but as stated above, addressing the study by Dhingra is tangential to the present study.

      We would also emphasize that the study by Dhingra was never designed to provide negative evidence, and Dhingra et al. never claimed that their study demonstrates the absence of PiCo. Unfortunately, the data by Dhingra were misinterpreted by Swen Hülsmann in his Journal of Physiology editorial which created considerable confusion, but also sensation in the field. Objectively, Toor et al reproduced the Anderson study in rats as we will elaborate below. Unfortunately, Toor et al added to the confusion, by renaming the PiCo area into IRt. The field of respiration would have also been confused if the first study reproducing the Smith et al. 1991 study in a different rodent species would have refused to call this area preBötC and instead would have called it e.g. ventrolateral reticular field.

      Did you perform control experiments in which the opto stimulations were done on animals without the genetic channels (for example, WT or uncrossed ChAT-ires-cre, etc.), or in mice with the genetic channels that weren't crossed (uncrossed Ai32 mice)? If so, please include. If not, why?

      Yes, we performed many control experiments. Aside of many recordings in which viral injections were targeted outside PiCo, we also performed optogenetic stimulations in mice lacking channelrhodopsin. We have now added the following statements and supplemental figure.

      Optogenetic stimulation in mice lacking channelrhodopsin

      Stimulation of PiCo, across all stimulation durations, in 3 Ai32+/+ mice and 4 ChATcre:Vglut2FlpO:ChR2 mice where the ChR2 did not transfect ChATcre:Vglut2FlpO, as confirmed by a post-hoc histological analysis, resulted in no response (Fig. S3).

      How do you know that your opto activations simulate physiological activation? First, the intensive optical activation at the stim site does not occur in those neurons naturally.

      This seems like a generic critique of the optogenetic approach. In none of the 10,000+ published optogenetic studies is it known to what extent optogenetic activation stimulates exactly the same neurons and the same degree of activity as during a natural behavior. What we know is that PiCo neurons are activated during postinspiration (Anderson et al. 2016) and that optogenetic activation stimulates these neurons and that this activation evokes the same muscles in the same temporal sequence as a water-evoked swallow. We assume that the reviewer’s comment does not intend to imply that “swallows” evoked by nonspecifically stimulating the SLN is more physiological than the optogenetically-evoked swallows of a specific neuron population? From the reviewer’s other comments, it is obvious that the reviewer has no problems with the results of the Toor study that used exclusively SLN stimulations, an approach which is known to be very non-specific.

      Doing a natural (water) stim for comparison is good, but it cannot necessarily be directly compared to the opto stim. The water stim would activate many other brainstem regions in addition to PiCo.

      Can the reviewer provide any hard evidence that “many other brainstem regions” are activated by water stimulation in comparison to optogenetic stimulation?

      A caveat is that opto PiCo stim =/= water stim (in terms of underlying mechanisms) should be included. Second, in looking at the differences between water vs opto swallows in Table S2: it appears that the ChAT animals (S2A) have something weaker than a swallow with opto stim. For the Vglut2 and ChAT/Vglut2 (S2B&C), the opto swallows also aren't as "strong" as the water swallows (the X and EMG amplitudes are smaller). The interpretation/discussion attributes this to the lack of sensory input during opto stim, but does not mention the strong possibility that there is a difference in central mechanisms occurring. It also seems to be dismissed with the characterization of the swallow as "all-or-none" (see note on Fig 3 results).

      With all due respect, we are somewhat surprised that the reviewer dismisses the entire paragraph in the discussion that specifically addresses the comparison between water-swallows and PiCo-stimulated swallows. We discussed the possibility that PiCo stimulated swallows may not activate the full pathway/mechanism as does the water swallow. We carefully compared and confirmed that PiCo-stimulated swallows have the same temporal motor sequence of the same muscles as those activated in water swallows. As already stated, it is surprising that the reviewer has no problem with accepting the validity of previously published methods like electrical non-specific stimulations of the cNTS or SLN, a frequently used and accepted model to produce and study swallow.

      The writing needs extensive copy editing to improve clarity and precision, and to fix errors.

      Thank you for this comment, we have revised and reviewed the writing.

      Results/Fig 1: What proportion had no/other motor response (non-swallow, non-laryngeal) to the opto stim? I can extrapolate by subtraction, but it would be nice to see the "no/other response" on the plot.

      With all due respect to this reviewer, but it is not possible to address this question. Specifically, it is not possible to know if a “No response” (meaning “no behavioral output” occurred in response to PiCo stimulation), would have resulted in a swallow or laryngeal activation. However, figure 2 contains responses other than swallows, i.e. “non swallows”, which includes both laryngeal activation as well as “no responses” meaning “no behavioral response” in response to PiCo stimulation. This was determined to assess how the respiratory rhythm is affected when a swallow is not produced by PiCo stimulation.

      The explanation of genetics is too spread out and confusing. There needs to be more detail about all the genetic tools used, using the standard language for such tools, in one spot. Please also provide a clear explanation of what those tools accomplish. Include a figure if necessary.

      We apologize for creating confusion. We added more explanations to the text.

      Pick a conventional designator/abbreviation for the different strains, define them in the methods and in the first paragraph of the results section, and use those abbreviations throughout. I think that using ChAT as an abbreviation for your ChAT-ires-cre x Ai32 mice is confusing because it makes it sound like you're talking about the enzyme rather than the specific strain/neurons. Saying "ChAT stimulated swallows... swallows evoked by water or ChAT" makes it sound like the enzyme choline acetyltransferase itself is stimulating swallow. As is convention, pick a more precise abbreviation like ChAT-cre/Ai32 or ChAT:Ai32 or ChAT-ChR2 or ChAT/EYFP. This goes for the other strains as well.

      Thank you for pointing this out. To avoid confusion the strains/neurons are now referred to as: ChATcre:Ai32, Vglut2cre:Ai32, and ChATcre:Vglut2FlpO:ChR2

      For Fig S2C&D, why does it say mCherry? Isn't it tdTomato? Is it just an anti-ChAT antibody and then the tdTomato Ai65 is only labeling Vglut2? I don't see this in the methods section.

      Thank you for pointing this out. We apologize for our mistake, and we have corrected the manuscript to say tdTomato.

      I also don't see methods for all the staining in Fig S3. The photomicrograph says Vglut2-cre Ai6, but there's no mention of Ai6 anywhere else. Which mice are these? Did you cross Vglut2-cre with an Ai6 reporter mouse? How can you image an Ai6 mouse (which I assume expresses ZsGreen? and that you excited at 488?) and a 488 anti-goat in the same section (that's the only secondary listed in the methods that would work with your goat anti-ChAT)? Is there an error in listing the fluorophores in the methods? Please give more details on the microscopy including which filters were used for the triple staining.

      We have decided to remove the CTb data from the manuscript.

      Regarding the staining: I would expect the staining/maps in for the 2 different ChAT/Vglut2 intersectional strains to be similar (Fig 5A/B and S2C/D). The photomicrographs look very different to me, while the heat maps (this goes for all the heat maps in the paper) have barely distinguishable differences. In Fig 5, the staining looks much stronger than in Fig S2C. Why does it look like there are so many more transfected neurons in Fig 5A2 than there are red neurons in the corresponding panel Fig S2C2? And for Fig 5A4 and Fig S2C44? The plot and results text for Fig 5 says the avg number of neurons was 123+¬11. The plot for Fig S2D says 112+¬15, but the results text says 242+¬12 (not sure which is the correct number).

      Thank you for your comments. Previously the heat maps had different scale bars if you compare Fig 5A/B and S2C/D (now figure S4C/D). We changed the heat maps keeping the same scale for all of them. Discussing the representative photomicrography, even figure Fig 5A/B and S4C/D represents the same cluster of cells (PiCo Chat/Vglut+). Figure S4D states 242 ± 12 neurons (also stated in the results section).

      However, we want to point out that there are several technical differences between both, 1) figure 5A represents the transfection promoted by the virus injection, impacting the number of cells stained/transfected (133 ± 16 neurons), 2) figure S4C/D represents a intersectional mouse ChATcre: Vglut2FlpO: Ai65; (242 ± 12 neurons). In this case, we have more tdTomato positive cells because this genetic approach is able to detect most of the Chat and Vglut2 cells. The difference between figures is considered normal for anatomical studies, in some studies the same bregma can show different number of cells. Thus, the differences are due to the differences in the type of approaches (viral expressions vs. intersectional approach).

      We have also added additional experiments to figure 5 (now N=7) which has been reflected in the text and figures.

      The results text for Fig S2C also says the staining is "similar to the previous ChAT staining...", which I assume refers to S2A/B. The plot and results text for Fig S2B reports 403+¬39 neurons, while S2D is either 112 or 242 (not sure?). The plots have different Y scales, which should be changed to be the same. But why do the photomicrographs and the heat maps look so similar? I would expect far fewer neurons to be stained in the intersectional mice (Fig 5 and Fig S2C/D) than in the ChAT staining (Fig S2A/B). I am having trouble reconciling the different presentations/quantifications and making sense of the data in these histology figures.

      We removed “similar to the previous ChAT staining” and we have reviewed the heat maps. Since the original submission, we performed more experiments and now added more animals to the analysis (now N=7), each heat map represents the correct number of neurons in PiCo, respectively to each experiment.

      The Y scales has been adjust to better demonstrate the Chat staining vs. the intersectional mice triple conditioned.

      How can you distinguish PiCo from non-PiCo in the histology, especially in the ChAT-only staining? It seems that you have arbitrarily defined the PiCo region, and only counted neurons within that very constrained area.

      Even in ChAT-only staining, the N.ambiguus is very distinct from the cholinergic neurons located more medial to the N.ambiguus. This can be unambiguously be confirmed by combining ChAT with glutamatergic in situ staining as done in the Anderson et al. study, or unambiguously be demonstrated with the viral approach as done in the present study. Thus, we don’t see why it is arbitrary to define the distribution of PiCo neurons. What is arbitrary is the definition of the preBötC, yet the field of respiration seems to have no problem with this. We assume that the reviewer knows that Dbx1 neurons are spread along the entire ventral respiratory column and dorsal portion of the PreBötzinger Complex up to the level of the XII nucleus. Yet it is commonly accepted for authors to refer to the PreBötzinger Complex by counting dbx1 neurons within a constrained area of what is believed to be the PreBötzinger Complex, even though the borders are arbitrary. It is e.g. known that some of the ventrally located preBötC neurons are presumed rhythmogenic while the more dorsally located Dbx1 neurons are premotor. The transition from rhythmogenic to premotor is gradual. Similarly, NK1 staining, or SST staining is not restricted to the preBötC and it is arbitrary to define where preBötC begins and what to include. Indeed, our PNAS paper indicates that inspiratory bursts can be generated by optogenetically stimulating Dbx1 neurons along the entire VRC column – so it is not clear where the rhythmogenic portion of the preBötC begins rostrocaudally and dorsoventrally and where the rhythmogenic portion and preBötC itself ends. Thus, we want to re-iterate and emphasize, that for the present study, we developed a method using the cre/FlpO approach to unambiguously define the PiCo region. It is surprising that this reviewer does not acknowledge this technical advance that added significantly more specificity to the anatomical and physiological characterization of PiCo, than the Toor et al. study, and also the Anderson et al. study.

      I can see stained neurons in the area immediately outside of PiCo, and I'd like to see lower-magnification images that show the staining distribution in a broader region surrounding PiCo as well, especially in the rest of the reticular formation.

      We characterized the PiCo area based on the histological phenotype and in vitro and in vivo experiments performed by Anderson et al., 2016. PiCo is an area located close to the NAmb, presenting the same ChATcre phenotype. As stated above, the distribution and agglomeration of the NAmb is clearly very compact, and different then the observed ChATcre: Vglut2FlpO: Ai65 neurons located outside of NAmb. It is also important to emphasize, that like is the case for the preBötC, other transmitter phenotypes of neurons are also present in the PiCo region (i.e. GABA or Dbx1). However, the study performed by Anderson et al, 2016 paper, described only the functions of cholinergic neurons located in PiCo, and we always planned to publish a paper of the other neurons within PiCo – this area e.g. contains pacemaker neurons etc. But, I hope that the reviewer acknowledges that many investigators have studied the preBötC for the past 30 years. Hence, much more information has been accumulated on this region (which btw was at least as controversial at the beginning), and it will likely take at least another 30 years to fully identify and characterize PiCo.

      Similarly, how can you be sure you're stereotaxically targeting PiCo precisely (600um in diameter?) with your opto fiber (200um in diameter). Wouldn't small variations in anatomy put the fiber outside the tiny PiCo area?

      We assume the reviewer means “stereotactically”. And yes, the reviewer is correct, it is necessary to position the laser at a consistent anatomical location. Placement of the optical fibers outside of this area does not result in activation of PiCo. We have added an additional supplemental figure (Figure S6) to address this.

      Please put N's and stats results in Table S1 for both swallow and laryngeal activity. I took what I assume to be the Ns (10, 11, and 4) and did some stats like the ones you presented for the laryngeal duration. The differences between vagus duration for 40 and 200 ms pulse durations are all significant for each strain, by my calculations. Also, I think there must be an error in the orange swallow plot in Fig 3A. The orange dots don't correspond to the table values. I plotted all the Table S1 values for each strain. Each line looks similar to the blue laryngeal activation plot in Fig 3A. The slopes of the Vglut2 were less than the other strains, and the slopes for the swallow behavior were less than the laryngeal behavior for all strains. Otherwise, they all look similar. Please double-check your values/stats to address these discrepancies. If it is indeed true that the stim pulse duration affects swallow duration, revise the interpretations and manuscript accordingly.

      We thank the reviewer for the diligence in reviewing our manuscript. But, with all due respect, the reviewer is incorrect and misunderstood the data. To clarify: Table S1 is only presenting data for laryngeal activation, swallow data is presented in Table S2. The orange data points in Fig 3A are not detailed in Table S1 or S2. Table S2 is the average of all swallows across all laser pulse durations since the laser pulse duration does not affect swallow behavior duration. All data will be publically available after publication of the manuscript.

      Figure 3A is only representing the ChATcre:Vglut2FlpO:ChR2 column of Table S1

      The N’s have been added to table S1

      Please add more details on stats in general, including the specific tests that were performed, F values and degrees of freedom, etc.

      Thank you, this has been added throughout the results section. Please refer to the results section for this addition. However below we have provided an example.

      An example: A two-way ANOVA revealed a significant interaction between time and behavior (p<0.0001, df= 4, F= 23.31) in ChATcre:Vglut2FlpO:ChR2 mice (N=7).

      How do you know that you're not just activating motoneurons in the NA when you stimulate your ChAT animals, especially given the results in Fig 1B? In this case, the phase-specific results could be explained by inhibitory inputs (during inspiration) to motoneurons in the region of the opto stim.

      As stated in this paper as well as the Anderson et al 2016 paper (and for that matter also the Toor et al study) this is a caveat. This major caveat motivated the development and use of the ChATcre:Vglut2FlpO:ChR2 (specifically targeting the PiCo neurons that co-express ChAT and Vglut2, not laryngeal motor neurons) experiments that have mostly the same response as the ChATcre:Ai32 mice. We cannot say this is due to inhibitory inputs to laryngeal motoneurons, since the cre/FlpO specific experiments are not directly activating laryngeal motoneurons. But we do not want to entirely exclude that some premotor mechanisms may also occur in PiCo. The reviewer may know that there is overlap of rhythmogenic and premotor functions for the Dbx1 neurons in the PreBötC, But, addressing this issue is beyond the scope of this study. In fact, we are working on a separate connectivity study using novel, still unpublished antegrade and retrograde vectors that do not reveal any direct connections to laryngeal motoneurons. Hence, we expect that the connectivity from PiCo to laryngeal motoneurons is more complex and addressing this question cannot be done as a simple add-on to an already complex study. Again, we would refer to the PreBötzinger complex, where nobody expects that one study can resolve all the physiological and anatomical characterizations that have been accumulated over 30 years in one study. We would argue that in some ways, our cre/FlpO approach is more specific than the Dbx1 stimulations which activates not only rhythmogenetic PreBötzinger complex neurons, but also pre motoneurons as well as glia cells, and many neurons rostral and caudal to the PreBötzinger complex. We are aware of these caveats, and we have discussed this in the original submission, and also in the revision.

      While the study from Toor et al is cited, there needs to be a much more thorough discussion of how their findings relate to the current study.

      Many thanks for asking for a more thorough discussion of Toor et al., which we are happy to provide here. Perhaps we were too polite in our original manuscript to emphasize all the problems in that study.

      They demonstrated that PiCo isn't necessary for the apneic portion of swallow. Inhibiting this region also didn't affect TI.

      Please note – the fact that Toor et al did not find an effect on TI confirms Anderson et al. 2016: In Figure 3G,3F of the Nature paper, the reviewer will find that injections of DAMGO and SST into PiCo inhibited post-I activity without affect inspiratory duration. This figure also shows that the inspiratory burst can terminate in the absence of postinspiratory activity.

      The reviewer states: “They demonstrated that PiCo isn't necessary for the apneic portion of swallow”. With all due respect to this reviewer, this is NOT correct. Toor et al showed that inhibiting PiCo did block SLN-evoked fictive-swallows but not the apnea caused by SLN stimulation. This is not the apnea caused by swallows (which was never studied by Toor), but by the SLN stimulation. The apnea evoked by SLN stimulation has most likely nothing to do with the apnea caused by swallows. Unfortunately, the Toor et al. makes the same misleading claim as the reviewer.

      PiCo cannot be the sole source of post-I timing, and the evidence overwhelmingly favors the major involvement of other regions such as the pons.

      This comment seems to be unrelated to the main thrust of this paper that studies PiCo’s involvement in swallow and laryngeal activation in coordination with breathing. However, since this comment seems to discredit the Ramirez lab in general, we would like to clarify that inhibiting PiCo with DAMGO and SST inhibits post-I activity (Anderson et al 2016, Fig.3G,3F). Thus, we don’t understand the rationale or actual data for the reviewer’s conclusion that PiCo cannot be the sole source of post-I timing? We also don’t understand the basis for the reviewer’s conclusion that “the evidence overwhelmingly favors the major involvement of other regions such as the pons”. We also want to add, that no-where in the Anderson et al. study did we state that the pons plays NO role. Indeed, we specifically stated: “In this context it will be interesting to resolve the role of the PiCo in specific postinspiratory behaviors and to identify how the PiCo interacts with other neural networks such as the Kolliker-Fuse nucleus, a pontine structure that has been hypothesized to gate postinspiratory activity and the periaqueductal grey a structure involved in vocalization and the control of postinspiration”.

      They also showed that inhibition of all neurons (not just ChAT/Vglut) in the PiCo region suppresses post-I activity in eupnea. This suppression was overcome by the increased respiratory drive during hypoxia.

      Before comparisons are made with Toor et al. it is important to note the species and methodological differences between Toor et al. rat anesthetized, vagotomized, paralyzed and artificially ventilated model which evaluated fictive swallows (deafferented and paralyzed). By contrast this study uses a mouse anesthetized, vagal intact, freely breathing model and evaluates natural physiologic swallow via water and central stimulation. It seems that the reviewers does not acknowledge one of the main innovations of this study. For this study we introduced a genetic approach to specifically target and activate ChATcre/Vglut2FlpO PiCo neurons. This has never been done before, and developing this approach took more than 4 years of breeding and crossing and testing different options.

      As for Toor et al., these authors pharmacologically, bilaterally inhibited neurons in the area of PiCo with isoguvacine, a specific GABA-A agonist. Even though this pharmacological intervention does not specifically inhibit cholinergic/glutamatergic neurons in PiCo, these authors essentially confirm the study by Anderson et al. We do not find this finding controversial. Perhaps the reviewer finds the definition of PiCo “controversial”, because Toor et al called the identical area IRt instead of PiCo, even though they exactly reproduce the finding by Anderson. Toor et al. not only arrive at the same conclusion as Anderson but they added more details – none of which is contradicting the results by Anderson et al.: Here are excerpts from the Toor study “We therefore conclude that the ongoing activity of neurons in the IRt contributes to eupneic respiratory and sympathetic post-I activities without exerting significant control on other respiratory or cardiovascular parameters” “IRt significantly inhibited the post-I components of VNA” “IRt inhibition was also associated with a reduction in PNA” “increase in respiratory cycle frequency” “due to a reduction in TE“ “with no effect on TI observed”. “Bilateral microinjection of isoguvacine selectively reduced the magnitudes of post-I VNA and rSNA, but not PNA responses to acute hypoxemia”.

      In this statement the reviewer probably refers to one particular aspect, i.e. the fact that Toor et al. did not significantly block some of the post-I activity – they state: “had no significant effect on the AUC of post-I rSNA (305+/- 24 vs 230+/- 28,p=0.16,n=6)”. Please note that there is a tendency, a reduction from 305-230. Perhaps the Toor study was not sufficiently powered to fully block the effect, perhaps the drug did not inhibit the entire PiCo. These are all open questions that a critical reader should know. The reviewer will agree that it is as difficult if not more difficult to demonstrate the absence of an effect. To arrive at a negative conclusion experiments should be done with the same scrutiny than to demonstrate a positive result. We also assume that the reviewer is familiar with animal experiments and will understand that pharmacological injections are often difficult to interpret, in particular in case of local in vivo injections. It is possible that Toor et al is inhibiting e.g. parts of the Bötzinger complex.

      We have added to the manuscript the following statement: It is important to note that SLN stimulation does not only trigger swallows, but also changes in the overall stiffness and tension of the vocal cords (Chhetri et al., 2013) as well as prolonged hypoglossal activation independent of swallowing (Jiang, Mitchell, & Lipski, 1991). It has been hypothesized that inhibition of the IRt blocks fictive swallow but not swallow-related apnea. Yet this apnea was generated by SLN stimulation and not by a natural swallow stimulation (Ain Summan Toor et al., 2019). It is known that SLN stimulation causes endogenous release of adenosine that activates 2A receptors on GABAergic neurons resulting in the release of GABA on inspiratory neurons and subsequent inspiratory inhibition (Abu-Shaweesh, 2007), suggesting that the SLN evoked apnea may not be the same as a swallow related apnea. Moreover, microinjections of isoguvacine into the Bötzinger complex attenuated the apneic response but not the ELM burst activity (Sun, Bautista, Berkowitz, Zhao, & Pilowsky, 2011), suggesting the Bötzinger complex, not PiCo, could be involved in modulating apnea.

      We would also like to add that our current study characterized swallow-related specific muscles and nerves in both water-triggered and PiCo-triggered swallows to better characterize the physiological properties of this swallow behavior. By contrast, Toor et al. only characterized nerve activities that are involved in multiple upper airway activities and breathing. It is somewhat surprising that the reviewer did not consider the fact that Toor et al. characterized putative swallows that were triggered by SLN stimulation and that Toor et al. were content with nerve-recordings and failed to confirm that the behavior that they evoked is actually a physiological swallow. Which, according to the comments from this reviewer (see above), indicates the possibility of differences in central mechanisms occurring between fictive swallow and physiological swallows.

      While we have cited Toor et al and their truly excellent work in the broad iRt we did not feel it is appropriate to critique them for the fact that they are confusing the field by using a different anatomical term for the area that was clearly defined by us as an area containing cholinergic-glutamatergic neurons. We also did not feel it is appropriate to discuss results that are similar to comparing Apples and Oranges. Toor et al. never specifically manipulated glutamatergic-cholinergic neurons, thus their entire results rest on indirect stimulation affecting this general area – which will unavoidably also include laryngeal motoneurons. We don’t want to criticize this approach, since PiCo is heterogenous, which is another misunderstanding that we find in the reviewers’ critique. We used cholinergic-glutamatergic neurons to define this area. However, like the preBötC, PiCo is also heterogenous. This region contains inhibitory neurons, it also contains glutamatergic neurons that are not cholinergic, and cholinergic neurons that are not glutamatergic. Because of this heterogeneity we compared the effects of stimulating glutamatergic neurons and cholinergic neurons as well as cholinergic-glutamatergic neurons. This is an approach that is generally accepted in the field. As already stated, there is not a single marker that uniquely characterizes the PreBötC. Thus, when stimulating Dbx1 neurons, glutamatergic neurons, or Somatostatin neurons it only captures subpopulations of this region. The recently published study by Menuet et al. in eLife, used even more indirect methods to inhibit preBötC. They used a pan-neuronal CBA promotor that targets neurons irrespective of phenotype. It is not our intention to discredit this very elegant study, but we object the statement that we “have arbitrarily defined the PiCo region”.

      This study has not demonstrated some of the things that are depicted in Fig 7 and included in the discussion. While swallow can inhibit inspiration, there are many mechanisms by which this can happen other than a direct inhibitory connection from the DGS to PreBotC. You cite Sun et al., 2011 findings of "a group of neurons that inhibits inspiration" during SLN stim, but don't mention that it is the BotC and that the paper shows that swallow apnea is dependent on BotC. That is also supported by the Toor study. I don't understand how post-I (aka E) can be discussed without discussion of the BotC - this is a glaring omission.

      We have removed figure 7, which was only meant as a hypothetical schematic.

      Why is it necessary for PiCo to innervate the cNTS?

      This was a hypothesis based on CTb data that we have now removed.

      That is true if the conjecture that PiCo gates swallowing is true, as the cNTS is the only known region for central swallow gating. However, PiCo could influence afferent input to the NTS less directly, and therefore not function as a gating hub per se. The experimental evidence does not warrant the claim that PiCo gates swallowing. The definition of a swallow gate(s) is a topic of much debate and no conclusive experimental evidence has emerged for swallow gating regions to exist anywhere except in the NTS. The current study's evidence also does not meet the criteria necessary to conclusively call PiCo a swallow gate. The authors should soften this claim and language throughout the manuscript.

      Although we do not know of any studies that has optogenetically gated swallow in the cNTS, it seems the reviewer objects our use of the word “gate”. We have revised the manuscript and removed any wording stating PiCo is a swallow “gate”. It would be interesting to know whether the reviewer has the same objections of the use of the word “relay” as done by Toor et al.?

      It is also unclear that PiCo acts directly on the swallow pattern generator to gate swallowing. It is not just "conceivable that the gating mechanism involves" the pons, but nearly certain. Swallow gating by respiratory activity may not be able to be ascribed to one particular location. At a minimum, it likely involves the NTS/DSG, pons, and possibly IRt (inclusive of PiCo). The authors are correct that "further studies are necessary to understand the interaction between PiCo and the pontine respiratory group on the gating swallow and other airway protective behaviors." This is why it shouldn't be stated that "this small brainstem microcircuits acts as a central gating mechanism for airway protective behaviors."

      We have removed all language stating PiCo is a swallow gate.

      PiCo is likely part of the VSG (and thus the swallow pattern generator). PiCo, as part of the IRt/VSG could indeed be surveilling afferent information and providing output that affects swallow or other laryngeal activation and the coordination of these behaviors with breathing. However, this is not the responsibility of PiCo alone. This role is likely shared by other parts of the SPG, and may require distributed SPG network participation to be functional. If one were to stim other regions of the distributed SPG, similar results might be expected. When Toor et al silenced the PiCo area (and locations that lie at least lightly beyond the borders of what the present study defines as PiCo), stim-evoked fictive swallows were greatly suppressed. However, swallow-related apnea was unaffected. This supports the role of PiCo as a premotor relay for swallow motor activation, but not as the site that terminates inspiration. Therefore, it cannot be called a gate.

      We already addressed the issue that Toor never demonstrated that the “swallow-related” apnea was unaffected. Toor et al only demonstrated that the SLN-evoked apneas were unaffected, and their conclusions were only based on nerve recordings under fictive conditions (deafferented and paralyzed). Also, to the best of our knowledge, many aspects of the putative swallow pattern generator that this reviewer mentions are purely hypothetical. However, to avoid further arguments, we have removed the word gate and Figure 7 from this manuscript.

      Similarly, Fig 7 does not accurately depict things that are already well-supported by evidence. PiCo should be included as part of the swallow pattern generator (VSG), not as a separate entity acting on it. The BotC and pons are glaring omissions. This study has not demonstrated the labeled inhibitory connection from DSG to PreBotC. The legend states speculations as fact and needs to be dialed way back to either include statements with solid experimental evidence or to clearly mark things as putative/speculative.

      We have removed figure 7.

      The discussion of expiratory laryngeal motoneurons needs to be expanded and integrated better into the discussion of swallow, post-I, and other laryngeal motor activation. Why can't PiCo just be premotor to ELMs?

      If PiCo would “only” or “just” be premotor to ELM then it would not be expected that it could trigger an all-or-none swallow response with a temporal activity pattern similar to the one of a water-evoked swallow. We would also not expect that the activation of the activity pattern is independent of the laser stimulation duration as demonstrated in Figure 3. This was our reasoning why we originally called PiCo a “gate” because at the correct phase it will gate/trigger a complex swallow sequence. But, as stated above, we avoid the word gate in the revised manuscript.

      Concerning the discussion of "PiCo's influence as a gate for airway protective behaviors is blurred...": The incomplete swallow motor sequence didn't seem super different in timing or duration compared to the fully transfected animals (comparing plots from Fig 6 to Fig S1, and Table S2 to Table S3. The values for swallow durations (XII and X) for each group for water and opto seem within similar ranges, as do the differences between water & opto-evoked swallows between strains. While the motor pattern is distinctive from the normal swallow, with laryngeal activity rather than submental activity leading, one might not even be able to call that a swallow. Is it evidence against a classic all-or-nothing swallow behavior any more than the graded swallow results from (fully transfected) Table S1?

      We fully agree that it is possible that this unidentified behavior may not be a swallow. We have changed the name of this behavior to “upper airway motor activity.” However we also cannot rule out the possibility of this being some portion of a graded swallow which would argue that a graded swallow response is exact evidence against the classic all or nothing swallow behavior.

      Please expand on this point and put it into context with others' results: "This brings into question whether this is the first evidence against the classic dogma of swallow as an "all or nothing" behavior, and/or whether this is an indication that activating the cholinergic/glutamatergic neurons in PiCo is not only gating the SPG, but is actually involved in assembling the swallow motor pattern itself."

      This has been expanded and included citation of other studies. The following paragraph can be found in the discussion

      Swallow has been thought of as an “all or nothing” response as early as 1883 (Meltzer, 1883). Whether modulating spinal or vagal feedback (Huff A, 2020b), central drive for swallow/breathing (Huff, Karlen-Amarante, Pitts, & Ramirez, 2022) or lesions in swallow related areas of the brainstem (Car, 1979; Robert W Doty, Richmond, & Storey, 1967; Wang & Bieger, 1991) swallow either occurred or did not. Swallows are thought to be a fixed action pattern, with duration of stimulation having no effect on behavior duration (Fig. 3) (Dick, Oku, Romaniuk, & Cherniack, 1993). Thus, it was particularly interesting that in instances when few PiCo neurons were transfected, either unilateral or bilateral, an unknown activation of upper airway activity occurred. Motor activity no longer outlasted laser stimulation rather was contained within, and the timing of the motor sequence was reversed in comparison to a water or PiCo evoked swallow (Fig. 6). Thus, if insufficient numbers of neurons are activated, PiCo’s influence to specifically activate swallow or laryngeal activation is blurred, resulting in the uncoordinated activation of muscles involved in both behaviors. This brings possible evidence against the classic dogma of swallow as an “all or nothing” behavior, or the presence of an entirely different behavior. We are not the first to bring possible evidence against the classic dogma, “small swallows” were described but failed to be discovered if this was in-fact a partial or incomplete swallow (Miller & Sherrington, 1915). The SPG is thought to consist of bilateral circuits (hemi-CPGs) that govern ipsilateral motor activities, but receive crossing inputs from contralateral swallow interneurons in the reticular formation, thought to coordinate synchrony of swallow movements (Kinoshita et al., 2021; Sugimoto, Umezaki, Takagi, Narikawa, & Shin, 1998; Sugiyama et al., 2011). Incomplete activation of PiCo activates the muscular components of a swallow, without establishing the coordinated timing and sequence of the pattern. It is possible that PiCo is involved in assembling the swallow motor pattern itself and unilateral activation of PiCo could either desynchronize swallow interneurons or activates only one side of the SPG. Since we did not record bilateral swallow related muscles and nerves this question needs to be further examined.

      Reviewer #3 (Public Review):

      Huff et.al further characterise the anatomy and function of a population of excitatory medullary neurons, the Post-inspiratory Complex (PiCo), which they first described in 2016 as the origin of the laryngeal adduction that occurs in the post-inspiratory phase of quiet breathing. They propose an additional role for the glutamatergic and cholinergic PiCo interneurons in coordinating swallowing and protective airway reflexes with breathing, a critical function of the central respiratory apparatus, the neural mechanics of which have remained enigmatic. Using single allelic and intersectional allelic recombinase transgenic approaches, Huff et al. selectively excited choline acetyltransferase (ChAT) and vesicular glutamate transporter-2 (VGluT2) expressing neurons in the intermediate reticular nucleus of anesthetised mice using an optogenetic approach, evoking a stereotyped swallowing motor pattern (indistinguishable from a water-induced swallow) during the early phase of the breathing cycle (within the first 10% of the cycle) or tonic laryngeal adduction (which tracked tetanically with stimulus length) during the later phase of the breathing cycle (after 70% of the cycle).

      They further refine the anatomical demarcation of the PiCo using a combination of ChAT immunohistochemistry and an intersectional transgenic strategy by which fluorescent reporter expression (tdTomato) is regulated by a combinatorial flippase and cre recombinase-dependent cassette in triple allelic mice (Vglut2-ires2-FLPO; ChAT-ires-cre; Ai65).

      Lastly, they demonstrate that the PiCo is anatomically positioned to influence the induction of swallowing through a series of neuroanatomical experiments in which the retrograde tracer Cholera Toxin B (CTB) was transported from the proposed location of the putative swallowing pattern generator within the caudal nucleus of the solitary tract (NTS) to glutamatergic ChAT neurons located within the PiCo. We would like to thank the reviewer for acknowledging the technical advances of the present study and for the positive statements in general.

      Methods and Results

      The experimental approach is appropriate and at the cutting edge for the field: advanced neuroscience techniques for neuronal stimulation (virally driven opsin expression within a genetically intersecting subset of neurons) applied within a sophisticated in vivo preparation in the anaesthetized mouse with electrophysiological recordings from functionally discrete respiratory and swallowing muscles. This approach permits selective stimulation of target cell types and simultaneous assessment of gain-of-function on multiple respiratory and swallowing outputs. This intersectional method ensures PiCo activation occurs in isolation from surrounding glutamatergic IRt interneurons, which serve a diverse range of homeostatic and locomotor functions, and immediately adjacent cholinergic laryngeal motor neurons within the nucleus ambiguous (seen by some as a limitation of the original study that first described the PiCo and its roll in post-I rhythm generation Anderson et al., 2016 Nature 536, 76-80). These experiments are technically demanding and have been expertly performed.

      Again, we would like to thank the reviewer for these positive comments acknowledging the advances of the present study.

      The supplemental tracing experiments are of a lower standard. CTB is a robust retrograde tracer with some inherent limitations, paramount of which is the inadvertent labelling of neurons whose axons pass through the site of tracer deposition, commonly leading to false positives. In the context of labelling promiscuity by CTB, the small number of PiCo neurons labelled from the NTS (maybe 5 or 6 at most in an optical plane that features 20 or more PiCo neurons) is a concern. Even assuming that only a small subset of PiCo neurons makes this connection with the presumed swallowing CPG within the cNTS, interpretation is not helped by the low contrast of the tracer labelling (relative to the background) and the poor quality of the image itself. The connection the authors are trying to demonstrate between PiCo and the cNTS could be solidified using anterograde tracing data the authors should already have at hand (i.e. EYFP labelling driven by the con-fon AAV vectors from PiCo neurons (shown in Fig5), which should robustly label any projections to the cNTS).

      We fully agree with the reviewer that the CTB staining is of a lower standard and have removed this approach.

      The retrograde labelling from laryngeal muscles seems unnecessary: the laryngeal motor pool is well established (within the nAmb and ventral medulla), and it would be unprecedented for a population of glutamatergic neurons to form direct connections with muscles (beyond the sensory pool).

      The authors support their claim that PiCo neurons gate laryngeal activity with breathing through the demonstration that selective activation of glutamatergic and cholinergic PiCo neurons is sufficient to drive oral/pharyngeal/laryngeal motor responses under anaesthesia and that such responses are qualitatively shaped by the phase of the respiratory cycle within which stimulation occurs. Optical stimulation within the first 10% of the respiratory cycle was sufficient to evoke a complete, stereotyped swallow that reset the breathing cycle, while stimuli within the later 70% of the cycle, evoked discharge of the laryngeal muscles in a stimulus length-dependent manner. Induced swallows were qualitatively indistinguishable from naturalistic swallow induced by the introduction of water into the oral cavity. The authors note that a detailed interpretation of induced laryngeal activity is probably beyond the technical limits of their recordings, but they speculate that this activity may represent the laryngeal adductor reflex. This seems like a reasonable conclusion.

      We thank the reviewer for this comment. Unfortunately, we felt compelled to remove the word “gating” based on the statements by reviewer 1.

      The authors propose a model whereby the PiCo impinges upon the swallowing CPG (itself a poorly resolved structure) to explain their physiological data. The anatomical data presented in this study (retrograde transport of CTB from cNTS to PiCo) are insufficient to support this claim. As suggested above, complementary, high-quality, anterograde tracing data demonstrating connectivity between these structures as well as other brain regions would help to support this claim and broaden the impact of the study.

      We fully agree with this reviewer. We have been working on a thorough anatomical characterization for more than 3 years using cutting edge anterograde and retrograde viruses in collaboration with vector experts at the University of Irvine. But these are partly novel, unpublished techniques that are in development, and require many careful controls and characterization. We feel that this is a separate study as it doesn’t relate to swallowing coordination and also includes partly different authors. We hope to submit this as a separate study later this year.

      This study proposes that the PiCo in addition to serving as the site of generation of the post-I rhythm also gates swallowing and respiration. The scope of the study is small, and limited to the subfields of swallowing and respiratory neuroscience, however, this is an important basic biological question within these fields. The basic biological mechanisms that link these two behaviors, breathing and swallowing, are elusive and are critical in understanding how the brain achieves robust regulation of motor patterning of the aerodigestive tract, a mechanism that prevents aspiration of food and drink during ingestion. This study pushes the PiCo as a key candidate and supports this claim with solid functional data. A more comprehensive study demonstrating the necessity of the PiCo for swallow/breathing coordination through loss of function experiments (inhibitory optogenetics applied in the same transgenic context) along with robust connectivity data would solidify this claim.

      Thanks again for the positive assessment of our study.

    1. Author Response

      Reviewer #1 (Public Review)

      Using in vitro assays that take advantage of thymic slices, with or without the ability to present pMHC antigens, the authors define an early period in which CCR4 expression is induced, which induces their migration to the medulla and likely encounter with cDC2 and other APCs. Notably, the timing for CCR4 expression precedes that of CCR7 and illustrates the potential role for this early expression to initiate the movement of post-positive selection thymocytes to the medulla. The evidence for supporting a role for CCR4, as well as CCR7, in sequential tolerance induction is provided using multiple approaches, and although the observed changes amount to small percent changes, the significance is clear and likely biologically relevant over the lifespan of a developing T cell repertoire. Overall, the model provides a holistic view of how tolerance to self-antigens is likely induced during T cell development, which makes this work highly topical and influential to the field.

      We thank the reviewer for their comments and for highlighting the significance of identifying distinct roles for CCR4 and CCR7 in promoting medullary localization and inducing self-tolerance of thymocytes at different stages of T-cell development.

      Reviewer #2: (Public Review )

      This manuscript describes that CCR4 and CCR7 differentially regulate thymocyte localization with distinct outcomes for central tolerance. Overall, the data are presented clearly. The distinct roles of CCR4 and CCR7 at different phases of thymocyte deletion (shown in Figure 6C) are novel and important. However, the conclusion that expression profiles of CCR4 and CCR7 are different during DP to SP thymocyte development was documented previously. More importantly, the data presented in this manuscript do not support the conclusion that CCR7 is uncoupled from medullary entry. Moreover, it is unclear how the short-term thymus slice culture experiments reflect thymocyte migration from the cortex to the medulla.

      We thank the reviewer for pointing out the significance of our finding that CCR4 and CCR7 regulate different phases of thymocyte deletion. We agree that prior reports, including our own (Cowan et al. 2014, Hu et al., 2015) have shown that CCR4 and CCR7 are expressed by different post-positive selection thymocytes. However, the expression data we present here provides a higher resolution perspective on the specific thymocyte subsets that express these two receptors, as well as the different timing with which the receptors are expressed after positive selection. These data, coupled with chemotaxis assays of the granular thymocyte subsets responding to CCR4 versus CCR7 ligands, and 2-photon imaging data showing that CCR4 and CCR7 are required for medullary accumulation of distinct thymocyte subsets, are critical for delineating the unexpectedly distinct roles of these two chemokine receptors in promoting medullary entry and central tolerance.

      The reviewer raises an important question about our conclusion that CCR7 is “uncoupled” from medullary entry. We think there was likely a misunderstanding of our intended meaning, as we did not mean to imply that CCR7 does not promote medullary entry of thymocyte subsets; we have modified the wording of the abstract to replace “uncoupled” to clarify. As we detail in the Introduction, the role of CCR7 in directing chemotaxis of single-positive thymocytes towards the medulla and inducing their medullary accumulation is well established (Ehrlich et al., 2009; Kurobe et al., 2006; Kwan & Killeen, 2004; Nitta et al., 2009; Ueno et al., 2004). Instead, our data demonstrate that 1) the most immediate post-positive selection thymocyte subset (DP CD3loCD69+) does not require CCR7 for medullary entry, and 2) the next stage of post-positive selection thymocytes (CD4SP SM) express CCR7, but CCR7 recruits these cells only modestly into medulla. In contrast, CCR7 promotes robust medullary accumulation of more mature thymocyte subsets (CD4SP M1+M2), in keeping with the well-known role of CCR7 in promoting thymocyte medullary localization. We think these findings are highly significant for the field because currently, there is a widely held assumption that post-positive selection thymocytes that do not express CCR7 are located in the cortex, while those that express CCR7 are located in the medulla. Our data show that neither of these assumptions is true: CCR4 drives medullary accumulation of cells that do not yet express CCR7, and the earliest post-positive selection cells that express CCR7 continue to migrate in both the cortex and medulla. These findings form the basis of our statement that CCR7 expression is “not synonymous with” medullary localization. The finding that thymocytes do not robustly accumulate in the medulla in a CCR7-dependent manner until more the mature SP stages has important implications for central tolerance, as localization of thymocytes in the cortex versus medulla will impact which APCs and self-antigens they encounter when testing their TCRs for self-reactivity.

      The reviewer also raised concerns about whether short-term thymus slice cultures reflect physiological thymocyte migration. Short-term live thymic slice cultures have been widely used to investigate the development, localization, migration, and positive and negative selection of thymocytes, as they have been shown to faithfully reflect these in vivo processes, including confirming the role of CCR7 in inducing chemotaxis of mature thymocytes from the cortex into the medulla (Au-Yeung et al., 2014; Dzhagalov et al., 2013; Ehrlich et al., 2009; Lancaster et al., 2019; Melichar et al., 2013; Ross et al., 2014). However, we acknowledge that thymic slices are not equivalent to intact thymuses and have now discussed limitations of this system in our revised Discussion.

      Comment 1: Differential profiles in the expression of chemokine receptors, including CCR4, CCR7, and CXCR4, during DP to SP thymocyte development were well documented. Previous papers reported an early and transient expression of CCR4, a subsequent and persistent expression of CCR7, and an inverse reduction of CXCR4 (Campbell, et al., 1999, Cowan, et al., 2014, and Kadakia, et al. 2019). The data shown in Figures 1, 2, and 3 are repetitive to previously published data.

      The expression profile of CCR4, CCR7 and CXCR4 on thymocytes has been documented previously in the studies cited above and in our prior publication (Hu et al., 2015). Campbell et al. (Campbell, Haraldsen, et al., 1999) investigated chemotactic effects of chemokines, but did not directly address expression of chemokine receptors by thymocyte subsets. Cowan et al. (Cowan et al., 2014) examined the expression of CCR4 versus CCR7 on DP and CD4SP thymocytes. However, our data provide a more detailed analysis of expression of these distinct chemokine receptors by subsets of DP, CD4SP, and CD8SP thymocyte subsets along the trajectory of differentiation after positive selection, using a gating scheme inspired by a study published after the above-cited papers (Breed et al., 2019). Our more nuanced evaluation of CCR4 versus CCR7 expression sets the stage for finding that they play distinct roles in promoting medullary entry and central tolerance of early- versus late-stage post-positive selection thymocytes. Without examining CCR4 and CCR7 expression patterns by distinct thymocyte subsets in detail, we would not have made the unexpected observation that although CCR7 is expressed at high levels by many CD4SP SM thymocytes, it does not induce strong chemotaxis or medullary accumulation of this subset, relative to its role in more mature SP thymocyte subsets. This finding has important implications for which APCs thymocytes encounter as they are tested for self-reactivity to enforce central tolerance. As we were working on these studies, Kadakia et al. reported that extinguishing CXCR4 expression was important for enabling medullary entry (Kadakia et al., 2019). Thus, we thought it was important to place CXCR4 in the context of CCR4 and CCR7 expression on thymocyte subsets in our study, and in doing so found another example of asynchronous chemokine receptor expression and function, further indicating that expression of a chemokine receptor alone is not a reliable marker of functional activity or thymocyte localization, as cells migrate dynamically between the cortex and medulla.

      Through more extensive gating and simultaneous investigation of chemokine receptor expression and function, our data have provided new insights into how thymocytes respond to chemokine cues at different time points during their post-positive selection development. Moreover, our refined gating scheme (Figure 1) can be used to distinguish thymocyte subsets at different development stages without relying on chemokine receptor expression, thus providing an unbiased way of investigating chemokine receptor expression at different developmental stages.

      Comment 2: The manuscript describes the lack of CCR7 at early stages during DP to SP thymocyte development (Figure 1-3). However, CCR7 expression is detected insensitively in this study. Unlike CCR4 detection with a wide fluorescence range between 0 and 2x104 on the horizontal axis, CCR7 detection has a narrow range between 0 and 2x103 on the vertical axis (Figure 1C, 1D, 4B, 4C, 6B, S2, S3), so that flow cytometric CCR7 detection in this study is 10-times less sensitive than CCR4 detection. It is therefore likely that the "CCR7-negative" cells described in this manuscript actually include "CCR7-low/intermediate" thymocytes described previously (for example, Figure S5A in Van Laethem, et al. Cell 2013 and Figure 6 in Kadakia, et al. J Exp Med 2019).

      We provide new data to address the possibility that we were failing to detect low levels of CCR7 expression on early post-positive selection DPs (CD3loCD69+). We agree that CCR7 immunostaining of mouse cells is known to be more challenging than immunostaining of other chemokine receptors, including CCR4 and CXCR4. CCR7 immunostaining needs to be carried out at 37°C, which we did throughout our studies. We provide new data comparing CCR7 expression by Ccr7+/+ versus Ccr7-/- thymocyte subsets (Figure 1—figure supplement 2A-B), which confirm that CCR7 is not expressed at detectable levels by CD3loCD69+ DP cells above the background seen in CCR7-deficient cells. As thymocytes transition to theCD4SP SM stage, low/intermediate to high expression of CCR7 can be detected (Figure 1—figure supplement 2A). To further test whether we were failing to detect low levels of CCR7 by post-positive selection DPs, we incubated thymocytes at 37°C for up to 2 hours prior to immunostaining for CCR4 and CCR7, as a prior study indicated in vitro culture would enable increased cell surface expression of CCR7 by alleviating ligand-mediated CCR7 internalization (Britschgi et al., 2008). However, we did not observe increased CCR7 (or CCR4) expression by any thymocyte subset incubated at 37°C (Figure 1—figure supplement 2C-D). Lack of expression of CCR7 by CD3loCD69+ DP cells is consistent with their failure to undergo chemotaxis to CCR7 ligands in vitro, and initial expression of CCR7 by CD4SP SM is consistent with their chemotaxis towards CCR7 ligands in vitro (now show in greater detail in Figure 2—figure supplement 1), albeit at a much lower migration index than subsequent thymocyte subsets.

      Comment 3: Low levels of CCR7 expression could be functionally evaluated by the chemotactic assay as shown in Figure 2. However, the data in Figure 2 are unequally interpreted for CCR4 and CCR7; CCR4 assays are sensitive where a migration index at less than 1.5 is described as positive (Figure 2A and 2B), whereas CCR7 assays are dismissal to such a small migration index and are only judged positive when the migration index exceeds 10 or 20 (Figure 2C and 2D). CCR7 chemotaxis assays should be carried out more sensitively, to equivalently evaluate the chemotactic function of CCR4 and CCR7 during thymocyte development.

      We thank the reviewer for his insight about the possibility that we could have overlooked CCR7-mediated chemotaxis at lower migration indexes. When data from the chemotaxis assays were evaluated separately for each thymocyte subset, CCR7-mediated chemotaxis of CD4SP SM and subsequent DP CD3+CD69+ co-receptor reversing thymocytes could be detected. However, DP CD3loCD69+ thymocytes still did not undergo CCR7-meidated chemotaxis, but were responsive to the CCR4 ligand CCL22 (Figure 2—figure supplement 1).

      We did not detect CCR7-mediated chemotaxis of CD4SP SM and DP CD3+CD69+ subsets in our previous analysis because their lower-level chemotactic index relative to mature thymocytes did not reach statistical significance when chemotaxis of all subsets were compared simultaneously (Figure 2D). We note that the magnitude of difference in the responsiveness of CD4SP SM cells compared to mature CD4SP and CD8SP M1 & M2 thymocytes (Figure 2D) is likely physiologically important as CCR7 deficiency results in severely reduced medullary accumulation of CD4SP M1+M2 cells, but only a very mild reduction in medullary accumulation of CD4SP SM cells, which is only detected with our new paired analyses in Figure 5C. We feel these new analyses provide important new insights and thank the reviewer for this suggestion.

      Comment 4: Together, this manuscript suffers from the poor sensitivity for CCR7 detection both in flow cytometric analysis and chemotactic functional analysis. Conclusions that CCR7 is absent at early stages of DP to SP thymocyte development and that CCR7 is uncoupled from medullary entry are the overinterpretation of those results with the poor sensitivity for CCR7. The oversimplified scheme in Figure 3D is misleading.

      We agree that the scheme in Figure 3D, as previously constructed, did not ideally display the difference in scale between thymocyte responses to CCR7 ligands versus CCR4 and CXCR4 ligands (as detected in vitro). Thus, we have now modified the schematic to include the mild response to CCR7 ligands that we observed in CD4SP SM thymocytes (comment 3) and to emphasize the higher chemotactic response of mature thymocytes to CCR7 ligands than of DPs and CD4SP SM to CCR4 ligands. Likewise, we have modified the manuscript to clarify the importance of CCR7 expression in the medullary entry and accumulation of mature thymocyte subsets.

      We respectfully disagree that the sensitivity of CCR7 detection was poor in our flow cytometry and chemotactic analyses. Our CCR7 stains identified a range of CCR7 expression levels, from no expression by pre- and post-positive positive selection DP cells to high expression by CD4SP M1 cells, and we now provide new data confirming our ability to detect CCR7 expression (Figure 1—figure supplement 2), as described in response to Comment 3. Our chemotaxis assays detected CCR7 responses over a range of migration indexes from ~ 2 up to 100, showing our sensitive ability to detect CCR7-mediated chemotaxis in vitro (Figure 2 and Figure 2—figure supplement 1). In live thymic slices, we were also able to capture a range of biologic activities of CCR7, from mediating modest medullary accumulation of CD4SP SM cells to robust medullary accumulation of CD4SP M1+M2 cells (Figure 5A-C). Importantly, our results demonstrate that CCR7 is not the only chemokine receptor responsible for medullary entry and accumulation of thymocytes. Complex spatiotemporal regulation of thymocytes at distinct stages of development is achieved through tight orchestration of expression and signaling through multiple chemokine receptors, including CCR4, as shown by our data. However, our study does not negate an important role for CCR7 in mediating medullary entry of thymocytes, which we have clarified in the text.

      Comment 5: The short-term thymus slice culture experiments should be described more carefully in terms of selection events during DP to SP thymocyte development, which takes at least 2 days for CD4 lineage T cells and approximately 4 days for CD8 lineage T cells (Saini, et al. Sci Signal 2010 and Kimura, et al. Nat Immunol 2016). The slice culture experiments in this manuscript examined cellular localization within 12 hours and chemokine receptor expression within 24 hours (Figures 4, 5) even for the development of CD8 lineage T cells (Figure S2), which are too short to examine entire events during DP to SP thymocyte development and are designed to only detect early phase events of thymocyte selection.

      Experiments in Figures 4 and 5 were indeed designed to capture behaviors of thymocytes relatively early after introduction onto thymic slices. Figure 4 (and Figure 4—figure supplement 1) shows that the timing of CCR4 versus CCR7 expression after positive selection is dramatically different: CCR4 is expressed within hours of positive selection, concomitant with medullary entry, while CCR7 expression takes several days in the slices (sufficient time for CD8SP development, Figure 4—figure supplement 1). Figure 5 shows that medullary accumulation of CD4SP M1+M2 cells occurs robustly in the medulla of thymic slices within a couple of hours after introduction into the slices, and this localization is CCR7 dependent, while CCR4 induces more mild medullary accumulation of post-positive selection DPs. As indicated by the reviewer, it has been shown that it takes days for DP thymocytes to develop into mature CD4SP and CD8SP cells (Kimura et al., 2016; Lutes et al., 2021; Saini et al., 2010), as recapitulated in the thymus slice system (Figure 4—figure supplement 1) (Lutes et al., 2021). The relatively short time frame of our time-course experiments (up to 12 hours after addition of pre-positive selection thymocytes to positively selecting thymic slices) allowed us to detect expression of CCR4 within a few hours after positive selection and to determine that this timing correlated with medullary entry. Thus, the 12-hour time-course was important for temporal resolution of chemokine receptor expression and medullary localization after initial stages of positive selection.

      Comment 6: It is unclear what the medullary density alteration measured in the thymus slice culture experiments represents. Although the manuscript describes that the increase in the medullary density reflects the entry of cortical thymocytes to the medulla (Figure 4E and S2E), this medullary density can be affected by other mechanisms, including different survival of the cells seeded on the top of different thymus microenvironments. Thymocytes seeded on the medulla may be more resistant to cell death than thymocytes seeded in the cortex, for example, because of the rich supply of cytokines by the medullary cells. So, the detected alterations in the medullary density may be affected by the differential survival of thymocytes seeded in the cortex and the medulla. Also, the medullary density is measured only within a short period of up to 12 hours. The use of MHC-II-negative slices and CCR4- or CCR7-deficient thymocytes in the thymus slice cultures may verify whether the detected alteration in the medullary density is dependent on TCR-initiated and chemokine-dependent cortex-to-medulla migration.

      We thank the reviewer for pointing out these possibilities. The purpose of the positive selection timing experiment (Figure 4) was to establish the early correlation between receiving a positive selection signal, upregulating CCR4, and migrating into the medulla. In this system, cells only enter only the cortex in the first hour after migration in the slice, consistent with prior studies of localization of pre-positive selection thymocytes to the cortex (Ehrlich et al., 2009; Ross et al., 2014); subsequently, they move into the medulla. Because CCR7 is widely accepted to be essential for medullary entry, we feel it is important to demonstrate the disconnect between the timing of medullary entry and CCR7 expression in multiple ways. The timing experiment design utilized MHCII-/- and β2m-/- slices to show that positive selection was necessary for expression of CCR4. To test whether CCR4 or CCR7 were required for medullary entry of early post-positive selection DPs, we evaluated medullary accumulation of this subset from WT, Ccr4-/-, Ccr7-/-, and Ccr4-/-Cc7-/- mice. This experiment provided a more robust means of determining the extent to which CCR4 deficiency impacted medullary localization of a large cohort of cells that had passed positive selection (Figure 5), and again showed that the post-positive selection thymocytes, which express CCR4 but not CCR7, accumulate in the medulla in a CCR4-dependent manner. We note that in Figure 5, we show that all Ccr4-/-Ccr7-/- thymocyte subsets imaged have medullary:cortical density ratios of ~1, indicating an even distribution across cortex and medulla, which is highly consistent with an essential role for these two chemokine receptors in cooperating to mediate medullary accumulation of different stages of developing T cells.

      The reviewer makes an interesting point that survival cues could differ in the cortex versus medulla. However, if thymocytes lacking one or both chemokine receptors had impaired survival because they didn’t enter a region of the thymus efficiently to receive survival cues, we would expect to detect increased apoptosis in Ccr4-/-, Ccr7-/-and Ccr4-/-Cc7-/- thymocytes. However, we found that chemokine receptor deficiencies resulted in diminished apoptosis of different thymocyte subsets (Figure 6). This finding is more consistent with reduced negative selection of these subsets due to reduced clonal deletion. We nonetheless discuss this possibility in our revised manuscript, as it important to consider that chemokine-mediated migration of thymocytes into different microenvironments could alter their access cytokines and other pro-survival cues.

      Reviewer #3 (Public Review)

      In this manuscript, Li et al. examine how the expression of the chemokine receptor CCR4 impacts the movement of thymocytes within the thymus. It is currently known that the chemokine receptor CCR7 is important for developing thymocytes to migrate from the cortical region into the medullary region and CCR7 expression is therefore often used to define medullary localization. This is important because key developmental outcomes, like enforcing tolerance to self-antigens amongst others, occur in the medullary environment. The authors demonstrate that the chemokine receptor CCR4 is induced on thymocytes prior to expression of CCR7 and thymocytes exhibit responsiveness to CCR4 ligands earlier in development. Using elegant live confocal microscopy experiments, the authors demonstrate that CCR4 expression is important for the entry and accumulation of specific thymocyte subsets while CCR7 expression is needed for the accumulation of more mature thymocyte subsets. The use of cells deficient in both CCR4 and CCR7 and competitive migration/accumulation experiments provide strong support for this conclusion. The elimination of CCR4 expression results in decreases in apoptosis of thymocyte subsets that have been signalled through their antigen receptor and are responsive to CCR4 ligands. As expected, more mature thymocyte subsets show decreased apoptosis when CCR7 is absent. Distinct antigen-presenting cells in the thymus express CCR4 ligands supporting a model where CCR4 expressing thymocytes can interact with thymic antigen-presenting cells for induction of apoptosis. The absence of CCR4 results in an increase in peripheral T cells that can respond to self-antigens presented by LPS-activated antigen-presenting cells providing further support for the model. Collectively, the manuscript convincingly demonstrates a previously unappreciated role for CCR4 in directing a subset of thymocytes to the medulla.

      We thank the reviewer for appreciating the novelty of the finding that CCR4 directs distinct subsets of thymocytes into the medulla relative to CCR7, as supported by multiple lines of evidence.

    1. Author Response

      Reviewer #1 (Public Review):

      The sustainability of vaccination programs is subject to multiple threats, from a pandemic like COVID-19 to political changes. The present study assesses different strategies, including gender-neutral vaccination, to better respond to threats in HPV national immunization programs. The authors showed that vaccinating boys against HPV (compared to vaccinating girls alone), would not only prevent more cases of cervical cancer but also limit the impact of disruptions in the program. Moreover, it would help attain the goal set by the World Health Organization of eliminating cervical cancer as a public health problem sooner, even in the case of disruptions.

      Strengths and weaknesses: I found the manuscript well-written and easy to read. Decision-makers may find the results helpful in policy development and other researchers may use the study as an example to investigate similar scenarios in their local contexts. Nevertheless, there are some limitations. First, it should be considered that the present study is only applicable to India and other countries with a similar HPV context. Second, because it is a study based on a mathematical model, errors might arise from the assumptions considered for its construction. It also relies on the quality of the data used to construct and calibrate the model.

      Models are important tools for decision-making, they allow us to assess different scenarios when obtaining real-world data is not feasible. They also allow to carried-out multiple sensitivity analyses to test the strengths of the results. The study carries out a necessary assessment of different vaccination strategies to minimize the impact on cervical cancer prevention due to disruptions in the HPV immunization program. By using a mathematical model, the authors are able to assess different scenarios regarding vaccination coverage rates, disruption time, and cervical cancer incidence. Therefore, decision-makers can consider the scenario which best represents their current situation.

      The present study is not only valuable for decision-making, but also from a methodological point of view as future research can be conducted exploring more in deep the impact of vaccination disruptions and prevention measures.

      The conclusions of this paper are mostly well supported by data, but some aspects of the methodology need clarification; furthermore, some aspects of the calculations can be improved. It would be more informative, and better for comparisons between the four scenarios, to have relative measures instead of the absolute numbers of cases prevented.

      We thank the reviewer for the kind acknowledgement of the merits of the paper. We have tried to address the suggestions and questions as much as possible in the revised manuscript.

      We agree to the points of weaknesses raised by the reviewer regarding the applicability of our study results is limited to other countries and the possible errors arising from a using a mathematical model. We have added more elaborate discussion of these points in the manuscript, as follows: - Page 15 lines 310-312: “Extrapolation of the results of this study to other populations will be limited to those sharing similar patterns of demography, social norms, and cervical cancer epidemiology as India.” - Page 17 lines 361-363: “…, within the limitations of our model, the modelbased estimates show that shifting from GO to GN vaccination may improve the resilience of the Indian HPV vaccination programme while also enhancing progress towards the elimination of cervical cancer.”

      Furthermore, we have tried to clarify the rationale, advantages, and limitations of the measure of resilience we have adopted.

      Reviewer #2 (Public Review):

      This study evaluated the effect of population-based HPV vaccination programs in India which is suffering from the disease burden of cervical cancer. The authors used model simulations for estimating the outcomes by adopting the latest available data in the literature. The findings provide evidence-based support for policymakers to devise efficient strategies to reduce the impacts of cervical cancer in the country.

      Strengths.

      The study investigated the potential impact of cervical cancer elimination when HPV vaccination was disrupted (e.g., during the COVID-19 pandemic) and for meeting the WHO's initiatives. The authors considered several settings from the low to high effects of vaccination disruption when concluding the findings. The natural history was calibrated to local-specific epidemiological data which helps highlight the validity of the estimation.

      Weaknesses.

      Despite the importance and strengths, the current study may likely be improved in several directions. First, the study considered the scenario of using a recently developed domestic HPV vaccine but assuming vaccine efficacy based on another foreign HPV vaccine that has been developed and used (overseas) for more than 10 years. More information should be provided to support this important setting.

      Second, the authors are advised to discuss the vaccine acceptability and particularly the feasibility to achieve high coverage scenarios in relatively conservative countries where HPV vaccines aim to prevent sexually transmitted infection. Third, as the authors highlighted, the health economics of gender-neutral strategies, which is currently missing in the manuscript, would be a substantial consideration for policymakers to implement a national, population-based vaccination program.

      We thank the reviewer for the kind acknowledgement of the merits and strengths of the paper.

      We have tried to address the reviewer’s three points of weaknesses as comprehensively as possible in the revised manuscript.

      Regarding the first two points of weaknesses, we have provided more background information about the current situation of HPV introduction and screening in India (see the more specific replies below for where changes have been made), and some data of observed coverage in India in the states where HPV vaccination has been introduced.

      Regarding the reviewer’s third point about the health economics of genderneutral strategies, we agree fully that it is an important aspect to consider for the local policymakers. However, a health economic assessment is out of the scope of the present paper. In the present paper, we are interested in highlighting the potential health benefits on GN HPV vaccination. Given the current context of HPV vaccination in India we think it is too early to provide a realistic assessment of the health-economic balance of GN vaccination. Please note that one manuscript (de Carvalho et al., MedRxiv, doi: https://doi.org/10.1101/2023.04.14.23288563) based on the same modelling exercise and reporting a health economic assessment of girls-only (routine and catch-up) HPV vaccination in India is currently submitted for peer-review.

      Reviewer #3 (Public Review):

      The authors put together a rigorous study to model the impact of HPV vaccine programme disruptions on cervical cancer incidence and meeting WHO elimination goals in a low-income country - using India as an example. The study explores possible scenarios by varying HPV vaccination strategies for 10-year-old children between a) increasing vaccine coverage in a girls-only vaccination programme and b) vaccinating boys in addition to girls (i.e a gender-neutral vaccination programme).

      The main strength of this study is the strength of the modelling methodology in helping to make predictions and in contingency planning. The study methodology is rigorous and uses models that have been validated in other settings. The study employs a high level of detail in calibrating and adapting the model to the Indian context despite poor data availability. The detailed methodology allows future studies to employ the model and techniques with locally-contextualised parameters to study the potential impact of HPV vaccine programme disruptions in other countries.

      The work in this field can begin to help lower-income countries explore varying HPV vaccination strategies to reduce cervical cancer incidence, keeping in mind the potential for future supply chains or other related disruptions. However, the scenarios could be better sculpted to model potentially realistic scenarios to guide policymakers to make decisions in situations with limited vaccine supplies - in other words comparing scenario alternatives based on a fixed number of vaccines being available. Using comparative alternatives will help policymakers grapple with the decisions that need to be made regarding planning national HPV vaccination programmes. The results could afford to provide readers with a clearer measure of vaccine strategy 'resilience'.

      In all, the authors are able to successfully explore the potential impact of varying HPV vaccination strategies on cervical cancer cases prevented in the context of vaccine disruptions, and make valid conclusions. The results produced are rich in information and are worthy of deeper discussion.

      We thank the reviewer for the kind acknowledgement of the merits and strengths of the paper.

    1. As previously discussed, deliberate practice, in this case through frequent and active homework, helps build expertise in a domain. Now we know that deliberate practice works to build expertise because it helps build synaptic plasticity. Think-pair-share also increases synaptic plasticity by engaging students’ brains in ways that recall semantic information but also may include the formation of skills and habits, depending on the questions posed. Concept maps rationally encode knowledge, which allows memories to build as synaptic networks. Problem-based learning encourages students in terms of motivation and attention, which in turn increase learning by increasing synaptic plasticity. Using culturally diverse examples in one’s pedagogy helps to alleviate or eliminate stereotype threat, which decreases stress.

      I LOVE scientific-based information, especially when it comes to study techniques/effective ways to learn, so this is really helpful for me! I'll definitely be implementing these into my study habits in the future.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2023-01910

      Corresponding author(s): Michael W. Sereda

      1. General Statements

      Reviewer #1:

      In this paper the authors report a direct correlation between PMP22 and PTEN expression levels in the nerve of CMT mutants. In CMT1A Pmp22tg rat nerves, PTEN levels are increased, whereas in Pmp22+/- mutants, a model of the HNPP neuropathy, PTEN levels decrease. Consistent with this, Pmp22tg nerves display lower Akt phosphorylation and, vice versa, Pmp22+/- nerves have higher Akt phosphorylation. The authors lowered PTEN in the transgenic and inhibited mTOR using Rapamycin in the Pmp22+/- to support the functional relevance of the PMP22-PTEN correlation. ... In conclusion, the correlation between PMP22 and PTEN is a potential interesting observation. However, in my opinion, experiments as shown don't support the conclusion that PMP22 controls PTEN expression level and activity, which is suggested at the basis of the pathogenesis of PMP22 dosage-related neuropathies.

      We thank Reviewer #1 for this detailed feedback. We appreciate the Reviewer’s assessment that our observation that PMP22 and PTEN are correlated in CMT1A and HNPP is of potential interest. In the revised manuscript we addressed this key point by adding additional quantifications (Figure 1a, d; Figure 5d) and novel Western Blot analyses (Figure 1a, d). Regarding the pathophysiological significance of the correlation, we point out that both the original as well as the partially revised manuscript contain multiple pieces of evidence demonstrating that altered PTEN activity is critical for both PMP22 gene-dosage related neuropathies:

      1. The inhibition of the PI3K/PTEN/AKT/mTOR axis upstream (LY294002) or downstream (Rapamycin) of decreased PTEN ameliorates myelin defects in an in vitro HNPP model (Figure 2b, c).
      2. Downstream of PTEN, Rapamycin treatment ameliorates myelin defects, motor behavior and electrophysiology in the HNPP mouse model in vivo (Figure 3c, d, e,____ g, i)
      3. Targeting of increased PTEN directly by inhibiting its activity pharmacologically (VO-OHpic) in a CMT1A rat model or by depleting it genetically in a CMT1A model leads to ameliorated myelination in vitro (Figure 4b, c; Figure 5f, g).
      4. The genetic depletion of PTEN in a CMT1A mouse model increases myelination in vivo, albeit not in the long term (Figure 6a, b, c, d). We therefore feel that any additional evidence to show that "PMP22 controls PTEN activity" is not vital for supporting the major claims of the manuscript, i.e. that the observed correlation of PTEN levels with PMP22 gene dosage has relevance for the etiology of PMP22 dosage diseases and and that targeting the PI3K-PTEN-AKT-mTOR axis downstream of PTEN provides a potential pharmacological therapy of HNPP (while directly targeting PTEN ultimately fails to rescue CMT1A). However, we agree that the activity of PTEN on the molecular level is interesting, and such evidence would further strengthen our conclusions. Therefore, in the final revised version, we plan to add further Western Blots and explore possible downstream effects of altered PTEN levels.

      Reviewer #2:

      This study investigates the modulation, both genetically and pharmacologically, of the PI3K/Akt/mTOR signaling in preclinical animal models for the inherited peripheral neuropathies HNPP and CMT1A. These conditions result from a gene dosage abnormality of the peripheral myelin protein gene PMP22. The exact biological molecular mechanisms remain enigmatic despite it having been over 30 years since the major genetic lesions, the CMT1A duplication and HNPP deletion, were described. With respect to myelin biology one observes focally slowed nerve conduction at pressure palsies and local/segmental hypermyelination in HNPP whereas hypomyelination occurs in CMT1A. The study is nicely conducted, data illustrations very informative, and writing clear and concise. This paper will likely be of great interest to your readers. The authors provide convincing evidence that the HNPP pathobiology is ameliorated by PI3K/Akt/mTOR inhibitors. Interestingly they found radial myelin growth was most affected by this approach and suggest an interesting transdermal approach in injured nerves in the acute prevention of pressure palsies.

      We thank Reviewer #2 for this positive evaluation.

      Reviewer #3:

      *In this paper Sareda and co-workers demonstrate that the PTEN/mTOR pathway is indirectly involved in regulating myelin thickness and wrapping in models of altered PMP22 gene dosage both in vitro and in vivo. Inhibition of this pathway decreases myelin thickness in models of HNPP, while increasing myelin thickness in models of CMT1A. The evidence for these conclusions is complex but reasonably presented, and the conclusions mainly supported by the data. The abstract for this paper, however, presents a somewhat oversimplified conclusion that the PTEN pathway mainly modifies models of HNPP, where the paper clearly demonstrates that models of CMT1A are also affected by this same pathway. This should be clarified. *

      We thank Reviewer #3 for the feedback on the manuscript. We agree with the Reviewer that the same pathway (PI3K/Akt/mTOR) also affects CMT1A, but it is of importance for us to highlight that the disease mechanisms are -at least partly- different between HNPP and CMT1A. This is supported by our observation that PTEN reduction in CMT1A only transiently improves myelination in vivo (Figure 6) and the persistent alteration of differentiation markers despite PTEN reduction, which is not observed in HNPP (Figure 7).

      2. Description of the planned revisions

      Reviewer #1

      Regarding the activity of PTEN

      Figure 1

      • Additional experiments are needed to support the conclusion of Figure 1 that, in the two mutants, Pten levels reversely correlate with PI3K-Akt-mTOR pathway activation, which represents the rationale of all further experiments. For example, it should be shown systematically in both mutants both Akt and ERK phosphorylation levels (Akt at both T308 and S473), and mTOR activity read outs. In the previously published paper (Fledrich et al.) only increased Akt phosphorylation in Pmp22+/- nerves was reported, whereas Pmp22tg analysis was focused on the interdependence between Akt and ERK without exploring mTOR activation, which is relevant here. 2) (Figure 4) A different model, the C61 mouse a Pmp22tg overexpressing PMP22 is used here (rather than the CMT1A rat). This should be explained in the results. Is also this model characterized by increased Pten levels in the nerve? And low Akt-mTOR activation for instance? 3) (Figure 5) How is Akt-mTOR signaling in the double mutant as compared to Pmp22tg? Is that increased at P18? * Response: We fully agree with the Reviewer that further exploration of PTEN downstream effects will add value to the manuscript. We already justified the usage of the C61 mouse model more clearly, added P-S6 staining of wildtype in addition to an improved representation in Figure 5e, and performed extra Western Blot analysis of PTEN expression (described in the next section “Incorporated *revisions”). Moreover, we will further evaluate the downstream signaling components of PTEN and will perform additional Western Blot analyses of peripheral nerves of HNPP mice, CMT1A rats as well as C61 and C61xPTENhKO mice.

      Figure S1

      • *Figure S1, page 4: what does it mean "in line with this finding we were unable to detect protein-protein...". May be the authors meant: since there is a direct correlation between Pmp22 and Pten expression levels in the mutants, the authors explored the possibility of an interaction between the two. Regarding the co-IPs, in panel a, the co-IP at the endogenous level, the immunoprecipitation efficiency of PMP22 is very low. May be a pull-down experiment using either exogenous purified PMP22 or PTEN and nerve lysates can help to rule out the possibility of an interaction. The experiments in b, c are performed in overexpression in a heterologous system (293 cells). * Response: We agree with the Reviewer that we might have missed a possible interaction between PMP22 and PTEN in the experiments performed so far. Indeed, pull-down experiments may prove helpful to rule out / reveal protein-protein interaction. Therefore, we will use purified PMP22 and perform pull-down experiments using nerve lysates of wildtype and CMT1A rats.

      Figure 5

      • *Pten Fl/+ Dhh-Cre cultures seem to have axonal fasciculation. * Response____: We thank the Reviewer for this observation. We will systematically inspected all recorded images for features of fasciculation. We will also assess whether fasciculation is a representative feature in cultures derived from any of the genotypes, and if so, whether the genotypes differ in this regard.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Changes in the text are highlighted in green in the revised manuscript

      Reviewer #1:

      Figure 1

      • *Panel a: the decrease of Pten expression should be quantified with at least n=3 taking into account the variability among different samples at the different time points indicated (the same applies in panel b, even if here the increase of Pten expression level in Pmp22tg nerves is more evident). * Response: We agree with the Reviewer that the timeline is not sufficient to demonstrate alteration in PTEN expression in PMP22 gene dosage diseases CMT1A and HNPP. Therefore, we performed new Western Blot experiments evaluating PTEN expression in (i) HNPP mice, (ii) CMT1A rat (iii) C61 mice and (iv) C61xPTENhKO mice with minimum n = 3 biological replicates and performed the respective quantification which is shown in Figure1 (i, ii) and Figure 5 (iii, iv). The results of the Western Blot analysis and quantification show an increase in PTEN abundance in CMT1A rat (Figure 1d) and C61 mice (Figure 5d) while a decrease is observed in HNPP mice (Figure 1a) and PTENhKOxC61 mice (Figure 5d) when compared to wildtype controls.

      • *Panel a and b: the statement that Pten is more expressed at P18 at the peak of myelination in wildtype nerves is not supported by the blots as shown. * Response: We agree that this observation is only partly supported by the Western Blot analysis, as seen in the HNPP mouse model, and deleted this part in the results section.

      • Figure S1, page 4: what does it mean "in line with this finding we were unable to detect protein-protein...". May be the authors meant: since there is a direct correlation between Pmp22 and Pten expression levels in the mutants, the authors explored the possibility of an interaction between the two. Response: We thank the Reviewer for pointing out the lack of clarity here. We changed the respective sentence accordingly:

      “Since there is a direct correlation between PMP22 and PTEN expression levels in the mutants, we explored the possibility of an interaction between the proteins. By immunoprecipitation experiments we were unable to detect protein-protein interaction between PMP22 and PTEN (Figure S1).” (Page 4)

      • *Page 4: "Taken together, Pmp22 dosage inversely correlates with the abundance of PTEN...": please revise this statement * Response: We thank the reviewer for spotting this mistake. We changed the sentence accordingly, which now reads:

      “Taken together, Pmp22 dosage directly correlates with the abundance of PTEN and presumably the activation level of the PI3K/Akt/mTOR pathway in myelinating Schwann cells (Figure 1i)." (Page 4, Line 23)

      Figure 2:

      • The aberrant myelin figures displayed are similar to myelin ovoids preceding degeneration rather than myelin outfoldings. It is also strange that these alterations are in the wildtype cultures treated with RAPA, that instead, in this system, has been reported to increase myelination as it improves protein homeostasis (autophagy, quality control, etc). Response: We thank the Reviewer for pointing this out. Indeed, in the way the images have been presented the aberrant myelin profiles can be mistaken for ovoids. However, a close inspection of the TUJ1 channel images revealed continuity of the axons below the aberrant myelin, thereby excluding ovoid formation. In the partially revised manuscript, we now also show the TUJ1 channel individually (Figure 2), so that it can be appreciated that the defects are confined to the myelin. Concerning the incidence of the myelin defects in RAPA treated wildtype cultures, our analysis can have missed a potential amelioration due to the rather high variability in the data.

      Figure 3

      *Panel c-e: aberrant fibers should be normalized on total number of fibers and on the area, particularly because RAPA is used. *

      Response: We agree with the Reviewer that number of tomacula and recurrent loops should be normalized to the total number of fibers on the area. We have quantified the total number of fibers in the whole sciatic nerve and normalized the tomacula and recurrent loops number accordingly. Results show a decrease in both tomacula and recurrent loops after Rapamycin treatment in the HNPP mice (Figure 3c, d, e, f).

      Figure 4

      The improvement in the number of myelin segments following PTEN inhibition in Pmp22tg co-cultures is very weak. The 500 nM has instead a consistent effect in reducing myelin segments in the wildtype and I think that these results overall don't support the conclusion that myelination is ameliorated by reducing PTEN activity in Pmp22tg co-cultures.

      Response: We thank the Reviewer for this important point. We like to emphasize that we treated whole cultures with the PTEN inhibitor and we cannot rule out a (probably) negative effect on axonal PTEN, resulting in only weak improvement of myelination in PMP22tg cultures and strong effects also on the wildtype co-cultures. Therefore, we decided against a treatment of CMT1A models in vivo and further explored the effects of PTEN reduction specifically in Schwann cells using the genetic model as described Figure 5. The Reviewer made clear to us that this is inappropriately explained in the results section and we therefore adapted this in the manuscript on page 6:

      “Similarly, the prolonged inhibition of PTEN with VO-OHpic (for 14 days) caused a dosage-dependent reduction in myelinated segments in wildtype co-cultures (Figure 4c, Figure S2). The mechanism is currently unexplained but cannot rule out a negative effect of PTEN inhibition on DRG neurons and myelination.”

      Figure 5:

      • *A different model, the C61 mouse a Pmp22tg overexpressing PMP22 is used here (rather than the CMT1A rat). This should be explained in the results. Is also this model characterized by increased Pten levels in the nerve? And low Akt-mTOR activation for instance? * Response: We agree with the Reviewer that it has not been clear in the text why we changed here to the C61 mouse model. We clarified this in the Results section which now reads on page 6:

      “To reduce Pten function in CMT1A models also in vivo, we applied a genetic approach (Figure 5a). As the genetic tools to specifically target Schwann cells were only available in the mouse and not the rat, we used the C61 mouse model of CMT1A. We reduced PTEN by about 50% selectively in CMT1A Schwann cells by crossbreeding Pmp22 transgenic mice with floxed Pten and Dhh-cre mice, yielding PTENfl/+Dhhcre/+PMP22tg experimental mutants (Figure 5b). Western blot analyses of sciatic nerve lysates confirmed the increase of PTEN in PMP22tg mice and the reduction of PTEN in the double mutants (Figure 5c, d).”

      Moreover, regarding the PTEN expression we added Western Blot analysis and quantification in Figure 5c, d showing increased PTEN expression in the C61 mouse model of CMT1A and decreased PTEN in the PTENhKOxC61 double mutants. Further analysis of the downstream signaling is planned (see “planned revision”).

      • *PTEN, Akt-mTOR expression/activation levels should be checked biochemically also in this model. And quantified (panel c). * Response: We added an explanation for the use of the C61 mouse model (see point Figure 5.1 above). Moreover, we quantified the Western Blot analysis and added it in Figure 5d. The expression of PTEN was included in the Western Blot analysis (Figure 5c) showing increased PTEN expression also in the C61 mouse model. Further biochemical analysis of the C61 mouse model is planned (see “planned revision”).

      • *In panel d overactivation of mTOR (PS6 staining) in Schwann cells is not evident. * Response: We agree with the Reviewer that the way the image was displayed is not sufficient to show P-S6 activation in the double mutants. We have now split the image (Figure 5e) to better visualize the P-S6 staining alone compared to the co-staining with P0 (marker for compact myelin) and DAPI (nuclei). Further, we added staining of wildtype nerve. We hope this way the differences in P-S6 activation can be easier appreciated.

      Figure 6:

      *G-ratio analysis: which are the mean values (numbers) with SEM in the three groups analyzed wildtype, Pmp22tg and Pmp22tg; Pten fl/+; Dhh-Cre? *

      Response: We thank the Reviewer for pointing this out. We added the quantification of the mean g-ratios in Figure 6d, f.

      Figure 7:

      • *If more fibers are committed to myelinate in the double mutant as compared to the single Pmp22tg at P18 ,particularly, it is unclear why there is no difference in differentiation marker expression in Figure 7 (Oct6 and Hmgcr). * Response: We thank the reviewer for this comment. We do not necessarily expect to see a strong difference in the expression of differentiation markers given the mild increase in myelination in the double mutants. Similarly, we do not observe alterations in the expression of differentiation markers in HNPP mice, while these fibers produce more myelin. Therefore, we concluded that alterations in PTEN-PI3K/Akt/mTOR signaling do not influence differentiation in the mouse models while in the PMP22 overexpressing situation of CMT1A other mechanisms alter differentiation of the Schwann cells. We also note that experiments were performed at postnatal day 18 and we cannot rule out possible alterations in differentiation marker expression at earlier time points in development in the double mutants.

      • In conclusion, the correlation between PMP22 and PTEN is a potential interesting observation. However, in my opinion, experiments as shown don't support the conclusion that PMP22 controls PTEN expression level and activity, which is suggested at the basis of the pathogenesis of PMP22 dosage-related neuropathies. Response: Please also see section 1. In order to avoid any overstatement that "PMP22 controls PTEN expression level and activity", in our revised version we have clarified this point and changed the wording in the main text:

      "The mechanisms that link the abundance of PMP22 to that of PTEN are still unclear and we here neither show direct nor indirect control of PTEN expression by PMP22." (Page 8)

      Reviewer #2:

      1. Regarding in the Introduction: "...the molecular mechanisms causative for the abnormal myelination remain largely unknown and still no therapy is available." Suggest consider modifying to perhaps: '...no small molecule or pharmacological therapeutic intervention exist.' To say "no therapy" exist is 'myopic' and untrue.

      *Suggest adding question mark to end of sentence or changing ‘asked’ to “investigated” for following thought: “Here, we asked whether PI3K/Akt/mTOR signaling provides therefore a therapeutic target to treat the consequences of altered Pmp22 gene-dosage.” *

      Rather than attempt to establish PRIORITY perhaps ‘softening’ the INTRODUCTION concluding statement “Our results thus identify a potential pharmacological target for this inherited neuropathy.

      [This makes thePI3K/Akt/mTOR pathway a promising target for a preventive treatment of affected nerves also in human patients.] *Does this belong in RESULTS? Or rather DISCUSSION? *

      Response: We thank the Reviewer for the suggestions. We changed the sentences accordingly in the manuscript (1.: Page 3, Line 23; 2.: Page 3, Line 26; highlighted in green). Regarding point 3, we are convinced that identifying pharmacological targets for peripheral neuropathies should be given priority. Indeed, the aspect concerning point 4 is already highlighted in the discussion therefore we removed the sentence from the result section.

      Reviewer #3:

      *The abstract for this paper, however, presents a somewhat oversimplified conclusion that the PTEN pathway mainly modifies models of HNPP, where the paper clearly demonstrates that models of CMT1A are also affected by this same pathway. This should be clarified. *

      We agree with the Reviewer that the same pathway (PI3K/Akt/mTOR) also affects CMT1A, but it is of importance for us to highlight that the disease mechanisms are -at least partly- different between HNPP and CMT1A. This is supported by our observation that PTEN reduction in CMT1A only transiently improves myelination in vivo (Figure 6) and the persistent alteration of differentiation markers despite PTEN reduction, which is not observed in HNPP (Figure 7). For clarification we have altered the wording in the abstract which now reads: "In contrast, we found that CMT1A pathogenesis was only transiently ameliorated by altered PI3K/Akt/mTOR signaling, which drives radial but not longitudinal growth of peripheral myelin sheaths".

      3. Description of analyses that authors prefer not to carry out

      Reviewer #1:

      Figure 1:

      *Figure 1, Panel e: may be with this experiment the authors aim to suggest that Pten and Pmp22 are unlikely to interact directly or indirectly since Pten is cytosolic and Pmp22 myelin-membrane enriched. However, this myelin purification shows that Pmp22 as P0 expression levels are also abundant in the cytosol, may be also because P18 has been chosen as time point. What about a different type of membrane-cytosol fractionation experiment and/or another time point? *

      Response: We want to clarify that in this experiment not myelin and cytosol fractions were separated but myelin and whole sciatic nerve lysate (which is the input before isolation of the myelin fraction, called “lysate”). Therefore, the analysis aimed at showing an enrichment of PMP22 and P0 in the myelin fraction while PTEN and TUJ (as a control) are not, which makes it more unlikely for PTEN and PMP22 to interact directly. This experiment, together with the immunohistochemical analysis in Figure 1h should highlight the location of PMP22 and PTEN in the Schwann cell. Together with the newly suggested experiments of the Reviewer for Figure S1 (see planned Revision point 1) we do not see the need for extra membrane-cytosol fractionations and/ or another timepoint as the more detailed as the improved experiment on protein-protein interaction using nerve lysate (not only cell culture) is the experiment of choice to clarify whether we have a direct interaction or not.

      Regarding in vitro Schwann cell- DRG co-culture experiments:

      (Figure 2, Figure 4 and Figure 5e)

      1. *(Figure 2) For this experiment, pulse treatment may be beneficial rather than in continuous. Is Akt-mTOR phosphorylation-signaling increased also in Pmp22+/- co-cultures as in mutant nerves? Is the treatment reducing the overactivation? *
      2. *(Figure 4) Similarly to Figure 2, is PTEN level increased in Pmp22tg cultures along with Akt-mTOR downregulation? *
      3. *(Figure 5) Panel e: co-cultures are established using ex vivo Dhh-Cre recombination. The downregulation of Pten in the cultures should be documented. * Response: We agree with Reviewer #1 that a deeper analysis of the co-culture system regarding the downstream signaling of PTEN would increase the value of the experiments. Unfortunately, the experiments were designed in a very small scale with the intention of only evaluating myelin alterations on a histological level and we did have enough tissue to collect cells for deeper protein expression analysis. Moreover, we tried to use the co-culture system as a proof-of-principle experiment in parallel to our in vivo studies which we value more important due to the still quite artificial co-culture setup. We hope that the Reviewer can understand our approach with the focus we set on the in vivo work.

      Figure 3:

      1. *The RAPA treatment seems to increase Pten level in the mutant even above wildtype levels (panel b), which can result in decreased myelin thickness due to downregulation of Akt-mTOR. A different method to normalize expression levels should be used. * Response: Comparing the mean, relative expression levels resulting from our quantification as plotted in the graph (panel b) revealed no increase above wildtype level after Rapamycin treatment in the HNPP mouse. Further, we decided for whole protein staining as the superior approach to loading control because we have observed alterations in the expression of other frequently used “housekeepers” such as GAPDH, Actin and Vinculin in the CMT1A rodent models.

      *Panel c-e: Can these data also be reproduced in quadriceps nerves as tomacula are more prominent in these Pmp22+/- nerves showing less variability due to the prevalence of large caliber axons? *

      Response: Unfortunately, quadriceps nerves were not collected for histology in the experiment and therefore we cannot redo the quantification. Nevertheless, we agree that the quadriceps nerves have less variability than the sciatic nerve and will definitely include the tissue in our future experiments.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this paper the authors report a direct correlation between PMP22 and PTEN expression levels in the nerve of CMT mutants. In CMT1A Pmp22tg rat nerves, PTEN levels are increased, whereas in Pmp22+/- mutants, a model of the HNPP neuropathy, PTEN levels decrease. Consistent with this, Pmp22tg nerves display lower Akt phosphorylation and, viceversa, Pmp22+/- nerves have higher Akt phosphorylation. The authors lowered PTEN in the transgenic and inhibited mTOR using Rapamycin in the Pmp22+/- to support the functional relevance of the PMP22-PTEN correlation.

      I have major concerns on the data as shown, which, in my opinion, don't support the main conclusion of this paper. In more detail:

      Figure 1 Panel a: the decrease of Pten expression should be quantified with at least n=3 taking into account the variability among different samples at the different time points indicated (the same applies in panel b, even if here the increase of Pten expression level in Pmp22tg nerves is more evident) Panel a and b: the statement that Pten is more expressed at P18 at the peak of myelination in wildtype nerves is not supported by the blots as shown

      Figure S1, page 4: what does it mean "in line with this finding we were unable to detect protein-protein...". May be the authors meant: since there is a direct correlation between Pmp22 and Pten expression levels in the mutants, the authors explored the possibility of an interaction between the two. Regarding the co-IPs, in panel a, the co-IP at the endogenous level, the immunoprecipitation efficiency of PMP22 is very low. May be a pull-down experiment using either exogenous purified PMP22 or PTEN and nerve lysates can help to rule out the possibility of an interaction. The experiments in b, c are performed in overexpression in a heterologous system (293 cells).

      Panel e: may be with this experiment the authors aim to suggest that Pten and Pmp22 are unlikely to interact directly or indirectly since Pten is cytosolic and Pmp22 myelin-membrane enriched. However, this myelin purification shows that Pmp22 as P0 expression levels are also abundant in the cytosol, may be also because P18 has been chosen as time point. What about a different type of membrane-cytosol fractionation experiment and/or another time point?

      Page 4: "Taken together, Pmp22 dosage inversely correlates with the abundance of PTEN...": please revise this statement

      Additional experiments are needed to support the conclusion of Figure 1 that, in the two mutants, Pten levels reversely correlate with PI3K-Akt-mTOR pathway activation, which represents the rationale of all further experiments. For example, it should be shown systematically in both mutants both Akt and ERK phosphorylation levels (Akt at both T308 and S473), and mTOR activity read outs. In the previously published paper (Fledrich et al.) only increased Akt phosphorylation in Pmp22+/- nerves was reported, whereas Pmp22tg analysis was focused on the interdependence between Akt and ERK without exploring mTOR activation, which is relevant here.

      Figure 2 The aberrant myelin figures displayed are similar to myelin ovoids preceding degeneration rather than myelin outfoldings. It is also strange that these alterations are in the wildtype cultures treated with RAPA, that instead, in this system, has been reported to increase myelination as it improves protein homeostasis (autophagy, quality control, etc). Also for this experiment, pulse treatment may be beneficial rather than in continuous. Is Akt-mTOR phosphorylation-signaling increased also in Pmp22+/- co-cultures as in mutant nerves? Is the treatment reducing the overactivation?

      Figure 3 The RAPA treatment seems to increase Pten level in the mutant even above wildtype levels (panel b), which can result in decreased myelin thickness due to downregulation of Akt-mTOR. A different method to normalize expression levels should be used. Panel c-e: aberrant fibers should be normalized on total number of fibers and on the area, particularly because RAPA is used. Can these data also be reproduced in quadriceps nerves as tomacula are more prominent in these Pmp22+/- nerves showing less variability due to the prevalence of large caliber axons?

      Figure 4 A different model, the C61 mouse a Pmp22tg overexpressing PMP22 is used here (rather than the CMT1A rat). This should be explained in the results. Is also this model characterized by increased Pten levels in the nerve? And low Akt-mTOR activation for instance?

      The improvement in the number of myelin segments following PTEN inhibition in Pmp22tg co-cultures is very weak.. The 500 nM has instead a consistent effect in reducing myelin segments in the wildtype and I think that these results overall don't support the conclusion that myelination is ameliorated by reducing PTEN activity in Pmp22tg co-cultures. Similarly to Figure 2, is PTEN level increased in Pmp22tg cultures along with Akt-mTOR downregulation?

      Figure 5 As for Figure 4, the use of the mouse transgenic instead of the CMT1A rat should be specified and PTEN, Akt-mTOR expression/activation levels should be checked biochemically also in this model. And quantified (panel c). In panel d overactivation of mTOR (PS6 staining) in Schwann cells is not evident. Panel e: co-cultures are established using ex vivo Dhh-Cre recombination. The downregulation of Pten in the cultures should be documented. Pten Fl/+ Dhh-Cre cultures seem to have axonal fasciculation.

      Figure 6 G-ratio analysis: which are the mean values (numbers) with SEM in the three groups analyzed wildtype, Pmp22tg and Pmp22tg; Pten fl/+; Dhh-Cre? How is Akt-mTOR signaling in the double mutant as compared to Pmp22tg? Is that increased at P18? If more fibers are committed to myelinate in the double mutant as compared to the single Pmp22tg at P18 ,particularly, it is unclear why there is no difference in differentiation marker expression in Figure 7 (Oct6 and Hmgcr).

      Significance

      In conclusion, the correlation between PMP22 and PTEN is a potential interesting observation. However, in my opinion, experiments as shown don't support the conclusion that PMP22 controls PTEN expression level and activity, which is suggested at the basis of the pathogenesis of PMP22 dosage-related neuropathies.

    1. Author Response:

      The following is the authors' response to the original reviews.

      We are very glad that the editor and reviewers found our paper of broad interest to the community of population, evolutionary, and ecological genetics. We thank them for their positive feedback and insightful comments and suggestions. We have revised our manuscript to address some of the issues raised by the review. The main change we made was providing a detailed discussion of limitations of simulated genomes, focusing on considerations one needs to make when selecting a demographic model. This can be found in a new section “Limitations of simulated genomes” (pages 9-10). We made a few additional adjustments in other parts of the text based on the reviewers’ suggestions. They are all listed in the detailed point-by-point response to reviewers comments and questions below.

      Editor:

      1) It was noted that demographic models (or genomic parameters) that are inferred based on certain aspects of the genomic data (eg., site frequency spectrum, haplotype structure) may not recapitulate other aspects of the data. In other words, any inferred demographic models are expected to reliably reproduce only some aspects of the genetic variation data but not necessarily all. It would be helpful to emphasize this limitation in the manuscript and to include a table summarizing the types of variation that the demographic models for the catalogued species were based on.

      This is a very important point, which we addressed in the revision by adding a section entitled “Limitations of simulated genomes”. This section discusses the considerations that one should make when selecting an inferred demographic model to implement in simulation. This includes the samples used in analysis, the method used for inference, as well as various filters. In this section we also point to the documentation page of the stdpopsim catalog, which provides information about each demographic model that can help users decide whether it is appropriate for their needs. We decided not to summarize this information in a succinct table in the manuscript because it is not straightforward to summarize the strengths and potential limitations of each model in a table. Instead, we will expand the summary provided for each demographic model in the documentation page to provide additional information. See response to the second reviewer’s comment on this topic for more details.

      2) It will make stdpopsim more user-friendly to include an automated module that can visualize a demographic model given the corresponding parameters (or simulation scripts).

      As mentioned in the response to the first reviewer’s comment on this subject, the documentation page of the stdpopsim catalog provides a brief summary for each demographic model, including a graphical representation. See response below for more details.

      Reviewer #1:

      In the introduction, the authors cite numerous efforts to generate high-quality reference genomes. That's not an issue in itself, but leading with this might send the message to some readers that it is these reference genome efforts that are driving the need for population genomics analysis and simulation tools, which is not really the case - why not instead give some citation attention to actual population genomics projects aiming to address the types of evolutionary questions this paper is concerned with? The reference genome citations would fit better in the section dealing with reference genomes, where they already appear.

      Indeed, the desire to answer complex evolutionary questions is the main motivation for sequencing these genomes and also for generating realistic genome simulations. The reason we chose to lead with the genome-sequencing efforts is that high quality genome data is an important prerequisite for obtaining parameters for chromosome-scale simulations. So, with that perspective, these efforts which we cite are the driving force behind expansion of stdpopsim in the near future. Thus, we decided to leave these citations in the introduction. To balance things out, we now start the introduction with a statement about board questions in population genetics. Moreover, after we list the genome sequencing efforts, we added a list of specific types of questions that can be addressed by these newly emerging genomes, with relevant citations. The beginning of the introduction now reads:

      “Population genetics allows us to answer questions across scales from deep evolutionary time to ongoing ecological dynamics, and dramatic reductions in sequencing costs enable the generation of unprecedented amounts of genomic data that can be used to address these questions (Ellegren, 2014). Ongoing efforts to systematically sequence life on Earth by initiatives such as the Earth Biogenome (Lewin et al., 2022) and its affiliated project networks, such as Vertebrate Genomes (Rhie et al., 2021), 10,000 Plants (Cheng et al., 2018) and others (Darwin Tree of Life Project Consortium, 2022), are providing the backbone for enormous increases in the amount of population-level genomic data available for model and non-model species. These data are being used, among other things, in inference of population history and demographic parameters (Beichman et al., 2018), studying adaptive introgression (Gower et al., 2021), distinguishing adaptation from drift (e.g. Hsieh et al., 2021), and understanding the implications of deleterious variation in populations of conservation concern (e.g. Robinson et al., 2023).”

      Something that would be useful for the stdpopsim resource in general, though not necessarily something for the paper, would be some kind of more human-friendly representation of the demographic models implemented in the curated library. Perhaps I'm not looking in the right place, but as far as I can tell, if I want to study the curated demographic models, I need to go into the Python scripts on the stdpopsim GitHub page (e.g.

      https://github.com/popsim-consortium/stdpopsim/tree/main/stdpopsim/catalog/BosTau). Here the various parameters and demographic events are hard-coded into the scripts. To understand the model being implemented, one thus needs to go dig into these scripts - something which is not necessarily very accessible to all researchers. Visual representations, such as the one for Anopheles gambiae in Fig 2. in the paper, are more widely accessible. I wonder if such figures could be produced for all the curated models and included in the GitHub folders alongside the scripts, perhaps aided by an existing model visualization software such as POPdemog. Again, I would not suggest that this is necessary for the paper, but if practically feasible I think it would be a useful addition to the resource in the longer term.

      This is a very good point. The stdpopsim catalog actually has a documentation page that provides a brief summary for each demographic model, including a graphical representation. This graphical representation is generated using demesdraw applied to the demographic model object implemented in the code. Thus, potential users do not have to dig through the Python code to figure out the details of the demographic model. We used a similar approach to generate the image of the demographic history of A. gambiae for Fig. 2 of the paper. The documentation page is an important part of the stdpopsim catalog, and we now added a link to it in section “Data availability”, and we mention it in key places in the manuscript, such as the caption of Fig 2.

      Reviewer #2:

      An important update to the stdpopsim software is the capacity for researchers to annotate coding regions of the genome, permitting distributions of fitness effects and linked selection to be modeled. However, though this novel feature expands the breadth of processes that can be evaluated as well as is applicable to all species within the stdpopsim framework, the authors do not provide significant detail regarding this feature, stating that they will provide more details about it in a forthcoming publication. Compared to this feature, the additions of extra species, finite-site substitution models, and non-crossover recombination are more specialized updates to the software.

      It would be helpful to provide additional information regarding the coding annotation (and associated distribution of fitness effects and linked selection) that is implemented in the current version of stdpopsim, but will be detailed in a forthcoming paper. This is not to take away from the forthcoming paper, but I believe this is the most important update to the software, and the current manuscript only brushes over it.

      We agree that implementation of selection in simulations is a significant addition to stdpopsim. However, our intention in this manuscript is to focus on the separate effort we made in the last two years to expand the utility of stdpopsim to a more diverse set of species. We think the manuscript stands firmly even without discussing in detail the new features that allow modeling selection. The main reason we briefly mention these features in sections “Additions to stdpopsim” and “Basic setup for chromosome-level simulations” is because the released version of stdpopsim contains implemented DFEs for a few species, and we did not want to completely ignore this. We thus added a brief comment at the end of the “Basic setup” section (page 8) mentioning the three model species for which the stdpopsim catalog currently has annotations and implemented DFE models. We think that a more detailed description of how these features and how they should be used is best left to the manuscript that the PopSim community is currently writing (preprint expected later this year).

      When it comes to simulating realistic genomic data, the authors clearly lay out that parameters obtained from the literature must be compatible, such as the same recombination and mutation rates used to infer a demographic history should also be used within stdpopsim if employing that demographic history for simulation. This is a highly important point, which is often overlooked. However, it is also important that readers understand that depending on the method used to estimate the demographic history, different demographic models within stdpopsim may not reproduce certain patterns of genetic variation well. The authors do touch on this a bit, providing the example that a constant size demographic history will be unable to capture variation expected from recent size changes (e.g., excess of low-frequency alleles). However, depending on the data used to estimate a demographic history, certain types of variation may be unreliably modeled (Biechman et al. 2017; G3, 7:3605-3620). For example, if a site frequency spectrum method was used to estimate a demographic history, then the simulations under this model from y stdpopsim may not recapitulate the haplotype structure well in the observed species. Similarly, if a method such as PSMC applied to a single diploid genome was used to estimate a demographic history, then the simulations under this model from stdpopsim may not recapitulate the site frequency spectrum well in the observed species. Though the authors indicate that citations are given to each demographic model and model parameter for each species, this may not be sufficient for a novice researcher in this field to understand what forms of genomic variation the models may be capable of reliably producing. A potential worry is that the inclusion of a species within stdpopsim may serve as an endorsement to users regarding the available simulation models (though I understand this is not the case by the authors), and it would be helpful if users and readers were guided on the type of variation the models should be able to reliably reproduce for each species and demographic history available for each species. It would be helpful to include a table with types of observed variation that the current set of 21 species (and associated demographic histories) are likely and unlikely to recapitulate well.

      This is a very important point, which we now address in the section “Limitations of simulated genomes”, which we added to the manuscript. In this section, we expand on this topic and discuss various things that will affect the way simulated genomes reflect true sequence variation. This includes the choice of demographic inference method, but also the analyzed samples, and various filters. The main message of this section is that one should consider various things when deciding to implement a demographic model in simulation (or selecting a model among those implemented in stdpopsim). We also cite studies (including Beichman, et al. 2017), which compared different approaches to demography inference. However, we note that the conclusions of these comparisons are not as straightforward as the reviewer suggests. In particular, methods that make use of the site frequency spectrum (such as dadi) should be able to capture some aspects of haplotype structure, because this information is encoded in the demographic history. Furthermore, a demographic history inferred from a single genome (e.g., using PSMC) should do a reasonable job approximating some aspects of the site frequency spectrum. In other words, the aspects of genetic variation not modeled well by a given demographic inference method are not always predicted in a straightforward way. This is why we avoid summarizing this information in a table in the manuscript. The 2nd paragraph of the “Limitations of simulated genomes” section addresses some of these subtle considerations. In particular, we suggest that considering a demographic model for simulation requires some familiarity with the inference method and the way it was applied to data. Regarding the demographic models currently implemented in stdpopsim, we provide some information about each model in the documentation page of the catalog. When selecting a demographic model from the catalog, users should make use of this documentation to guide their decision. This is mentioned in the 3rd paragraph of the “Limitations of simulated genomes” section. Following-up on this issue, we intend to review the documentation and make sure it provides sufficient information for each demographic model. See this GitHub issue.

      Reviewer #3:

      - p5, 2nd paragraph: I think many Biologists, myself included, will think of horizontal gene transfer mostly as plasmids being transferred among bacteria and adding extra genetic material, not as homologous bacterial recombination. This made me confused about modelling horizontal gene transfer in the same way as gene conversion. It may be helpful for some readers if you specify that you are modelling this particular type of horizontal gene transfer. Some explanation along the lines of what is in Cury et al (2022) would be enough.

      This is a good point. We modified the text in that sentence in the 2nd paragraph on page 5 to clarify that we are modeling non-crossover homologous recombination, and not incorporation of exogenous DNA (e.g., via plasmid transfer). The relevant part of the text now says:

      “In bacteria and archaea, genetic material can be exchanged through horizontal gene transfer, which can add new genetic material (e.g., via the transfer of plasmids) or replace homologous sequences through homologous recombination (Thomas and Nielsen, 2005; Didelot and Maiden, 2010; Gophna and Altman-Price, 2022). However, the initial version of stdpopsim used crossover recombination to stand in for these processes. Although we cannot currently simulate varying gene content (as would be required to simulate the addition of new genetic material by horizontal gene transfer), the msprime and SLiM simulation engines now allow gene conversion, which has the same effect as non-crossover homologous recombination.

      Following (Cury et al., 2022), we use this to include non-crossover homologous recombination in bacterial and archaeal species.”

      - p5, 3rd paragraph: When you say gene conversion is turned off by default, you could refer to table 1 and briefly mention the consequence of ignoring gene conversion.

      We agree that it is important to note that avoiding to model gene conversion may lead to faulty lengths of shared haplotypes across individuals. This is implied by the statement we make in the beginning of the 3rd paragraph on page 5, where we lay out the motivation for modeling gene conversion in simulation. Following the reviewer’s suggestion, we now added a statement about this in the end of that paragraph:

      “Note that ignoring gene conversion may result in a slightly skewed distribution of shared haplotypes between individuals (see Table 1)”

      -  p7, item 1 and p9, 1st paragraph: I am not sure what you mean by genetic map here, can you define this term? I am not sure if it is synonymous with gene annotations, a recombination map, or something else. The linkage map doesn't seem to make sense to me here.

      The term ‘genetic map’ referred to the recombination map whenever it was used in the manuscript. To avoid any confusion, we now removed all mentions of ‘genetic map’, and use ‘recombination map’ instead. The recombination map is relevant in item 1 of page 7 because in species with poor assemblies you will not be able to reliably estimate recombination maps, making chromosome-scale simulations less effective. In the 1st paragraph of page 9, we discuss the issue of lifting over coordinates from one assembly to another, and if you have a recombination map estimated in one assembly, you might need to lift it over to another assembly to apply it in your simulation.

      -  Table 1, last row, middle column: when you say "simulated population", I think it is a bit ambiguous. You mean "the true population that we are trying to simulate", but could be read as "the population data that was generated by simulation". I would delete the word simulated here.

      What we mean here is that the selected effective population size should reflect the observed genetic diversity in real genomic data. We realize that the previous wording was confusing, and changed this to the following:

      “Set the effective population size (Ne) to a value that reflects the observed genetic diversity”

      -  Figure 2, and other places when you refer to mutation and recombination rate (eg p11, last paragraph), can you include the units (e.g. per base pair, per generation)?

      Throughout the manuscript, rates are always specified per base per generation. In Figure 2, this is specified in the caption (3rd line). We added units in other places in section “Examples of added species” on pages 12-13, where they were indeed missing.

      -  p11, "default effective population size": can you use a more descriptive word instead of the default? Maybe the historical average? Also, what is this value used for in the simulations when there is a demographic model specified (as in the case of Anopheles)?

      We think that “default effective population size” is the most appropriate term to use here, since we are referring to the parameter in the species model in stdpopsim. It is correct that the value of this parameter should reflect the historical average size in some sense, but it is really unclear what this should be in the case of a species like Bos taurus, which experienced a very dramatic bottleneck in the recent past. We address this subtle, yet important, issue in the sentence preceding this one. If a demographic model is specified in simulation, it overrides the default effective population size, and its value is ignored (which is why we refer to it as ‘default’). We added a short sentence clarifying this in the 2nd paragraph of the “Bos Taurus” section (now page 12).

      “Note that the default Ne is only used in simulation if a demographic model is not specified.”

      -  p8, when you say "Such simulations are useful for a number of purposes, but they cannot be used to model the influence of natural selection on patterns of genetic variation.": You may want to bring up the discussion that many of these neutral parameters taken from the literature could have been estimated assuming genome-wide neutrality, and thus ignoring the effect of background selection. Therefore the parameter values might reflect some effect of background selection that was unaccounted for during their estimation.

      This is an important subtle point, which we now address in the section “Limitations of simulated genomes”, which we added to the revised manuscript. In that section, we discuss various limitations of simulations, focusing on inferred demographic models. We address the potential influence of the segments selected for analysis toward the end of 2nd paragraph in that section (page 9):

      “... all methods assume that the input sequences are neutrally evolving. This implies that technical choices, such as the specific genomic segments analyzed and various filters, may also influence the inferred model and its ability to model observed genetic variation.”

      Interestingly, background selection in itself typically does not have a strong effect on the inferred model. This is something that is examined in the forthcoming publication that presents simulations with natural selection in stdpopsim.

      -  Why are some concepts written in bold (eg effective population size, demographic model)? Were you planning to make a vocabulary box? I think this is a good idea given that you are aiming for a public that can include people who are not very familiar with some population genetics concepts.

      In the “Examples of added species” section, we use boldface fonts to highlight the model parameters that were determined for each species. We added a statement clarifying this in the beginning of this section (page 11), and made sure that all the relevant parameters were consistently highlighted throughout this section. In other sections, we use boldface fonts only for titles. A few cases that did not conform to this rule were removed in the current version. We did not intend on adding a vocabulary box, but considered this when revising the manuscript, due to the reviewer’s suggestion. However, we found it difficult to converge on a small (yet comprehensive) set of terms with accurate and succinct definitions. We think that the important terms are adequately defined within the text of the manuscript, providing sufficient information also for readers who are not expert population geneticists.

      - p4, 2nd paragraph: Are these automated scripts that are used to compare models publicly available? If you are suggesting that people use this approach generally when coming up with a simulation model (p8, penultimate paragraph), it would be helpful to have access to these automated scripts.

      The scripts are part of the public stdpopsim repository on GitHub, and may be used by anyone. Some components of these scripts are more easy to apply in general, such as comparing a demographic model with one implemented separately by the reviewer. This step, for example, is achieved by application of the Demography.is_equivalent method in msprime. Other parts of the comparison depend on the specific structure of python objects used by stdpopsim, so they are not likely to be useful when implementing simulations outside the framework of stdpopsim.

      -  p9, 1st paragraph, and p.12 2nd paragraph: instead of adjusting the mutation rate to fit the demographic model (and using an old estimate of the mutation rate), would it be ok to adjust the demographic model to fit the new mutation rate? E.g. with a new mutation rate that is the double of a previous estimate, would it be ok to just divide Ne by 2 such that Ne*mu is constant (in a constant population size model)? I imagine this could get complicated with population size changes.

      In principle, this could be done if you were simulating neutrally evolving sequences (without modeling natural selection). Since the coalescence is scale-free, then you can scale down all population sizes and divergence times by a multiplicative factor, and scale up migration rates and the mutation rate by the same factor, and you get the exact same distribution over the output sequences. However, making sure you get the scaling right is tricky and is quite error-prone. Especially considering the fact that you have to do this every time the mutation rate of a species is updated. Moreover, once you start modeling natural selection, this scale-free property no longer holds. Thus, the simple solution we came up with in stdpopsim is to attach to each demographic model the mutation rate used in its inference.

    1. Author Response:

      The following is the authors' response to the original reviews.

      We sincerely thank all the editors and reviewers for taking the time to evaluate this study. Here is our point-by-point response to the reviewers’ comments and concerns.

      Reviewer #1 (Public Review):The study by Oikawa and colleagues demonstrates for the first time that a descending inhibitory pathway for nociception exists in non-mammalian organisms, such as Drosophila. This descending inhibitory pathway is mediated by a Drosophila neuropeptide called Drosulfakinin (DSK), which is homologous to mammalian cholecystokinin (CCK). The study creates and uses several Drosophila mutants to convincingly show that DSK negatively regulates nociception. They then use several sophisticated transgenic manipulations to demonstrate that a descending inhibitory pathway for nociception exists in Drosophila.

      […]

      Weaknesses:

      A minor weakness in the study is that it is unclear how DSK negatively regulates nociception. An earlier study at the Drosophila nmj shows that loss of DSK signaling impairs neurotransmission and synaptic growth. In the current study, loss of CCKLR-17D1 in Goro neurons seems to increase intracellular calcium levels in the presence of noxious heat. An interesting future study would be the examination of the underlying mechanisms for this increase in intracellular calcium.

      We thank the reviewer for the kind and very positive evaluation of our manuscript. We agree that this study has not elucidated the intracellular molecular pathway(s) downstream of CCKLR-17D1 that are involved in the regulation of the activity of Goro neurons, and we think that it would definitely be an interesting topic for future research.       

      Reviewer #1 (Recommendations For The Authors):

      The response latencies for the control yw larvae seem large, with many larvae appearing to be insensitive to the thermal stimulus. Is this just an effect of the yw genetic background? A brief discussion of this might be helpful.

      We thank the reviewer for pointing this out. We have also noticed that the yw control larvae tend to show longer response latencies than the other control strains, and in the revised manuscript, we have added the following sentence in the Result section (Lines 91–94):

      “We have noticed that the yw control strain, which was used by us to generate the dsk and receptor deletion mutants, showed relatively longer response latencies to the 42 °C probe compared to the other control strains used in this study. This may be attributed to the effect of the genetic background, although, presently, the cause for this difference is unknown.”

      Reviewer #2 (Public Review):_

      This is an exceptional study that provides conclusive evidence for the existence of a descending pathway from the brain that inhibits nociceptive behavioral outputs in larvae of Drosophila melanogaster. […] The study raises many interesting questions for future study such as what behavioral contexts might depend on this pathway. Using the CAMPARI approach, the authors do not find that the DSK neurons are activated in response to nociceptive input but instead suggest that these cells may be tonically active in gating nociception. Future studies may find contexts in which the output of the DSK neurons is inhibited to facilitate nociception, or contexts in which the cells are more active to inhibit nociception._

      Reviewer #2 (Recommendations For The Authors):I have no recommendations for the authors as this is a very complete and thoroughly executed study. The writing is crystal clear.

      We thank the reviewer for the kind and very positive evaluation of our manuscript. We are happy to know that our current manuscript was deemed to be clear and convincing by the reviewer.

      Reviewer #3 (Public Review):[…] Overall the authors use clean logic to establish a role for DSK and its receptor in regulating nociception. I have made a few suggestions that I believe would strengthen the manuscript as this is an important discovery.

      Major comments:

      1. It's not completely clear why the authors are staining animals with an FLRFa antibody. Can the authors stain WT and DSK KO animals with a DSK antibody? Also, can the authors show in supplemental what antigen the FLRFa antibody was raised against, and what part of that peptide sequence is retained in the DSK sequence? This overall seems like a weakness in the study that could be improved on in some way by using DSK-specific tools.

      We thank the reviewer for this query. We would like to clarify that we first tried the FLRFa antibody to visualize an RFamide-type neuropeptide other than DSK in Drosophila and found that the staining pattern is quite similar to that of anti-DSK, as shown by Nichols et al. [1]. According to the original paper describing the anti-FLRFa antisera [2] (already cited in the reviewed manuscript), the antigen used to raise it was the Phe-Met-Arg-Phe-NH2 peptide conjugated with succinylated thyroglobulin, and the study experimentally shows that the antibody well binds to peptides containing Met-Arg-Phe-NH2 or Leu-Arg-Phe-NH2 sequence and has 100% cross-reactivity to FLRFa. As DSK contains Met-Arg-Phe-NH2 sequence [3], the cross-reaction of this antibody to DSK is consistent with the description of the original study.    

      Although we were unable to use an antibody specific to DSK, our staining data with dsk deletion mutants and the expression pattern of DSK-2A-GAL4 corroborate each other (Figure 2 and Figure 2-figure supplement 1), which we believe provides compelling evidence for the specific expression of DSK in MP1 and Sv neurons, and for that DSK-2A-GAL4 is a reasonably effective tool to specifically manipulate DSK-expressing neurons.

      2. What is the phenotype of DSK-Gal4 x UAS-TET animals? They should be hyper-reactive. If it's lethal maybe try an inducible approach.

      We thank the reviewer for this question. Unfortunately, we have not attempted this experiment, although we agree that this would be a nice addition to further strengthen the study if TET worked well in the DSKergic neurons.

      3. Figure 9. This was not totally clear, but I think the authors were evaluating spontaneous (i.e. TRPA1-driven) rolling at 35C. The critical question is "does activating DSK-expressing neurons suppress acute heat nociception" and this hasn't really been addressed. The inclusion of PPK Gal4 + DSK Gal4 in the same animal kind of clouds the overall conclusions the reader can draw. The essential experiment is to express UAS-dTRPA1 in DSK-Gal4 or GORO-Gal4 cells, heat the animals to ~29C, and then test latency to a thermal heat probe (over a range of sub and noxious temperatures). Basically prove the model in Figure 10 showing ectopic activation or inhibition for each major step, then test heat probe responses.

      We thank the reviewer for suggesting ideas for alternative experiments to potentially strengthen our conclusion. Regarding experiments using heat probes, previous studies have demonstrated that (i) Blocking ppk1.9-GAL4-positive C4da neurons almost completely abolishes the larval nociceptive response to local heat stimulations [4]; (ii) Local heat stimuli above 39 °C readily activate C4da neurons and larval nociceptive rolling [5-9]; and (iii) Thermogenetically or optogenetically activating these neurons is sufficient to trigger Goro neurons and larval rolling [4, 10-12]. Thus, it has now been made clear that heat probes induce larval nociceptive rolling via excitation of the C4da pathway, and we believe that our experiments using thermogenetic activation of C4da neurons can be safely interpreted as an alternative to experiments using heat probes. Using heat probes demands a more complicated experimental set-up to be combined with CaMPARI imaging experiments, and this is another reason why we preferred to take the thermogenetic approach.

      We have also considered the experiment using Goro-GAL4 instead of ppk-GAL4. However, if dTRPA1 artificially activates Goro neurons far downstream of the neuronal mechanism by which MP1 activation suppresses Goro neuron activity, the effect of MP1 activation may be bypassed and masked. As we currently do not know the epistasis between dTRPA function and the effect of MP1 activation in modulating the activity of Goro neurons, we rather chose to activate C4da neurons by using ppk-GAL4, which likely resulted in more natural activation of Goro neurons than dTRPA1-triggered direct activations.

      4. It would also then be interesting to see how strong the descending inhibition circuit is in the context of UV burn. If this is a real descending circuit, it should presumably be able to override sensitization after injury.

      Indeed, this is an interesting avenue to explore in future studies to understand the type of situation in which the DSKergic descending system functions to control nociception.

      Reviewer #3 (Recommendations For The Authors):Overall this is a good story and the claims are generally supported with experimental evidence. The way to really improve this study would be to use more precise and definitive tools, like specific antibodies, specifically targeted genes, and better temporal control of the descending circuit to prove this is inducible sufficient to suppress acute thermal nociception and this occurs only via a descending pathway, etc. However this would be exponentially more work, and so the authors I guess need to weigh the cost-benefit of definitive proof vs. strong evidence for their claims. Overall I think this study will be the beginning of a new line of inquiry in the field that has the potential to guide our understanding also of mammalian descending pathways, and as such, this study is of value to the community.

      We appreciate the reviewer’s multiple interesting ideas for experiments that could have been performed to further reinforce our findings. We agree that some experiments that the reviewer suggested would potentially strengthen this work if supplemented. However, as aforementioned, in our humble opinion, we think that the experiments that the reviewer suggested are either outside the scope of this paper or have no significant benefits over the experiments that were already conducted, and hence are not essential to the present study.

      References

      1. Nichols, R. and I.A. Lim, Spatial and temporal immunocytochemical analysis of drosulfakinin (Dsk) gene products in the Drosophila melanogaster central nervous system. Cell Tissue Res, 1996. 283(1): p. 107-16.

      2. Marder, E., et al., Distribution and partial characterization of FMRFamide-like peptides in the stomatogastric nervous systems of the rock crab, Cancer borealis, and the spiny lobster, Panulirus interruptus. J Comp Neurol, 1987. 259(1): p. 150-63.

      3. Nassel, D.R. and M.J. Williams, Cholecystokinin-like peptide (DSK) in Drosophila, not only for satiety signaling. Front Endocrinol, 2014. 5.

      4. Hwang, R.Y., et al., Nociceptive neurons protect Drosophila larvae from parasitoid wasps. Curr Biol, 2007. 17(24): p. 2105-2116.

      5. Tracey, W.D., Jr., et al., painless, a Drosophila gene essential for nociception. Cell, 2003. 113(2): p. 261-73.

      6. Xiang, Y., et al., Light-avoidance-mediating photoreceptors tile the Drosophila larval body wall. Nature, 2010. 468(7326): p. 921-6.

      7. Burgos, A., et al., Nociceptive interneurons control modular motor pathways to promote escape behavior in Drosophila. eLife, 2018. 7.

      8. Honjo, K. and W.D. Tracey, Jr., BMP signaling downstream of the Highwire E3 ligase sensitizes nociceptors. PLoS Genet, 2018. 14(7): p. e1007464.

      9. Im, S.H., et al., Tachykinin acts upstream of autocrine Hedgehog signaling during nociceptive sensitization in Drosophila. eLife, 2015. 4: p. e10735.

      10. Ohyama, T., et al., A multilevel multimodal circuit enhances action selection in Drosophila. Nature, 2015. 520(7549): p. 633-9.

      11. Honjo, K., R.Y. Hwang, and W.D. Tracey, Jr., Optogenetic manipulation of neural circuits and behavior in Drosophila larvae. Nat Protoc, 2012. 7(8): p. 1470-8.

      12. Zhong, L., et al., Thermosensory and non-thermosensory isoforms of Drosophila melanogaster TRPA1 reveal heat sensor domains of a thermoTRP channel. Cell Rep, 2012. 1(1): p. 43-55.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Major points:

      1. Although the role of mitofusin on mitochondrial morphology has been established by others and comprehensively assessed in the present study, the authors should determine the functional outcome from the genetic manipulations on Mfn2 and Mfn1. As observed by increased glucose uptake, one could hypothesize an impairment in mitochondrial oxidative phosphorylation, leading the cells to rely uniquely or heavily on glycolysis as a fuel. Also, as mentioned by the authors in the discussion, ROS play a fundamental role in adipogenesis, and, therefore, mitochondrial ROS emission and/or cellular redox balance should also be assessed. I believe these two experiments will add insightful information to the current dataset.

      __Thank you for these suggestions. Whilst we agree with the general premise of this point, unfortunately quantifying oxidative phosphorylation and ROS production with sufficient precision to detect relatively subtle changes remains very challenging. We have attempted these experiments but they require considerable optimisation (particularly using adipocytes). Preliminary studies done in MEFs (Cover letter figure 1) suggest that under some stimuli there may be higher ROS in Mfn1 and Mfn2 knock-out lines. However this preliminary data would require further optimisation and repetition in adipocytes, which is more challenging. __

      For now, we have amended the Discussion to specify that these experiments are of particular interest.

      Cover letter figure 1. Levels of reactive oxygen species (ROS) in mouse embryonic fibroblasts measured by flow cytometry for fluorometric dyes CellROX (total cellular ROS), D2-HDCFA (total cellular ROS), and MitoSOX (mitochondrial ROS). Levels are expressed relative to wild-type. MEFs were treated with antimycin A (or media only) for 20minutes prior to incubation with the ROS dyes, then washed three times before assayed. AntA, Antimycin; CR, CellROX; M1, Mfn1-/- MEFs; M2, Mfn2-/- MEFs; MS, MitoSOX; WT, wild-type.

      The insulin effect on glucose uptake does not allow to conclude any impairment in insulin responsivity. The fold change of glucose uptake mediated by insulin was roughly 1.2 in undifferentiated adipocytes, 2.3 in differentiated WT, and 2.5 in Mfn1KO differentiated adipocytes. The absolute increase in glucose uptake could be a compensatory mechanism due to impairment in mitochondrial bioenergetics (see point #1), given that the cells can still respond to insulin. Measuring Akt phosphorylation levels following insulin treatment would help solve this issue.

      __As requested, we have assessed the effect of insulin treatment on Ser 473 phosphorylation of Akt2 (Pkb) in wild-type and knock-out MEFs differentiated into adipocytes (Fig 2D). Mfn1_-/-_ MEFs show an increase in Akt phosphorylation relative to the other cell lines. They also have higher expression of insulin receptor and Glut4, consistent with their degree of adipogenic differentiation. __

      We agree that impaired mitochondrial bioenergetics could account for the observations in perturbed glucose uptake in the knockout cell lines (especially Mfn2-null) and have therefore amended the text throughout to reflect this.

      Usually, working with clonal transgenic cells lines has the limitations that the cells might behave differently in terms of adipogenic potential over passages. A transient loss of function in the same cells would solve this concern. Also, introducing the patient mutations might be closer to the human situation than working with KO mouse fibroblasts.

      __We agree with this potential concern, which is why we conducted knock-down studies in 3T3-L1 cells in addition to the work in knockout MEFs. These data were concordant with what we observed in the KO MEFs so we don’t think it is necessary to conduct repeat KD experiments in WT MEFs. __

      In our previous study we observed that human fibroblasts with biallelic MFN2-R707W mutations did not have any obvious phenotype (____https://elifesciences.org/articles/23813____). We have separate work studying these mutations in vivo where we provide further characterisation of murine adipocytes harbouring Mfn2-R707W; this work is now published here: https://elifesciences.org/articles/82283

      Minor points:

      1. Although the authors mention in the introduction that the differentiation of adipocytes is followed by an increase in mitochondrial mass, it would be interesting the determine the expression profile of mfn1 and mfn2 during the differentiation process.

      We have found that there is an increase in markers of mitochondrial fusion (Mfn1 & Mfn2) as well as fission (Fis1) throughout differentiation of 3T3-L1s. ____We have included this data in the manuscript (Supplementary Figure ____6A ).

      The authors should discuss other models, even though pre-clinical, of mitochondrial dysfunction that results in lipodystrophy but with different metabolic outcomes. To cite a few but not only PMID: 29588285; PMID: 21368114; PMID: 31925461.

      Thank you for this suggestion. We have added a section on this in the introduction.

      It would be interesting to discuss the role of Mfn1/2 in the context of cold-induced adipogenesis, given the prominent role of mitochondrial dynamics, as mentioned by the authors in the reference list, on cold-induced adaptative thermogenesis (Mahdaviane et al. 2017; Boutant et al. 2017).

      Thank you for this suggestion. We have added a section on this in the introduction.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      • In Fig.2A, the authors report "increased lipid accumulation in Mfn1-/- MEFs, but not in Mfn2-/- MEFs". While the overall content might be similar, the pattern of lipid accumulation seems to be different. Indeed, differences in lipid droplet morphology have been observed in Mfn2 KO MEFs upon oleate treatment (McFie et al., 2016). The manuscript would benefit from having quantifications of lipid droplet size and number.

      Thank you for highlighting this. We have quantified lipid droplet size and, consistent with McFie et al have found increased size in Mfn2 knock-down. This data is now included in Supplementary Figure 6B.

      • Following the above point, McFie et al. also reported that Mfn1/Mfn2 double KO MEFs could differentiate into adipocytes. The authors should discuss these opposing observations. The contrasting observation may be due to acquired clonal differences in MEF lines. We have attempted ‘double’ knock down (of both Mfn1 and Mfn2 concurrently) in 3T3-L1 cells however this was essentially lethal and also did not generate any cells capable of differentiation. We have added a section in the Discussion regarding this point.

      • In relation to the effects of Mitofusin deletions on glucose uptake, the authors mention that Mfn2 KO MEFs show impaired insulin stimulated glucose uptake. The interpretation of the result is not straight forward, as basal glucose uptake is highly increased in Mfn2 KO MEFs. Maybe there is simply a treshold for maximal glucose uptake capacity in MEF-derived adipocytes. In any of these cases, the authors might want to check GLUT1 levels, in line of their suggestion that the increased basal glucose uptake might be related to higher GLUT1. Alternatively, the authors might also want to check elements of the insulin signaling path, in case there are alterations that could explain the phenomenon.

      As mentioned above in response to reviewer 1, we have now ____performed immunoblots to quantify some components of the insulin signalling cascade (Fig 2D). We observed lower expression of both Glut1 and Glut4 in the Mfn2-/- cells. Mfn2-/- cells did demonstrate some Akt phosphorylation but considerably less than Mfn1-/- cells. These results are now included in the revised manuscript (Figure 2D).

      • In line with the above point, one would have wished that mitochondrial biology was better characterized in the different MEF models. While mitochondrial shape analyses are provided, some information on, at least, mitochondrial respiratory capacity, glucose oxidation and/or fatty acid oxidation rates, would be important. This would allow for a more solid discussion on why Mfn2 KO MEFs display such high basal glucose uptake rates.

      We have responded to a similar suggestion from Reviewer 1, above.

      • In relation to the experiments in MEFs, one should never forget that WT, Mfn1 and Mfn2 KO MEFs derive from different mice. Hence, the phenotypes could be related to trait variabilities in the origin mice themselves, and not just the gene deletion. To control for this aspect, the authors could simply re-introduced Mfn1 or Mfn2 in their respective MEFs and evaluate if their alterations are normalized.

      __Yes one could try this but we have addressed this general concern by replicating the impact of Mfn1/2 KD in 3T3L1 cells so are not inclined to pursue this at this time. __

      • Transcriptomic analyses reveals a decrease in adipogenic gene expression in Mfn2 KO MEFs. However, lipid accumulation is comparable to WT MEFs is normal. This could be due to defects in lipolytic capacity, leading to similar lipid accumulation despite lower adipogenic capacity. This could be tested by evaluating the adrenergic response of these cells (e.g.: glycerol release).

      Thank you for this suggestion. We have commented in the Discussion to explain that we have not fully characterised this mechanism.

      • The experiments in 3T3-L1 would also benefit from some gene expression analyses to evaluate if Mfn1 depletion leads to acceleration and/or magnification of the differentiation stages. In relation to this, 3T3-L1 cells could be used to monitor Mfn1 and Mfn2 through differentiation, which in itself would be valuable information.

      We have performed a protein-level time course for markers of mitochondrial fusion (Mfn1 & Mfn2) as well as fission (Fis1) throughout differentiation of 3T3-L1s. We have included this data in the manuscript (Supplementary Figure 6A). We think that changes in protein expression are more relevant than changes in mRNA so have not included gene expression changes at this time.

      CROSS-CONSULTATION COMMENTS The comments from the three independent reviewers are extremely well aligned and agree that improving the following aspects could largely benefit the manuscript:

      • A better metabolic characterisation of the models used
      • Provide measurements in relation to mitochondrial bioenergetics and ROS production – we have attempted this but the data is not very clear in our view and warrants further optimisation which we are not inclined to pursue currently. - Explorations of insulin signaling - done thank-you.
      • Improve the validation and significance of the cellular models used, following the different suggestions from the three reviewers. Most notably, considering the introduction of human Mfn2 mutation forms – we have published a separate manuscript on follow up work on the human MFN2 variant as mentioned above.

      A number of additional comments are raised, all of which are very reasonable and, in my opinion, should not be difficult to address. I think we can all agree that a mechanistic underpinning of the observations would give a larger degree of novelty to the work. Also, none of us would like the revision's quality to be constraint by a tight deadline. I would therefore be totally OK to extend the timeframe for the revision beyond the original 3 months proposed.

      Reviewer #2 (Significance (Required)):

      This is an interesting and well-crafted manuscript. Mice deficient for Mfn2 or Mfn1 have been reported by different laboratories, yet most of them fail to explore the effects on early adipogenesis. The study is limited to cultured cells, but this is well acknowledged by the authors Given the existence of human mutations in the mitofusin-2 gene that largely alter fat mass distribution, this work provides new clues on how these mutations might impact adipose tissue.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Mann et al. The objective of this study is to determine the extent to which mitofusins (Mfn1 and Mfn2) have redundant functions and assess their contributions to adipocyte differentiation. While a point mutation in the Mfn2 gene has been associated with severe adipose tissue dysfunction and lipodystrophy, no disease phenotypes have been linked to mutations in Mfn1. To address these objectives, the authors sought to characterize how adipocyte differentiation and function is affected in Mfn1, Mfn2 or double knockout adipocytes in two distinct in vitro models. Their findings indicate divergent effects of Mfn1 and Mfn2 on adipocyte differentiation and function despite similar alterations to mitochondrial morphology. Loss of Mfn1 promotes adipogenesis while Mfn2 decreases it. The authors conclude that these findings are indicative of non-redundant functions in Mfn1 and Mfn2.

      Major comments: The observation that Mfn1 KO/KD leads to increased adipogenesis in vitro is somehow novel and, perhaps, surprising, as the author say. However, the molecular understanding underlying this phenotype remains unexplored. The analyses performed are mainly descriptive and don't dig deeper into the identification of the molecular mechanism. They do hypothesize that ROS production may be responsible for the observed effects, but that's how far they go.

      The authors do highlight the limitations of this work, but these limitations need careful consideration, for not addressing them seriously limits the novelty of this study, especially not testing these conditions in human cells. The current version of this work seems too preliminary to suggest useful experiments that could strengthen the study, since future analyses could take many different directions.

      Yes, we accept that the findings are rather preliminary but our initial efforts suggest that precisely elucidating the underlying mechanism/s is likely to be more difficult and complicated than alluded to by the reviewers. We would therefor prefer to share our initial observations so that others can also attempt to clarify the underlying mechanisms.

      A few unanswered questions that the authors might consider are: What is the difference between the Arg707Trp mutation and the KO/KD? Mfn1 and 2 deletions lead to fragmented mitochondria, but opposite adipogenic potentials. What other mitochondrial defects can explain it? Are organelle contact site disrupted only with Mfn2? How does Mfn1 and 2 KO/KD affect mitochondrial proteome? What does mitochondrial bioenergetics look like? How is ROS production affected? Is the increased glucose uptake (basal) a compensatory mechanism for mitochondrial dysfunction? Thank you for these suggestions. We acknowledge that this work is largely descriptive in nature. These are all questions that should be addressed to improve mechanistic understanding of our observations.

      __The difference between p.Arg707Trp and KO/KD is challenging to address because in the non-adipose cell lines studied so far (human and mouse fibroblasts) there has been no evidence of perturbation of the mitochondrial network. __

      As discussed above, we have done preliminary studies into ROS production but are unable to provide a complete characterisation at this time. Similarly, we have not been able to perform bioenergetic studies (e.g. Seahorse, Oxyboros) that would provide more insight into differences between Mfn1 and Mfn2 KO cell lines.

      CROSS-CONSULTATION COMMENTS I agree the work is interesting, but is too preliminary and merely descriptive. the experiments suggested will significantly improve the manuscript. However, I don't think they will take only three months to be completed. This work needs a significant amount of work including the study of the mechanism, at least an idea of what the mechanism could be, to be considered novel.

      We accept this limitation and have responded to this general point above.

      Reviewer #3 (Significance (Required)):

      Understanding how mitochondrial dynamics affect adipogenic differentiation is critical to better understand how metabolism impact cell signaling, cell fate and function.

      Strengths: this work reveals an interesting phenotype for Mfn1 and Mfn2 mutant preadipocytes. Weaknesses: this work is merely descriptive and preliminary to provide a clear understanding of the observed phenotypes

      Advance: Although the performed experiments are accurate, well designed, and well controlled, the fact that Mfn1 and 2 have distinct functions and cannot compensate for one another was already clear based on the embryonic lethality of either Mfn1 and Mfn2 KO mice as well as the Mfn2 mutation in humans that leads to a pathological condition.In the current verison, this work minimally contributes to advancing the field.

      Audience: an extensively revised version of this work including deeper phenotyping of thier models and human cell work would be of interest for sceintists studying mitchondrial biology, adipose tissue, metabolic diseases, and human genetic diseases.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to reviewers.

      We deeply thank the reviewers for the time spent on evaluating our manuscript as well as providing comments and suggestions to improve our study.

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __

      *In this manuscript Lebdy et al. describe a new role of GNL3 in DNA replication. They show that GNL3 controls replication fork stability in response to replication stress and they propose this is due to the regulation of ORC2 and the licensing of origins of replication. Their data suggest that GNL3 regulates the sub nuclear localization of ORC2 to limit the number of licensed origins of replication and to prevent resection of DNA at stalled forks in the presence of replication stress.

      While many of the points of the manuscript are proven and well supported by the results, there are some experiments that could improve the quality and impact of the manuscript. The main issue is that the connection between the role of GNL3 in controlling ORC2, the firing of new origins and the protection of replication forks is not clearly established. At the moment the model relies on mainly correlative data. In order to further substantiate the model, we propose to address some of the following issues:*

      1. *The authors indicate that RPA and RAD51 accumulation at stalled forks is not affected by GNL3 depletion. These data should be included and other proteins should be analysed. In addition, the role of helicases could be explored through the depletion of the main helicases involved in the remodelling of the forks. * Response: As asked by the reviewer we will add the fractionation experiments that show that the level of RAD51 and RPA on chromatin is not affected by GNL3 depletion. So far, the other proteins we checked (RIF1 and BRCA1), both involved in nascent strand protection, did not show clear differences. Therefore, we concluded that depletion of GNL3 does not seem to affect the recruitment of major proteins required for protection of nascent DNA. Of course, we cannot exclude that other proteins may be affected by GNL3 depletion, but testing all the possible candidates would be time consuming with a very low chance of success. In addition, fractionation experiments are possibly not quantitative enough to uncover small differences and may be not that informative. Thus it remains possible that RPA exhaustion may be the cause of resection in absence of GNL3 as suggested by the work conducted in Lukas’ lab (Toledo et al. 2013. https://pubmed.ncbi.nlm.nih.gov/24267891/). To test this hypothesis, we will analyze if resection in absence of GNL3 is still occurring in a well-characterized cell line that overexpress the three RPA subunits that we obtained from Lukas’ lab.

      To our knowledge not many helicases have been shown to be involved in remodeling of stalled forks. The best example is RECQ1, however we feel that testing RECQ1 involvement in resection upon GNL3 depletion will complicate our story without adding much regarding the mechanism. We hope the reviewer understands our concern.

      • The proposed model implies that GNL3 depletion leads to increased origin licensing. FThe authors should address if the primary effect of GNL3 depletion is on origin firing by using CDC7 inhibition in the absence of stress (Rodríguez-Acebes et al., JBC 2018). *

      __Response: __This is an excellent point raised by the reviewer. To test if the primary effect of GNL3 depletion in on origin firing we will test if the defect in replication fork progression is dependent on CDC7 using DNA fibers experiments and CDC7 inhibitor.

      • A way to prove that origin firing mediates the effect of GNL3 on fork protection would be to reduce the number of available origins. The depletion of MCM complexes has been shown to limit the number of back-up origins that are licensed and leads to sensitivity to replication stress (Ibarra et al., PNAS 2008). If GNL3 depletion results in increased number of origins, this effect should be prevented by the partial depletion of MCM complexes. *

      __Response: __This is also an excellent point. We will test if MCM depletion decreases resection upon GNL3 depletion and treatment with HU. In addition, we will integrate in the manuscript experiments that we have done recently that show that treatment with roscovitine, a CDK inhibitor that impairs origin firing, decreases the level of resection observed in absence of GNL3. We think this experiment strengthens the results obtained with CDC7 inhibitors.

      *Alternatively, the authors could try to modulate the depletion of GNL3. Origin licensing takes place in the G1 phase and thus the depletion of GNL3 by siRNA could affect the following S phase. Using an inducible degron for GNL3 depletion would allow to deplete GNL3 in G1 or S phase specifically. If the model is correct, the removal of GNL3 in S phase should not affect fork protection but removing GNL3 in the previous G2/M phase should reduce the number of licensed origins and lead to impaired fork protection. *

      __Response: __This is obviously a good point given the fact that GNL3 deletion is not viable (see responses to reviewer 2). We tried to develop an auxin induced degron of GNL3, but we could not obtain homozygous clones, meaning that our clones had always an untagged GNL3 allele. Since GNL3 is essential its tagging may impair its function, explaining why we could not obtain homozygous clones. However, we are planning to optimize the design using other degrons system (for instance Halo-tag) to address the role of GNL3 specifically during S-phase. But we think this is above the scope of the present study.

      *In addition to the connection GNL3-origin firing-fork protection, it is unclear how the lack of GNL3 in the nucleolus and the change in the sub nuclear localization of ORC2 controls origin firing and resection. The strong interaction observed between GNL3-dB and ORC2, and the subsequent change in ORC2 localization does not explain how origin licensing can be affected. In this sense, the authors could address: *

      1. *Does the depletion of GNL3 and the expression of GNL3-dB affect the formation of the ORC complex, its subnuclear localization or its binding to chromatin? The authors have not explored if the interaction of GNL3 with ORC2 is established in the context of the ORC complex. An IF showing NOP1 with PLA data from GNL3-dB and ORC2 is needed to analyse how the expression of increasing amounts of GNL3-dB affects ORC2. * __Response: __We tested if GNL3 depletion impacts ORC2 and ORC1 recruitment on chromatin, but we could not observe significant differences. No clear differences were observed upon GNL3-dB expression either. One reason for this may be due to the excess of ORC complex on the chromatin, in addition chromatin fractionation is likely not sensitive enough to observe small differences. We think that quantitative ChIP-seq of ORC2 or other ORC subunits upon GNL3 depletion is required to visualize such differences, but this is above the scope of the study, and this constitutes the following of this project. We also tried to look at subnuclear localization of ORC2 using immunofluorescence, but the signal was not specific enough to observe differences. We think that the increased interaction (PLA) of ORC2 with GNL3-dB (Figure 5E) demonstrates a change in ORC2 subnuclear localization. To confirm this, we will perform the excellent experiment proposed by the reviewer to test if increasing level of GNL3-dB affects its interaction with ORC2 using PLA.

      We do not think that the interaction between ORC2 and GNL3 is established in the context of the ORC complex since only ORC2 (and not the other ORC) was significantly enriched in the GNL3 Bio-ID experiment. The full list of proteins from the Bio-ID experiment (Figure 4A) will be provided in the revised version. Therefore, we think that either GNL3 regulates ORC2 subnuclear localization that in turns impact the ORC complex or GNL3 regulates ORC2-specific functions. More and more evidences show that ORC2 plays roles possibly independently of the ORC complex (see Huang et al. 2016 https://doi.org/10.1016/j.celrep.2016.02.091 or Richards et al. 2022 https://doi.org/10.1016/j.celrep.2022.111590 for instance). Future work should uncover how these ORC2 functions may regulate origins activity.

      *In order to confirm if the mislocalization of ORC2 by the expression of GNL3-dB increases origin firing and mediates the effects on fork protection the authors could check DNA resection levels inhibiting CDC7 in high GNL3-dB conditions. Also, the levels of MCM2, phosphor-MCM2, CDC45, have not been analysed upon expression of GNL3-dB. *

      __Response: __This is a good point; we will test if the resection observed upon expression of GNL3-dB is dependent on origin firing using CDC7 inhibitor. We have not measured the level of the cited proteins but instead we performed DNA combing to measure Global Instant Fork Density. We now show that expression of GNL3-WT suppresses the increased origin firing observed upon GNL3 depletion, in contrast expression of GNL3-dB does not suppress it. This important result indicates that origin firing is increased upon GNL3-dB expression, providing a link between aberrant localization and increased firing. These data will be part of the revised version of the manuscript.

      The data in the paper suggest that GNL3 may affect the role of ORC2 in centromeres. Since depletion of GNL3 leads to increased levels of gH2AX, it would be interesting to address if this damage is due to incomplete replication in centromeres by analysing the co-localization of g*H2AX and centromeric markers both in unstressed conditions and upon the induction of replication stress. *

      __Response: __This is indeed and interesting comment, however since it has been previously shown that gH2AX signal is rather strong upon GNL3 depletion (see Lin et al. 2013. https://pubmed.ncbi.nlm.nih.gov/24610951/ ; Meng et al. 2013. https://pubmed.ncbi.nlm.nih.gov/23798389/) we do not think that co-localization experiments with CENP-A for instance will be informative given the high number of gH2AX foci.

      *Minor points: *

      1. In the initial esiRNA screen the basal levels of g*H2AX should also be shown. * Response: Our negative control is the transfection of an esiRNAs that targets EGFP (a gene that is not expressed in the tested cell line). This esiRNAs is ranked at the end of the list and therefore constitutes the basal level of gH2AX signal. In any case it is well-established that GNL3 depletion increases gH2AX signal (see Lin et al. 2013. https://pubmed.ncbi.nlm.nih.gov/24610951/ ; Meng et al. 2013. https://pubmed.ncbi.nlm.nih.gov/23798389/).

      *Figure EV1B: I think the rank needs another RS mark to see better the effect of each esiRNA on DNA lesions (high variability in all the conditions showed). *

      __Response: __We understand this issue, but we cannot repeat this set of experiments for technical reasons (reagents and cost mainly). Anyway, we believe that the role of GNL3 is response to replication stress is extensively addressed by other experiments of this manuscript.

      *Figure 1C and Figure EV1D/E: the quantification of the pCHK1/CHK1 levels could be included to show that there are no changes in phosphorylation upon GNL3 depletion. *

      Response: it is a good point; we will put quantification in the revised version.

      *In the first section of the results, at the end Figure 4B is incorrectly called for. *

      __Response: __Thanks for the comment, we will modify accordingly.

      The levels of GLN3 expression in 293 cells should be already included in section GNL3 interacts with ORC2.

      __Response: __We will add a figure that shows the level of expression in 293 cells.

      The full MS data needs to be included for both GNL3 and ORC2.

      __Response: __This will be integrated in the revised version.

      Figure 4B should be improved, since there is a faint band in the IgG mouse control.

      __Response: __it is true that the figure is not perfect, but we believed that our Bio-ID and PLA experiments fully demonstrate the interaction between GNL3 and ORC2.

      __Reviewer #1 (Significance (Required)): __

      *The work is nicely written, the figures are well presented and the experiments have the necessary controls. It provides relevant information to understand how replication stress is controlled and linked to replication fork protection through origin firing. These results are relevant to the field, linking GNL3 to origin firing and with potential to help understand the role of GNL3 in cancer. They provide new information and can give rise to new studies in the future. Many of the conclusions of the manuscript are well supported. Additional support for some of the main claims would strengthen the results and also increase the impact providing a bigger conceptual advance by performing some of the suggested experiments. *

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      *This manuscript explores the role of GNL3/nucleostemin in DNA replication and specifically in the response of DNA replication to DNA damage. GNL3 is a predominantly nucleolar protein, previously characterised as a GTP-binding protein and shown to be necessary for effective recruitment of the RAD51 recombinase to DNA breaks. The entry point for this report is a mini screen, based on proteins identified previously by the authors to associate with replication forks by iPOND, for factors that increase gamma-H2Ax (an indicator of DNA damage) after treatment with the Top1 inhibitor camptothecin (CPT). In this mini-screen GNL3 emerged as the top hit.

      The authors put forward the hypothesis that GNL3 is able to sequester the replication licensing factor ORC2 in the nucleolus and that failure of this mechanism leads to excessive origin firing and DNA resection following CPT treatment.*

      • The model put forward is interesting, but currently rather confusing. However, for the reasons upon which I expand below, I do not believe that the data provide a compelling mechanistic explanation for the effects that are reported and I am left not being certain about some of the links that are made between the various parts of the study, even though individual observations appear to be of good quality. *

      *Specific points: *

      *The knockdown of GNL3 is very incomplete. In this regard, the complementation experiments are welcome and important. However, is it an essential protein? Can it be simply deleted with CRISPR-Cas9?

      *__Response: __There are obviously variations between experiments but overall, the depletion of GNL3 using siRNA seems good in our opinion. Deletion of GNL3/nucleostemin leads to embryonic lethality in mouse (Beekman et al. 2006. https://pubmed.ncbi.nlm.nih.gov/17000755/ ; Zhu et al. 2006. https://pubmed.ncbi.nlm.nih.gov/17000763/). ES cells deleted for GNL3 can be obtain but do not proliferate probably because of their inability to enter in S-phase (Beekman et al. 2006. https://pubmed.ncbi.nlm.nih.gov/17000755/). We wanted to test if it was the case in our cellular model and we tried to delete it using CRISPR-Cas9. We managed to obtain few clones deleted for GNL3, but they grow really poorly prevented us to do experiments. To bypass this, and as suggested by the reviewer 1, we tried to make an auxin-induced degron of GNL3. Unfortunately, we did not manage to obtain homozygous clones, only heterozygous. One possibility could be that the tagging induced a partial loss of function of GNL3, and since GNL3 is essential, it may explain why we did not obtain homozygous clones. We may also want to use alternative degron systems such as Halo-Tag, but we believe this is out of the scope of the study.

      __ __*Global instant fork density is not quite the same as actually measuring origin firing. Ideally, it would be good to see some more direct evidence of addition origin firing e.g. by EdU-seq (Macheret & Halazonetis Nature 2018) but this would be quite a significant additional undertaking. However, given the authors have performed DNA combing with DNA counterstain, they should be able to provide accurate measurements of origin density and inter-origin distance. *

      __Response: __As indicated by the reviewer EdU-seq would need a lot of development since we are not using this approach in our team. In addition, this method can detect replication origins only if performed in the beginning of S-phase, meaning that only the early firing origins will be detected and not the others. GIFD measurement is actually directly linked with origin firing since it is counting the forks to duplicate the genome. The measurements of IODs have at least two main limitations: (1) there is a bias for short IODs due to the length of analyzed fibers and (2) it focuses only on origins within a cluster not globally. Overall, we believe that GIFD is the method of choice to measures origins firing. In addition, these experiments have been done by the lab of Etienne Schwob (see acknowledgments), a leader in the field.

      *'Replication stress' is induced with CPT. This term is frequently used to describe events that lead to helicase-polymerase uncoupling (e.g. O'Connor Mol Cell 2015) but that is not the case with CPT, which causes fork collapse and breaks. Are similar effects seen with e.g. UV or cisplatin? Additionally, a clear statement of the authors definition of replication stress would be welcome. *

      __Response: __We will better define the term ‘replication stress’ in the revised version of the manuscript. It should be understood, in our case, that any impediment that leads to replication fork stalling and measurable by DNA combing or Chk1 phosphorylation. We have not performed experiments using UV and cisplatin.

      *It is really not clear how the authors explain the link between potential changes in origin firing and resection. i.e. What is the relationship between global origin firing and resection at a particular fork, presumably broken by encounter with a CPT-arrested TOP1 complex. What is the link mechanistically? This link needs elaborating experimentally or clearly explaining based on prior literature. *

      • *__Response: __Most of our results on resection has been performed with hydroxyurea, but it is true that we saw resection in absence of GNL3 in response to CPT. Treatment with HU or CPT reduces fork speed and activates additional replication origins (see Ge et al. 2007 https://pubmed.ncbi.nlm.nih.gov/18079179/ for HU or Hayakawa et al. 2021 https://pubmed.ncbi.nlm.nih.gov/34818230/ for CPT ). When GNL3 is depleted, more forks are active, meaning more targets for HU and CPT. In addition, it is likely that the firing of additional origins in response to HU and CPT is stronger in absence of GNL3. Because of this we believe that factors required to protect stalled forks may be exhausted explaining why resection is observed. This is inspired by the work of Lukas’ lab (Toledo et al. 2013 https://pubmed.ncbi.nlm.nih.gov/24267891/) and is described in the figure 6. One obvious candidate that may be exhausted is RPA, to test this we will check if resection upon GNL3 depletion and treatment with HU is still occurring in cell lines provided by Lukas’ lab that overexpress RPA complex (described in Toledo et al.). We will explain our model more carefully in the revised version.

      *Related to this, I remain unconvinced that the experiments in Figure 3 show that the effects of ATRi and Wee1i on origin firing and on resection are contingent on each other. I do not believe that the authors have adequately supported the statement (end of pg 9) 'We conclude that the enhanced resection observed upon GNL3 depletion is a consequence of increased origin firing.' The link between origin firing and resection needs really needs further substantiation and / or explanation.

      *__Response: __Our rational was the following. Inhibition of ATR or WEE1 increase replication origin firing, a situation that may be like the one observed for GNL3 depletion. In Toledo et al, they show that inhibition of WEE1 or ATR induces exhaustion of RPA. This exhaustion is reduced in presence of CDC7 inhibitor, roscovitine (a CDK inhibitor that inhibits origin firing) or depletion of CDC45, indicating that this is due to excessive origin activation. In our case we show that the resection observed upon WEE1 or ATR inhibition is reduced upon treatment with CDC7 inhibitor. We conclude that excessive replication origin firing induces DNA resection. Since we observed the same thing upon GNL3 depletion (but not upon BRCA1 depletion) we conclude that excessive origin firing favors DNA resection likely through exhaustion of RPA. As indicated above we will test this hypothesis by overexpressing RPA. In addition, we now show that treatment with roscovitine decreases resection upon GNL3 depletion (this will be part of the revised manuscript), an experiment that we believe confirms that excessive replication origins firing is responsible for resection upon GNL3 depletion. As suggested by reviewer 1, we will also test if depletion of MCM also reduces resection observed in absence of GNL3.

      *It is not clear whether the binding of ORC2 to GNL3 also sequesters other components of the origin recognition complex? Does loss of the ability of GNL3 to bind ORC2 actually lead to more ORC bound to chromatin? How does GNL3 contribute to regulation of origin firing under normal conditions? Is it a quantitatively significant sink for ORC2 and what regulates ORC2 release? *

      Response: The results of GNL3 Bio-ID were extremely clear, we could not significantly detect any other ORC subunits than ORC2 (these data were not present in the manuscript but will be added in the revised version), therefore we believe that GNL3 may sequester/regulate only ORC2. We tried to see if GNL3 depletion was changing the binding of ORC1 and ORC2 to the chromatin, but we could not see any difference, one possibility may be that small differences are not detectable by chromatin fractionation. We believe that ChIP-seq or ORC2 or other ORC subunits in absence of GNL3 is required but this it out of the scope of the study. GNL3 may regulates the stability of the ORC complex on chromatin via ORC2 but GNL3 may also regulates other ORC2 functions, at centromeres for instance. It has been shown indeed that ORC2 plays roles possibly independently of the ORC complex (see Huang et al. 2016 https://doi.org/10.1016/j.celrep.2016.02.091 or Richards et al. 2022 https://doi.org/10.1016/j.celrep.2022.111590 for instance). How exactly this is affecting origin firing is still mysterious. This is something we are planning to address in the future.

      We do not know if it is a quantitatively sink for ORC2 or how this is regulated, however we believe that the ability of GNL3 to accumulate in the nucleolus may sequester ORC2. Consistent with this, we show that a mutant of GNL3 (GNL3-dB) that diffuses in the nucleoplasm interacts more with ORC2 in the nucleoplasm suggesting a release. As suggested by reviewer 1 we will now test if the interaction between ORC2 and GNL3-dB is dependent on the level of expression of GNL3-dB. In addition, we now show that expression of GNL3-dB increases replication origin firing like GNL3 depletion (data that will be added in the revised version), suggesting that regulation of ORC2 is the major cause of increased firing upon GNL3 depletion.

      *Minor points: *

      *All blots should include size markers *

      __Response: __We will add them

      *Some use of language is not sufficiently precise. For instance: ** - the meaning of 'DNA lesions' at the end of the first paragraph of the introduction needs to be more explicit. *

      * - the approach to measurement of these 'lesions' (monitoring gamma-H2Ax) needs to be spelled out explicitly, e.g. line 4 of the last paragraph of the introduction. *

      *

      • 'we observed that the interaction between GNL3-dB and ORC2 was stronger' ... I do not see how number of foci indicates necessarily the strength of an interaction. *

      * - in many places throughout 'replication origins firing' should be 'replication origin firing' (or 'firing of replication origins'). *

      __Response: __We will correct these language mistakes.

      __Reviewer #2 (Significance (Required)): __

      The model put forward here has the potential to shed light on an important facet of the cellular response to DNA damage, namely the control of origin firing in response to replication stress that will certainly be of interest to the DNA repair / replication community and possibly more widely. The roles of GNL3 are poorly understood and this study could improve this state of affairs. However, the gaps in the mechanism outlined above and somewhat confusing conclusions do limit the ability of the paper to achieve this at present.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      *In this study, Lebdy et al propose a new mechanism to regulate the resection of nascent DNA at stalled replication forks. The central element of this mechanism is nucleolar protein GNL3, whose downregulation with siRNA stimulates DNA resection in the presence of stress induced by HU (Figure 1). Resection depends on the activity of nucleases MRE11 and CtIP, and can be rescued by reintroducing exogenous GNL3 protein in the cells (Figure 1G). GNL3 downregulation decreases fork speed and increases origin activity, without any strong effect on replication timing (Figure 2). Inhibition of Dbf4-dependent kinase CDC7 (a known origin-activating factor) also restricts fork resection (Figure 3). GNL3 interacts with ORC2, one of the subunits of the origin recognition complex, preferentially in nucleolar structures (Figure 4). A mutant version of GNL3 (GNL3-dB) that is not sufficiently retained in the nucleoli fails to prevent fork resection as the WT protein (Figure 5). In the final model, the authors propose that GNL3 controls the levels of origin activity (and indirectly, stalled fork resection) by maintaining a fraction of ORC2 in the nucleoli (Figure 6). *

      This model is interesting and provocative, but it also relies on a significant degree of speculation. The authors are not trying to "oversell" their observations, because the Discussion section entertains different interpretations and possibilities, and the model itself contains several interrogative statements (e.g. "ORC2-dependent?"; "exhaustion of factors?").

      • While the article is honest about its own limitations, the major concern remains about its highly speculative nature. I have some questions and suggestions for the authors to consider that could contribute to test (and hopefully support) their model. *

      • *If GNL3 downregulation induces an excess of licensed origins and mild replicative stress resulting in some G2/M accumulation (Figure 2), what is the consequence of longer-term GNL3 ablation? Do the cells adapt, or do they accumulate signs of chromosomal instability? (micronuclei, chromosome breaks and fusions, etc) * __Response: __This is an important point also raised by Reviewer 2: deletion of GNL3 leads to embryonic lethality in mouse and ES cells deleted for GNL3 do not proliferate and fail to enter into S-phase. Consistent with this, the clones deleted for GNL3 that we obtained using CRISPR-Cas9 grow poorly, thus preventing us to do experiments. To our knowledge micronuclei and chromosome breaks have never been analyzed upon transient depletion of GNL3 using siRNA. However, it is well established that depletion of GNL3 induces phosphorylation of H2A.X) and the formation of ATR, RPA32 and 53BP1 foci due to S-phase arrest (Lin et al. 2013. https://pubmed.ncbi.nlm.nih.gov/24610951/ ; Meng et al. 2013. https://pubmed.ncbi.nlm.nih.gov/23798389/). DNA lesions have also been visualized by comet assay (Lin et al. 2019. https://pubmed.ncbi.nlm.nih.gov/30692636/). Consistent with this we observed a weak increased of DNA double-strand breaks upon GNL3 depletion using pulse-field gel electrophoresis as well as mitotic DNA synthesis (MiDAS). We can integrate this data in the revised version of the manuscript if required. To sum up, it is clear that GNL3 depletion is inducing problems during S-phase that may lead to possible genomic rearrangements.

      • The model relies on the link between origin activity and stalled fork resection that is almost exclusively based on the results obtained with CDC7i (Figure 3). But CDC7 has other targets besides pre-RC components at the origins, such as Exo1 (from the Weinreich lab, cited in the study), MERIT40 and PDS5B (from the Jallepalli lab, also cited). The effect of CDC7i could be exerted through these factors, which are linked to fork stability and DNA resection. The loss of BRCA1 (Figure 3F) could somehow entail the loss of control over these factors. Could the authors check the possible participation of these proteins?*

      __Response: __It is true that CDC7 has other targets than pre-RC components. We therefore decided to inhibit origin firing using roscovitine, a broad CDK inhibitor, a strategy previously used in Lukas lab (Toledo et al. 2013. https://pubmed.ncbi.nlm.nih.gov/24267891/). We observed that treatment with roscovitine decreased significantly resection observed upon GNL3 depletion, confirming the link between origin activity and stalled fork resection. This will be integrated in the revised version of the manuscript. As asked by Reviewer 1, we will also perform depletion of MCM to strength our model.

      Exo1 is indeed a target of CDC7 as shown by the Weinreich lab (Sasi et al. 2018. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6111017/) however the authors do not formally demonstrate that Exo1 phosphorylation is required for its activity. We observed that depletion of Exo1 significantly reduced resection upon GNL3 depletion (data that will be added in the revised version), indicating that the effect of CDC7 inhibitor could be exerted via the control of Exo1. This is why our BRCA1 control is important, it is well stablished that Exo1 is required for nascent strand degradation upon BRCA1 depletion (Lemaçon et al. 2017. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5643552/) but CDC7 inhibition has no effect on resection upon BRCA1 depletion suggesting that resection by Exo1 may not be regulated by CDC7 in our context.

      As stated by the reviewer MERIT40 and PDS5B are targets of DDK kinases (Jones et al. 2021 https://doi-org.insb.bib.cnrs.fr/10.1016/j.molcel.2021.01.004) and seem to be required for protection of nascent DNA and in response to HU. However, little is known about the role(s) of these proteins and we think that adding them will complicate message. We hope the reviewer understands this.

      The model also relies on the fact that GNL3-dB mutant (not retained in the nucleoli) is not sufficient to counteract fork resection induced by HU (Figure 5G). The authors should test directly whether GNL3-dB induces extra origin activation, using their available DNA fibers-based technique.

      __Response: __This is an excellent point. We have now GIFD (Global Instant Fork Density) data that shows that the number of active forks is increased upon dB GNL3-dB expression. It demonstrates that when GNL3 is no longer retained in the nucleolus more origins are active. These data will be integrated in the revised version of the manuscript, and we believe further support the regulation of ORC2 by GNL3.

      *Finally, the model implies an exquisite regulation of the amount of ORC2 protein, which could influence the number of active origins and the extent of fork resection in case of stress. In this scenario, one could predict that ORC2 ectopic expression would have similar, or even stronger effects, than GNL3 downregulation. Is this the case? *

      __Response: __We completely agree with this prediction. However, we are afraid that overexpression of ORC2 may have indirect effects due to the many described functions of ORC2, therefore it may be difficult to interpret the data. We will give a try anyway.

      *Even if the connection between origins and fork resection could be firmly established, the molecular link between them remains enigmatic. The authors hint (as "data not shown") that it is neither mediated by RPA nor RAD51. Unfortunately, the reader is left without a clear hypothesis about this point. *

      __Response: __We will add data that show that RPA and RAD51 recruitment is not affected by GNL3 depletion. However, the sensitivity of chromatin fractionation approach may be too weak to detect low differences. Based on the work of Lukas Lab (Toledo et al. 2013 https://pubmed.ncbi.nlm.nih.gov/24267891/) one possible mechanism may be exhaustion of the pool of RPA. This may link the excessive activation of origins observed upon GNL3 depletion and resection. To test this, we will check if resection upon GNL3 depletion and treatment with HU is still occurring in cell lines that overexpress RPA complex (described in Toledo et al.) that we obtained from Lukas’ lab.

      __ __ **Referees cross-commenting**

      __ __In addition to each reviewer's more specific comments, the three reviews share a main criticism: the lack of mechanistic information about the proposed link between origin activity and resection of nascent DNA at stalled forks.

      __Reviewer #3 (Significance (Required)): __

      In principle, this study would appeal to the readership interested in fundamental mechanisms of DNA replication and the cellular responses to replicative stress.

      For the reasons outlined in the previous section, I believe that in its current version the study is not strong enough to provide a new paradigm about origins being regulated by partial ORC2 sequestering at nucleoli. The other potentially interesting advance is the connection between frequency of origin activity and the extent of nascent DNA resection at stalled forks, but the molecular link between both remains unknown.


    1. All art is quite useless.

      From ELIMIMIAN 626: By saying "All art is quite useless," Wilde is not only using paradox or a rhetorical style, but enforces the concept that contraries do not imply a negation. By claiming "Vice and Virtue" form suitable "material for the artist," Wilde encourages us, as readers, to think that the realm of artistic discourse is limitless. Art enables us to view life clearly, but art, by itself, is incapable of any permanent rendering. Similarly, when we interpret art, we interpret life, since life's colors are enshrined in it. Thus, Wilde would argue Art and life are not contingent but may complement each other.

      From WINWAR 171: This line struck right in the face of Victorian materialism and was coined indiscriminately; Its defenders interpreted it as an exaltation of art, but its opposers claimed it was dangerous to established values.

      From DUGGAN 61: In this one sentence, Wilde encapsulates the complete principles of the Aesthetic Movement popular in Victorian England. That is to say, real art takes no part in molding the social or moral identities of society, nor should it. Art should be beautiful and pleasure its observer, but to imply further-reaching influence would be a mistake.

    1. Reviewer #2 (Public Review):

      The authors had two aims in this study. First, to develop a tool that lets them quantify the synaptic strength and sign of upstream neurons in a large network of cultured neurons. Second, they aimed at disentangling the contributions of excitatory and inhibitory inputs to spike generation.

      For the quantification of synaptic currents, their methods allows them to quantify excitatory and inhibitory currents simultaneously, as the sign of the current is determined by the neuron identity in the high-density extracellular recording. They further made sure that their method works for nonstationary firing rates, and they did a simulation to characterize what kind of connections their analysis does not capture. They did not include the possibility of (dendritic) nonlinearities or gap junctions or any kind of homeostatic processes. I see a clear weakness in the way that they quantify their goodness of fit, as they only report the explained variance, while their data are quite nonstationary. It could help to partition the explained variance into frequency bands, to at least separate the effects of a bias in baseline, the (around 100 Hz) band of synaptic frequencies and whatever high-frequency observation noise there may be. Another weak point is their explanation of unexplained variance by potential activation of extrasynaptic receptors without providing evidence. Given that these cultures are not a tissue and diffusion should be really high, this idea could easily be tested by adding a tiny amount of glutamate to the culture media.

      For the contributions of excitation and inhibition to neuronal spiking, the authors found a clear reduction of inhibitory inputs and increase of excitation associated with spiking when averaging across many spikes. And interestingly, the inhibition shows a reversal right after a spike and the timescale is faster during higher network activity. While these findings are great and provide further support that their method is working, they stop at this exciting point where I would really have liked to see more detail. A concern, of course is that the network bursts in cultures are quite stereotypical, and that might cause averages across many bursts to show strange behaviour. So what I am missing here is a reference or baseline or null hypothesis. How does it look when using inputs from neurons that are not connected? And then, it looks like the E/(E+I) curve has lots of peaks of similar amplitude (that could be quantified...), so why does the neuron spike where it does? If I would compare to the peak (of similar amplitude) right before or right after (as a reference) are there some systematic changes? Is maybe the inhibition merely defining some general scaffold where spikes can happen and the excitation causes the spike as spiking is more irregular?<br /> The averaged trace reveals a different timescale for high and low activity states. But does that reflect a superposition of EPSCs in a single trial or rather a different jittering of a single EPSC across trials? For answering this question, it would be good to know the variance (and whether/ how much it changes over time). Maybe not all spikes are preceded by a decrease in inhibition. Could you quantitify (correlate, scatterplot?) how exactly excitation and inhibition contributions relate for single postsynaptic spikes (or single postsynaptic non-spikes)? After all, this would be the kind of detail that requires the large amount of data that this study provides.

      For the first part, the authors achieved their goal in developing a tool to study synaptic inputs driving subthreshold activity at the soma, and characterizing such connections. For the second part, they found an effect of EPSCs on firing, but they barely did any quantification of its relevance due to the lack of a reference.

      With the availability of Neuropixels probes, there is certainly use for their tool in in vivo applications, and their statistical analysis provides a reference for future studies.<br /> The relevance of excitatory and inhibitory currents on spiking remains to be seen in an updated version of the manuscript.

      I feel that specifically Figures 6 and 7 lack relevant detail and a consistent representation that would allow the reader to establish links between the different panels. The analysis shows very detailed examples, but then jumps into analyses that show population averages over averaged responses, losing or ignoring the variability across trials. In addition, while their results themselves pass a statistical test, it is crucial to establish some measure of how relevant these results are. For that, I would really want to know how much spiking would actually be restricted by the constraints that would be posed by these results, i.e. would this be reflected in tiny changes in spiking probabilities, or are there times when spiking probabilities are necessarily high, or do we see times when we would almost certainly get a spike, but neurons can fire during other times as well.<br /> I would agree that a detailed, quantitative analysis of this question is beyond the scope of this paper, but a qualitative analysis is feasible and should be done. In the following, I am detailing what I would consider necessary to be done about these two Figures:

      Figure 6C is indeed great, though I don't see why the authors would characterize synchrony as low. When comparing with Figure 4B, I'd think that some of these values are quite high. And it wouldn't help me to imagine error bars in panel 6D.<br /> Figure 6B is useful, but could be done better: The autocovariance of a shotnoise process is a convolution of the autocovariance of underlying point process and the autocovariance of the EPSC kernel. So one would want to separate those to obtain a better temporal resolution. But a shotnoise process has well defined peaks, and the time of these local maxima can be estimated quite precisely. Now if I would do a peak triggered average instead of the full convolution, I would do half of the deconvolution and obtain a temporally asymmetric curve of what is expected to happen around an EPSC. Importantly, one could directly see expected excitation after inhibition or expected inhibition after excitation, and this visualization could be much better and more intuitively compared to panel 6E.<br /> Panel D needs some variability estimate (i.e. standard deviation or interquartile range or even a probability density) for those traces.<br /> Figure 6E: Please use more visible colors. A sensitivity analysis to see traces for 2E/(2E+I) and E/(E+2I) would be great.<br /> Figure 6F: with an updated panel B, we should be able to have a slope for average inhibition after excitation for each of these cells. A second panel / third column showing those slopes would be of interest. It would serve as a reference for what could be expected from E-I interactions alone.<br /> Figure 6G: Could the authors provide an interquartile range here?

      Figure 7A: it may be hard to squeeze in variability estimates here, but the information on whether and how much variance might be explained is essential. Maybe add another panel to provide a variability estimate? The variability estimate in panel 7B and 7D only reflect variability across connections, and it would be useful to add panels for the timecourses of the variability of g (or E/(E+I) respectively).

      As a suggestion for further analysis, though I am well aware that this is likely beyond the scope of this manuscript, I'd suggest the following analysis:<br /> I would split the data into the high and low activity states. Then I would compute the average of E/(E+I) values for spikes. Assuming that spikes tend to happen for local maxima of E/(E+I) I would find local maxima for periods without spike such that their average is equal to the value for actual spikes. Finally, I would test for a systematic difference in either excitation or inhibition.<br /> If there is no difference, you can make the claim that synaptic input does not guarantee a spike, and compare to a global average of E/(E+I).

    1. Author Response

      Reviewer #1 (Public Review):

      First, we thank the reviewer for his instructive remarks. In the following we address the queries of Reviewer 1.

      1.1) At several points, the authors make claims that I believe extend beyond the data presented here. For instance, in the Abstract (line 27), the authors state "the development of adult songs requires restructuring the entire HVC, including most HVC cell types, rather than altering only neuronal subpopulations or cellular components." The gene ontology analyses performed do suggest that there is a progression from cellular transcriptional changes to organ-level changes, however caution should be taken in claiming that "most HVC cell types" exhibit transcriptional changes. In fact, according to Fig. 3D most of the transcriptional changes appear restricted to neurons. As the authors themselves note elsewhere, claims at this resolution are difficult without support from single-cell approaches. I do not suggest that the authors need to perform single-cell RNA-seq for this work, but strong claims like this should be avoided.

      We have revised our claim to more accurately reflect our findings. Our intended message is that testosterone treatment leads to extensive transcriptional changes in the HVC, likely affecting a majority of neuronal subpopulations rather than solely targeting specific cellular components. The revised text in lines 29-32 now reads: "Thus, the development of adult songs stimulated by testosterone results in widespread transcriptional changes in the HVC, potentially affecting a majority of neuronal subpopulations, rather than altering only specific cellular components."

      1.2) Similarly the Abstract states that parallel regulation "directly" by androgen and estrogen receptors, as well as the transcription factor SP8, "lead" to the transcriptional and neural changes observed after testosterone treatment of females. However, experiments that demonstrate such a causal role have not been performed. The authors do perform a set of bioinformatic analyses that point in this direction - enrichment of androgen and estrogen receptor binding sites in the promoters of differentially expressed genes, high coexpression of SP8 with other genes, and the enrichment of predicted SP8 binding sites in coexpressed genes. However, further support for direct regulation, at the level that the authors claim, would require some form of transcription factor binding assay, e.g. ChIP-seq or CUT&RUN. I am fully aware that these assays are enormously challenging to perform in this system (and again I don’t suggest that these experiments need to be done for this work); however, statements of direct regulation should be tempered. This is especially true for the role of SP8. This does appear to be a compelling target, but without some manipulation of the activity of SP8 (e.g. through knockdowns) and subsequent analysis of gene expression, it is too much to claim that this transcription factor is a regulatory link in the testosterone-driven responses. SP8 does appear to be a highly connected hub gene in correlation network analysis, but this alone does not indicate that it acts as a hub transcription factor in a gene regulatory network.

      We appreciate the reviewer's comment and have revised the statement concerning the role of SP8. Indeed, we document the coexpression of ESR2 and SP8, and our bioinformatics analysis suggests that SP8 might play an important role in transcriptomics. We have rephrased the statement in line 29-32 as follows: "Parallel gene regulation directly by androgen and estrogen receptors, potentially amplified by coexpressed transcription factors that are themselves steroid receptor regulated, leads to substantial transcriptomic and neural changes in specific behavior-controlling brain areas, resulting in the gradual seasonal occurrence of singing behavior." In addition, we have included discussions regarding limitations of promoter sequence analyses (lines 414 to 427).

      1.3. Along these lines, the in-situ hybridizations of ESR2 and SP8 presented in Figure 5 need significant improvement. The signals in the red and green channels, SP8 and ESR2, look suspiciously similar, showing almost identical subcellular colocalization. This signal pattern usually suggests bleed-through during image acquisition, as it’s highly unlikely that the mRNA of both genes would show this degree of overlap. I would suggest that control ISHs be run with one probe left out, either SP8 or ESR2, and compare these ISHs with the dual label ISHs to determine if signal intensity and cellular distribution look similar. Furthermore, on lines 354-356 the authors write, "The fact that the two genes were expressed nearby in the same cell may indicate physical interactions between the gene pair and warrant further investigation into the nature of their relationship.". Yet, even if the overlap between ESR2 and SP8 shown in Figure 5 is confirmed, close localization of transcripts does not imply that the protein products physically interact. The STRING bioinformatic analysis is more convincing that there is a putative regulatory interaction between ESR2 and the SP8 locus, and this suggestion of protein-protein interaction is weak and should be omitted. In addition, the authors note that ESR2 has not been detected in the songbird HVC in a previous study. To further demonstrate the expression of ESR2 (and SP8) in HVC, it would be useful to plot their expression from the microarray data across the different testosterone conditions.

      We repeated the coexpression study using confocal microscopy and fluorescent RNAScope in situ hybridization, which is now reflected in the revised Figure 5 and a new Figure 5 - Supplement Figure 1. We have also moderated our statement regarding the sparse co-expression of ESR2 and SP8 in HVC neurons. While the presence of co-expressing neurons may provide some anatomical basis for the bioinformatic findings, we have been cautious in our interpretation and have stated that "SP8 and ESR2 mRNAs exhibited low expression levels in HVC, co-localizing in a subset of cells, predominantly GABAergic cells" (lines 369-370). We have removed the speculation about potential protein interaction based on mRNA distribution. Additionally, we have highlighted that SP8 and ESR2 were differentially upregulated at T14d (lines 362-363).

      1.4) My final concern lies in the interpretation of these results as generalizable to other sex hormone-modualated behaviors. On lines 452-455, the authors write, "This suggests that the testosterone (or estrogen)-triggered induction of adult behaviors, such as parental behavior and courtship, requires a much more extensive reorganization of the transcriptome and the associated biological functions of the brain areas involved than previously thought.". The experiments and argument likely apply to other neural systems to undergo large seasonal fluctuations in sex hormones and similar morphological changes. However, the authors argue that the large number of transcriptional changes seen here may generalize broadly to sex hormone modulated adult behaviors. I think there are a couple of problems with this argument. First, as described here and in past work, testosterone drives major morphological changes the song system of adult canaries; such dramatic changes are not seen for instance in sex hormone-receptive areas underlying mating behavior in adult mammals. Similarly, the study introduced testosterone into female birds which drives a greater morphological change in HVC relative to similar manipulations in males, which again may account for the large number of differentially expressed genes. I would temper the generality of these results and note how the experimental and biological differences between this system and other sex hormone-responsive systems and behaviors may contribute to the observed transcriptional differences.

      We modified this statement in lines 473-478: “The testosterone-driven changes in female HVC morphology and function represent some of the most notable modifications known in the vertebrate brain. However, how this extensive, testosterone-induced gene regulation in the HVC applies to other seasonally testosterone-sensitive brain areas remains to be seen. Endpoint analysis of testosterone-induced singing in male canaries during the non-reproductive season also indicates considerable regulation of HVC transcriptomes (Frankl-Vilches et al., 2015; Ko et al., 2021)”.

      Reviewer #2 (Public Review):

      First, we would like to express our gratitude to Reviewer #2 for the constructive feedback. We have addressed the concerns in detail below:

      2.1). The bulk of the manuscript details WGCNA, GO terms, and promoter ARE/ERE motif abundance, using the initial pairwise comparisons for each timepoint as input lists. However, there are no p/adjp values provided for these pair-wise comparisons that form the basis of all subsequent analyses. Nor are there supplementary tables to indicate how consistent the replicates are within each group or how abundantly the genes-of-interest are expressed. With the statistical tests used here, and the lack of relevant information in the supplementary tables, I cannot determine if the data support the authors’ conclusions. These omissions mar what is otherwise a conceptually intriguing line of investigation.

      We appreciate the reviewer’s concerns. Please refer to our response addressing this point and the subsequent one (2.2) together in the section below.

      Reviewer #3 (Public Review):

      We appreciate the positive feedback from the reviewer and below addressed the issues pointed out by the reviewer.

      3.1) My biggest concern is the sample size. Most of the time points only have 5 or 6 individuals represented, and I question whether these numbers provide sufficient statistical power to uncover the effects the authors are trying to explore. This is a particular problem when it comes to evaluating the supposed "transient" of testosterone on gene expression. There is currently little basis for distinguishing such effects from noise that accrues because of low power. This can be a major problem with studies of gene expression in non-model species, like canaries, where among-individual variability in transcript abundance is quite high. Thus, it is possible that one or two outliers at a given time point cause the effect testosterone at this time point to become indistinguishable from the controls; if so, then a gene may get put into the transient category, when in fact its regulation was not likely transient.

      We acknowledge that our sample sizes may appear moderate. To address the concern regarding temporal regulation analysis, we followed Reviewer 3's suggestion and conducted a probe-level power analysis (point 2 of recommendations for the authors; labelled as point 3.9 below). We then excluded differentially expressed genes with a power less than 0.8 prior to conducting temporal classification. Consequently, 93% of our differentially expressed genes demonstrated a power ≥ 0.8 (9025/9710). Following further classification by temporal regulation pattern, we identified 29 constantly upregulated, 41 constantly downregulated, 39 dynamically regulated, and 8916 transiently regulated genes. If we apply a stricter constraint by requiring each differentially expressed gene to have at least two probe-sets with a power ≥ 0.8, 83% of differentially expressed genes (8033/9710) still have sufficient power.

      We recognize that our sample size may not be sufficient to detect weakly differentially expressed genes. However, we have intentionally excluded these genes from the beginning (those with |log2(fold change)| ≤ 0.5 were excluded).

      The scenario outlined by the reviewer, where outliers might cause the effect of testosterone to blend with controls, leading to misclassification, is indeed plausible. This could occur either because the genes are weakly regulated, or because the power to detect differential expression is insufficient, thus preventing these genes from surpassing the threshold to be deemed significantly differentially expressed. However, this also illustrates that the effect of testosterone does not regulate every gene in the same way.

      We have appended a column indicating high power genes (≥ 0.8) in the DiffExpression.tsv file, available in the Dryad repository. The power analysis has been incorporated to the method section at lines 801-808 and result section at lines 188-192.

      3.2) More on the transient categorization. Would a gene whose expression is not immediately upregulated (within 1 hour), but is upregulated later on (say in the 14d group) be considered transient? If so, this seems problematic. Aren’t the authors setting the null expectation of "non-transient" as a gene that does not increase immediately after 1 hour of treatment? The authors even recognize that it is quite surprising that gene expression changes after an hour. It may be that some genes whose regulation is classified as transient are simply slower to upregulate; but, really, would we say their expression in transient per se? Maybe I’m misunderstanding the categorizations?

      We appreciate the reviewer's insightful discussion regarding the transient categorization. We understand that it is indeed more challenging for a gene to be classified as constantly regulated than transiently regulated, due to smaller effects by testosterone or being undetectable owing to low power. To address this concern, we further dissected the transiently regulated category by reporting the number of time points at which a gene is differentially expressed in Figure 2 - Figure supplement 1. Approximately half of the transiently regulated genes were only regulated at one time point, further illustrating that the effect of testosterone on gene expression was not constant during the time window we examined (see lines 184 - 187).

      3.3) The authors don’t fully explain the logic for using females in this study to measure a "male-typical" behavior (singing). My understanding is that females have underlying circuitry to sign, and T administration triggers it; thus, this situation that creates a natural experiment in which we can explore T’s on brain and behavior, unlike in males which have fluctuating T. First, it might be good to clarify this logic for readers, unless perhaps I’m misunderstanding something. Second, I found myself questioning this logic a little. Our understanding of basic sex differences and the role that steroid hormones play in generating them has changed over the last few decades. There are, for example, a variety of genetic factors that underlie the development of sex differences in the brain (I’m especially thinking about the incredible work from Art Arnold and many others that harness the experimental power of the four core genotype mice). Might some of these factors influence female development, such that T’s effects on the female brain and subsequent ability to increase HVC size and sing is not the same as males.

      Indeed, sex-chromosome dosage compensation is absent in birds leading to higher Z-chromosomal gene expression in males. We demonstrated substantial sex differences in gene expression in our earlier work [Ko, M.-C., Frankl-Vilches, C., Bakker, A., Gahr, M., 2021. The Gene Expression Profile of the Song Control Nucleus HVC Shows Sex Specificity, Hormone Responsiveness, and Species Specificity Among Songbirds. Frontiers in Neuroscience 15].

      We have revised the introduction (lines 96-98) to clarify our rationale for using female canaries as a model for adult behavioral development, not as a model for male canaries. After testosterone treatment, these females start to sing, with song structure developing over time, similar to male seasonal progression. This approach eliminates the confounding effect of fluctuating testosterone levels seen in males, supported by distinct HVC transcriptomes in testosterone-implanted singing female canaries compared to males (Ko et al., 2021).

      The revised paragraph reads as below: Female canaries (Serinus canaria) are typically non-singers, with their spontaneous songs displaying less complexity than their male counterparts (Hartley et al., 1997; Herrick and Harris, 1957; Ko et al., 2020; Pesch and Güttinger, 1985). Despite their infrequent singing, these females possess the necessary underlying circuitry that can be activated by testosterone. Following testosterone treatment, these females start to produce simple songs, which gradually evolve in structure over weeks—paralleling the seasonal progression of male singing (Hartog et al., 2009; Ko et al., 2020; Shoemaker, 1939; Vallet et al., 1996; Vellema et al., 2019). Moreover, testosterone induces the differentiation of song control-related brain nuclei in adult female canaries, a critical step for song development (Fusani et al., 2003; Madison et al., 2015; Nottebohm, 1980). In this study, we focus on these testosterone-treated female canaries as a model for adult behavioral development rather than a model for male canaries. This unique model allows us to examine transcriptional cascades in parallel with the differentiation of the song control system and the progression of song development, without the confounding impact of fluctuating testosterone levels seen in males, which often results in considerable individual differences in the non-reproductive season baseline singing behavior. This approach is backed by the observation that the HVC transcriptomes of testosterone-implanted singing female canaries are distinct from those of singing males (Ko et al., 2021).

      3.4) I was surprised by the authors assertion that testosterone would only influence several tens or hundreds of genes. My read of the literature says that this is low, and I would have expected 100s, if not 1,000s, of genes to be influenced. I think that the total number of genes influenced by T is therefore quite consistent with the literature.

      We apologize for any confusion caused by our statement. We did not mean to imply that testosterone only influences several tens or hundreds of genes, but rather that we did not expect such an extensive transcriptional regulation in the HVC by testosterone. We have clarified this in our revised manuscript, specifically in lines 450-451. Thank you for helping us to clarify this point.

      3.5) I found the GO analyses presented herein uncompelling. As the authors likely know, not all GO terms are created equally. Some GO terms are enriched by hundreds of genes and thus reflect broad functional categories, whereas other GO terms are much more specific and thus are enriched by only a few genes. The authors report broad GO terms that don’t tell us much about what is happening in the HVC functionally. This is particularly the case when a good 50% of the genome is being differentially regulated.

      We appreciate the reviewer's comment. We have added KEGG pathway enrichment analysis in Figure 3 - Figure supplement 1 as an alternative. However, we believe that the GO term enrichment results still provide valuable insights, and therefore we have retained them in Fig. 3.

      3.6) The Genomatix analyses are similarly uncompelling. This approach to finding putative response elements can uncover many false positives, and these should always be validated thoroughly. Don’t get me wrong-I appreciate that these validations are not trivial, and I value the authors response element analysis.

      We appreciate the reviewer's comment on the presence of AR or ER motifs in promoters and acknowledge that in mammals, AR and ER predominantly bind at distal enhancers rather than promoters. Our analysis focused on promoter regions due to the limitations of available tools and resources for our study species. We understand that this approach may not capture the full complexity of AR and ER regulation. We have revised our manuscript to note the limitations of our approach and clarify that the presence of AREs and EREs alone is not indicative of active receptor binding or direct regulation (lines 416-427).

      3.7) I’m sceptical about the section of the paper that speculates about modification of steroid sensitivity in the HVC. These conclusions are based on analyses of mRNA expression of AKR1D1, SRD5A2, and the like. However, this does not reflect a different in the capacity to metabolize steroids, or at least there is little evidence to suggest this. Note that many of these transcripts have different isoforms, which could also influence steroidal metabolism.

      We agree that mRNA expression levels of AKR1D1, SRD5A2, and other transcripts involved in steroid metabolism do not necessarily reflect changes in steroid metabolizing capacity. However, we believe that these changes in mRNA expression are indicative of potential changes in steroid sensitivity in the HVC, which could affect the neural response to steroids. We acknowledge that isoform differences of these transcripts may influence steroid metabolism and further studies are necessary to confirm our findings and elucidate the mechanisms underlying the observed changes in gene expression. In response to this comment, we have amended the text in lines 245-249 to reflect this consideration.

    2. Reviewer #3 (Public Review):

      I found this paper fascinating. It is a study that needed to be done in the field of behavioral endocrinology, as it addresses our understanding of exactly how steroid hormone action might regulate behavioral output like few other published studies. For decades, researchers have been implanting animals with steroids and observing corresponding changes in behavior, noting that some behavioral traits are immediately expressed, while others take time to be expressed. Why would this be? The answer lies in the temporal dynamics of steroid action, but few have ever addressed this. Having said this, I do have several issues with the manuscript that I think need to be addressed.

      1) My biggest concern is the sample size. Most of the time points only have 5 or 6 individuals represented, and I question whether these numbers provide sufficient statistical power to uncover the effects the authors are trying to explore. This is a particular problem when it comes to evaluating the supposed "transient" of testosterone on gene expression. There is currently little basis for distinguishing such effects from noise that accrues because of low power. This can be a major problem with studies of gene expression in non-model species, like canaries, where among-individual variability in transcript abundance is quite high. Thus, it is possible that one or two outliers at a given time point cause the effect testosterone at this time point to become indistinguishable from the controls; if so, then a gene may get put into the transient category, when in fact its regulation was not likely transient.

      2) More on the transient categorization. Would a gene whose expression is not immediately upregulated (within 1 hour), but is upregulated later on (say in the 14d group) be considered transient? If so, this seems problematic. Aren't the authors setting the null expectation of "non-transient" as a gene that does not increase immediately after 1 hour of treatment? The authors even recognize that it is quite surprising that gene expression changes after an hour. It may be that some genes whose regulation is classified as transient are simply slower to upregulate; but, really, would we say their expression in transient per se? Maybe I'm misunderstanding the categorizations?

      3) The authors don't fully explain the logic for using females in this study to measure a "male-typical" behavior (singing). My understanding is that females have underlying circuitry to sign, and T administration triggers it; thus, this situation that creates a natural experiment in which we can explore T's on brain and behavior, unlike in males which have fluctuating T. First, it might be good to clarify this logic for readers, unless perhaps I'm misunderstanding something. Second, I found myself questioning this logic a little. Our understanding of basic sex differences and the role that steroid hormones play in generating them has changed over the last few decades. There are, for example, a variety of genetic factors that underlie the development of sex differences in the brain (I'm especially thinking about the incredible work from Art Arnold and many others that harness the experimental power of the four core genotype mice). Might some of these factors influence female development, such that T's effects on the female brain and subsequent ability to increase HVC size and sing is not the same as males.

      4) I was surprised by the authors assertion that testosterone would only influence several tens or hundreds of genes. My read of the literature says that this is low, and I would have expected 100s, if not 1,000s, of genes to be influenced. I think that the total number of genes influenced by T is therefore quite consistent with the literature.

      5) I found the GO analyses presented herein uncompelling. As the authors likely know, not all GO terms are created equally. Some GO terms are enriched by hundreds of genes and thus reflect broad functional categories, whereas other GO terms are much more specific and thus are enriched by only a few genes. The authors report broad GO terms that don't tell us much about what is happening in the HVC functionally. This is particularly the case when a good 50% of the genome is being differentially regulated.

      6) The Genomatix analyses are similarly uncompelling. This approach to finding putative response elements can uncover many false positives, and these should always be validated thoroughly. Don't get me wrong-I appreciate that these validations are not trivial, and I value the authors response element analysis.

      7) I'm sceptical about the section of the paper that speculates about modification of steroid sensitivity in the HVC. These conclusions are based on analyses of mRNA expression of AKR1D1, SRD5A2, and the like. However, this does not reflect a different in the capacity to metabolize steroids, or at least there is little evidence to suggest this. Note that many of these transcripts have different isoforms, which could also influence steroidal metabolism.

    1. Author Response

      Reviewer #1 (Public Review):

      This manuscript confirms previous studies suggesting a great deal of heterogeneity of gene expression at the neural plate border in early vertebrate embryos, as neural, placodal, neural crest, and epidermal lineages gradually segregate. Using scRNA-seq, the study expands previous studies by using far larger numbers of genes as evidence of this heterogeneity. The evidence for this heterogeneity and the change in heterogeneity over time is compelling.

      Many studies have suggested that there is considerable heterogeneity of gene expression in the developing neural plate border as the neural, neural crest, placodal and epidermal lineages segregate. Although the evidence for such heterogeneity was strong, until the advent of scRNA-seq, the extent of this heterogeneity was not appreciated. By using scRNA-seq at different stages of chick development, the authors sought to characterize how this heterogeneity develops and resolves over time.

      The work is technically sound, and the level of analysis of gene expression, clustering, synexpression groups, and dynamic changes in gene modules over time is state-of-the-art. A weakness of the results as they stand now is that the conclusions of the analysis are not tested by the authors and thus, are over-interpreted. Such tests could be performed in future studies either by gain- and loss-of-function experiments or by using lineage tracing to demonstrate that the cell states the authors observe - especially the "unstable progenitors" they characterize - are biologically meaningful. The data will nevertheless be a useful resource to investigators interested in understanding the development of different cell lineages at the neural plate border.

      We thank the reviewer for the positive assessment of our work. We agree that our models will need to be tested experimentally in the future, however, this will require a substantial amount of work. We therefore opted to share our data as a resource to be used by the community.

      Reviewer #2 (Public Review):

      The study of Thiery et al. aims to elucidate how cells undergo fate decisions between neural crest and (pan-) placodal cells at the neural plate border (NPB). While several previous single-cell RNA-Seq studies in vertebrates have included neural plate border cells (e.g. Briggs et al., 2018; Wagner et al., 2018; Williams et al., 2022), these previous studies did not provide conclusive insights on cell fate decisions between neural crest and placodes, due to either the limited number of genes recovered, the limited number of cells sampled or the limited numbers of stages included. The present study overcomes these limitations by analyzing almost 18,000 cells at six stages of development ranging from gastrulation until after neural tube closure (8 somite-stage), with an average depth of almost 4000 genes/cell. Using this extensive and high-quality data set, the study first describes the timing of segregation of neural crest and placodal lineages at the NPB suggesting that at late neural fold stages (somite stage 4) most cells have decided between placodal and neural crest fates. It then identifies gene modules specific for neural crest and placodal lineages and characterizes their temporal and spatial expression. Focusing on an NPB-specific subset of cells, the study then shows that initially most of these cells co-express neural crest and placodal gene modules suggesting that these are undecided cells, which they term "border-located unstable progenitors" (BLUPs). The proportion of BLUPs decreases over time, while cells classified as placodal or neural crest cells increases, with few BLUPs remaining at late neural fold stages (and a few scattered BLUPs even at somite stage 8). Based on these findings, the authors propose a new model of cell fate decisions at the NPB (termed the "gradient border model"), according to which the NPB is not defined by a specific transcriptional state but is rather a region of undecided cells, which diminishes in size between gastrulation and neural fold stages due to more and more cells committing to a placodal or neural crest fate based on their mediolateral position (with medial cells becoming specified as neural crest and lateral cells as placodal cells).

      The study of Thiery et al. provides an unprecedentedly detailed, methodologically careful, and well-argued analysis of cell fate decisions at the NPB. It provides novel insights into this process by clearly demonstrating that the NPB is an area of indecision, in which cells initially co-express gene modules for ectodermal fates (neural crest and placodes), which subsequently become segregated into mutually exclusive cell populations. The paper is very well written and largely succeeds in presenting the very complex strategy of data analysis in a clear way. By addressing the earliest cell fate decisions in the ectoderm and one of the earliest cell fate decisions in the developing vertebrate embryo, this study will have a significant impact and be of interest to a wide audience of developmental biologists. There are, two conceptual issues raised in the paper that require further discussion.

      We thank the reviewer for the positive comments on our work and its significance; we have addressed the conceptual issues below and in the revised version of the manuscript.

      First, the authors suggest that their data resolve a conflict between two previously proposed models, the "binary competence model" and the "neural plate border model". The authors correctly describe, that the binary competence model proposed by Ahrens and Schlosser (2005) and Schlosser (2006) suggests that the ectoderm is first divided into two territories (neural and non-neural), which differ in competence, with the neural territory subsequently giving rise to the neural plate and neural crest and the non-neural territory giving rise to placodes and epidermis (sequence of cell-fate decisions: ([neural or neural crest]-[epidermal or placodal]). This model was proposed as an alternative to a "neural plate border state model", which instead suggests that initially the NPB is induced as a territory characterized by a specific transcriptional state, from which then neural crest and placodes are induced by different signals (sequence of cell fate decisions: neural-[placodal or neural crest]-epidermal) (see Schlosser, 2006, 2014). Instead in this paper, the authors contrast the binary competence model with a model they call the "neural plate border" model according to which the NPB can give rise to all four ectodermal fates with equal probability. However, I think this misses the main point of contention since all previously proposed models are in agreement that initially the neural plate border region is unspecified and can give rise to all four fates and that lineage restrictions only appear over time. "Binary competence" and "Neural plate border state" model, differ, however, in their predictions about the sequence, in which these fate restrictions occur.

      We appreciate the reviewer's thoughtful feedback, but respectfully disagree with their comment regarding the sequence of events predicted by the neural plate border (NPB) model. While the NPB model does suggest that the NPB is a transcriptionally distinct state, it does not make specific predictions about the sequence of fate decisions. Although several papers cited in the Schlosser 2006 and 2014 reviews suggest that the NPB gives rise to all four ectodermal fates, none of them (and, to the best of our knowledge, no other primary paper referring to the NPB model) specifically defines the sequence of fate specification from the NPB.

      The key points of the NPB model are that the NPB is defined by overlapping expression of early neural/non-neural markers (which is also observed in Xenopus – see Pieper et al., 2012 supplementary material), contains progenitors for all four ectodermal fates, and that this "state" exists prior to the emergence of definitive neural crest and placodal cells.

      To investigate the heterogeneity in the order of cell fate decisions at the NPB, we carried out additional pairwise co-expression analyses of forebrain, mid-hindbrain, neural crest, and placodal gene modules, which reveals multiple different hierarchies of cell fate choice depending on a cell's axial positioning, as shown in Figure 6-figure supplement 1.

      Considering these findings, we have expanded our discussion of the previously proposed binary competence and neural plate border models to highlight how neither of these models is sufficient to fully characterize the heterogeneity in cell fate decisions observed in our study. We hope this clarification will help address any concerns the reviewer may have had about the NPB model and its implications for our results.

      Second, the authors should be more careful when relating their data to the specification or commitment of cells. Questions of specification and commitment can only be tested by experimental manipulation and cannot be inferred from a transcriptome analysis of normal development. So the conclusion that the activation of placodal, neural and neural crest-specific modules in that sequence suggests a sequence of specification in the same temporal order (lines 706-709) is not justified. Studies from the authors' own lab previously showed that epiblast cells from pre-gastrula stages are specified to express a large number of NPB border markers including neural crest and panplacodal markers, when cultured in vitro (Trevers et al., 2018; see also Basch et al., 2006 for early specification of the neural crest), which is not easily reconciled with this interpretation. I am not aware of any experimental evidence that shows that a panplacodal regulatory state is specified prior to neural crest in the chick (although I may have missed this). In Xenopus, experimental studies have shown instead that neural crest is specified and committed during late gastrulation, while the panplacodal states are specified much later, at neural fold stages (Mancilla and Mayor, 2006; Ahrens and Schlosser, 2005). It may well be the case that the relative timing of neural crest and panplacodal specification is different between species (and such easy dissociability may even be expected from the perspective of the binary competence model).

      We very much agree with the reviewer that the definitions and correct terminology is important and apologise for lack of clarity. We have reworded the text carefully.

      The reviewer is correct: specification of neural crest, placodes and neural plate is observed very early in chick, prior to gastrulation. However, in specification experiments tissue is removed from its normal environment to reveal what it does autonomously in the absence of additional signals. In the current study, we assess the activation of gene modules in normal development. We have therefore reworded the text to avoid ‘specification’ in this context.

      Reviewer #3 (Public Review):

      The goal of this work was to better understand how cell fate decisions at the neural plate border (NPB) occur. There are two prevailing models in the field for how neural, neural crest and placode fates emerge: (i) binary competence which suggests initial segregation of ectoderm into neural/neural crest versus placode/epidermis; (ii) neural plate border, where cells have mixed identity and retain the ability to generate all the ectodermal derivatives until after neurulation begins.

      The authors use single-cell sequencing to define the development of the NPB at a transcriptional level and suggest that their cell classification identified increased ectodermal cell diversity over time and that as cells age their fate probabilities become transcriptionally similar to their terminal state. The observation of a placode module emerging before the neural and neural crest modules is somewhat consistent with the binary competence model but the observation of cells with potentially mixed identity at earlier stages is consistent with the neural plate border model.

      Differences in the timing of analyses and techniques used can account for the generation of these two original models, and in essence, the authors have found some evidence for both models, possibly due to the period over which they performed their studies. However, the authors propose recognizing the neural plate border as an anatomical structure, containing transcriptionally unstable progenitors and that a gradient border model defines cell fate choice in concert with spatiotemporal positioning.

      The idea that the neural plate border is an anatomical structure is not new to most embryologists as this has been well-recognized in lineage tracing and transplantation assays in many different species over many decades.

      We appreciate the reviewers comment and agree that the neural plate border has previously been characterised anatomically. However, many studies have applied the term literally in reference to a transcriptional state which is specified through the expression of ‘neural plate border specifiers’, prior to segregation of the placodes and neural crest. Here we highlight that treating the neural plate border as a definitive transcriptional state which can be identified through the expression of ‘neural plate border specifiers’ is false. Instead, we find these ‘specifiers’ are upregulated within either neural crest, placodal or neural cell lineages over time. Cells at the neural plate border co-express these alternate lineage markers and therefore predicted to be undecided.

      The authors don't provide molecular evidence for transcriptional instability in any cells. It's a molecular term and phenomenon inaccurately applied to these cells that are simply bipotential progenitors.

      We thank the reviewer for pointing this out; we have therefore refrained from using the term unstable and instead refer to the cells as ‘undecided’ as suggested by reviewer 2.

      Lastly, there's no evidence of a gradient that fits the proper biochemical or molecular definition. Graded or sequential are more appropriate terms that reflect the lineage determination or segregation events the authors characterize, but there's no data provided to support a true role for a gradient such as that achieved by a concentration or time-dependent morphogen.

      We agree with the reviewer that ‘gradient’ was misleading. We have now replaced ‘gradient’ with ‘graded’ and expanded figure 6 to highlight the graded co-expression of gene modules associated with alternate fates. We have changed the title to reflect this.

      A limitation of the study is that much of it reads like a proof-of-principle because validation comes primarily from known genes, their expression patterns in vivo, and their subsequent in vivo functions. Thus, the authors need to qualify their interpretations and conclusions and provide caveats throughout the manuscript to reflect the fact that no functional testing was performed on any novel genes in the emerging modules classified as placode versus neural or neural crest.

      We agree with the reviewer that we do not provide any functional data to validate our predictions; it is for this reason that we submitted the manuscript as a ‘resource’ to make our data available to the community.

      Lastly, a limitation of gene expression studies is that it provides snapshots of cells in time, and while implying they have broad potential or are lineage fated, do not actually test and confirm their ultimate fate. Therefore, in parallel with their studies, the authors really need to consider, the wealth of lineage tracing data, especially single-cell lineage tracing, which has been performed using the embryos of the same stage as that sequenced in this study, and which has revealed critical data about the potential cells through when and where lineage segregation and cell fate determination occurs.

      The reviewer rightly points out the significance of the classical experiments in the context of the neural plate border. However, only one of the mentioned studies (Bronner-Fraser and Fraser, 1989), analyses cells at a single-cell level and does not assess placodes, while the remaining studies use tissue transplantation or cell population labelling. Although these studies provide valuable insights, they do not examine the fate or potential of single cells, nor do they reveal the transcriptional signature of these progenitors.

      Our findings emphasize the transcriptional heterogeneity at the neural plate border, suggesting that distinct subsets of neural plate border progenitors undergo varying sequences of fate restrictions. The upcoming challenge will be to conduct clonal analysis alongside scRNAseq to determine if neural plate border progenitors with similar transcriptional signatures experience the same fate restrictions or if external factors, such as cell-cell signalling, dictate cell fate choices.

      We have amended the manuscript to clarify that predictions of fate decisions require future validation through lineage tracing. Additionally, we have acknowledged in the introduction that previous studies have demonstrated the intermingling of neural, neural crest, and placodal progenitors at the neural plate border.

    1. Author Response

      Reviewer #2 (Public Review):

      Kim et al. examined the properties of neuronal connections responsible for inhibitory cell activation to show that the characteristics examined were similar in humans and rodents. This is important, as it suggests that the many rodent studies carried out over the past decades are physiologically relevant to humans.

      Strengths

      1) Human brain tissues are difficult to obtain, hence the study provides valuable insights

      2) An impressive multipronged approach was used for cell classifications

      3) Despite the lack of novel findings, the revelation of the similarities between human and rodent synapses is important and has far-reaching implications. This important finding suggests the knowledge generated from rodent research is, at least partly, physiologically relevant to and transferrable to humans.

      Weaknesses

      1) The study is descriptive by design, and hence provides limited conceptual advances, especially with the retrospect that synaptic properties are similar between humans and rodents (although see strength #3). For example, very similar findings and techniques have already recently been reported by a number of the same authors in the Campagnola et al., Science 2022 paper.

      We agreed that stimulus protocols of connectivity assays with multiple patch-clamp recordings in this study had been adapted from the recent publication (Campagnola et al., Science 2022). In this previous study, especially for human synaptic connectivity data, the main cell type categorization was at the level of excitatory and inhibitory neurons which identified based on morphological features and observed PSP characteristics (e.g., direction of membrane potential changes) when it connected each other. However, we went further to identify interneuron subclasses in the connectivity assays using virally labeled slice cultures and post-hoc HCR staining in addition to intrinsic classifier, which is not investigated from the recent publication (Campagnola et al., 2022). Therefore, following scientific findings and their implications are not the same shown in the previous study and we think this study provides a significant advance of our understanding in human cortical circuits organization.

      2) Despite the fact that normal physiology was reported, the use of pathological human brain tissue could affect the results.

      We agreed that the use of pathological human brain tissue to investigate normal physiology is not ideal, however, as mentioned in the METHODS below (section of “Acute slice preparation”), our surgically resected neocortical tissues show minimal pathology, and we believe these tissue preparations can be used to address normal physiological properties of human neurons. Importantly, we saw no effect of disease state (epilepsy vs. tumor) on the intrinsic or synaptic properties that we measured. Our METHODS state that “Surgically resected neocortical tissue was distal to the pathological core (i.e., tumor tissue or mesial temporal structures). Detailed histological assessment and using a curated panel of cellular marker antibodies indicated a lack of overt pathology in surgically resected cortical slices (Berg et al., 2021).”. We also state in the RESULTS that “These tissues were distal to the epileptic focus or tumor, and have shown minimal pathology when examined (Berg et al., 2021). Brain pathology was evaluated using six histological markers that were independently scored by three pathologists. Surgically resected tissues have been used extensively to characterize human cortical physiology and anatomy (Berg et al., 2021).”. Lastly, this is the best possible human tissue available for us to conduct physiological experiments. It is an unavoidable caveat of this work that our healthy brain tissue was derived from a donor brain exhibiting a serious disease.

      3) The manuscript may not be easy to understand for the uninvited, because many concepts and abbreviations were not properly introduced.

      Thank you for pointing this oversight out. We updated our manuscript and made sure that we fully describe all abbreviations. We now changed the abbreviation of MPC back to multiple patch-clamp recording, and some other abbreviations such as LAMP5, SLC17A7, DLX are now better explained. We have also changed the order of multiple figures (i.e., Figure 5 – Figure supplements to Figure 3 – Figure supplements) and removed some complicated figures (e.g., Figure 1 – Figure supplement 1) to present the data in a fashion that can be understood by a more general reader.

      4) The statistical treatment is not ideal, so some conclusions may not be valid.

      We performed additional statistical analyses as suggested and implemented in the text of the RESULTS.

      Furthermore, we also made additional Figure supplements (Figure 4 – Figure supplement 3, Figure 4 – Figure supplement 4, Figure 6 – Figure supplement 2, and Figure 6 – Figure supplement 3) to support our conclusions.

      5) The mixed usage of acute and cultured slices is not ideal and likely affects the outcome.

      We agree that the mixed usage of acute and cultured slices is not ideal, and it could affect the interpretation of outcome. Therefore, we performed additional analyses to see if there is any correlated change of synaptic property (i.e., paired pulse ratio) along the days after slice culture (now implemented in Figure 4 – Figure supplement 4 and Figure 6 – Figure supplement 3) and we didn’t find any significant correlation. However, we noticed the short-term synaptic dynamics are rather differentiated between acute and slice culture condition shown in Figure 4 – Figure supplement 1d. We think this is due to sampling bias rather than tissue preparation difference and these points are now more carefully described in the DISCUSSION as “This difference we observed in this study, i.e., more facilitating synapses were detected in slice cultures than in acute slices, could either reflect an acute vs. slice culture difference. However, we believe it is more likely to reflect a selection bias for PVALB neurons when patching in unlabeled acute slices, and that the AAV-based strategy with a pan-GABAergic enhancer allows a more unbiased sampling of interneuron subclasses whose properties are preserved in culture. In support of this, PPR analysis as a function of days after slice culture shows no relationship to acute versus slice culture preparation (Figure 4 – Figure supplement 4, Figure 6 – Figure supplement 3). Furthermore, we have observed that viral targeting of GABAergic interneurons greatly facilitates sampling of the SST subclass in the human cortex compared to unbiased patch-seq experiments (Lee et al., 2022), and this selection bias likely explains synapse type sampling differences in cultured slices compared to acute preparations.”.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      In this manuscript, Wang and colleagues demonstrate that a single systemic injection of a high dose of Akkermansia muciniphila (A.m.) lysate drives a rapid pancytopenia followed by prolonged anaemia and hepatosplenomegaly with late-onset extramedullary hematopoiesis (EMH). The latter, as well as the splenomegaly, were likely mediated through activation of pattern recognition receptors and IL-1R signalling pathways. This was demonstrated through the partial and full phenotype reversal in Tlr2;4-/- and MyD88;Trif-/- mice, respectively. Moreover, the phenotype was partially reversed following IL-1R antagonism. After performing multiplex protein assays and flow cytometry, the authors conclude that EMH, in this model, is mediated by IL-1a produced within the spleen by local monocytes and DC.

      Overall, the manuscript by Wang et al. is quite well presented, the experiments are mostly well controlled, the methods are well reported, and the data fit a clearly defined story with clinical relevance. Nevertheless, there are several major concerns that if addressed would greatly increase the strength of the authors conclusions.

      Major comments

      1. The "two wave" hypothesis of hematopoiesis - first in the bone marrow (BM) and then in the spleen - is interesting. However, although an early wave of BM hematopoiesis would make sense, under these circumstances, I don't think the data are strong enough to support this hypothesis as they stand. For example, although the frequency of LSK cells increase, the numbers of most LSK subsets decrease. Given the decrease in the absolute number of BM cells 1d after A.m. injection, isn't it possible that the LSK cells are only proportionally increased relative to the remaining Lin- cells? What happens to the absolute number of LSK cells following A.m. injection?

      Also, describing "two distinct waves of HSPC increase in the A.m.-injected spleen" (Fig 2 & S2 titles) and describing a "first wave" of HSPC expansion in the BM (lines 396, 399, 402 etc.) is misleading for the following reasons: (i) the data strongly support a single wave of increasing HSPC in the spleen, peaking at d14, and (ii) there is no evidence HSPC are increased in the BM until d56, although there does appear to be an early increase in MPP. The language should be changed accordingly. 2. The flow cytometry panel is not comprehensive enough to fully characterize the mature hematopoietic cell populations to the levels that are claimed here. For example, it is a stretch to assume that all B220- CD3- CD11c- cells are DC (splenic NK cells, eosinophils, monocytes and red pulp macrophages, for example, can express CD11c, particularly following inflammatory insult), or that CD11b+ F4/80+ SSC-hi cells are eosinophils, especially when eosinophils should be F4/80-lo are not known to express Ly6C in the spleen (For reference, see Immgen). These gating issues may explain the conspicuous absence of macrophages (should be F4/80+CD11b+Ly6C- and would also have a higher SSC than monocytes) in the plots. The B cell gate will also contain PDC, which express B220 (but can be easily excluded using Ly6C and CD11c). With respect to assessing the mature leukocyte populations in the spleen, relabelling the gates (CD11c+ cells instead of DC, F4/80+ myeloid cells instead of eosinophils) would suffice, however, these issues become a problem when trying to identify which cell populations express IL-1a.

      Due to the limited antibody panel used here, there is not enough evidence to suggest that DC and monocytes are producing IL-1a. Moreover, the histograms showing the changes in expression of IL-1a on the "DC" and "Mo" are not very convincing. How does the IL-1a staining look on a dot plot? Is there good separation between positive and negative? These plots need to be included. What happens if you gate on the IL-1a+ cells first, then phenotype them?

      Macrophages and splenic stromal cells are also likely candidates for IL-1a production. To assess which cell types are the true source of IL-1a, the authors need to repeat these experiments (namely, injecting A.m. and assessing IL-1a expression by leukocytes (and ideally also mesenchymal cells)) at d1 and d14, using a more comprehensive panel. Consider adding MHCII, CD64, Siglec F and CD24 to help differentiate between DC, MF, eosinophils and monocytes. CD45+ vs CD45- could be used as a minimum to assess the expression of IL-1a on leukocytes vs. stroma.

      OPTIONAL: The mechanism could be better defined using bone marrow chimeras to assess the different contribution of TLR2/4 signalling and IL-1R signalling on the hematopoietic vs. mesenchymal cell compartments. 3. From these experiments, it is difficult to fully rule out a contribution from the adaptive immune system to the splenomegaly phenotype due to the marked difference in the size of BALB/c and MSTRG spleens at steady state. The authors should show the differences in spleen weight and total cell number as a % increase from control. The no of HSPC should also be normalized per gram of tissue weight or represented as a fold change compared to the relevant control groups. 4. When using fluorescent imaging to compare the abundance of HSPC and other cell populations in the spleen, the authors should provide absolute quantification from multiple FOV and multiple mice. 5. Finally, although the experiments are adequately replicated, the stats are not always appropriate. For example, a t-test shouldn't be used when there are >2 groups, or for a time course. This needs to be amended.

      Minor comments

      • Line 82-83: I'm fairly certain monocytes and inflammatory Ly6Chi cells are the same thing.
      • Line 83-84: "IL-1a is crucial for sustaining inflammatory responses, recruiting myeloid cells to infected tissue and inducing hematopoietic stem and progenitor cell (HSPC) mobilization and expansion both in vitro and in vivo" - I don't believe IL-1a has been shown to be crucial for either, even if it has been shown to play a role. If I am mistaken, please reference with a manuscript showing relevant phenotypes using KO mice.
      • Line 214: "Thus, we decided to use 200ug of lysate for the rest of all experiments." - is this what was usen for Figures 1A-C? This is not mentioned anywhere.
      • Line 227: "containing both non-hematopoietic cells and immature HSPCs" Please reference Fig. 1H here. Otherwise, it is unclear how you identified the "HSPC and other cell types" in Fig. 1G.
      • Figure S2A is described in text before Supp 1I-O and Fig S1H is not referenced in text at all.
      • It would be interesting to include what happens to hepatomegaly in MSTRG, Tlr2;4-/- and MyD88;Trif-/- mice.
      • Please define WBM. Presumably whole bone marrow?
      • Notably, CCL2 is increased in spleen lysate, BM lysate and serum. Given is role in myeloid cell mobilization from the BM, I would expect its role in the phenotype described here to at least be discussed.
      • HSPC LT gate includes MPP1, and should be labelled as such.

      Significance

      General assessment: The manuscript provided by Wang et al. describes, for the first time, a prolonged anaemia and hepatosplenomegaly with late-onset extramedullary hematopoiesis following a single systemic injection of A.m. lysate. The EMH phenotype appears robust and the data implicating TLR-signalling and IL-1a production are compelling. The work has clinical relevance as it increases our understanding of the factors driving EMH.

      There are two key limitations that let this study down. Firstly, the lack of depth in the flow cytometry panel used for immunophenotyping means it is not at all clear which cell types are producing IL-1a. Secondly, the authors use an enormous dose of bacterial lysate that is well above physiological levels, even following a loss of barrier integrity (e.g., in patients with IBD). This makes me question the biological relevance of the study, particularly with respect to Akkermansia translocation.

      Advance: With some improvement, this study will advance the field, in general. Previous work has looked at EMH following LPS injection, or live E. coli infection, however; the authors are able to demonstrate a distinct Akkermansia-specific effect that differs to that of LPS, membrane components of a different gram-negative bacteria, B. theta. The advancements implicate IL-1a in the modulation of EMH, for the first time, providing some mechanistic insight into this phenomenon.

      Audience: This work will likely be of interest to basic researchers interested in EMH. It may also be of interest to clinical researchers of pathologies where EMH is a known complication, such as rheumatoid arthritis and cancer. The impact of the work will depend on whether or not EMH contributes to pathogenesis, or is an epiphenomenon. To my knowledge, this has not been fully established, although this is not my area of research.

      I am a basic researcher with expertise in immunology focused on host-microbe interactions, both within the intestine and at distal tissues. I have knowledge of BM hematopoiesis and the microbial factors that influence if although my knowledge on extramedullary hematopoiesis is limited.

    1. “Consider a future device …  in which an individual stores all his books, records, and communications, and which is mechanized so that it may be consulted with exceeding speed and flexibility. It is an enlarged intimate supplement to his memory.”
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript presents a detailed numerical model of blood flow in a region of the zebrafish vasculature.

      The results section is quite intense and detailed. it is difficult to understand what the authors are after. I think a rewrite would beneficial. The authors present simulations for a wild type and a couple of phenotypes. For each of these they speculate on the possible adaptation mechanism leading to the discussed phenotype, as preservation of constant wall shear stress. However, the comparison between experiments and numerical simulations is really elusive as the conclusions on those mechanisms. Overall we suggest a rewrite with clearer organisation in a way that the reader is not overflown with useless details.

      We thank the reviewer for the advice on the general writing standard and data organization. We have reanalyzed experiment data and interpreted the findings more conservatively for application into the simulation models. As a result, some conclusions to the results sections have changed. Accordingly, we have done a major revision of the entire Results, Discussion and Models and Methods sections in the paper to articulate these reinterpretations while removing superfluous details that may obfuscate the data.

      It is not always clear what info of the experiments are used in the simulations on top of the anatomy. Our understanding is that the pressure boundary conditions are set to match the red blood cel velocity observed in experiments. Is this always the case for the three phenotypes and which vessels ?

      We thank the reviewer for the question. Only WT and Marcksl1 KO have been matched for peak velocities in the CA, CV and ISVs between experiments and simulations. WT results were compared to both the experimental reference of 27 embryos in Table 3 and also to the current experiment pool of WT (5 embryos) in Table 6. Marcksl1 KO simulation models 1, 2 and 3 were compared against the average level seen in the low and moderate perfusion Marcksl1 KO phenotypes (8 embryos) from the experiment (Table 5 and Table 6). Additionally, we also have represented the similar level of RBC hematocrit in the CA for WT model to WT experiment data from the reference cited in Table 3.

      In addition to the velocity comparisons, we now use the experimentally observed trend of decreased flow rate in the CA of Marcksl1 KO experiment data to assess the model boundary conditions amongst Marcksl1 KO models 1, 2 and 3 that best reflect the experimental observations:

      Page 11, lines 1 to 20

      The Marcksl1 OE cannot be matched because we do not have the experiment data for that, the same goes for PlxnD1 where we have no experiment flow data. These two networks represent more conceptual discussions, particularly in PlxnD1 case where we have explicitly stated in the new discussion section:

      Page 15, lines 24 to 34

      There are about 7 inlets and outlets where to impose pressure boundary conditions. Can the author comment on the uniqueness of this problem?

      Can different combination of pressure boundary condition leading to the same result ? In how many points/vessels is the measured velocity matched ?

      We thank the reviewer on this insightful concern. Indeed, the uniqueness of flow and pressure field can be a problem without careful consideration. We have tried to address this to some extent, because CA, CV are connected by the ISV and DLAV network, to match flow velocity in all regions, the pressure distribution ought to be unique to the particular setting we employed.

      As shown in table 3, average systolic peak flow velocities in the entire CA and CV encompassing the 5 ISV segment domain is matched between the simulation and the population-averaged experimental data from the experimental reference (27 fish sampled in the cited reference) for the same regions in WT network. Average systolic peak flow velocities for the 10 ISVs in the simulation were matched against WT experiment population-averaged systolic peak flow velocities in arterial and venous ISVs in the same caudal region.

      Additionally, we also compared the flow velocities to the experiment conducted within this study (5 WT, and embryos). This comparison data is shown in Table 6. Admittedly the discrepancy was large for CV and ISVs regions likely due to a smaller data set sampled in this study and biological variations that happen from one experiment to another. We have acknowledged this deficiency in the revised discussion section:

      Page 15, lines 3 to 9

      The argument that similar beating frequency in the WT and GATA1 MO suggest pressure does not change is not clear. If the heart was a volumetric pump it would impose the same flow rate, not the same pressure. It would be more useful to measure the cardiac output in terms of flow rate in the Dorsal Aorta. Previous measurements by Vermot suggested the latter would not change much in gata1 MO. It could be that the cardiac output is the same but the vasculature network is different in a way that the shear stress remain the same. It does not look like this was checked by the authors.

      We thank the reviewer for this insight. In accordance with the reviewer’s suspicion, we have estimated the flow rates in the CA of gata1 MO injected embryos and found the level to be similar to WT. This supports the reviewer’s opinion that the heart rate similarity indicates cardiac output similarity and not arterial pressure similarity as we previously put forward. Furthermore, we have checked that the gata1 morphants do in fact present reduced ISV diameters. In light of this reinterpretation, we performed an additional zero hematocrit model (model 3 in section 2.1). We have consequently rewritten the entire section on how RBC hematocrit modulates hemodynamics in a microvascular network:

      Page 6, line 18 to page 8 line 10.

      Additionaly, it would be useful to provide an effective viscosity for the different vessels, and an effective hydraulic impedance relating DP and Q to interpret the results.

      We have followed the reviewer’s advice and have analyzed for vessel hydraulic impedance and effective viscosity in all the network models presented. This is included in the main figures and discussion. The vessel impedances are discussed for the various models in these following parts of the manuscript:

      Page 9, lines 20 to 29

      Page 11, lines 28 to 30

      Page 12, line 1 to page 13 line 10

      Is the hydraulic impedance of the vessels kept constant in the smooth-geometry model? This needs clarification

      The SGM diameters have been determined based on geometric averages and not impedance equivalency. The reason why we did this is because the impedance will not be known until the CFD is performed for the WT network. This is because without a pressure distribution (which cannot be determined experimentally) we cannot calculate vessel impedance since only flow can be measured and both flow and pressure are requirements to impedance calculation. Our intention with the SGM is to highlight how geometric averaging of morphological characteristics lead to incorrect flow and stress predictions. However, we understand the reviewer’s sensibility and have revised the entire section of the SGM results. We have now discussed three SGM models with varying degrees of geometry simplification. The SGM1 in the revised manuscript matches WT network impedance in the ISVs by including both axial variation in lumen diameter of the WT network and the elliptical fit representation of cross-sectional skewness seen in WT ISV lumens. SGM 2 has representation of axial variation but not luminal skewness and SGM3 has only geometric average similarity to WT ISVs. The new findings and discussion can be found in the revised manuscript here:

      Page 8, line 19 to page 9 line 36.

      As mentioned by the authors they propose a very complex and time expensive simulation. However the results they report are kind of intuitive. Given the availability of the experimental results, would it be useful to use a simpler red blood cell model in the future, to make their simulation more practical? Or clarify when such demanding simulations can add something new?

      We agree that the intuition feedback depends on the expertise of the investigator. The boundary condition selection is intuitive from the experimental findings and key data like pressures in the network cannot be measured. Furthermore, population-averaged flow data does not always match the flow-to-geometry situations that vary from sample to sample, thus demonstrated by the high margin of prediction discrepancy for flow velocities in table 6. We have discussed these challenges and our recommendations for improvement in the Discussion section:

      Page 15, lines 3 to 9

      Page 15, lines 35 to 40

      Page 16, lines 12 to 15

      On the topic of RBC model simplification, we agree with the reviewer that our work suggests the methodology would benefit from a further coarse-graining approach to the RBC phase. Accordingly, we discussed the possibility of using a low-dimensional RBC model already published in literature:

      Page 14, lines 13 to 17

      The authors should check their references as this is not the first time work has been done on the topic. Would be good to have a check in the work of Freund JB and colleagues, as well as Dickinson and colleagues and Franco and colleagues to discuss how the work compares. There may be interesting work in modelling cardiac flow forces in the embryo too.

      Thank you for referring us to other publications that are related to our study. To our knowledge and after performing publication search on these authors, we find that although Dickinson and colleagues performed experiments to examine the effects of perturbed blood flow on vessel remodelling (Udan et al., 2013), they did not perform any numerical modelling to calculate hemodynamic forces such as WSS and luminal pressure. Instead, changes in vessel morphogenetic process were only correlated with blood flow velocity. In our study, we attempt to quantitatively correlate WSS and pressure distributions within a vascular network. Franco and colleagues (Bernabeu et al., 2014) developed PoINet to model haemodynamic forces in mouse retina model of angiogenesis. From what we understand, PoINet is different from our 3D CFD model by 1) not having red blood cells incorporated in their model and as such, the blood viscosity prediction is modelled using shear-rate dependent formulation and not through red blood cell hematocrit, 2) cross sections of blood vessels are assumed to be circular and therefore have no irregularity and 3) live imaging of blood flow is difficult in mouse retina therefore preventing accurate boundary conditions for the model.

      We have included the reference to work of Franco and colleagues:

      Page 14, line 28 to line 31

      Page 9, lines 12 to 14

      Freund JB indeed has had extensive work on RBC and cellular flow in microvessels. We have included a reference of his work in:

      Page 14, lines 22 to 25.

      Reviewer #1 (Significance (Required)):

      The authors discuss the applicability of a detailed numerical model of blood flow in a region of the zebrafish vasculature.

      We are not expert in the lattice boltzmann method used here, but the results are what it would be expected from a physical stand point, and together with the information from the method section, we do not have major concerns about the numerics.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors report corroborating numerical-experimental studies on the relationship between morphological alterations (e.g. vessel lumen dilation/constriction, network mispatterning) and hemodynamical changes (e.g. variation in flow rate, pressure, wall shear stress) in the vascular network of zebrafish trunk circulation. Various physiological or pathological adaptation scenarios were proposed and tested, with a range of simulation and experiment models. Where I found it a solid piece of work supported by abundant data, certain aspects need to be clarified/enhanced to improve the scientific rigor and potential impact of the manuscript. Below are my detailed comments in the hope of helping the authors improve the manuscript's quality.

      Major comments:

      1. Cellular blood flow in vascular networks has been extensively studied in recent years by existing computational models (some of which were published open-source) with similar methods and features to the one proposed by the present work. Can the authors be more explicit about the original contributions of the current model, and provide evidence accordingly (e.g. Github repository or code resources)

      The RBC model is essentially the model developed by Fedosov and colleagues (Fedosov, et al., 2010). Likewise, the LBM solver for fluid flow calculation is not. Following the reviewer’s advice, we have removed the details of these non-novel aspects of the methodology and placed them in sections E and F of supplementary material instead. The new Models and methods now show condensed descriptions of the three numerical solvers used and the addition of a grid independence matrix discussion section:

      Page 17, line 8 to page 20, line 33.

      Crucial details for the simulation setup and model configuration are missing. What were the exact boundary conditions (e.g. inlet and outlet pressures) and initial conditions (e.g. feeding hematocrit of RBCs), and how the numerical-experimental validation process of "to match the velocities of various segments of the network by iteratively altering the pressure inputs ..." as stated on page 13 (lines 1-2) was performed for simulations in this work?

      We apologize for the vagueness of our description on how numerical to experimental validations were performed. As replied to reviewer 1 for a similar clarification, we have indicated in Table 3 how average systolic peak flow velocities in the entire CA and CV encompassing the 5 ISV segment domain were matched between the simulation and the population-averaged experimental data for the same regions in WT network. Average systolic peak flow velocities for the 10 ISVs in the simulation were matched against WT experiment population-averaged systolic peak flow velocities in arterial and venous ISVs in the same caudal region.

      With regards to what iterative alterations of pressure inputs mean, we monitored the average systolic peak velocities and hematocrit levels in CA, CV and ISVs in intervals of 5 cardiac cycle intervals before manually correcting the pressure input levels to better match average systolic peak velocities in these vessels from the experiment averages. Since we are using population averaged flow data, we do not expect their levels to match the levels in a particular fish-specific geometry, the degree of discrepancy between experiment averages and the model predictions of systolic velocities can be large (Table 6). Admittedly, this is one of the weaknesses of our approach and this limitation is stated in the Discussion section:.

      Page 15, lines 3 to 9

      As RBC flow typically requires roughly 5 cardiac cycles of flow to reach flow development this process of iterative correction typically takes place over 10 to 20 cardiac cycles. We understand that validation may be a subject of keen interest to readers, hence we have now briefly described the solution initialization and flow development protocol in our modeling approach here:

      Page 6, lines 5 to 8

      What lattice resolution was used for the flow solver and was the RBC membrane mesh chosen accordingly? Were there any sensitivity analysis (regarding pressure input) or grid-independence study (regarding lattice resolution)

      We originally decided on the grid (∆X) and time (∆T) discretization resolutions (0.5 µm and 0.5 µs) based on the acceptable computing turnaround time for each model within our scale of resources. We have now included a section on the grid independence matrix in Models and Methods:

      Page 19, line 20 to page 20, line 33

      Details of the statistical tests (type of tests used, assessment of data normality, sample size etc.) should be given in the figure caption where applicable (e.g. Fig. 3C, Figs. 7-9).

      We apologize for the lack of clarity. All statistical tests used have now been mentioned at least once in each section of results and also in Figure captions wherever significance bars are displayed.

      The regression models should also be used with caution, e.g. in Fig. 4B, why should data from two different fish types, namely Gata1 MO and WT, be grouped to fit a linear regression model?

      We understand the reviewer’s concern that two population data sets should not be carelessly pooled together for regression analysis without adequate justification. In this case we are utilizing gata1 morpholino injection as a means to alter hematocrit level. There is no reported side-effect as to the best of our knowledge, only hematocrit and possibly hemodynamics and morphological response related to hematocrit level should be affected. Moreover, we have mislabelled the companion set to the gata1 morpholino as WT, the data is in fact data from control morphants and not WT. This change has been applied to Fig. 3 graphs and Table 4 and results section:

      Page 7, lines 3 to 16

      Finally, as we want to generate a continuum range of varying hematocrit for embryos of the same developmental age. In this regard, we think that within the scope of our intentions and well-accepted usage of gata1 morpholino as a hematocrit reduction protocol it is reasonable to pool the two data sets together for regression analysis.

      4.I found the data presented in Fig. 7 insufficient to confidently exclude the numerical models 2, 3 but favor model 1 as the adaptation scenario for the Marcksl1KO case. The first question is, how are the threshold RBC perfusion levels determined to categorize the experimented Marcksl1KO fishes into four groups, i.e. "high", "moderate", "low", "zero"? The authors also need to justify why the "high", "moderate", "low" groups can be mapped to the three modelling scenarios (namely models 1, 2, 3) is it just because "a qualitative match with the experimental trend of ascending CA blood velocity" (Fig. 7F)?

      We thank the reviewer for his interpretation of our results. Firstly, we apologize for generating the confusion but we are not trying to map simulation models 1, 2 and 3 to high moderate and low groups respectively in Fig. 7. The high, moderate and low categorizations of experimental Marcksl1 KO phenotypes are based on RBC flux levels observed experimentally. We are trying to ascertain which Marcksl KO phenotype the models 1, 2 and 3 fit, if they do fit the experiment trend at all.

      Second, in Fig. 7C, it is shown that no significant difference exists between the "high" group and the WT in their average ISV diameter, then what is defining that group as Marcksl1KO type ?

      We apologize for the confusion generated. High flow phenotype is similar to WT flow, the diameter is also similar to WT. In Marcksl1 KO mutants we don’t always see clear phenotyping and often a range is presented from mutant to mutant. Hence the high group is essentially morphometrically and hemodynamically similar to WT, the only reason we know it is a mutant because we have genotyped the zebrafish (marcksl1a-/-;marcksl1b1-/-).

      Third, a central assumption here is using heart rate as a measure of the pressure drop in different fish individuals (Fig. 7D). Can't two fishes with similar heart rate have distinct pressure drops in the trunk due to difference in network architecture and topology, vice versa?

      We agree with the reviewer’s opinion and now feel that our initial proposition was naïve. After addressing the interpretation of heart rate similarity in the gata1 morphants with more convincing CA flow rate estimations, we now believe that heart rates might not be useful indicators of flow or pressure levels in the network. Instead, cardiac output in the form of CA flow rate as reviewer 1 has suggested might be a better indicator. As the reanalysis has dismantled the earlier interpretation, and found that based on the flow rate estimation for the CA, Marcksl1 KO networks have reduced blood flow rates in the CA.

      Page 11, lines 9 to 20

      This finding has been incorporated into the consideration of flow adaptation scenarios predicted by the simulation models accordingly in the revised manuscript:

      Page 12, line 1 to page 13, line 10

      Fourth, the authors should explain why a power-law fit (note that it is not "exponential" as stated on page 10, line 3) should be adopted for the regression analysis in Figs. 7E-v,vi (a useful reference may be Joseph et al. eLife 2019: 10.7554/eLife.45077).

      We thank the reviewer for the useful reference and the careless mislabeling of regression curve used. This figure has been redone and a linear regression is instead used that does not attempt to imply any physical law for a power or exponential fitting.

      Change made: Fig. 7C

      Minor comments:

      1. The state of art of cell-resolved blood flow models employed to simulate microcirculatory hemodynamics is not accurately described in the introduction (page 4). More recent works should be cited and critically reviewed to present a fair view on the novelty of the computational model developed herein.

      We apologize that the models were mentioned in a passing manner. However ,the need for brevity in introduction somewhat limits their expansion. We have instead gave more direct discussion on similar studies and their relevance to our present work in the Discussion section:

      Page 14, lines 13 to 31

      It is unclear what "realistic representation of local topologies in the network" (page 7, lines 28-31) means as a claim of novelty. If it means vessel "diameter variation", this geometric feature has been modeled by the works the author referenced (namely Roustaei et al. 2022, Zhou et al. 2021). If it means something else, for example, unsmooth or non-circular vessel surface (or "irregularity of the local endothelium surface" as mentioned on page 5, line 2), then strangely the effects of such features are actually not described in the manuscript.

      We apologize for not meeting the expectation of novelty as claimed. We see value in the SGM study matrix have now generated data on three SGM scenarios. The SGM1 in the revised manuscript matches WT network impedance in the ISVs by including both axial variation in lumen diameter of the WT network and the elliptical fit representation of cross-sectional skewness seen in WT ISV lumens. SGM 2 has representation of axial variation but not luminal skewness and SGM3 has only geometric average similarity to WT ISVs. Essentially the comparison between SGM1 and SGM2 highlights the role of luminal cross-sectional shape skewness while SGM2 to SGM3 highlights the role of axial variation in luminal diameter. With this new SGM data set, we think we can better qualify the aspiration of demonstrating how vessel shape “irregularities” can alter network hemodynamics. The new findings and discussion can be found in the revised manuscript here:

      Page 8, line 19 to page 9 line 36.

      Why should Fig. 8 contain data from Marcksl1KO model 2? The scenario underlying model 2 was rejected earlier in the manuscript (see point 6 above), and the Marcksl1KO model 2 data are not mentioned in the text when describing the results of Fig. 8, either.

      We have reanalyzed the experiment trend and rewritten the outcome of this results section. In summary, both models 1 and model 2 meet the trend of flow rate reduction (with respect to WT levels) in the CA observed in the experiment. Hence, model 2 inclusion is relevant to the WSS analysis. The changes pertaining to this can be found here:

      Page 11, line 9 to page 13 line 10.

      It is a dense article with loads of data, which is an advantage but only if appropriately streamlined. More subheadings should be considered, especially for section 2.3 (for which the current subsections appear mistaken, 2.3.1 followed by 2.4.2) The manuscript could also benefit from restructuring through optimal combination of simulation visualizations and quantitative analyses. For example, in Fig. 6, not all simulation snapshots are needed here (it is difficult to visually compare the changes between different cases), whereas some quantification in the form of histograms or boxplots will be handy for the readers to note the variation of WSS magnitudes and ranges.

      Thank you for the advice, we removed the unnecessary graphical plots and refer to simulation videos in supplementary data instead for such cases. The bad indexing of results subsections has been fixed, while new subsections have been made for better directional narrative to the paper. These changes are colored in red throughout the revised results section:

      Page 4, line 37 to page 13 line 39

      Related to point 8, the authors could also consider integrating or synthesizing the analyses for individual aISVs and vISVs presented in various figures. Current descriptions for the ISV data appear scattered with frequent exceptions to the summarized trends or relationships. Some minor formatting issues should also be addressed, e.g. the confusing color codes in Figs. 9D-i, E-i.

      Thank you for the advice, we have now pooled aISVs together into one group and vISVs into another, instead of discussing data trends on each of the 10 ISVs.

      The mispattening case presented in the end of the results section (section "2.4.2") is interesting but appears loosely connected to the preceding contents. Also, it seems not even mentioned in the discussion section.

      We agree that the mispatterning case has been only tangentially relevant to the rest of the manuscript. We have linked the topic thematically by network alterations transforming network flows. It is also now included in the discussion section here:

      Page 15, lines 30 to 34

      Finally, apart from the effect of topological features on local blood flow, the authors should consider the global flow redistribution arising from the network structure (useful refs. Include Chang et al. PLOS Computational Biology 2017: 10.1371/journal.pcbi.1005892; Meigel et al. Physical Review Letters 2019: 10.1103/PhysRevLett.123.228103; Schmid et al. eLife 2021: 10.7554/eLife.60208).

      Thank you for the additional references. These are solid pieces of work that have been added to the discussion here:

      Page 16, lines 3 to 10

      **Referees cross-commenting**

      This review report resonates with mine from an experimental perspective and I agree with all points made regarding issues of the current manuscript that the authors need to address with a revised version.

      Reviewer #2 (Significance (Required)):

      Significance: The particular merit of the work lies in its comprehensiveness of design and abundance of data, which will be of great interest to both the computational and experimental communities in this research field. However, some crucial details (especially with respect to the modelling aspects) are missing, thus hampering the scientific rigor and potential impact of the work. Furthermore, certain justifying statements appear speculative and inconclusive to explain the obtained data, especially regarding the effect of boundary conditions and systemic parameters. The citation of references (some not cited, some cited already but not properly discussed) also needs to be enhanced with engaging discussions to better bridge the findings of the current work (e.g. RBC partitioning in vascular network, effect of WSS on vasculature morphogenesis) with recent works on this research topic.

      References

      Fedosov DA, Caswell B, Karniadakis GE. 2010. A Multiscale Red Blood Cell Model with Accurate Mechanics, Rheology, and Dynamics. Biophys J 98:2215–2225. doi:10.1016/j.bpj.2010.02.002

      Freund JB, Goetz JG, Hill KL, Vermot J. 2012. Fluid flows and forces in development: functions, features and biophysical principles. Dev Camb Engl 139:1229–45. doi:10.1242/dev.073593

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript presents a detailed numerical model of blood flow in a region of the zebrafish vasculature.

      The results section is quite intense and detailed. it is difficult to understand what the authors are after. I think a rewrite would beneficial. The authors present simulations for a wild type and a couple of phenotypes. For each of these they speculate on the possible adaptation mechanism leading to the discussed phenotype, as preservation of constant wall shear stress. However, the comparison between experiments and numerical simulations is really elusive as the conclusions on those mechanisms. Overall we suggest a rewrite with clearer organisation in a way that the reader is not overflown with useless details.

      It is not always clear what info of the experiments are used in the simulations on top of the anatomy. Our understanding is that the pressure boundary conditions are set to match the red blood cel velocity observed in experiments. Is this always the case for the three phenotypes and which vessels ? There are about 7 inlets and outlets where to impose pressure boundary conditions. Can the author comment on the uniqueness of this problem? Can different combination of pressure boundary condition leading to the same result ? In how many points/vessels is the measured velocity matched ?

      The argument that similar beating frequency in the WT and GATA1 MO suggest pressure does not change is not clear. If the heart was a volumetric pump it would impose the same flow rate, not the same pressure. It would be more useful to measure the cardiac output in terms of flow rate in the Dorsal Aorta. Previous measurements by Vermot suggested the latter would not change much in gata1 MO. It could be that the cardiac output is the same but the vasculature network is different in a way that the shear stress remain the same. It does not look like this was checked by the authors.

      Additionaly, it would be useful to provide an effective viscosity for the different vessels, and an effective hydraulic impedance relating DP and Q to interpret the results.

      Is the hydraulic impedance of the vessels kept constant in the smooth-geometry model? This needs clarification

      As mentioned by the authors they propose a very complex and time expensive simulation. However the results they report are kind of intuitive. Given the availability of the experimental results, would it be useful to use a simpler red blood cell model in the future, to make their simulation more practical? Or clarify when such demanding simulations can add something new?

      The authors should check their references as this is not the first time work has been done on the topic. Would be good to have a check in the work of Freund JB and colleagues, as well as Dickinson and colleagues and Franco and colleagues to discuss how the work compares. There may be interesting work in modelling cardiac flow forces in the embryo too.

      Significance

      The authors discuss the applicability of a detailed numerical model of blood flow in a region of the zebrafish vasculature.

      We are not expert in the lattice boltzmann method used here, but the results are what it would be expected from a physical stand point, and together with the information from the method section, we do not have major concerns about the numerics.

    1. Author Response:

      The following is the authors' response to the original reviews.

      We would like to thank all Reviewers for their careful evaluation of our work. Below please find our responses and comments.

      Reviewer #1 (Recommendations For The Authors):

      1) The detection of cell-released GLP-1 is addressed in an indirect, averaged way in Fig. 2 - Supplement 1. This question seems like a good opportunity for an antagonist experiment (Exendin-9), which presumably would require much lower concentrations than those used to antagonize a saturating dose of GLP-1. It would also be much more convincing if GLPLight1 could be used to detect stimulated release of GLP-1 from the GLUTag cells.

      We tried multiple times to acutely stimulate GLUTag cells using Forskolin and IBMX, but unfortunately we did not observe any robust fluorescence increase of GLPLight1. The only observation that was consistent was the higher baseline fluorescence of GLPLight1, and the reduced maximal response to saturating GLP-1 when GLPLight1 expressing HEK cells were cultured overnight with GLUTag cells. We considered this assay to be at best qualitative and — despite the aforementioned attempts — could not determine quantitative values.

      2) The excitation-ratiometric response of the sensor, shown in Fig. 1D, is usually accompanied by strong pH-dependence of sensor function. It would be valuable to characterize this pH-dependence, using permeabilized cells in which the pH is changed; the ability of small (0.2-0.5 unit) pH changes to produce changes in fluorescence, as well as to affect the dynamic range of the sensor, should be characterized. This will prevent the misidentification of agents that affect cellular pH as having (for instance) an inhibitory effect on the binding of GLP-1 to GLPLight.

      The pH sensitivity of cpGFP-based sensors is a valid concern. However, considering that the cpGFP module from GLPLight1 is intracellular (and thus largely protected from potential extracellular pH changes) we assume that GLPLight1 signal should be robust in most in-vivo or cell-based assays. In fact we have previously characterized this for a similarly-built neuropeptide sensor (PMID: 35145320) and believe that this will be the case also for GLPLight1.

      3) The reported Kd for Exendin-9 is in the low nM range. Please explain the partial response at 1000x the concentration (including a discussion of the Kd of GLP-1 itself, as well as its off kinetics, and a comparison of this assay to the assays used previously).

      The partial response is due to the presence of 1 uM GLP-1 in the imaging buffer, which is in constant competition with Exendin-9 for the binding to GLPLight1. Because GLP-1 has similar affinity as Exendin9 (see for example PMIDs: 34351033 and 21210113) and both are present at saturating concentration, we did expect to observe a partial response from GLPLight1. In this study, we did not exactly determine the on and off kinetics of both GLP-1 and Exendin9 on the GLPLight1 sensor due to technical challenges: to perform these experiments, we would need to set up a perfusion system where we could remove the unbound ligand and either wash off the bound ligand with buffer or compete it out with an antagonist. Unfortunately, we currently do not have access to such a set up.

      4) Are the turn-on kinetics in Fig. 2C limited by drug application or by association? Are the on-rates much slower for the lower concentrations used for Fig. 2C? This is important for knowing how fast responses are likely to be at the lower concentrations likely to be achieved by endogenous release.

      If we consider Fig 2B and 2C, we assumed the on-kinetics to be mostly driven by association since the ligand is expected to be homogeneously distributed.

      The on-rate kinetics are indeed slower when lower concentrations of GLP-1 are used as shown in (Figure 2b) where we observe a TauOn of 4.7s with 10 uM GLP-1 and much slower kinetics when GLP-1 is applied a 1 uM for example (Figure 3d). As a result, we chose to incubate the ligand with GLPLight1 expressing cells for at least 30 minutes before the measurement of the dose-response to be close to equilibrium.

      5) The parameters for the fitted dose-response curves in Fig 2C should be listed. The ~4x discrepancy between the dose-response in HEK-293 cells and neurons should be discussed. Are there known auxiliary subunits, dimerization, or lipid dependence that might account for this? It seems important to understand this if the sensors are to be used in an assay that may compare different systems.

      We added the EC50 values to Fig 2C as requested. We did not consider a 4x discrepancy to be significant, because the measurement error in the EC50 region is relatively high and this difference seemed to be within the error range. In fact, the 95% confidence interval ranges are 7.8 to 11.1 nM in Neurons and 23.8 to 32.1 nM for HEK cells, if we consider the upper and lower boundaries of each, the difference drops to around 1-fold. We also performed a statistical test to compare the two fits (Extra sum of squares F-test) that confirmed the two fits were not significantly different (P value = 0.3736). Of course, the interaction partners and membrane composition are different in HEK cells and neurons and probably have an influence on the EC50 of GLPLight1, but their exact influence is unclear.

      6) It seems surprising that removal of the endogenous N-terminal secretory sequence is actually helpful for membrane expression. Do the authors have any suggested explanation for this?

      GLPLight1 contains an N-terminal hemagglutinin (HA) secretory motif. The hmGLP1R sequence that we chose also contained an endogenous secretory sequence that most likely interfered with the membrane transport mechanism and resulted in a lower sensor expression with both secretory sequences. We thus decided to keep the HA instead of endogenous to remain consistent with other sensors created in-house.

      7) In Fig. 1, supplement 3, are the transient responses real? Do they occur with the control construct?

      While we have not measured the G-protein recruitment on GLPLight-ctr, we have often observed this phenomenon for various receptors and ligands. The transient responses are thus most likely an artifact after manual addition of the ligand possibly due to:

      -       Temperature difference

      -       Exposure of the plate to ambient light before resuming measurement (phosphorescence)

      -       Re-suspension of the cells affecting the proximity to the detector

      -       Other unknown variables

      If these responses were real, we would also expect them to be more sustained over time.

      8) Please include a sentence or two explaining the luminescence complementation assay, and a reference.

      We updated the results section of the manuscript with a section describing the luminescence complementation assay along with a reference:

      “Next, we compared the coupling of GLPLight1 and its parent receptor (WT GLP1R) to downstream signaling. We first measured the agonist-induced membrane recruitment of cytosolic mini-G proteins and β-arrestin-2 using a split nanoluciferase complementation assay (Dixon et al., 2016). In this assay both the sensor/receptor and the mini-G proteins contains part of a functional luciferase (smBit on the sensor/receptor and LgBit for Mini-G proteins) that becomes active only when these two partners are in close proximity (Wan et al., 2018).”

      Bravo to the authors for already making the sensor plasmids available at addgene.com. It would be helpful to include the plasmid IDs and/or a URL in the manuscript.

      We would like to thank Reviewer #1 for noticing this. We have updated the data availability section of the manuscript and added the AddGene plasmid numbers of the constructs generated in this study.

      Reviewer #2 (Recommendations For The Authors):

      1) There are some parts of the introduction that need clarification. For example, GLP1 is quoted as an anorexigenic peptide, however, that is probably only true for centrally- derived GLP1. There is no evidence that enteroendocrine-derived GLP1 (the major pool) is anorexigenic- it is likely to be substantially degraded by DPPIV before reaching the brain. In any case, the discovery of GLP1 was always one of glucose-dependent insulin secretion, with the brain system being described decades later. Overall, the intro needs to be slightly reframed. While the tools presented here are more useful for assessment of central GLP1-releasing circuitry, they are ultimately based upon GLP1R signaling that is much better validated in the periphery.

      We have slightly reframed the introduction accordingly.

      2) "The human GLP1R (hmGLP1R) is a prime target for drug screening and drug development efforts, since GLP-1 receptor agonists (GLP1RAs) are among the most effective and widely-used weight-loss drugs available to date (Shah and Vella, 2014)." GLP1R was for two decades the breakthrough drug for treatment of type 2 diabetes mellitus and correction of glucose tolerance as assessed through HbA1c. It is only through reporting on millions of patients receiving GLP1RA that the weight loss effects were noted, leading to Phase1-3 trials and eventual approval for obesity indication. Again, some slight reframing of the introduction is required here.

      Also for this point, we have slightly reframed the introduction accordingly.

      3) GLP1 was applied at a maximal dose of 10 uM, which is 10-fold higher than maximal. Can the authors confirm absence of cytotoxic effects of exposing to peptide at such concentration? Ex4 (9-39) at such concentrations is usually cytotoxic at least in primary tissue.

      We did not observe any obvious cytotoxic effect of GLP-1 at this concentration in HEK293T cells or Neurons.

      4) "As expected, GLPLight1 responded to both GLP1RAs with almost maximal activation, on par with GLP1 (Figure 2a)." Such a claim is difficult to interpret without concentration-response curves, since the maximal concentration of liraglutide and semaglutide might not have been achieved in these experiments.

      We agree with this statement is difficult to interpret without further clarification. We know from the literature that GLP-1, liraglutide and semaglutide all have very high affinity to the hmGLP1R (PMID: 31031702). We also proved that GLPLight signal saturates at concentrations above 1 uM of GLP-1 (figure 2C), we thus applied a 10x excess of all ligands and considered this signal as maximal.

      5) "These results indicate that GLPLight1 can serve as a direct readout of pharmacological drug action on the hmGLP1R with higher temporal resolution than previously available approaches, such as downstream signaling assays (Zhang et al., 2020)." Many investigators use cAMP imaging to investigate GLP1R signaling, which is arguably of similar spatiotemporal resolution, also with the advantage of FRET quantification in some cases (e.g. EpacVV). Direct GLP1R signaling can also be inferred using cell lines heterologously-expressing GLP1R. Thus, the advantage of the current probes is that they can be used to readout direct GLP1R activation in native cells/tissues where promiscuous class B binding might limit signaling measures or where endogenous GLP1 release needs to be investigated.

      We have edited the manuscript text accordingly.

      6) "State-of-the-art techniques for detecting endogenous GLP-1 or glucagon release in vitro from cultured cells or tissues consist of costly and time-consuming antibody- based assays (Kuhre et al., 2016) or analytical chemistry procedures (Amao et al., 2015)." Agreed, but non-specificity/cross-reactivity of such assays is more prohibitive/problematic (e.g. against glicentin).

      We have edited the introduction accordingly.

      7) The studies using co-culture of GLUTag and GLP1Light1-HEK293 cells, whilst interesting, are not entirely convincing in their current form. Firstly, co-culture could influence GLP1Light expression levels (can the authors label FLAG?). Secondly, specificity of the response is not tested e.g. by adding Ex4 (9-39). Thirdly, titration with GLUTag conditioned media is not performed.

      We partially addressed this issue in the answer to comment #1 from Reviewer #1. We previously performed a FLAG staining of GLPLight1 in the presence or absence of GLUTag cells and we did not notice any obvious difference. This goes in line with the fact that GLPLight1 is signaling inert, and the presence of GLP1 should not interfere with the surface expression of the sensor. We also checked that HEK293T cells did not express high levels of GLP1R according to the BioGPSCell line Gene Expression profile (https://maayanlab.cloud/Harmonizome/gene_set/HEK293/BioGPS+Cell+Line+Gene+Expression+Profiles).

      We also tried to add GLUTag media after stimulation in bolus to GLPLight1 expressing cells and observed no response. This indicated that the “sniffer” cells must be present in close proximity to GLUTag cells for an extended period of time to observe any substantial difference in response, justifying our choice of experimental setup.

      8) "Given that our photocage was placed at the very N-terminus of photo-GLP1, our results show that this caging approach prevents the peptide's ability to activate GLP1R but, at the same time, preserves its ability to interact with the ECD." An alternative hypothesis is that PhotoGLP1 does activate GLP1R, but this is undetectable with the sensitivity of GLP1Light. PhotoGLP1 cAMP concentration-response assays are needed (uncaged versus cage) to properly characterize and validate the compound (as would be standard for any newly-described GLP1R peptide ligand).

      While we agree that there is a chance that Photo-GLP1 could activate GLP1R at high concentrations, we think that the characterization of Photo-GLP1 has to be determined by the end user directly with the technique of choice (GLPLight1 in our case) in order to get a reliable comparison of potency and efficacy. We modified the text accordingly to more accurately reflect the direct conclusions from our data, as follows:

      “our results show that this caging approach prevents the peptide's ability to activate GLPLight1”.

      9) "Surprisingly, GLPLight1 shows a fluorescent response in all three uncaged areas, while its fluorescence remained unaltered throughout the rest of the FOV, indicating high spatial localization of the response to GLP-1 (Figure 3f)." Why is this surprising?

      We agree that this result is, indeed, not surprising and would like to thank Reviewer #2 for spotting this mistake, which has now been corrected in the manuscript.

      10) The localized PhotoGLP1 experiments are interesting and show the utility of the ligand. There is however activation outside of the region of uncaging, which would argue against a pre-bound ECD mode of action. Possibly some PhotoGLP1 is pre- bound to the ECD, and some is freely diffusing? Alternatively, the scan area might be below the diffraction limit/accuracy of the microscope?

      We would like to thank Reviewer #2 for this comment and agree with their observation. There could be some free Photo-GLP1 that gets photo-activated and binds regions around the uncaging area (similar to what has been observed for Photo-OXB:,PMID: 36481097). The activation around the uncaging area could also be due to lateral diffusion of the activated receptor on the membrane. There is also most likely some light diffraction at the uncaging area that could account for this phenomenon. To increase the spatial resolution, future studies could involve uncaging during sensor imaging via two-photon microscopy.

      11) What was the rationale for caging native GLP1, which is then susceptible to DPPIV-mediated degradation? Would the N-terminal cage and first 2 amino acids also not be cleaved by DPPIV, thus rendering the tool of limited in vivo application? Conversely, PhotoGLP1 provides a template for similar light-activated (stabilized) GLP1R agonists such as Ex4 or liraglutide.

      Thank you for making us aware of this (in vivo) limitation. We designed photoGLP1 as a tool for neurobiological experiments in the brain, where DPPIV expression would be low compared to peripheral organs (https://www.proteinatlas.org/ENSG00000197635-DPP4/tissue). We also envisage that the presence of the photocage would be enough to hinder the binding to DPP4 that cuts the first 2 AA. This hypothesis, however, was never tested experimentally, and we, therefore, acknowledge the limitation in the manuscript. We would furthermore like to thank the reviewers for his comment on additional photo-caged GLP1 agonists, which could be developed future studies.

      12) It wasn't clear how GLP1Light could be used as a HTS screen for drug discovery? Surely, conventional systems (e.g. GLP1R + BAR/Ca2+/cAMP reporting) allow signal bias, an important component of GLP1RA action, to be assessed. Or could GLP1Light1 be used as a pre-screen to exclude any ligands that do not orthosterically bind GLP1R?

      We would like to thank Reviewer #2 for this comment and would like to offer some clarification. We indeed thought that GLPLight1 could be used as a first line of screening to exclude ligands that do not bind in the orthosteric pocket. It is also a rather flexible method as the fluorescence increase of those sensors can be monitored using various techniques/devices that are available in most labs (e.g. microscopy, plate reader, flow cytometry).

      13) Limitations of GLP1Light1 and PhotoGLP1 are not acknowledged in the discussion.

      We would like to thank Reviewer #2 for pointing out the lack of description of the limitations of these tools, which have now been added to the Discussion.

      14) Full characterization of PhotoGLP1 is missing, to include UV/Vis, Tr and HRMS.

      PhotoGLP1 was fully characterized by UV/Vis and HRMS, and all experimental and analytical data was uploaded as supplementary data when the manuscript was initially submitted for publication in eLife.

      Reviewer #3 (Recommendations For The Authors):

      1) The ~1000 fold lower EC50 for GLP1 of GLPLight1 compared with native GLP1R needs to be openly acknowledged as a major limitation of the sensor, as this will substantially reduce the types of experiment for which it will be useful. Because it needs 1000 times higher GLP1 levels than wild type GLP1R to be activated, it is unlikely, for example, to be useful for monitoring the dynamics of activation of native GLP1R in vivo. The claim that the sensor could be used for in vivo imaging for fibre photometry is therefore an exaggeration.

      We would like to first thank Reviewer #3 for this comment and to further provide some clarification. We recognized that the data presented in this manuscript might have been confusing when comparing the affinity of GLP1R (using cAMP) and GLPLight1 (using the fluorescence increase because there is no coupling to cAMP). We believe that the low EC50 measured in the cAMP assay cannot accurately be compared to GLPLight1 response because it is an enzymatically amplified process. In order to support this claim, we included another set of experiments where we titrated agonist- induced recruitment of miniGs protein to the GLP1R receptor and found an EC50 of 3.8 nM for native GLP-1 using this assay (added as panel l in Figure1 Supplement 3). We thus confirmed that the nature of the assay itself has a drastic influence on the EC50 measured and it is not unusual to observe 100x fold difference of EC50 for the same receptor-ligand pair.

      We believe that the miniGs protein recruitment is a better comparison to GLPLight1 because it is not enzymatically amplified. This assay reveals that GLPLight1 has around 8-fold lower affinity to GLP1 compared to its parent receptor, which is in line with the EC50 loss observed previously for other GPCR-based sensors of this class. We are thus confident that GLPLight1 has to potential to be used in vivo under specific circumstances, specifically in brain tissue. We elaborated on this point in the Discussion part of the manuscript.

      2) Fig2 suppl 1 is described as demonstrating a reduced response of GLPLight1 to GLP-1 when HEK cells with were cultured with GLUTag cells. However, it is speculation to conclude that this is because GLP1Light1 was partially pre-activated by endogenous GLP-1, without demonstrating the response of GLPLight1 before and after GLUTag cell stimulation. Unless additional data are generated, the presented data do not convincingly demonstrate that GLP1Light1 can detect GLP1 released from GLUTag cells.

      We would like to thank Reviewer #3 for this comment which has been addressed already in the replies to Comment#1 from Reviewer #1 and Reviewer #2.

      3) The authors should openly acknowledge that photo-uncaging the GLP1 probe might not be very helpful for monitoring the temporal dynamics of the GLP1-GLP1R interaction, because unless all the photocaged glp1 is released by the light stimulus, the activation of photo-released GLP1 will be slowed by the remaining caged GLP1, and the dynamics will be slower than for native GLP1. This makes it unsuitable for many temporal questions, although it might be useful to deliver GLP1 in a spatial restricted manner.

      We do agree that the biggest advantage of Photo-GLP1 is its ability to be activated in a very localized manner. We also agree that the presence of caged Photo-GLP1 will influence the binding of the uncaged GLP-1. Nevertheless, there is still an advantage of using Photo-GLP1 in some assays such as pharmacological activation on brain slices. In fact, we have shown for our Photo-OXB molecule that the perfusion of OXB was much slower at eliciting neuronal depolarization compared to uncaging of Photo- OXB (see PMID: 36481097). We think that this was mainly due to the slow diffusion kinetics of the peptide into the brain tissue. We also think that uncaging can provide a more controlled activation with varying laser power and uncaging duration.

      4) To claim (as currently in the discussion) that GLPLight1 has potential to be used for investigating the dynamics of endogenous GLP1, the authors would need to compare the dynamics of the GLP1Light sensor with wild type GLP1R. We do not know that its activation dynamics will reproduce native glp1r.

      We would like to thank Reviewer #3 for this comment and would like to offer some clarification. Since GLPLight1 does not couple to intracellular signaling, it was impossible to compare its activation kinetics to GLP1R WT using the same assay. However, we can offer a relative comparison since we know that GLPLight1 takes around 50 seconds to be activated using 1 µM GLP-1 (figure 2B) and that it takes a similar time for GLP1R to be activated in the miniG protein recruitment assay (Fig 1 Supplement 3) using 100 nM GLP-1. Considering that GLPLight1 has a lower affinity than the GLP1R (8-10x lower), we think that the activation kinetics of both the sensor and GLP1R are comparable.

      Additional comments:

      1) In fig 2A,B, it is not clear whether the trace shows a partial reversal of GLP1- triggered activation by Ex9, or Ex9-independent receptor desensitization. A control trace is required to show the kinetics of GLP1-triggered activation without the addition of Ex9.

      We would like to thank Reviewer #3 for this comment. We can exclude the possibility of Ex9-independent desensitization because GLPLight1 has been shown to be signaling inert to all G-proteins, Beta arrestin-2 and cAMP. Moreover, we have observed that the fluorescence signal was stable for more than 30 minutes for the GLP-1 titrations, even at high concentrations of ligand.

      2) It would be helpful if the pEC50 for WT GLP1 were also shown in table 1, for comparison with the GLP1 mutants.

      We would like to thank Reviewer #3 for this comment, and we have now added the respective pEC50 for WT GLP1 to Table 1.

      3) Fig2 suppl 1. The methods and analysis for this figure are inadequately explained. To show that the HEK-GLPLight1 cells are responding to GLP1 released from GLUTag cells, the GLPLight1 response needs to be shown before and after GLUTag cell stimulation with an agent that should trigger GLP-1 release.

      We would like to thank Reviewer #3 for this comment which has been partially addressed already in the replies to Comment#1 from Reviewer #1 and Reviewer #2.

      Since we did not observe any response to acute stimulation of GLUTag cells we considered the high glucose concentration present in the culture media being a stimulation agent for GLUTag cells, which has been previously reported (PMID: 17643200).

      4) Fig 3g and others: The end of the photo activation period needs to be represented correctly on the timeline. In 3g, the bar that should indicate when photoactivation was applied does not end at the zero time point (which is labelled as the time relative to photoactivation).

      We would like to thank Reviewer #3 for pointing this out. The shaded area representing the photo-activation has been matched accordingly.

      5) Discussion para 1: the authors claim their data show that ligand induced activation of human GLP1R occurs more slowly than others similar GPCR sensors - they should give actual data to substantiate this claim, since the time course of glp1r activation has not been analysed and compared with other sensors in the manuscript.

      We added data to support this claim to the discussion: “As a reference, other previously-characterized class-A GPCR-based neuropeptide biosensors showed sub- second activation kinetics (Duffet et al., 2022a; Ino et al., 2022).”

      6) Methods: what wavelength was used for recording emission from GLP1Light1? The excitation wavelength is given, but I can't see the emission wavelength(s). In fig 1d, the excitation and emission spectra should be depicted in different colours/line properties, otherwise this figure is very confusing.

      We updated figure1d and changed the colors to improve data visualization. Regarding the missing wavelength, we would like to clarify that both wavelengths were already described in the methods section as: “The excitation and emission spectra were measured at λem =560nm and λex\= 470nm, respectively, on a TECAN M200 Pro plate reader at 37 °C. “. We would be happy to rewrite this paragraph, if necessary, shall it remain unclear to the reader.

    1. Sites like 4chan and 8chan bill themselves as sites that support free-speech, in the sense that they don’t ban trolling and hateful speech, though they may remove some illegal content, like child pornography. One thing these sites do ban though, is spam. While much of spam is certainly legal, and a form of speech, this speech is restricted on these sites. If the chat boards filled up with spam, the users would find it boring and leave, so for practical reasons, these sites still moderate for spam (though they may allow some uses of ironic spam, copypasta)

      I think there is definitely an interesting point to be made if we should restrict certain parts of free-speech. Do we get rid of hate-speech and spam even when they are harmful? Does that contradict democracy or uphold it? I don't know that I can answer these questions as online free-speech would set precedent to a slew of things (online +in person) and requires much more knowledge than I can provide as a freshman in college lol.

    1. Author Response:

      Reviewer #1 (Public Review):

      This manuscript features a key technical advance in single-molecular force spectroscopy. The critical advance is to employ a click chemistry (DBCO-cycloaddition) for making a stable covalent connection between a target biomacromolecule and solid support in place of conventional antigen-antibody binding. This tweak dramatically improves the mechanical stability of the pulling system such that the pulling/relaxation can be repeated up to a thousand times (the previous limit was a few hundred cycles at best). This improvement is broadly applicable to various molecular interactions and other types of single-molecule force spectroscopy allowing for more statistically reliable force measurements. Another strength of this method is that all conjugation steps are chemically orthogonal (except for Spy-catcher conjugation to the termini of a target molecule) such that the probability of side reactions could be reduced.

      The reliability of kinetic and thermodynamic parameters obtained from single-molecule force spectroscopy depends on statistics, that is, the number of pulling measurements and their distribution. By extending the number of measurements, this robust method enables fundamental/critical statistical assessment of those parameters. That is, it is an important and interesting lesson from this study that ~200 repeats can yield statistically reasonable parameters.

      The authors carried out carefully designed optimization steps and inform readers of the critical aspects of each. The merit, quality, and rigor as a method-oriented manuscript are impressive. Overall, this is an excellent study.

      We appreciate for the positive evaluation for our work. Additionally, the minor suggestions were helpful to improve our manuscript. Thank you!

      Reviewer #2 (Public Review):

      In this study, the authors have developed methods that allow for repeatedly unfolding and refolding a membrane protein using a magnetic tweezers setup. The goal is to extend the lifespan of the single-molecule construct and gather more data from the same tether under force. This is achieved through the use of a metal-free DBCO-azide click reaction that covalently attaches a DNA handle to a superparamagnetic bead, a traptavdin-dual biotin linkage that provides a strong connection between another DNA handle and the coverslip surface, and SpyTag-SpyCatcher association for covalent connection of the membrane protein to the two DNA handles.

      The method may offer a long lifetime for single-molecule linkage; however, it does not represent a significant technological advancement. These reactions are commonly used in the field of single-molecule manipulation studies. The use of multiple tags including biotin and digoxygenin to enhance the connection's mechanical stability has already been explored in previous DNA mechanics studies by multiple research labs. Additionally, conducting single-molecule manipulation experiments on a single DNA or protein tether for an extended period of time (hours or even days) has been documented by several research groups.

      One of the unique features of our work is the development of a robust single-molecule tweezer method that is applicable to membrane proteins, rather than simply making another stable system. As re-written in Introduction, it is not straightforward as we have to consider the membrane reconstitution. We believe that our work is expected to overcome the bottleneck in membrane protein studies that arises when using single-molecule tweezer methods.

      To improve the delivery of the contextual information, we revised Introduction, Results, and Discussion. The first four paragraphs in the Introduction briefly review previous tweezer methods with an improved stability and delineate where our work is placed. In the first paragraph of the Results, we also briefly discussed how and why our DBCO tethering strategy differs from previous DBCO methods. In the first paragraph of the Discussion, we compared the previous methods regarding the stability improvement.

      Additionally, the revised manuscript now includes new findings – the full dissection of structural transitions of a helical membrane protein, the observation of hidden helix-coil transitions at a constant force, and the estimation of kinetic pre-exponential factors. We believe that the new findings provide important insights into membrane protein folding, in addition to the usefulness of our method itself for membrane protein studies. We extensively edited the main text and Methods accordingly. Relevant figures are Figures 6 and 7, Figure 6–figure supplements 1–3, and Figure 7–source data 1.

      Reviewer #3 (Public Review):

      The authors describe a method to tether proteins via DNA linkers in magnetic tweezers and apply it to a model membrane protein. The main novelty appears to be the use of DBCO click chemistry to covalently couple to the magnetic bead, which creates stable tethers for which the authors report up to >1000 force-extension cycles. Novel and stable attachment strategies are indeed important for force spectroscopy measurements, in particular for membrane proteins that are harder and therefore less studied in this regard than soluble proteins, and recording >1000 stretch and release cycles is an impressive achievement. Unfortunately, I feel that the current work falls short in some regards to exploring the full potential of the method, or at least does not provide sufficient information to fully assess the performance of the new method. Specific questions and points of attention are included below.

      We appreciate for the positive evaluation. We were able to largely improve our manuscript while preparing our responses to the comments. Thank you!

      - The main improvement appears to be the more stable and robust tethering approach, compared to previous methods. However, the stability is hard to evaluate from the data provided. The much more common way to test stability in the tweezers is to report lifetimes at constant force(s). Also, there are actually previous methods that report on covalent attachment, even working using DBCO. These papers should be compared.

      As shown in Figure 4E, we evaluated the robustness of our method in a way suggested by you – the lifetime measurement at a constant force. Specifically, ~12 hours at 50 pN. Definitely, our tweezer approach established here is the most robust method for membrane protein studies. Please refer to the section “Assessing robustness of our single-molecule tweezers” in page 7 and line 31.

      We discussed the previous covalent methods for which quantitative data are presented in light of the system stability. Please refer to the first paragraph of Discussion. We also briefly discussed how and why our DBCO tethering strategy differs from previous DBCO methods, in the first paragraph of Results.

      - The authors use the attachment to the surface via two biotin-traptavidin linkages. How does the stability of this (double) bond compare to using a single biotin? Engineered streptavidin versions have been studied previously in the magnetic tweezers, again reporting lifetimes under constant force, which appears to be a relevant point of comparison.

      The papers in this comment showed that the tethering lifetimes of biotin-streptavidin variants were affected by the asymmetric bead anchoring point. However, the situation does not apply to our work as we do not anchor traptavidin to beads. Besides, the stability comparison between the single- and double-biotin systems is not the main point of our work, so we do not have the answer to the question. However, we cited the reference in the first paragraph of Discussion where we discuss the system stability.

      - Very long measurements of protein unfolding and refolding have been reported previously. Here, too, a comparison would be relevant.

      We briefly discussed the relevant previous works in the first paragraph of Discussion.

      In light of this previous work, the statement in the abstract "However, the weak molecular tethers used in the tweezers limit a long time, repetitive mechanical manipulation because of their force-induced bond breakage" seems a little dubious. I do not doubt that there is a need for new and better attachment chemistries, but I think it is important to be clear about what has been done already.

      The sentence is in Abstract, so we also had to consider the conciseness. By simply adding the phrase “used for the membrane protein studies”, we can place our work into a more proper context.

      In page 2 and line 3, “…However, the weak molecular tethers used for the membrane protein studies have limited long-time, repetitive molecular transitions due to force-induced bond breakage…”

      - Page 5, line 99: If the PEG layer prevents any sticking of beads, how do the authors attach reference beads, which are typically used in magnetic tweezers to subtract drift?

      The PEG layer consists of biotin-PEG and methyl-PEG at a 1:27.5 molar ratio. As the reference beads are coated with streptavidin, they are attached to the PEG layer by the regular biotin-streptavidin interaction. In page 19 and line 7, you can refer to “…The polystyrene beads are attached to the PEG surface via biotin-streptavidin interaction. The beads are used as reference beads for the correction of microscope stage drifts…”

      - Figure 3 left me somewhat puzzled. It appears to suggest that the "no detergent/lipid" condition actually works best, since it provides functional "single-molecule conjugation" for two different DBCO concentrations and two different DNA handles, unlike any other condition. But how can you have a membrane protein without any detergent or lipid? This seems hard to believe.

      We explained the raised point in page 6 and line 18,

      “…Indeed, the best condition was in the absence of any detergents or lipids (Figure 3; no detergents/lipids only during the conjugation step). This situation is possible because membrane proteins are sparsely tethered to the chamber surface, which kept them from aggregating. However, not using detergents or lipids means that the membrane proteins are definitely deformed from their native folds. Therefore, we sought an optimal solubilization condition for membrane proteins during the DBCO-azide conjugation step...”

      Figure 3 also seems to imply that the bicelle conditions never work. The schematic in Figure 1 is then fairly misleading since it implies that bicelles also work.

      The buffer conditions shown in Figure 3 are those ONLY during the DBCO-azide conjugation step. In this step, the bicelle conditions did not work. Therefore, after the conjugation in 0.5% DDM, the buffer was exchanged with a bicelle solution. This process is shown in Figure 2 and the finally assembled system is depicted in Figure 1.

      To clarify this point, we put a note “Buffer conditions only during the DBCO-azide conjugation step” just above the buffer conditions in Figure 3. You can also find for the relevant exchange step in page 6 and line 31, “…Following a 1 h incubation of the beads in the single-molecule chamber at 25°C, unconjugated beads were washed, and the detergent micelles were exchanged with bicelles to reconstitute the lipid bilayer environment for membrane proteins…”

      - When it comes to investigating the unfolding and refolding of scTMHC2, it would be nice to see some traces also at a constant force. As the authors state themselves: magnetic tweezers have the advantage that they "enable constant low-force measurements" (page 8, line 189). Why not use this advantage?<br /> In particular, I would be curious to see constant force traces in the "helix coil transition zone". Can steps in the unfolding landscape be identified? Are there intermediates?

      Yes, please refer to Figure 6. We were able to dissect three distinct transitions from the fully unstructured state to the native state, including the helix-coil transitions. We also reconstructed the folding energy landscape using a deconvolution method.

      Please refer to the pertinent sections in the main text, which are titled “Structural transitions and folding energy landscape over extended time scales” and “Mechanistic dissection of folding transitions”.

      - Speaking of loading rates and forces: How were the forces calibrated? This seems to not be discussed.

      We wrote an additional section in Methods titled “Instrumentation of single-molecule magnetic tweezers”, where we discuss the force calibration. For the actual force calibration data, please see Figure 4–figure supplement 1A.

      In page 20 and line 10, “…The mechanical force applied to a bead-tethered molecule was calibrated as a function of the magnet position using the formula F = k_B_T∙L/δx_2 derived from the inverted pendulum model96, where _F is the applied force, k_B is the Boltzmann constant, _T is the absolute temperature, L is the extension, and _δx_2 is the magnitude of lateral fluctuations…”

      And how were constant loading rates achieved? In Figure 4 it is stated that experiments are performed at "different pulling speeds". How is this possible? In AFM (and OT) one controls position and measures force. In MT, however, you set the force and the bead position is not directly controlled, so how is a given pulling speed ensured?<br /> It appears to me that the numbers indicated in Figures 4A and B are actually the speeds at which the magnets are moved. This is not "pulling speed" as it is usually defined in the AFM and OT literature. Even more confusing, moving the magnets at a constant speed, would NOT correspond to a constant loading rate (which seems to be suggested in Figure 4A), given that the relationship between magnet positions and force is non-linear (in fact, it is approximately exponential in the configuration shown schematically in Figure 1).

      You are correct, so we simply modified the “pulling speed” to “magnet speed” in the figure caption. The loading rates provided in the figure (with the notation <>) were average loading rates in 1–50 pN to provide rough estimates. We actually specified it in the caption as “average force-loading rate”. However, this can be misleading at a glance, so we just deleted all the loading-rate values in the figure and caption.

      - Finally, when it comes to the analysis of errors, I am again puzzled. For the M270 beads used in this work, the bead-to-bead variation in force is about 10%. However, it will be constant for a given bead throughout the experiment. I would expect the apparent unfolding force to exhibit fluctuations from cycle to cycle for a given bead (due to its intrinsically stochastic nature), but also some systematic trends in a bead-to-bead comparison since the actual force will be different (by 10% standard deviation) for different beads. Unfortunately, the authors average this effect away, by averaging over beads for each cycle (Figure 4). To me, it makes much more sense to average over the 1000 cycles for each bead and then compare. Not surprisingly, they find a larger error "with bead size error" than without it (Figure 5A). However, this information could likely be used (and the error corrected), if they would only first analyze the beads separately.

      We might be wrong, but there seems to be a misunderstanding. First, we added Figure 5–figure supplement 1 where you can see individual traces. As expected, the levels of unfolding forces/sizes appear consistent during the progress of pulling cycles. Second, the advantage of averaging for different beads is that you can effectively remove the bead size effect. This “averaging-out” is the key strategy in our kinetic analysis. Based on the error estimation, if you average the values of kinetic parameters obtained from different beads, you can then estimate them with reasonably small errors despite the bead size variations. This becomes more evident after initial hundreds of pulling cycles. The errors for 200 and 1000 cycles are of only ~1% difference, indicating that you do not need to blindly run the pulling cycles. These results are based on the “averaging-out” strategy, which is the merit of our analysis. For more details, please see the section in the main text titled “Assessing statistical reliability of pulling-cycle experiments”, where relevant figures, figure supplements, and Method sections are referred.

      What is the physical explanation of the first fast and then slow decay of the error (Figure 5B)? I would have expected the error for a given bead after N pulling cycles to decrease as 1/sqrt(N) since each cycle gives an independent measurement. Has this been tested?

      If the sampling was from one population (here, unfolding probability profile), the error would follow a 1/√n decay as expected for the standard error. In our analysis, however, we estimated the expected “mean” errors, regardless of detailed shapes of the unfolding probability profiles. To this end, we sampled the data from different possible profiles (shown in Figure 5–figure supplement 5). We then averaged all the error plots to obtain the plot of the mean errors during progress of pulling cycles (black curve in Figure 5D). In this case, the plot does not have to follow the standard error curve represented by the factor 1/√n.

      We tested this by fitting with the model function of y = A/√n, for various lower limit of N = 10, 30, 50, 100, 300, and 500 in the regression analysis (Figure 5–figure supplement 6). The results of the reduced chi-square (χ2) used for a goodness-of-fit test (χ2 = 1 for the best fit) indicates that the two-term exponential model (χ2 = 1.60) shows a better fit than the reciprocal square root model (χ2 = 2.30–6.01). The regression model adopted in our analysis is a phenomenological model that more properly describes the error decay curve. The trend of the first fast and then slow decay is not unusual because it is also expected for the reciprocal square root model – the plot 1/√n decays fast and then slowly, too (Figure 5–figure supplement 6).

    1. Author Response

      Reviewer #1 (Public Review):

      Estimating the effects of mutations on the thermal stability of proteins is fundamentally important and also has practical importance, e.g, for engineering of stable proteins. Changes can be measured using calorimetric methods and values are reported as differences in free energy (dG) of the mutant compared to wt proteins, i.e., ddG. Values typically range between -1 kcal/mol through +7 kcal/mol. However, measurements are highly demanding. The manuscript introduces a novel deep learning approach to this end, which is similar in accuracy to ROSETTA-based estimates, but much faster, enabling proteomewide studies. To demonstrate this the authors apply it to over 1000 human proteins.

      The main strength here is the novelty of the approach and the high speed of the computation. The main weakness is that the results are not compared to existing machine learning alternatives.

      We thank Prof. Ben-Tal for taking the time to assess our work, and for his comments and suggestions below.

      Reviewer 2 (Public Review):

      Summary:

      This work presents a new machine-learning method, RaSP, to predict changes in protein stability due to point mutations, measured by the change in folding free energy ΔΔG.<br /> The model consists of two coupled neural networks, a 3D selfsupervised convolutional neural network that produces a reduceddimensionality representation of the structural environment of a given residue, and a downstream supervised fully-connected neural network that, using the former network's structural representation as input, predicts the ΔΔG of any given amino-acid mutation. The first network is trained on a large dataset of protein structures, and the second network is trained using a dataset of the ΔΔG values of all mutants of 35 proteins, predicted by the biophysics-based method Rosetta.

      The paper shows that RaSP gives good approximations of Rosetta ΔΔG predictions while being several orders of magnitude faster. As compared to experimental data, judging by a comparison made for a few proteins, RaSP and Rosetta predictions perform similarly. In addition, it is shown that both RaSP and Rosetta are robust to variations of input structure, so good predictions are obtained using either structures predicted by homology or structures predicted using AlphaFold2.<br /> Finally, the usefulness of a rapid approach such as RaSP is clearly demonstrated by applying it to calculate ΔΔG values for all mutations of a large dataset of human proteins, for which this method is shown to reproduce previous findings of the overall ΔΔG distribution and the relationship between ΔΔG and the pathological consequences of mutations. The RaSP tool and the dataset of mutations of human proteins are shared.

      Strengths:

      The single main strength of this work is that the model developed, RaSP, is much faster than Rosetta (5 to 6 dex), and still produces ΔΔG predictions of comparable accuracy (as compared with Rosetta, and with the experiment). The usefulness of such a rapid approach is convincingly demonstrated by its application to predicting the ΔΔG of all single-point mutations of a large dataset of human proteins, for which using this new method they reproduce previous findings on the relationship between stability and disease. Such a large-scale calculation would be prohibitive with Rosetta. Importantly, other researchers will be able to take advantage of the method because the code and data are shared, and a google colab site where RaSP can be easily run has been set up. An additional bonus is that the dataset of human proteins and their RaSP ΔΔG predictions, annotated as beneficial/pathological (according to the ClinVar database) and/or by their allele frequency (from the gnomAD database) are also made available, which may be very useful for further studies.

      Weaknesses:

      The paper presents a solid case in support of the speed, accuracy, and usefulness of RaSP. However, it does suffer from a few weaknesses.

      The main weakness is, in my opinion, that it is not clear where RaSP is positioned in the accuracy-vs-speed landscape of current ΔΔGprediction methods. The paper does show that RaSP is much faster than Rosetta, and provides evidence that supports that its accuracy is comparable with that of Rosetta, but RaSP is not compared to any other method. For instance, FoldX has been used in large-scale studies of similar size to the one used here to exemplify RaSP. How does RaSP compare with FoldX? Is it more accurate? Is it faster? Also, as the paper mentions in the introduction, several ML methods have been developed recently; how does RaSP compare with them regarding accuracy and CPU time? How RaSP fares in comparison with other fast approaches such as FoldX and/or ML methods will strongly affect the potential usefulness and impact of the present work.

      Second, this work being about presenting a new model, a notable weakness is that the model is not sufficiently described. I had to read a previous paper of 2017 on which this work builds to understand the self-supervised CNN used to model the structure, and even so, I still don't know which of 3 different 3D grids used in that original paper is used in the present work.

      A third weakness is, I think, that a stronger case needs to be made for fitting RaSP to Rosetta ΔΔG predictions rather than experimental ΔΔGs. The justification put forward by the authors is that the dataset of Rosetta predictions is large and unbiased while the dataset of experimental data is smaller and biased, which may result in overfitting. While I understand that this may be a problem and that, in general, it is better to have a large unbiased dataset in place of a small biassed one, it is not so obvious to me from reading the paper how much of a problem this is, and whether trying to fix it by fitting the model to the predictions of another model rather than to empirical data does not introduce other issues.

      Finally, the method is claimed to be "accurate", but it is not clear to me what this means. Accuracy is quantified by the correlation coefficient between Rosetta and RaSP predictions, R = 0.82, and by the Mean Absolute Error, MAE = 0.73 kcal/mol. Also, both RaSP and Rosetta have R ~ 0.7 with experiment for the few cases where they were tested on experimental data. This seems to be a rather modest accuracy; I wouldn't claim that a method that produces this sort of fit is "accurate". I suppose the case is that this may be as accurate as one can hope it to be, given the limitations of current experimental data, Rosetta, RaSP, and other current methods, but if this is the case, it is not clearly discussed in the paper.

      We thank the reviewer for their detailed comments and suggestions.

      As discussed in our general comments above and also below, we have now added additional benchmarking, making it easier to compare the accuracy of RaSP with other methods. Regarding the model description, we have now added a more detailed description of also the 3D CNN.

      Regarding whether to fit the model to experiments or computational data, we agree that it is not clear cut that the former would also not work. Indeed, a main problem is that in both cases it is hard to answer which approach is better because of the scarcity of experimental data. One major problem with the larger sets of experimental data is, as we mention, the bias and variability; another is the provenance. While some databases exist, they are rarely exactly raw data, and for example may contain ∆∆G values estimated from ∆Tm values. In the revised manuscript we now explain better why we chose to target Rosetta, but also acknowledge that one might also have used experiments.

      As to the question of accuracy, we agree completely that the methods could be better. One problem, however, is that it is very difficult to answer how much better because of problems with experiments. As mentioned also by reviewer 1, variation across different experiments suggest that even a “perfect” predictor would only achieve Pearson correlation coefficients in the range 0.7–0.8 (https://doi.org/10.1093/bioinformatics/bty880). Clearly, this is an issue with imperfect data curation (it is possible to measure ∆∆G quite accurately), but in the absence of larger and better curated experiments, one will not expect much better accuracy than what we report here. This is now discussed in the revised manuscript.

      Reviewer 3 (Public Review):

      The authors present a machine learning method for predicting the effects of mutations on the free energy of protein stability. The method performs similarly to existing methods, but has the advantage that it is faster to run. Overall this is reasonable and a faster method will likely have some potential uses. However, not improving performance beyond the reasonable but not great performance of existing methods of course makes this a less useful advance. The authors provide predictions for a set of human proteins, but the impact of their method would be much greater if they provided predictions for all substitutions in all human proteins, for example. In places the text somewhat overstates the performance of computational methods for predicting free energy changes and is potentially misleading about when ddGs are predicted vs. experimentally measured. In addition, the comparison to existing methods is rather slim and there isn't a formal evaluation of how well RASP discriminates pathological from benign variants.

      We thank the reviewer for taking time to read our work and for their various suggestions.

    1. Author Response

      Reviewer #1 (Public Review):

      Alignment between high dimensional data which express their dynamics in a subspace is a challenge which has recently been addressed both with analytic-based solutions like the Procrustes transformation, and, most interestingly, via deep learning approaches based on adversarial networks. The authors have previously proposed an adversarial network approach for alignment which relied on first dimensionally-reducing the binned neural spikes using an autoencoder. Here, they use an alternative approach to align data without use of an initial dimensional-reduction step.

      The results are fairly clear - the Cycle-GAN approach works better than their previous ADAN approach and one based on dimensionality reduction followed by the Procrustes transform. In general, a criticism of this entire field is to understand what alignment teaches us about the brain or how it specifically will be used in a BCI context.

      There are a few issues with the paper.

      1.) To increase the impact of their work, the investigators have now used it to align data in multiple types of tasks. There was an unanswered question about this related to neuroscience - does alignment in one task predict alignment for another?

      This is a great question! We anticipate that it will be challenging for an alignment learned on one task to be used on another task, because we know that M1 decoders trained on data from one behavior often do not generalize when tested using a different behavior (Naufel et al., 2019)*. The same nonlinearities that prevent zero-shot decoding across tasks are also likely to impair the ability of an aligner trained on data from one task to successfully align data from another task. Furthermore, the results of Naufel et al. indicate that even if neural alignment is successful, we would need a decoder already trained on the new task to produce reliable predictions-- in which case the data needed to train that decoder could simply be used for alignment. A systematic study of the relation between the ability to align and decode from data is well warranted, but beyond the scope of our current work.

      *Naufel, S., Glaser, J. I., Kording, K. P., Perreault, E. J., & Miller, L. E. (2019). A muscle-activity-dependent gain between motor cortex and EMG. Journal of neurophysiology, 121(1), 61-73.

      Action in the text: none.

      2) Investigators use decoding as a way of comparing alignment performance. The description of the cycle GAN was not super detailed, and it wasn't clear whether there was any dynamic information stored in the network that might create questions of causality in actual use. It seems that input is simply the neural activity at a current time point rather than neural activity across the trial, which would alleviate this concern. However, they mention temporal alignment but never describe in detail whether all periods of spikes are properly modeled by the system or if only subsets of data (specific portions of task or non-task time) will work. Perhaps this is more a question of the Wiener filter, for which precise details are missing.

      As intuited by the reviewer, we did only use the neural activity at a current time point as the inputs for Cycle-GAN training, so the system is causal and can be used in real time. We have modified the text to clarify this.

      We apologize for any confusion caused by our use of the term "temporal alignment", which was for the sake of consistency with earlier-published, CCA-based alignment methods (e.g., in Gallego et al., 2020), but is indeed confusing. In the revised manuscript, we have switched to the term ‘trial alignment’ which we believe better reflects this pre-processing step, and we have included additional explanations in the introduction.

      Importantly, while CCA-style trial alignment is not required by our methods, we do still preprocess our data to exclude behaviors not related to the investigated task. Since monkeys were resting or performing task-irrelevant movements during inter-trial period, we chose to use data only from trial start to trial end, but without any explicit trial matching or alignment (see Appendix 1 - Behavior tasks). In the revised manuscript, we now show that our methods still works well when applied even to the continuous recordings, with Cycle-GAN significantly outperforming both ADAN and PAF.

      Action in the text (page 2, lines 72-74): clarifying CCA description and replacing “temporal alignment” with “trial alignment”.

      Action in the text (page 5, lines 191-192): stating that ADAN and Cycle-GAN have no knowledge of dynamics.

      Action in the text (page 6, lines 258-272): documenting performance on full-day recordings without trial matching.

      Action in the text (page 13, lines 647-649): again, stating that Cycle-GAN has no knowledge of dynamics.

      3) In general, precise details of the algorithms should have been provided.

      We appreciate the reviewer noting this-- in the submitted manuscript, the full descriptions of Cycle-GAN and ADAN were included as supplementary methods in Appendix 4, but we did not extensively reference this and it may have been missed. In the revised manuscript, we added more references to Appendix 4 and in the Methods section of the main text. We provided further details on the choice of hyperparameters for each method (including PAF) in Appendix 4 itself.

      Action in the text (page 13, lines 643-644): added “For a full description of the ADAN architecture and its training strategy, please refer to “ADAN based aligner” in Appendix 4 and (Farshchian et al., 2018).”

      Action in the text (page 14, lines 669): added “Further details about the Cycle-GAN based aligner are provided in “Cycle-GAN based aligner”, Appendix 4.” Action in the text (Appendix 4 Tables 1-2): We have added a summary table of hyperparameters for each method in Appendix 4 (ADAN: Appendix 4 Table 1; CycleGAN: Appendix 4 Table 2).

      4) Cross validation for day-0 alignment is not explained.

      As mentioned above, the training and validation details of day-0 models were included in Appendix 4, which was not extensively referenced in the manuscript and may have been missed. We have now added more references to the Appendix in the revised manuscript.

      Action in the text (page 13, lines 627-629): added “(Note that this LSTM based decoder is only used for latent space discovery, not the later decoding stage that is used for performance evaluation (see “ADAN day-0 training” in Appendix 4 for full details)).”

      5) Details of statistical tests is not provided.

      We apologize for this omission. In the revised manuscript, we have added a section in the methods summarizing all the statistical tests. In addition, we added the sample sizes for each stat reported in the results section.

      Action in the text (page 15, lines 754-768): new Methods section added.

      6) (minor) The idea that for neurons that have disappeared that the CycleGAN can "infer their response properties", seems an incorrect description. A proper description should be that it "hallucinates" their response properties?

      We prefer to avoid the term “hallucinate”, due to its recent increased (appropriate) use in the context of large language models describing content generation that is “nonsensical or unfaithful to the provided source content” (as per the Wikipedia article on hallucination in AI). The synthetized “responses” of vanished neurons are not nonsensical, but are indeed, inferred: they are the model’s best estimate of how these neurons would have responded, had they been observed. While not explored further here, this prediction could be of potential scientific use: a strong discrepancy between predicted and observed activity might be a clue to look for further evidence of learning or remodeling of neural representations of behavior.

      Action in the text: none.

      Reviewer #2 (Public Review):

      In this manuscript, the authors use generative adversarial networks (GANs) to manipulate neural data recorded from intracortical arrays in the context of intracortical BCIs so that these decoders are robust. Specifically, the authors deal with the hard problem where signals from an intracortical array change over time and decoders that are trained on day 0 do not work on day K. Either the decoder or the neural data needs to be updated to achieve the same performance as initially. GANs try to alter the neural data from day K to make it indistinguishable to day 0 and thus in principle the decoder should perform better. The authors compare their GAN approach to an older GAN approach (by an overlapping group of authors) and suggest that this new GAN approach is somewhat better. Major Strengths are multiple datasets from behaving monkeys performing various tasks that involve motor function. Comparison between two different GAN approaches and a classical approach that uses factor analysis. The weakness is insufficient comparison to another state-of-the-art approach that has been applied on the same dataset (NoMAD, Karpowicz et al. BioRxiv.)

      The results are very reasonable and they show their approach, Cycle GANs, does slightly better than the traditional GAN approach. However, the Cycle GANs have many more modules and also as I understand it performs a forward backward mapping of the day - 0 and day - k and thus theoretically better. But, it seems quite slow.

      We are concerned that the reviewer may have mistaken the Cycle-GAN training time (the time it takes to find an alignment, Figure 4B) with its inference time (the time it takes to transform data once an alignment has been found). Whereas inference time is critical for practical deployment of a model, we argue that Cycle-GAN's somewhat longer training time is not a substantial barrier to use: it is still reasonably fast (a few minutes) and training will only need to be performed on the order of once per day. We have modified the y-axis label of Figure 4B to make this distinction clearer.

      We have also now added information on the inference speed of trained models to the paper: we find that both Cycle-GAN and ADAN perform the inference step in under 1 ms per 50 ms sample of data – this is because the forward map in both models consists of a fully connected network with only two hidden layers. We also note that while forward-backward mapping between days does occur during Cycle-GAN training, only the forward mapping is performed during inference.

      Action in the text (page 7, lines 303-306): added inference time for Cycle-GAN and ADAN.

      I think the results are interesting but as such, I am not sure this is such a fundamental advance compared to the Farashcian et al. paper, which introduced GANs to improve decoding in the face of changing neural data. There are other approaches that also use GANs and I think they all need to be compared against each other. Finally, these are all offline results and what happens online is anyone's real guess. Of course, this is not just a weakness of this study but many such studies of its ilk.

    1. Author Response

      Reviewer #1 (Public Review):

      The manuscript by Gochman and colleagues reports the discovery of a very strong sensitization of TRPV2 channels by the herbal compound cannabidiol (CBD) to activation by the synthetic agonist 2aminoethoxydiphenyl borate (2-APB). Using patch-clamp electrophysiology the authors show that the ~100-fold enhancement by micromolar CBD of TRPV2 current responses to low concentrations of 2-APB reflects a robust increase in apparent affinity for the latter agonist. Cryo-EM structures of TRPV2 in lipid nanodiscs in the presence of both drugs report two-channel conformations. One conformation resembles previously solved structures whereas the second conformation reveals two distinct CBD binding sites per subunit, as well as changes in the conformation of the S4-S5 linker. Interestingly, although TRPV1 and TRPV3 are highly homologous to TRPV2 and both CBD binding sites are relatively conserved, the CBD-induced sensitization towards 2-APB is observable only for TRPV3 but not for TRPV1. Moreover, the simultaneous substitution of non-conserved residues in the CBD binding sites and the pore region of TRPV1 with the amino acids present in TRPV2 fails to confirm strong CBD-induced sensitization. The authors conclude that CBD-dependent sensitization of TRPV2 channels depends on structural features of the channel that are not restricted to the CBD binding site but involve multiple channel regions.

      These are important findings that promote our understanding of the molecular mechanisms of TRPV family channels, and the data provide convincing evidence for the conclusions.

      We appreciate the supportive evaluation of the reviewer.

      Reviewer #2 (Public Review):

      In this manuscript, Gochman et al. studied the molecular mechanism by which cannabidiol (CBD) sensitizes the TRPV2 channel to activation by 2-APB. While CBD itself can activate TRPV2 with low efficacy, it can sensitize TRPV2 current activated by 2-APB by two orders of magnitude. The authors showed, via single-channel recording, that the CBD-dependent sensitization arises from an increase in Po when the channel binds to both CBD and 2-APB. The authors then used cryo-EM to investigate how CBD binds to TRPV2 and identified two CBD binding sites in each subunit, with one site being previously reported and the other being newly discovered.

      TRPV1 and TRPV2 are two channels closely related to TRPV2. All three channels can be activated by CBD and 2-APB, but only TRPV2 and 3 are strongly sensitized by CBD. To understand the molecular basis of the different sensitivity to CBD, the authors compared the residues within the CBD binding sites and generated mutants by swapping non-conserved residues between TRPV1 and TRPV2. They then performed patch-clamp recordings on these mutants and found that mutations on non-conserved residues indeed influenced the CBD-dependent sensitization, thereby supporting the observed CBD binding sites.

      Unexpectedly, the authors did not identify the binding site of 2-APB, despite its robust effect in electrophysiology recordings, especially when combined with CBD. Although previous structural studies of TRPV2 have reported 2-APB binding sites, the associated densities in these studies were not wellresolved. Therefore, the authors called on the field to re-examine published structural data with regard to the 2-APB binding sites.

      Overall, this is an important study with well-designed and well-conducted experiments.

      We appreciated the supportive comments of the reviewer.

      Reviewer #3 (Public Review):

      In this paper, Gochman et al examine TRPV1-3 channel sensitization by CBD, specifically in the context of 2-APB activation. The authors primarily used classic electrophysiological techniques to address their questions about channel behavior but have also used structural biology in the form of cryo-EM to examine drug binding to TRPV2. The authors have carefully observed and quantified sensitization of the rat TRPV2 channel to 2-APB by CBD. While this sensitization has been reported previously (Pumroy et al, Nat Commun 2022), the authors have gone into much more detail here and carefully examined this process from several angles, including a comparison to some other known methods of sensitizing TRPV2. Additionally, the authors have also revealed that CBD sensitizes rat TRPV1 and mouse TRPV3 to 2-APB, which has not been reported previously. Up to this point, the work is well thought through and cohesive.

      The major weakness of this paper is that the authors' efforts to track down the structural and molecular basis for CBD sensitization neither give insight into how sensitization occurs nor provide a solid footing for future work on the topic. The structural work presented in this paper lacks proper controls to interpret the observed states and the authors do nothing to follow up on a potentially interesting second binding site for CBD. Overall, the structural work feels detached from the rest of the paper. The mutations chosen to examine sensitization are based on setting up TRPV1 in opposition to TRPV2 and TRPV3, which makes little sense as all three channels show sensitization by CBD, even if to different extents. The authors chose their mutations based on the assumption that response to CBD is the key difference between the channels for sensitization, yet the overall state of each channel or the different modes of activation by 2-APB seem to be more likely candidates. As a result, it is not particularly surprising that none of the mutations the authors make reduce CBD sensitization in TRPV2 or increase CBD sensitization in TRPV1.

      A difficulty in examining TRPV1-3 as a group is that while they are highly conserved in sequence and structure, there are key differences in drug responses. While it does seem likely that CBD would bind to the same location in TRPV1-3, there is extensive evidence that 2-APB binds at different sites in each channel, as the authors discuss in the paper. Without more basic information about where 2-APB binds to each channel and confirmation that CBD does indeed bind TRPV1-3 at the same site, it may not be possible to untangle this particular mode of channel sensitization.

      We appreciate this reviewer’s perspective and we too were disappointed that our approach did not yield more definitive answers to why some TRPV channels are more sensitive to CBD. We have revised the results and discussion sections to more clearly articulate what we think our results reveal. We have also added a section to the discussion to present the idea that the differential sensitivity of TRPV channels to CBD may have more to do with where 2-APB binds and how it activates the channel than CBD. These challenging points are all excellent and they have helped us to present our message more clearly.

    1. Author Response

      Reviewer #1 (Public Review):

      The authors introduce a computational model that simulates the dendrites of developing neurons in a 2D plane, subject to constraints inspired by known biological mechanisms such as diffusing trophic factors, trafficked resources, and an activity-dependent pruning rule. The resulting arbors are analyzed in terms of their structure, dynamics, and responses to certain manipulations. The authors conclude that 1) their model recapitulates a stereotyped timecourse of neuronal development: outgrowth, overshoot, and pruning 2) Neurons achieve near-optimal wiring lengths, and Such models can be useful to test proposed biological mechanisms- for example, to ask whether a given set of growth rules can explain a given observed phenomenon - as developmental neuroscientists are working to understand the factors that give rise to the intricate structures and functions of the many cell types of our nervous system.

      Overall, my reaction to this work is that this is just one instantiation of many models that the author could have built, given their stated goals. Would other models behave similarly? This question is not well explored, and as a result, claims about interpreting these models and using them to make experimental predictions should be taken warily. I give more detailed and specific comments below.

      We thank the reviewer for the summary of the work. We find the criticism “that this is one instantiation of many models [we] could have built” can apply to any model. To quote George Box, “all models are wrong, but some models are useful” was the moto that drove our modeling approach. In principle, there are infinitely many possible models. We chose one of the most minimalistic models which implements known biological mechanisms including activity-independent and -dependent phases of dendritic growth, and constrained parameters based on experimental data. We compare the proposed model to other alternatives in the Discussion section, especially to the models of Hermann Cuntz which propose very different strategies for growth.

      However, the reviewer is right that within the type of model we chose, we could have more extensively explored the sensitivity to parameters. In the revised manuscript we will investigate the sensitivity of model output to variations of specific parameters, as explained below.

      Point 1.1. Line 109. After reading the rest of the manuscript, I worry about the conclusion voiced here, which implies that the model will extrapolate well to manipulations of all the model components. How were the values of model parameters selected? The text implies that these were selected to be biologically plausible, but many seem far off. The density of potential synapses, for example, seems very low in the simulations compared to the density of axons/boutons in the cortex; what constitutes a potential synapse? The perfect correlations between synapses in the activity groups is flawed, even for synapses belonging to the same presynaptic cell. The density of postsynaptic cells is also orders of magnitude of, etc. Ideally, every claim made about the model's output should be supported by a parameter sensitivity study. The authors performed few explorations of parameter sensitivity and many of the choices made seem ad hoc.

      It is indeed important to clarify how the model parameters were selected. Here we provide a short justification for some of these parameters, which will be included in the revised manuscript.

      1) Potential synapse density: We modelled 1,500 potential synapses in a cortical sheet of size 185x185 microns squared. We used 1 pixel per μm to capture approximately 1 μm thick dendrites. Therefore, we started with initial density of 0.044 potential synapses per μm^2. From Author Response Image 1 we can see that at the end of our simulation time ~1,000 potential synapses remain. So in fact, the density of potential synapses is totally sufficient, since not many potential synapses end up connected. The rapid slowing down of growth in our model is not due to a depletion of potential synaptic partners as the number of potential synapses remains high. Nonetheless, we will explore this in the revised manuscript. (this figure will be included in the revised submission):

      2) Stabilized synapse density: Since ~1,000 of the potential synapses in the modeled cortical sheet remain available, ~500 become connected to the dendrites of the 9 somas in the modeled cortical sheet. This means that the density of stable connected synapses is approximately 0.015 synapses per μm^2. This is also the number that is shown in Figure 3b, which is about 60 synapses stabilized per cell. This density is much easier to compare to experimental data, and below we provide some numbers from literature we already cited in the manuscript as well as a recent preprint.

      In the developing cortex:

      • Leighton, Cheyne and Lohmann 2023 https://doi.org/10.1101/2023.03.02.530772 find up to 0.4 synapses per μm in pyramidal neurons in vivo in the developing mouse visual cortex at P8 to P13. This is almost identical to our value of 0.4 synapses per μm.

      • Ultanir et al., 2007 https://doi.org/10.1073/pnas.0704031104 find 0.7 to 1.7 spines per μm in pyramidal neurons in vivo in L2/3 of the developing mouse cortex, at P10 to P20.

      • Glynn et al., 2011 https://doi.org/10.1038/nn.2764 find 0.1 to 0.7 spines per μm^2 in pyramidal neurons in vivo and in vitro in L2/3 of the developing mouse cortex, at P8 to P60.

      In the developing hippocampus:

      Although these values vary somewhat across experiments, in most cases they are in agreement with our chosen values, especially when taking into account that we are modeling development (rather than adulthood).

      3) Soma/neuron density: Indeed, we did not exactly mention this number anywhere in the paper. But from the figures we can infer 9 somas growing dendrites on an area of ~34,000 μm^2. Thus, neuron density would be 300 neurons per mm^2. This number seems a bit low after a short search through the literature. For e.g. Keller et al., 2018 https://www.frontiersin.org/articles/10.3389/fnana.2018.00083/full reports about 90,000 neurons per mm^3, albeit in adulthood.

      We are also performing a sensitivity analysis where some of these parameters are varied and will include this in the revised manuscript. In particular:

      (1) We will vary the nature of the input correlations. In the current model, the synapses in each correlated group receive spike trains with a perfect correlation and there are no correlations across the groups. We will reduce the correlations within group and add non-zero correlations across the groups.

      (2) We will vary the density of the neuronal somas. We expect that higher densities of somas will either yield smaller dendritic areas because the different neurons compete more or result in a state where nearby neurons have to complement each other regarding their activity preferences.

      (3) We will introduce dynamics in the potential synapses to model the dynamics of axons. We plan to explore several scenarios. We could introduce a gradual increase in the density of potential synapses and implement a cap on the number of synapses that can be alive at the same time, and vary that cap. We could also introduce a lifetime of each synapse (following for example a lognormal distribution). A potential synapse can disappear if it does not form a stable synapse in its lifetime, in which case it could move to a different location.

      Point 1.2. Many potentially important phenomena seem to be excluded. I realize that no model can be complete, but the choice of which phenomena to include or exclude from this model could bias studies that make use of it and is worth serious discussion. The development of axons is concurrent with dendrite outgrowth, is highly dynamic, and perhaps better understood mechanistically. In this model, the inputs are essentially static. Growing dendrites acquire and lose growth cones that are associated with rapid extension, but these do not seem to be modeled. Postsynaptic firing does not appear to be modeled, which may be critical to activity-dependent plasticity. For example, changes in firing are a potential explanation for the global changes in dendritic pruning that occur following the outgrowth phase.

      As the reviewer concludes, no model can be complete. In agreement with this, here we would like to quote a paragraph from a very nice paper by Larry Abbott (“Theoretical Neuroscience Rising, Neuron 2008 https://www.sciencedirect.com/science/article/pii/S0896627308008921) which although published more than 10 years ago, still applies today:

      “Identifying the minimum set of features needed to account for a particular phenomenon and describing these accurately enough to do the job is a key component of model building. Anything more than this minimum set makes the model harder to understand and more difficult to evaluate. The term ‘‘realistic’’ model is a sociological rather than a scientific term. The truly realistic model is as impossible and useless a concept as Borges’ ‘‘map of the empire that was of the same scale as the empire and that coincided with it point for point’’ (Borges, 1975). […] The art of modeling lies in deciding what this subset should be and how it should be described.”

      We have clearly stated in the Introduction (e.g. lines 37-75) which phenomena we include in the model and why. The Discussion also compares our model to others (lines 315-373), pointing out that most models either focus on activity-independent or activity-dependent phases. We include both, combining literature on molecular gradients and growth factors, with activity-dependent connectivity refinements instructed by spontaneous activity. We could not think of a more tractable, more minimalist model that would include both activity-independent or activity-dependent aspects. Therefore, we feel that the current manuscript provides sufficient motivation but also a discussion of limitations of the current model.

      Regarding including the concurrent development of axons, we agree this is very interesting and currently not addressed in the model. As noted at the bottom of our reply to point 1.1, bullet (3) we are now revising the manuscript to include a simplified form of axonal dynamics by allowing changes in the lifetime and location of potential synapses, which come from axons of presynaptic partners.

      Regarding postsynaptic firing, this is indeed super relevant and an important point to consider. In one of our recent publications (Kirchner and Gjorgjieva, 2021 https://www.nature.com/articles/s41467-021-23557-3), we studied only an activity-dependent model for the organization of synaptic inputs on non-growing dendrites which have a fixed length. There, we considered the effect of postsynaptic firing and demonstrated that it plays an important role in establishing a global organization of synapses on the entire dendritic tree of the neuron, and not just local dendritic branches. For example, we showed that could that it could lead to the emergence of retinotopic maps which have been found experimentally (Iacaruso et al., 2017 https://www.nature.com/articles/nature23019). Since we use the same activity-dependent plasticity model in this paper, we expect that the somatic firing will have the same effect on establishing synaptic distributions on the entire dendritic tree. We will make a note of this in the Discussion in the revised paper.

      Point 1.3. Line 167. There are many ways to include activity -independent and -dependent components into a model and not every such model shows stability. A key feature seems to be that larger arbors result in reduced growth and/or increased retraction, but this could be achieved in many ways (whether activity dependent or not). It's not clear that this result is due to the combination of activity-dependent and independent components in the model, or conceptually why that should be the case.

      We never argued for model uniqueness. There are always going to be many different models (at different spatial and temporal scales, at different levels of abstraction). We can never study all of them and like any modeling study in systems neuroscience we have chosen one model approach and investigated this approach. We do compare the current model to others in the Discussion. If the reviewers have a specific implementation that we should compare our model to as an alternative, we could try, but not if this means doing a completely separate project.

      Point 1.4. Line 183. The explanation of overshoot in terms of the different timescales of synaptic additions versus activity-dependent retractions was not something I had previously encountered and is an interesting proposal. Have these timescales been measured experimentally? To what extent is this a result of fine-tuning of simulation parameters?

      We found that varying the amount of BDNF controls the timescale of the activity-dependent plasticity (see our Figure 5c). Hence, changing the balance between synaptic additions vs. retractions is already explored in Figure 5e and f. Here we show that the overshoot and retraction does not have to be fine-tuned but may be abolished if there is too much activity-dependent plasticity.

      Regarding the relative timescales of synaptic additions vs. retractions: since the first is mainly due to activity-independent factors, and the second due to activity-dependent plasticity, the questions is really about the timescales of the latter two. As we write in the Introduction (lines 60-62), manipulating activity-dependent synaptic transmission has been found to not affect morphology but rather the density and specificity of synaptic connections (Ultanir et al. 2007 https://doi.org/10.1073/pnas.0704031104), supporting the sequential model we have (although we do not impose the sequence, as both activity-independent and activity-dependent mechanisms are always “on”; but note that activity-dependent plasticity can only operate on synapses that have already formed).

      Point 1.5. Line 203. This result seems at odds with results that show only a very weak bias in the tuning distribution of inputs to strongly tuned cortical neurons (e.g. work by Arthur Konnerth's group). This discrepancy should be discussed.

      First, we note that the correlated activity experienced by our modeled synapses (and resulting synaptic organization) does not necessarily correspond to visual orientation, or any stimulus feature, for that matter.

      Nonetheless, this is a very interesting question and there is some variability in what the experimental data show. Many studies have shown that synapses on dendrites are organized into functional synaptic clusters: across brain regions, developmental ages and diverse species from rodent to primate (Kleindienst et al. 2011; Takahashi et al. 2012; Winnubst et al. 2015; Gökçe et al., 2016; Wilson et al. 2016; Iacaruso et al., 2017; Scholl et al., 2017; Niculescu et al. 2018; Kerlin et al. 2019; Ju et al. 2020). Interestingly, some in vivo studies have reported lack of fine-scale synaptic organization (Varga et al. 2011; X. Chen et al. 2011; T.-W. Chen et al. 2013; Jia et al. 2010; Jia et al. 2014), while others reported clustering for different stimulus features in different species. For example, dendritic branches in the ferret visual cortex exhibit local clustering of orientation selectivity but do not exhibit global organization of inputs according to spatial location and receptive field properties (Wilson et al. 2016; Scholl et al., 2017). In contrast, synaptic inputs in mouse visual cortex do not cluster locally by orientation, but only by receptive field overlap, and exhibit a global retinotopic organization along the proximal-distal axis (Iacaruso et al., 2017). We proposed a theoretical framework to reconcile these data: combining activity-dependent plasticity similar to the BDNF-proBDNF model that we used in the current work, and a receptive field model for the different species (Kirchner and Gjorgjieva, 2021 https://www.nature.com/articles/s41467-021-23557-3). We can mention this aspect in the revised manuscript.

      Point 1.6. Line 268. How does the large variability in the size of the simulated arbors relate to the relatively consistent size of arbors of cortical cells of a given cell type? This variability suggests to me that these simulations could be sensitive to small changes in parameters (e.g. to the density or layout of presynapses).

      As noted at the bottom of our reply to point 1.1, bullet (3) we are now revising the manuscript to include changes in the lifetime and location of potential synapses.

      Point 1.7. The modeling of dendrites as two-dimensional will likely limit the usefulness of this model. Many phenomena- such as diffusion, random walks, topological properties, etc - fundamentally differ between two and three dimensions.

      The reviewer is right about there being differences between two and three dimensions. But a simpler model does not mean a useless model even if not completely realistic. We have ongoing work that extends the current model to 3D but is beyond the scope of the current paper. In systems neuroscience, people have found very interesting results making such simplified geometric assumptions about networks, for instance the one-dimensional ring model has been used to uncover fundamental insights about computations even though highly simplified and abstracted.

      Point 1.8. The description of wiring lengths as 'approximately optimal' in this text is problematic. The plotted data show that the wiring lengths are several deviations away from optimal, and the random model is not a valid instantiation of the 2D non-overlapping constraints the authors imposed. A more appropriate null should be considered.

      We did not use the term “optimal” in line with previous literature. We wrongly referred to the minimal wiring length as the optimal wiring length, but neurons can optimize their wiring not only by minimizing their dendritic length (e.g. work of Hermann Cuntz). In the revised manuscript, we will replace the term “optimal wiring” with “minimal wiring”. Then we will compare the wiring length in the model with the theoretically minimal wiring length, the random wiring length and the actual data.

      Point 1.9. It's not clear to me what the authors are trying to convey by repeatedly labeling this model as 'mechanistic'. The mechanisms implemented in the model are inspired by biological phenomena, but the implementations have little resemblance to the underlying biophysical mechanisms. Overall my impression is that this is a phenomenological model intended to show under what conditions particular patterns are possible. Line 363, describing another model as computational but not mechanistic, was especially unclear to me in this context.

      What we mean by mechanistic is that we implement equations that model specific mechanisms i.e. we have a set of equations that implement the activity-independent attraction to potential synapses (with parameters such as the density of synapses, their spatial influence, etc) and the activity-dependent refinement of synapses (with parameters such as the ratio of BDNF and proBDNF to induce potentiation vs depression, the activity-dependent conversion of one factor to the other, etc). This is a bottom-up approach where we combine multiple elements together to get to neuronal growth and synaptic organization. This approach is in stark contrast to the so-called top-down or normative approaches where the method would involve defining an objective function (e.g. minimal dendritic length) which depends on a set of parameters and then applying a gradient descent or other mathematical optimization technique to get at the parameters that optimize the objective function. This latter approach we would not call mechanistic because it involves an abstract objective function (who could say what a neuron or a circuit should be trying to optimize) and a mathematical technique for how to optimize the function (we don’t know of neurons can compute gradients of abstract objective functions).

      Hence our model is mechanistic, but it does operate at a particular level of abstraction/simplification. We don’t model individual ion channels, or biophysics of synaptic plasticity (opening and closing of NMDA channels, accumulation of proteins at synapses, protein synthesis). We do, however, provide a biophysical implementation of the plasticity mechanism though the BDNF/proBDNF model which is more than most models of plasticity achieve, because they typically model a phenomenological STDP or Hebbian rule that just uses activity patterns to potential or depress synaptic weights, disregarding how it could be implemented.

      Reviewer #2 (Public Review):

      This work combines a model of two-dimensional dendritic growth with attraction and stabilisation by synaptic activity. The authors find that constraining growth models with competition for synaptic inputs produces artificial dendrites that match some key features of real neurons both over development and in terms of final structure. In particular, incorporating distance-dependent competition between synapses of the same dendrite naturally produces distinct phases of dendritic growth (overshoot, pruning, and stabilisation) that are observed biologically and leads to local synaptic organisation with functional relevance. The approach is elegant and well-explained, but makes some significant modelling assumptions that might impact the biological relevance of the results.

      Strengths:

      The main strength of the work is the general concept of combining morphological models of growth with synaptic plasticity and stabilisation. This is an interesting way to bridge two distinct areas of neuroscience in a manner that leads to findings that could be significant for both. The modelling of both dendritic growth and distance-dependent synaptic competition is carefully done, constrained by reasonable biological mechanisms, and well-described in the text. The paper also links its findings, for example in terms of phases of dendritic growth or final morphological structure, to known data well.

      Weaknesses:

      The major weaknesses of the paper are the simplifying modelling assumptions that are likely to have an impact on the results. These assumptions are not discussed in enough detail in the current version of the paper.

      1) Axonal dynamics.

      A major, and lightly acknowledged, assumption of this paper is that potential synapses, which must come from axons, are fixed in space. This is not realistic for many neural systems, as multiple undifferentiated neurites typically grow from the soma before an axon is specified (Polleux & Snider, 2010). Further, axons are also dynamic structures in early development and, at least in some systems, undergo activity-dependent morphological changes too (O'Leary, 1987; Hall 2000). This paper does not consider the implications of joint pre- and post-synaptic growth and stabilisation.

      We thank the reviewer for the summary of the strengths and weaknesses of the work. While we feel that including a full model of axonal dynamics is beyond the scope of the current manuscript, some aspects of axonal dynamics can be included. In a revised model, we will introduce a gradual increase in the density of potential synapses and implement a cap on the number of synapses that can be alive at the same time, and vary that cap. We plan to also introduce a lifetime of each synapse (following for example a lognormal distribution). A potential synapse can disappear if it does not form a stable synapse in its lifetime, in which case it could move to a different location. See also our reply to reviewer comment 1.1, bullet (3).

      2) Activity correlations

      On a related note, the synapses in the manuscript display correlated activity, but there is no relationship between the distance between synapses and their correlation. In reality, nearby synapses are far more likely to share the same axon and so display correlated activity. If the input activity is spatially correlated and synaptic plasticity displays distance-dependent competition in the dendrites, there is likely to be a non-trivial interaction between these two features with a major impact on the organisation of synaptic contacts onto each neuron.

      We are exploring the amount of correlation (between and within correlated groups) to include in the revised manuscript (see also our reply to reviewer comment 1.1, bullet (1)).

      However, previous experimental work, (Kleindienst et al., 2011 https://doi.org/10.1016/j.neuron.2011.10.015) has provided anatomical and functional analyses that it is unlikely that the functional synaptic clustering on dendritic branches is the result of individual axons making more than one synapse (see pg. 1019).

      3) BDNF dynamics

      The models are quite sensitive to the ratio of BDNF to proBDNF (eg Figure 5c). This ratio is also activity-dependent as synaptic activation converts proBDNF into BDNF. The models assume a fixed ratio that is not affected by synaptic activity. There should at least be more justification for this assumption, as there is likely to be a positive feedback relationship between levels of BDNF and synaptic activation.

      The reviewer is correct. We used the BDNF-proBDNF model for synaptic plasticity based on our previous work: Kirchner and Gjorgjieva, 2021 https://www.nature.com/articles/s41467-021-23557-3.

      There, we explored only the emergence of functionally clustered synapses on static dendrites which do not grow. In the Methods section (Parameters and data fitting) we justify the choice of the ratio of BDNF to proBDNF from published experimental work. We also performed sensitivity analysis (Supplementary Fig. 1) and perturbation simulations (Supplementary Fig. 3), which showed that the ratio is crucial in regulating the overall amount of potentiation and depression of synaptic efficacy, and therefore has a strong impact on the emergence and maintenance of synaptic organization. Since we already performed all this analysis, we do not expect there will be any differences in the current model which includes dendritic growth, as the activity-dependent mechanism has such a different timescale.

      A further weakness is in the discussion of how the final morphologies conform to principles of optimal wiring, which is quite imprecise. 'Optimal wiring' in the sense of dendrites and axons (Cajal, 1895; Chklovskii, 2004; Cuntz et al, 2007, Budd et al, 2010) is not usually synonymous with 'shortest wiring' as implied here. Instead, there is assumed to be a balance between minimising total dendritic length and minimising the tree distance (ie Figure 4c here) between synapses and the site of input integration, typically the soma. The level of this balance gives the deviation from the theoretical minimum length as direct paths to synapses typically require longer dendrites. In the model this is generated by the guidance of dendritic growth directly towards the synaptic targets. The interpretation of the deviation in this results section discussing optimal wiring, with hampered diffusion of signalling molecules, does not seem to be correct.

      We agree with this comment. We had wrongly used the term “optimal wiring” as neurons can optimize their wiring not only by minimizing their dendritic length but other factors as noted by the reviewer. In the revised manuscript will replace the term “optimal wiring” with “minimal wiring” and discuss these differences to previous work.

      Reviewer #3 (Public Review):

      The authors propose a mechanistic model of how the interplay between activity-independent growth and an activity-dependent synaptic strengthening/weaken model influences the dendrite shape, complexity and distribution of synapses. The authors focus on a model for stellate cells, which have multiple dendrites emerging from a soma. The activity independent component is provided by a random pool of presynaptic sites that represent potential synapses and that release a diffusible signal that promotes dendritic growth. Then a spontaneous activity pattern with some correlation structure is imposed at those presynaptic sites. The strength of these synapses follow a learning rule previously proposed by the lab: synapses strengthen when there is correlated firing across multiple sites, and synapses weaken if there is uncorrelated firing with the relative strength of these processes controlled by available levels of BDNF/proBDNF. Once a synapse is weakened below a threshold, the dendrite branch at that site retracts and loses its sensitivity to the growth signal

      The authors run the simulation and map out how dendrites and synapses evolve and stabilize. They show that dendritic trees growing rapidly and then stabilize by balancing growth and retraction (Figure 2). They also that there is an initial bout of synaptogenesis followed by loss of synapses, reflecting the longer amount of time it takes to weaken a synapse (Figure 3). They analyze how this evolution of dendrites and synapses depends on the correlated firing of synapses (i.e. defined as being in the same "activity group"). They show that in the stabilized phase, synapses that remain connected to a given dendritic branch are likely to be from same activity group (Figure 4). The authors systemically alter the learning rule by changing the available concentration of BDNF, which alters the relative amount of synaptic strengthening, which in turn affects stabilization, density of synapses and interestingly how selective for an activity group one dendrite is (Figure 5). In addition the authors look at how altering the activity-independent factors influences outgrowth (Figure 6). Finally, one of the interesting outcomes is that the resulting dendritic trees represent "optimal wiring" solutions in the sense that dendrites use the shortest distance given the distribution of synapses. They compare this distribute to one published data to see how the model compared to what has been observed experimentally.

      There are many strengths to this study. The consequence of adding the activity-dependent contribution to models of synapto- and dendritogenesis is novel. There is some exploration of parameters space with the motivation of keeping the parameters as well as the generated outcomes close to anatomical data of real dendrites. The paper is also scholarly in its comparison of this approach to previous generative models. This work represented an important advance to our understanding of how learning rules can contribute to dendrite morphogenesis

      We thank the reviewer for the positive evaluation of the work and the suggestions below.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The current study uses 3D organotypic rafts to culture primary keratinocytes from Foreskin, Tonsil and Cervix. Further the authors looked at the transcriptomic profiles of each tissue types to study similarities and differences depending on the tissue of origin as well as show the similarity in the tissue specific gene signatures and the ex-vivo samples (data from GTEx). As mentioned by authors Skin and Cervix keratinocytes have been previously cultured on collagen rafts however extending it to Tonsil provides resource and possibility of growing more tissue specific epithelial cells in 3D.

      Major comments 1. As the papers focus is to culture epithelial/ epidermal cells on 3D rafts, methods section needs more details about the raft composition, preparation, fibroblast embedding what was the plate size used for raft preparation and culturing of cells on those rafts. What culture media was used for epithelial raft cultures?

      We have a detailed published protocol that highlight these details. However, we will expand on some of these details in the manuscript

      Results: Figure 1, authors show IF staining's for COL17A1 as marker for basal cells and cornulin for differentiated layers. However, it is important to show how many cells in the basal layer are proliferative? (or how many layers of proliferative cells are present in different epithelia analysed here?) after 14 days majority of cells might already start losing their stemness potential (maybe staining for at least ki67 if staining for basal stem cell marker not possible? Along with loricrin or Involucrin might be good idea).

      We will stain for ki67 as suggested. However, based on published data using these raft cultures, we do not expect that many cells will be positive.

      This is also important as from supp fig 3 you can see F1 has higher expression of Loricrin, filaggrin etc as compared to all other samples indicating higher diff in this sample. Also, if authors can comment on what was the passage of cells used? And have they observed any difference in the re-epithelization in early passage versus late passage of keratinocytes?

      We will expand on this is the updated manuscript. Importantly, we grow these cells in a rho-kinase inhibitor that ‘conditionally’ immortalizes these cells as described (DOI: 10.1172/JCI42297).

      It is interesting to see Tonsillar 3D epithelia recapitulate the crypt and surface epithelia and authors also show this with gene expression profile, if possible (Optional), can authors show staining for crypt specific and surface specific markers.

      We agree that this is an important control. This will be included.

      For all the Supplementary tables where only Ensembl ids are represented, please add gene Id column alongside (it is easier to get biological context from gene id for the reader rather than looking up Ensembl ids). Rename the file names to include the Supplementary file 1, 2, 3?

      Since there is 1-to-1 conversion for Ensembl to Gene Id, we elected to not include these. The online app does try tp accommodate this as much as possible. We propose to include two versions of each table. 1 with Ensemble ids only and one with both IDs.

      Its excellent to see that in vitro tissue signature matched the in vivo tissue samples (Figure 8) but it will be interesting to show the gene expression differences if found any between the in vitro and in vivo samples that will give insight on the changes as result of in vitro system.

      Since the in vivo data will be a mixture of epithelial cells and stroma, these comparisons are not straightforward. However, we are currently examining the use of existing scRNA-Seq data to begin addressing these concerns. This data will be included in the next revision.

      Minor comments

      1. Abstract: Give sample number (n?) and brief results about the genes that had tissue specific expression pattens.
      2. Gene names needs to be in Italics throughout.
      3. Introduction: page 5 line 9, authors claim that they based on comparisons they can "identify potential therapeutic targets for various disease" I think this statement either needs experimental evidence or statement / claim needs to be modified.
      4. Data submission to GEO???
      5. Typo (page 15, line 16 should be "HFK-down", same on page 23 "ectocervix", "endocervix", "uterus", so on, please correct, comma needs to be placed after "
      6. Page 24 last line is the heatmap referred here Fig 9B?
      7. Fig. 1 legends please indicate what F1, F2, F3, C1--- T1--- represent. Fig 1C Please add axis range/ values for protein atlas data as well.
      8. Can authors comment in discussion how was current 3D cervix cells on raft method different from Meyers, C., 1996 3D system?

      All these ‘minor’ comments will be addressed.

      Reviewer #1 (Significance (Required)):

      This article does extend and validate the 3D raft culture method to different epithelial tissues in addition to Skin and cervix. This will be useful for the researchers using co culture systems and interested in understanding epithelial cell and immune cell interactions or host pathogen interacts etc

      Describe your expertise: establishing and maintaining primary skin and oral keratinocyte cultures on feeders and 3D cultures on DEDs, Organoid cultures from oral keratinocytes, Oral cancer biology, Histopathology, transcriptomics study, Immuno-oncology.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary Jackson et al.'s manuscript describes an experiment that directly compares 3D organotypic assays created with primary human epithelial cells from foreskin, cervix and tonsil using histological and bulk RNA sequencing approaches. The authors convincingly show the retention of site-specific histological and transcriptomic differences between the stratified epithelial tissues in culture. Differentially expressed genes are identified and pathway analyses suggest genes that might be involved in the different differentiation processes between these tissue sites and differential regulation of ECM and immune pathways. Differentially expressed genes are used to develop a classifier for tissue identification, which is tested using GTEx data.

      Major Comments • The interferon stimulated genes of B cells and macrophages (from Mostafavi et al., 2016) are likely to be very different from those in epithelial cells, so the analysis presented in Figure 9 seems like a stretch to me.

      We will include caveats to this interpretation. We are planning stimulation experiments of each tissue to compare IFN responses. However, depending on the outcomes, these may end up being outside of the scope of the current manuscript.

      • OPTIONAL: Further data comparing the nature and magnitude of the interferon responses of the three epithelia would improve interest in the manuscript but are not necessary for publication of the current dataset.

      See above

      Minor Comments • Details of n numbers and what each point represents should be added to Figure 1C. Are these points measurements from 25 um intervals of just one raft per donor? What are 'fields of view' here? Are measurements from one section or from multiple sections per raft? • Page 12 - provide a figure/panel citation for the "micrograph derived from a tonsillectomy" that is suggested for comparison. • In Figure 1 - Figure Supplement 1, how representative of the whole raft are these images? Does the extent of stratification change near to the edge of the collagen gel, for example? How well matched for location within a raft are the images shown? • Page 24 - clarify uses of the phrase "down-regulated in tonsils". Presumably this section refers to tonsil epithelium in 3D organotypic rafts.

      Typos • Page 3 - "the cervix is lined with stratified squamous epithelia", should be epithelium. • "J.G. Rheinwald" in in text references. • Page 6 - 'or' not 'and' in first sentence of primary cell culture section.

      All these ‘minor’ comments and typos will be addressed.

      Reviewer #2 (Significance (Required)):

      This highly descriptive study provides a detailed analysis of a bulk RNA sequencing experiment comparing foreskin, cervix and tonsil 3D organotypic rafts. Retained histological and transcriptional differences between epithelial tissues of different origins in organotypic assays are well documented in the literature (e.g., parmoplantar vs non-parmoplantar skin, PMID: 36732947; airway tract, PMID: 32526206) so the observed differences between these three very distinct anatomical tissues are unsurprising overall. The data have been made available via SRA and a shiny web app and are likely to be of interest and use to other researchers working on these tissues in culture. The experiment was performed in matched cell culture conditions so replicates are well-controlled, if limited in number (n=3).

      We appreciate this feedback. We agree this is a descriptive study. Nonetheless, we believe there is value in formally demonstrating differences and similarities between these tissues. The provided references will be included to expand our discussion.

      I am an epithelial cell biologist specializing in human cell culture models. I do not have sufficient computational background to comment in detail on the RNA sequencing methods or analysis within the manuscript.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This is very carefully analysed and written study describing the transcriptional differences between in vitro models of epithelia derived from cervix, foreskin and tonsil tissues. Importantly, they compare the findings to in vivo samples using publicly available data. The findings are significant and will be of interest to the scientific community. I cannot fault the analysis pathways or the conclusions, and the manuscript is a pleasure to read. I recommend it is accepted for publication as is.

      Reviewer #3 (Significance (Required)):

      This is an important study that is highly significant for researchers interested in epithelia tissue and infection. The data are clearly presented and the analysis is thorough. The authors state that they will make the data publicly available. This will be an important resource for the community.

      We appreciate the kind words

    1. Author Response

      Reviewer #1 (Public Review):

      The authors present a study of visuo-motor coupling primarily using wide-field calcium imaging to measure activity across the dorsal visual cortex. They used different mouse lines or systemically injected viral vectors to allow imaging of calcium activity from specific cell-types with a particular focus on a mouse-line that expresses GCaMP in layer 5 IT (intratelencephalic) neurons. They examined the question of how the neural response to predictable visual input, as a consequence of self-motion, differed from responses to unpredictable input. They identify layer 5 IT cells as having a different response pattern to other cell-types/layers in that they show differences in their response to closed-loop (i.e. predictable) vs open-loop (i.e. unpredictable) stimulation whereas other cell-types showed similar activity patterns between these two conditions. They analyze the latencies of responses to visuomotor prediction errors obtained by briefly pausing the display while the mouse is running, causing a negative prediction error, or by presenting an unpredicted visual input causing a positive prediction error. They suggest that neural responses related to these prediction errors originate in V1, however, I would caution against over-interpretation of this finding as judging the latency of slow calcium responses in wide-field signals is very challenging and this result was not statistically compared between areas.

      Surprisingly, they find that presentation of a visual grating actually decreases the responses of L5 IT cells in V1. They interpret their results within a predictive coding framework that the last author has previously proposed. The response pattern of the L5 IT cells leads them to propose that these cells may act as 'internal representation' neurons that carry a representation of the brain's model of its environment. Though this is rather speculative. They subsequently examine the responses of these cells to anti-psychotic drugs (e.g. clozapine) with the reasoning that a leading theory of schizophrenia is a disturbance of the brain's internal model and/or a failure to correctly predict the sensory consequences of self-movement. They find that anti-psychotic drugs strongly enhance responses of L5 IT cells to locomotion while having little effect on other cell-types. Finally, they suggest that anti-psychotics reduce long-range correlations between (predominantly) L5 cells and reduce the propagation of prediction errors to higher visual areas and suggest this may be a mechanism by which these drugs reduce hallucinations/psychosis.

      This is a large study containing a screening of many mouse-lines/expression profiles using wide-field calcium imaging. Wide-field imaging has its caveats, including a broad point-spread function of the signal and susceptibility to hemodynamic artifacts, which can make interpretation of results difficult. The authors acknowledge these problems and directly address the hemodynamic occlusion problem. It was reassuring to see supplementary 2-photon imaging of soma to complement this data-set, even though this is rather briefly described in the paper.

      We will expand on the discussion of caveats as suggested.

      Overall the paper's strengths are its identification of a very different response profile in the L5 IT cells compared other layers/cell-types which suggests an important role for these cells in handling integration of self-motion generated sensory predictions with sensory input. The interpretation of the responses to anti-psychotic drugs is more speculative but the result appears robust and provides an interesting basis for further studies of this effect with more specific recording techniques and possibly behavioral measures.

      Reviewer #2 (Public Review):

      Summary:

      This work investigates the effects of various antipsychotic drugs on cortical responses during visuomotor integration. Using wide-field calcium imaging in a virtual reality setup, the researchers compare neuronal responses to self-generated movement during locomotion-congruent (closed loop) or locomotion-incongruent (open loop) visual stimulation. Moreover, they probe responses to unexpected visual events (halt of visual flow, sudden-onset drifting grating). The researchers find that, in contrast to a variety of excitatory and inhibitory cell types, genetically defined layer 5 excitatory neurons distinguish between the closed and the open loop condition and exhibit activity patterns in visual cortex in response to unexpected events, consistent with unsigned prediction error coding. Motivated by the idea that prediction error coding is aberrant in psychosis, the authors then inject the antipsychotic drug clozapine, and observe that this intervention specifically affects closed loop responses of layer 5 excitatory neurons, blunting the distinction between the open and closed loop conditions. Clozapine also leads to a decrease in long-range correlations between L5 activity in different brain regions, and similar effects are observed for two other antipsychotics, aripripazole and haloperidol, but not for the stimulant amphetamine. The authors suggest that altered prediction error coding in layer 5 excitatory neurons due to reduced long-range correlations in L5 neurons might be a major effect of antipsychotic drugs and speculate that this might serve as a new biomarker for drug development.

      Strengths:

      • Relevant and interesting research question:

      The distinction between expected and unexpected stimuli is blunted in psychosis but the neural mechanisms remain unclear. Therefore, it is critical to understand whether and how antipsychotic drugs used to treat psychosis affect cortical responses to expected and unexpected stimuli. This study provides important insights into this question by identifying a specific cortical cell type and long-range interactions as potential targets. The authors identify layer 5 excitatory neurons as a site where functional effects of antipsychotic drugs manifest. This is particularly interesting as these deep layer neurons have been proposed to play a crucial role in computing the integration of predictions, which is thought to be disrupted in psychosis. This work therefore has the potential to guide future investigations on psychosis and predictive coding towards these layer 5 neurons, and ultimately improve our understanding of the neural basis of psychotic symptoms.

      • Broad investigation of different cell types and cortical regions:

      One of the major strengths of this study is quasi-systematic approach towards cell types and cortical regions. By analysing a wide range of genetically defined excitatory and inhibitory cell types, the authors were able to identify layer 5 excitatory neurons as exhibiting the strongest responses to unexpected vs. expected stimuli and being the most affected by antipsychotic drugs. Hence, this quasi-systematic approach provides valuable insights into the functional effects of antipsychotic drugs on the brain, and can guide future investigations towards the mechanisms by which these medications affect cortical neurons.

      • Bridging theory with experiments

      Another strength of this study is its theoretical framework, which is grounded in the predictive coding theory. The authors use this theory as a guiding principle to motivate their experimental approach connecting visual responses in different layers with psychosis and antipsychotic drugs. This integration of theory and experimentation is a powerful approach to tie together the various findings the authors present and to contribute to the development of a coherent model of how the brain processes visual information both in health and in disease.

      Weaknesses:

      • Unclear relevance for psychosis research

      From the study, it remains unclear whether the findings might indeed be able to normalise altered predictive coding in psychosis. Psychosis is characterised by a blunted distinction between predicted and unpredicted stimuli. The results of this study indicate that antipsychotic drugs further blunt the distinction between predicted and unpredicted stimuli, which would suggest that antipsychotic drugs would deteriorate rather than ameliorate the predictive coding deficit found in psychosis. However, these findings were based on observations in wild-type mice at baseline. Given that antipsychotics are thought to have little effects in health but potent antipsychotic effects in psychosis, it seems possible that the presented results might be different in a condition modelling a psychotic state, for example after a dopamine-agonistic or a NMDA-antagonistic challenge. Therefore, future work in models of psychotic states is needed to further investigate the translational relevance of these findings.

      We fully agree that it is unclear how the effects of antipsychotics in mice relate to the drug effects that would be observed in schizophrenic patients. It is also correct that the reduction of the difference between closed and open loop locomotion onset response in L5 IT neurons (Figure 4) is not what we would have expected to find under the assumption that psychosis is characterized by a blunted distinction between predicted and unpredicted stimuli. We are not sure how to interpret this finding. However, it is probably important to note that the difference is only reduced when using a normalized comparison. Looking just at the subtraction of the two curves, the difference between closed and open loop locomotion onset responses remains unchanged before and after antipsychotic drug injection. The finding of a decorrelation of layer 5 activity, however, is easier to interpret under the assumption that layer 5 functions as an internal representation. If speech hallucinations, for example, are the consequence of a spurious activation of internal representations in speech processing areas of cortex, then antipsychotics might reduce the probability of these spurious activation events by reducing the lateral influence between layer 5 neurons in different cortical areas.

      We do indeed plan to address the question of how antipsychotics influence cortical processing in mouse models of schizophrenia in the future.

      • Incomplete testing of predictive coding interpretation

      While the investigation of neuronal responses to different visual flow stimuli Is interesting, it remains open whether these responses indeed reflect internal representations in the framework of predictive coding. While the responses are consistent with internal representation as defined by the researchers, i.e., unsigned prediction error signals, an alternative interpretation might be that responses simply reflect sensory bottom-up signals that are more related to some low-level stimulus characteristics than to prediction errors.

      This is correct – we will expand on the discussion of this point in the manuscript.

      Moreover, This interpretational uncertainty is compounded by the fact that the used experimental paradigms were not suited to test whether behaviour is impacted as a function of the visual stimulation which makes it difficult to assess what the internal representation of the animal actual was. For these reasons, the observed effects might reflect simple bottom-up sensory processing alterations and not necessarily have any functional consequences. While this potential alternative explanation does not detract from the value of the study, future work would be needed to explain the effect of antipsychotic drugs on responses to visual flow. For example, experimental designs that systematically vary the predictive strength of coupled events or that include a behavioural readout might be more suited to draw from conclusions about whether antipsychotic drugs indeed alter internal representations.

      We agree that much additional work will be necessary to identify internal representation neurons. However, it is difficult to envision how behavioral output could be used to make inferences about internal representations in sensory areas of cortex. In humans, for example, there is evidence that internal representations in visual cortex and behavioral output are not always directly related: binocular rivalry activates representations of both stimuli shown in visual cortex, while the conscious experience that drives behavioral output is only of one of the two stimuli. Hence, we would assume that the internal representation in visual cortex does not necessarily relate to behavioral output.

      • Methodological constraints of experimental design

      While the study findings provide valuable insights into the potential effects of antipsychotic drugs, it is important to acknowledge that there may be some methodological constraints that could impact the interpretation of the results. More specifically, the experimental design does not include a negative control condition or different doses. These conditions would help to ensure that the observed effects are not due to unspecific effects related to injection-induced stress or time, and not confined to a narrow dose range that might or might not reflect therapeutic doses used in humans. Hence, future work is needed to confirm that the observed effects indeed represent specific drug effects that are relevant to antipsychotic action.

      We agree that both dosages and a broader spectrum of non-antipsychotic compounds will need to be investigated. We are in the process of building a screening pipeline to perform exactly these types of experiments. We would however argue that the paper already includes a control condition in the form of the amphetamine data (Figure 7). While it is possible that amphetamine might have an effect that exactly cancels out potential i.p. injection- or stress-induced changes, we would argue it is more probable that these changes had no measurable effect on Tlx3 positive L5 IT neuron calcium activity per se. We will provide additional evidence that time or injection stress alone do not result in the observed effects.

      Conclusion:

      Overall, the results support the idea that antipsychotic drugs affect neural responses to predicted and unpredicted stimuli in deep layers of cortex. Although some future work is required to establish whether this observation can indeed be explained by a drug-specific effect on predictive coding, the study provides important insights into the neural underpinnings of visual processing and antipsychotic drugs, which is expected to guide future investigations on the predictive coding hypothesis of psychosis. This will be of broad interest to neuroscientists working on predictive coding in health and in disease.

      Reviewer #3 (Public Review):

      The study examines how different cell types in various regions of the mouse dorsal cortex respond to visuomotor integration and how antipsychotic drugs impacts these responses. Specifically, in contrast to most cell types, the authors found that activity in Layer 5 intratelencephalic neurons (Tlx3+) and Layer 6 neurons (Ntsr1+) differentiated between open loop and closed loop visuomotor conditions. Focussing on Layer 5 neurons, they found that the activity of these neurons also differentiated between negative and positive prediction errors during visuomotor integration. The authors further demonstrated that the antipsychotic drugs reduced the correlation of Layer 5 neuronal activity across regions of the cortex, and impaired the propagation of visuomotor mismatch responses (specifically, negative prediction errors) across Layer 5 neurons of the cortex, suggesting a decoupling of long-range cortical interactions.

      The data when taken as a whole demonstrate that visuomotor integration in deeper cortical layers is different than in superficial layers and is more susceptible to disruption by antipsychotics. Whilst it is already known that deep layers integrate information differently from superficial layers, this study provides more specific insight into these differences. Moreover, this study provides a first step into understanding the potential mechanism by which antipsychotics may exert their effect.

      Whilst the paper has several strengths, the robustness of its conclusions is limited by its questionable statistical analyses. A summary of the paper's strengths and weaknesses follow.

      Strengths:

      The authors perform an extensive investigation of how different cortical cell types (including Layer 2/3, 4 , 5, and 6 excitatory neurons, as well as PV, VIP, and SST inhibitory interneurons) in different cortical areas (including primary and secondary visual areas as well as motor and premotor areas), respond to visuomotor integration. This investigation provides strong support to the idea that deep layer neurons are indeed unique in their computational properties. This large data set will be of considerable interest to neuroscientists interested in cortical processing.

      The authors also provide several lines of evidence that visuomotor information is differentially integrated in deep vs. superficial layers. They show that this is true across experimental paradigms of visuomotor processing (open loop, closed loop, mismatch, drifting grating conditions) and experimental manipulations, with the demonstration that Layer 5 visuomotor integration is more sensitive to disruption by the antipsychotic drug clozapine, compared with cortex as a whole.

      The study further uses multiple drugs (clozapine, aripiprazole and haloperidol) to bolster its conclusion that antipsychotic drugs disrupt correlated cortical activity in Layer 5 neurons, and further demonstrates that this disruption is specific to antipsychotics, as the psychostimulant amphetamine shows no such effect.

      In widefield calcium imaging experiments, the authors effectively control for the impact of hemodynamic occlusions in their results, and try to minimize this impact using a crystal skull preparation, which performs better than traditional glass windows. Moreover, they examine key findings in widefield calcium imaging experiments with two-photon imaging.

      Weaknesses:

      A critical weakness of the paper is its statistical analysis. The study does not use mice as its independent unit for statistical comparisons but rather relies on other definitions, without appropriate justification, which results in an inflation of sample sizes.

      We will expand on both analyses and justifications throughout.

      For example, in Figure 1, independent samples are defined as locomotion onsets, leading to sample sizes of approx. 400-2000 despite only using 6 mice for the experiment. This is only justified if the data from locomotion onsets within a mouse is actually statistically independent, which the authors do not test for, and which seems unlikely. With such inflated sample sizes, it becomes more likely to find spurious differences between groups as significant. It also remains unclear how many locomotion onsets come from each mouse; the results could be dominated by a small subset of mice with the most locomotion onsets. The more disciplined approach to statistical analysis of the dataset is to average the data associated with locomotion onsets within a mouse, and then use the mouse as an independent unit for statistical comparison. A second example, for instance, is in Figure 2L, where the independent statistical unit is defined as cortical regions instead of mice, with the left and right hemispheres counting as independent samples; again this is not justified. Is the activity of cortical regions within a mouse and across cortical hemispheres really statistically independent? The problem is apparent throughout the manuscript and for each data set collected.

      This may partially be a misunderstanding. Figures 1F-1K indeed use locomotion onsets as a unit, but there were no statistical comparisons. In these Figures we were addressing the question of whether locomotion onsets in closed loop differ from those in open loop. Thus, we quantify variability as a unit of locomotion onsets. The question of mouse-to-mouse variability of this analysis is a slightly different one. We did include the same analysis (for visual cortex) with the variability calculated across mice as Figure S2. We will expand this supplementary figure with the equivalent data of Figure 3 to further address this concern.

      For Figure 1L (we assume the reviewer means Figure 1L, not Figure 2L), the unit we used for analysis was cortical area. We will update and improve the analysis. This was indeed not optimal, and we will replace the statistical testing with hierarchical bootstrap (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7906290/) to account for nested data.

      An additional statistical issue is that it is unclear if the authors are correcting for the use of multiple statistical tests (as in for example Figure 1L and Figure 2B,D). In general, the use of statistics by the authors is not justified in the text.

      We will update and improve the analysis shown in Figure 1L.

      In Figures 2B and 2D, we think adding family-wise error correction would be slightly misleading. We could add a correction – our conclusions would remain unchanged almost independent of the choice of correction (most of the significant p values are infinitesimally small, see Table S1). However, our interpretation is not focusing on one particular comparison (of many possible comparisons) that is significant - all comparisons between closed and open loop data points were significant in the L5 IT recordings and none of them were significant in the recordings in C57BL/6 mice that expressed GCaMP brain-wide.

      Finally, it is important to note that whilst the study demonstrates that antipsychotics may selectively impact visuomotor integration in L5 neurons, it does not show that this effect is necessary or sufficient for the action of antipsychotics; though this is likely beyond the scope of the study it is something for readers to keep in mind.

      We fully agree, it is still unclear how the effects we observe in our work relate to the treatment relevant effects in patients. We will expand on this point in the discussion.

    1. The idea here focuses a lot on memory retention of past events which I think can be approached in two ways:

      1. At the level of the EDR itself: just like Dinil already suggested, instead of simply checking a threshold of maliciousness, we can monitor the gradient as well to effectively raise signals with an increasing maliciousness of event sequences. The idea of aggregated events across boots or even distributed analysis across similar clients also fit into this scenario

      2. At the ML level using such architectures as LSTMs, attention layers, memory based graph networks, etc.

      We may need to use both approaches here

    1. single pathways are exclusionary.

      I agree with this statement. There are so many factors to consider when creating something that is to be used by various kinds of people. It may feel as though it is fair in the moment, but we need to stop and think of whether we missed anything or disregarded a disability because we ourselves do not have it.

    1. Author Response:

      Assessment note: “Whereas the results and interpretations are generally solid, the mechanistic aspect of the work and conclusions put forth rely heavily on in vitro studies performed in cultured L6 myocytes, which are highly glycolytic and generally not viewed as a good model for studying muscle metabolism and insulin action.”

      While we acknowledge that in vitro models may not fully recapitulate the complexity of in vivo systems, we believe that our use of L6 myotubes is appropriate for studying the mechanisms underlying muscle metabolism and insulin action. As mentioned below (reviewer 2, point 1), L6 myotubes possess many important characteristics relevant to our research, including high insulin sensitivity and a similar mitochondrial respiration sensitivity to primary muscle fibres. Furthermore, several studies have demonstrated the utility of L6 myotubes as a model for studying insulin sensitivity and metabolism, including our own previous work (PMID: 19805130, 31693893, 19915010).

      In addition, we have provided evidence of the similarities between L6 cells overexpressing SMPD5 and human muscle biopsies at protein levels and the reproducibility of the negative correlation between ceramide and Coenzyme Q observed in L6 cells in vivo, specifically in the skeletal muscle of mice in chow diet. These findings support the relevance of our in vitro results to in vivo muscle metabolism.

      Finally, we will supplement our findings by demonstrating a comparable relationship between ceramide and Coenzyme Q in mice exposed to a high-fat diet, to be shown in Supplementary Figure 4 H-I. Further animal experiments will be performed to validate our cell-line based conclusions. We hope that these additional results address the concerns raised by the reviewer and further support the relevance of our in vitro findings to in vivo muscle metabolism and insulin action.

      Points from reviewer 1:

      1. Although the authors' results suggest that higher mitochondrial ceramide levels suppress cellular insulin sensitivity, they rely solely on a partial inhibition (i.e., 30%) of insulin-stimulated GLUT4-HA translocation in L6 myocytes. It would be critical to examine how much the increased mitochondrial ceramide would inhibit insulin-induced glucose uptake in myocytes using radiolabel deoxy-glucose.

      Response: The primary impact of insulin is to facilitate the translocation of glucose transporter type 4 (GLUT4) to the cell surface, which effectively enhances the maximum rate of glucose uptake into cells. Therefore, assessing the quantity of GLUT4 present at the cell surface in non-permeabilized cells is widely regarded as the most reliable measure of insulin sensitivity (PMID: 36283703, 35594055, 34285405). Additionally, plasma membrane GLUT4 and glucose uptake are highly correlated. Whilst we have routinely measured glucose uptake with radiolabelled glucose in the past, we do not believe that evaluating glucose uptake provides a better assessment of insulin sensitivity than GLUT4.

      We will clarify the use of GLUT4 translocation in the Results section:

      “...For this reason, several in vitro models have been employed involving incubation of insulin sensitive cell types with lipids such as palmitate to mimic lipotoxicity in vivo. In this study we will use cell surface GLUT4-HA abundance as the main readout of insulin response...”

      1. Another important question to be addressed is whether glycogen synthesis is affected in myocytes under these experimental conditions. Results demonstrating reductions in insulin-stimulated glucose transport and glycogen synthesis in myocytes with dysfunctional mitochondria due to ceramide accumulation would further support the authors' claim.

      Response: We have carried out supplementary experiments to investigate glycogen synthesis in our insulin-resistant models. Our approach involved L6-myotubes overexpressing the mitochondrial-targeted construct ASAH1 (as described in Fig. 3). We then challenged them with palmitate and measured glycogen synthesis using 14C radiolabeled glucose. Our observations indicated that palmitate suppressed insulin-induced glycogen synthesis, which was effectively prevented by the overexpression of ASAH1 (N = 5, * p<0.05). These results provide additional evidence highlighting the role of dysfunctional mitochondria in muscle cell glucose metabolism.

      These data will be added to Supplementary Figure 4K and the results modified as follows:

      “Notably, mtASAH1 overexpression protected cells from palmitate-induced insulin resistance without affecting basal insulin sensitivity (Fig. 3E). Similar results were observed using insulin-induced glycogen synthesis as an ortholog technique for Glut4 translocation. These results provide additional evidence highlighting the role of dysfunctional mitochondria in muscle cell glucose metabolism (Sup. Fig. 5K). Importantly, mtASAH1 overexpression did not rescue insulin sensitivity in cells depleted…”

      We will add to the method section:

      “L6 myotubes overexpressing ASAH were grown and differentiated in 12-well plates, as described in the Cell lines section, and stimulated for 16 h with palmitate-BSA or EtOH-BSA, as detailed in the Induction of insulin resistance section.

      On day seven of differentiation, myotubes were serum starved in plain DMEM for 3 and a half hours. After incubation for 1 hour at 37C with 2 µCi/ml D-[U-14C]-glucose in the presence or absence of 100 nM insulin, glycogen synthesis assay was performed, as previously described (Zarini S. et al., J Lipid Res, 63(10): 100270, 2022).”

      1. In addition, it would be critical to assess whether the increased mitochondrial ceramide and consequent lowering of energy levels affect all exocytic pathways in L6 myoblasts or just the GLUT4 trafficking. Is the secretory pathway also disrupted under these conditions?

      Response: As the secretory pathway primarily involves the synthesis and transportation of soluble proteins that are secreted into the extracellular space, and given that the majority of cellular transmembrane proteins (excluding those of the mitochondria) use this pathway to arrive at their ultimate destination, we believe that the question posed by the reviewer is highly challenging and beyond the scope of our research. We will add this to the discussion:

      “...the abundance of mPTP associated proteins suggesting a role of this pore in ceramide induced insulin resistance (Sup. Fig. 6E). In addition, it is yet to be determined whether the trafficking defect is specific to Glut4 or if it affects the exocytic-secretory pathway more broadly…”

      Points from reviewer 2:

      1. The mechanistic aspect of the work and conclusions put forth rely heavily on studies performed in cultured myocytes, which are highly glycolytic and generally viewed as a poor model for studying muscle metabolism and insulin action. Nonetheless, the findings provide a strong rationale for moving this line of investigation into mouse gain/loss of function models.

      Response: The relative contribution of the anaerobic (glycolysis) and aerobic (mitochondria) contribution to the muscle metabolism can change in L6 depending on differentiation stage. For instance, Serrage et al (PMID30701682) demonstrated that L6-myotubes have a higher mitochondrial abundance and aerobic metabolism than L6-myoblasts. Others have used elegant transcriptomic analysis and metabolic characterisation comparing different skeletal muscle models for studying insulin sensitivity. For instance, Abdelmoez et al in 2020 (PMID31825657) reported that L6 myotubes exhibit greater insulin-stimulated glucose uptake and oxidative capacity compared with C2C12 and Human Mesenchymal Stem Cells (HMSC). Overall, L6 cells exhibit higher metabolic rates and primarily rely on aerobic metabolism, while C2C12 and HSMC cells rely on anaerobic glycolysis. It is worth noting that L6 myotubes are the cell line most closely related to adult human muscle when compared with other muscle cell lines (PMID31825657). Our presented results in Figure 6 H and I provide evidence for the similarities between L6 cells overexpressing SMPD5 and human muscle biopsies. Additionally, in Figure 3J-K, we demonstrate the reproducibility of the negative correlation between ceramide and Coenzyme Q observed in L6 cells in vivo, specifically in the skeletal muscle of mice in chow diet. Furthermore, we have supplemented these findings by demonstrating a comparable relationship in mice exposed to a high-fat diet, as shown in Supplementary Figure 4 H-I (refer to point 4). We will clarify these points in the Discussion:

      “In this study, we mainly utilised L6-myotubes, which share many important characteristics with primary muscle fibres relevant to our research. Both types of cells exhibit high sensitivity to insulin and respond similarly to maximal doses of insulin, with Glut4 translocation stimulated between 2 to 4 times over basal levels in response to 100 nM insulin (as shown in Fig. 1-4 and (46,47)). Additionally, mitochondrial respiration in L6-myotubes have a similar sensitivity to mitochondrial poisons, as observed in primary muscle fibres (as shown in Fig. 5 (48)). Finally, inhibiting ceramide production increases CoQ levels in both L6-myotubes and adult muscle tissue (as shown in Fig. 2-3). Therefore, L6-myotubes possess the necessary metabolic features to investigate the role of mitochondria in insulin resistance, and this relationship is likely applicable to primary muscle fibres”.

      We will also add additional data - in point 2 - from differentiated human myocytes that are consistent with our observations from the L6 models. Additional experiments are in progress to further extend these findings.

      1. One caveat of the approach taken is that exposure of cells to palmitate alone is not reflective of in vivo physiology. It would be interesting to know if similar effects on CoQ are observed when cells are exposed to a more physiological mixture of fatty acids that includes a high ratio of palmitate, but better mimics in vivo nutrition.

      Response: Palmitate is widely recognized as a trigger for insulin resistance and ceramide accumulation, which mimics the insulin resistance induced by a diet in rodents and humans. Previous studies have compared the effects of a lipid mixture versus palmitate on inducing insulin resistance in skeletal muscle, and have found that the strong disruption in insulin sensitivity caused by palmitate exposure was lessened with physiologic mixtures of fatty acids, even with a high proportion of saturated fatty acids. This was associated, in part, to the selective partitioning of fatty acids into neutral lipids (such as TAG) when muscle cells are exposed to physiologic lipid mixtures (Newsom et al PMID25793412). Hence, we think that using palmitate is a better strategy to study lipid-induced insulin resistance in vitro. We will add to results:

      “In vitro, palmitate conjugated with BSA is the preferred strategy for inducing insulin resistance, as lipid mixtures tend to partition into triacylglycerides (33)”.

      We are also performing additional in vivo experiments to add to the physiological relevance of the findings.

      1. While the utility of targeting SMPD5 to the mitochondria is appreciated, the results in Figure 5 suggest that this manoeuvre caused a rather severe form of mitochondrial dysfunction. This could be more representative of toxicity rather than pathophysiology. It would be helpful to know if these same effects are observed with other manipulations that lower CoQ to a similar degree. If not, the discrepancies should be discussed.

      Response: We conducted a staining procedure using the mitochondrial marker mitoDsRED to observe the effect of SMPD5 overexpression on cell toxicity. The resulting images, displayed in the figure below (Author Response Figure 1), demonstrate that the overexpression of SMPD5 did not result in any significant changes in cell morphology or impact the differentiation potential of our myoblasts into myotubes.

      Author Response Figure 1.

      In addition, we evaluated cell viability in HeLa cells following exposure to SACLAC (2 uM) to induce CoQ depletion (left panel). Specifically, we measured cell death by monitoring the uptake of Propidium iodide (PI) as shown in the right panel. Our results demonstrated that Saclac-induced CoQ depletion did not lead to cell death at the doses used for CoQ depletion (Author Response Figure 2).

      Author Response Figure 2.

      Therefore, we deemed it improbable that the observed effect is caused by cellular toxicity, but rather represents a pathological condition induced by elevated levels of ceramides. We will add to discussion:

      “...downregulation of the respirasome induced by ceramides may lead to CoQ depletion. Despite the significant impact of ceramide on mitochondrial respiration, we did not observe any indications of cell damage in any of the treatments, suggesting that our models are not explained by toxic/cell death events.”

      1. The conclusions could be strengthened by more extensive studies in mice to assess the interplay between mitochondrial ceramides, CoQ depletion and ETC/mitochondrial dysfunction in the context of a standard diet versus HF diet-induced insulin resistance. Does P053 affect mitochondrial ceramide, ETC protein abundance, mitochondrial function, and muscle insulin sensitivity in the predicted directions?

      Response: We would like to note that the metabolic characterization and assessment of ETC/mitochondrial function in these mice (both fed a high-fat (HF) and chow diet, with or without P053) were previously published (Turner N, PMID30131496). In addition to this, we have conducted targeted metabolomic and lipidomic analyses to investigate the impact of P053 on ceramide and CoQ levels in HF-fed mice. As illustrated in the figures below (Author Response Figure 3), the administration of P053 led to a reduction in ceramide levels (left panel) and an increase in CoQ levels (right panel) in HF-fed mice, which is consistent with our in vitro findings.

      Author Response Figure 3.

      We will add to results:

      “…similar effect was observed in mice exposed to a high fat diet for 5 wks (Supp. Fig. 4H-I further phenotypic and metabolic characterization of these animals can be found in (41))”

      We will further perform more in-vivo studies to corroborate these findings.

    1. I think we’re about to enter a stage of sharing the web with lots of non-human agents that are very different to our current bots – they have a lot more data on how behave like realistic humans and are rapidly going to get more and more capable.Soon we won’t be able to tell the difference between generative agents and real humans on the web.Sharing the web with agents isn’t inherently bad and could have good use cases such as automated moderators and search assistants, but it’s going to get complicated.

      Having the internet swarmed by generative agents is unlike current bots and scripts. It will be harder to see diff between humans and machines online. This may be problematic for those of us who treat the web as a space for human interaction.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their comments and constructive suggestions to improve the manuscript. We are encouraged to see that both reviewers acknowledge how the results from our manuscript uses state-of-art technologies to advance molecular underpinnings of centriole length, integrity and function regulation. Both reviewers also highlighted that the manuscript is well laid out and presents clear, rigorous, and convincing data. Reviewer#1 described our manuscript of highest experimental quality and broad interest to the field of centrosome and cell biology form a basic research and genetics/clinical point of view. Here, we explain the revisions, additional experimentations and analyses planned to address the points raised by the referees. We will perform most of the experimentations and corrections requested by the reviewers. We have already made several revisions and are currently working on additional experiments.

      Our responses to each reviewer comment in bold are listed below. References mentioned here are listed in the references section included at the of this document.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: __In this manuscript, Arslanhan and colleagues use proximity proteomics to identify CCDC15 as a new centriolar protein that co-localizes and interacts with known inner scaffold proteins in cell culture-based systems. Functional characterization using state-of-the-art expansion microscopy techniques reveals defects in centriole length and integrity. The authors further reveal intriguing aberrations in the recruitment of other centriole inner scaffold proteins, such as POC1B and the SFI1/centrin complex, in CCDC15-deficient cells, and observe defects in primary cilia. __

      We thank the reviewer for the accurate summary of the major conclusions of our manuscript.

      Major points:

      1) The authors present a high-quality manuscript that identifies a novel centriolar protein by elegantly revealing and comparing the proximity proteomes of two known centriolar proteins, which represents an important component for the maintenance of centrioles.

      We thank the reviewer for highlighting that our manuscript is of high quality and presents important advances for the field.

      __2) Data are often presented from two independent experiments (n = 2), which is nice, but also the minimum for experiments in biology. It is strongly recommended to perform at least three independent experiments. __

      We agree with the reviewer that analysis of data form three experimental replicates is ideal for statistical analysis. We performed three replicates for the majority of experiments in the manuscript. However, as the reviewer pointed out, we included analysis from two experiments for the following figures:

      • Fig. 4H: quantification of CCDC15 total cellular levels throughout the cell cycle by western blotting
      • Fig. 5A: CCDC15-positive centrioles in control and CCDC15 siRNA-transfected cells
      • Fig. 6B: % centriolar coverage of POC5, FAM161A, POC1B and Centrin-2 in control and CCDC15 siRNA-transfected cells
      • Fig. 6C, 6E: Centrin-2 or SFI1-positive centrioles in control and CCDC15 siRNA-transfected cells
      • Fig. 6J, K: normalized tubulin length and percentage of defective centrioles in cells depleted for CCDC15 or co-depleted for CCDC15 and POC1B
      • Fig. 7F, H: SMO-positive cilia and basal body IFT88 levels in control and CCDC15 siRNA-transfected cells
      • Fig. S3H: centriole amplification in HU-treated control and CCDC15 siRNA-transfected cells (no)
      • Fig. S3A: centrosomal levels upon CCDC15 depletion There are two reasons for why we performed two experimental replicates for these experiments: 1) results from the two experimental replicates were similar, 2) quantification of data by U-ExM is laborious. To address the reviewer’s comments, we will perform the third experimental replicate for the sets of data that led to major conclusions of our manuscript, which are Figures 4H, 6C, 6E, 6J, 6K, 7F, 7H and S3A.

      3) The protein interaction studies presented in Fig. 3 could be of higher quality. While it is great that the authors compared interactions to the centriolar protein SAS6, which is not expected to interact with CCDC15, the presented data raise many questions.

      __a) In most cases, co-expression of tagged CCDC15 stabilizes the tested interaction partners, such that the overall abundance seems to be higher. The increase in protein abundance is substantial for Flag-FAM161A (Fig. 3D) and GFP-Centrin-2 (Fig. 3E) and is even higher for the non-interactor SAS6 (Fig. 3G), while it cannot be assessed for GFP-POC1B (Fig. 3F). Hence, the higher expression levels under these conditions make it more likely that these proteins are "pulled down" and therefore do not represent appropriate controls. __

      We agree with the reviewer that the differences in protein abundance of the prey proteins upon expression of CCDC15 relative to control might impact the interpretation of the interaction data. To address this concern, we will perform the following experiments:

      • To account of the potential stabilizing effects of CCDC15 expression, we will change the relative ratio of plasmids expressing proteins of interest and assess the expression of bait and prey protein levels. We will then repeat the co-immunoprecipitation experiments in conditions where prey expression levels are similar.
      • To avoid the potential stabilizing effects of CCDC15 overexpression, we will perform immunoprecipitation experiments in cells expressing GFP or V5-tagged inner scaffold proteins and assess their potential physical or proximity interaction by blotting for endogenous CCDC15. __b) All Co-IP experiments are lacking negative controls in the form of proteins that are not pulled down under the presented conditions. __

      For the co-IP experiments, we only included a specificity control for the interaction of the bait protein with the tag of the prey protein (i.e. GBP pulldown of GFP or GFP-CCDC15-expressing cells). As the reviewer suggested, we will also include a specificity control for the interaction of bait with the tag of the prey protein for co-immunoprecipitation experiments (i.e. GFP pulldown of cells expressing GFP-CCDC15 with V5-BirA* or V5-BirA*-FAM161A).

      __c) The amounts of co-precipitation of the tested proteins appears very different. Could this reflect strong or weak interactors, or does it reflect the abundance of the respective proteins in centrioles? __

      We agree with the reviewer that the quantity of the co-precipitated prey proteins might be a proxy for the interaction strength if the abundance of the bait proteins is similar. However, the expression levels of bait and prey proteins in co-transfected cells are different and thus, cannot be used to derive a conclusion on the interaction strength. For the revised manuscript, we will repeat the IP experiments and comment on this in the discussion section.

      __4) The observation that IFT88 is supposedly decreased at the base of cilia in CCDC15-depleted cells requires additional experiments/evidence. Fig. 7G shows the results of n = 2 and more importantly, a similar reduction of gamma-tubulin in siCCDC15. Could the observed reduction in IFT88 be explained by a decrease in accessibility to immunofluorescence microscopy? Would the reduction in IFT88 at the base also be apparent when the signals were normalized to gamma-tubulin signals? __

      To address the reviewer’s concern, we quantified the basal body gamma-tubulin and IFT88 levels in control and CCDC15-depleted cells and plotted the basal body IFT88 levels normalized to gamma-tubulin levels in Fig. 7H. Similar to the reduction in IFT88 levels, gamma-tubulin-normalized IFT88 levels was significantly less relative to control cells. Moreover, the gamma-tubulin basal body levels were similar between control and CCDC15 cells. We revised the gamma-tubulin micrographs in Fig. 7G to represent this. These results indicate that the reduction in basal body IFT88 levels upon CCDC15 depletion in specific.

      __5) The observed Hedgehog signaling defects are described as follows: "CCDC15 depletion significantly decreased the percentage of SMO-positive cells". It is similarly described in the figure legend. If this was true, the simplest explanation would be that it reflects the reduction in ciliation rate (which is in a similar range). If SMO-positive cilia (instead of "cells") were determined, the text needs to be changed accordingly. __

      As the reviewer pointed out, we quantified SMO-positive cilia, but not cells. We are sorry for this typo. We corrected SMO-positive cells as SMO-positive cilia in the manuscript text, Fig. 7 and figure legends.

      __6) OPTIONAL: While expansion microscopy is slowly becoming one of the standard super-resolution microscopy methods, which is particularly well validated for studying centrioles, the authors should consider confirming part of their findings (as a proof of principle, surely not in all instances) by more established techniques. This could serve to convince critical reviewers that may argue that the expansion process may induce architectural defects of destabilized centrioles, as observed after disruptions of components, such as in Fig. 6. Alternatively, the authors could cite additional work that make strong cases about the suitability of expansion microscopy for their studies, ideally with comparisons to other methods. __

      • SIM imaging was previously successfully applied for nanoscale mapping of other centriole proteins including CEP44, MDM1 and PPP1R35 (Atorino et al., 2020; Sydor et al., 2018; Van de Mark et al., 2015). To complement the U-ExM analysis, we have started imaging cells stained for CCDC15 and different centriole markers (i.e. distal appendage, proximal linker, centriole wall) using a recently purchased 3D-SIM superresolution microscope. We already included the SIM imaging data for CCDC15 localization in centrosome fractions purified from HEK293T cells in Fig. S5B. In the revised manuscript, we will replace confocal imaging data in Fig. 3A and 3B with SIM imaging data.
      • As the reviewer noted, expansion microscopy has been successfully used for the analysis of a wide range of cellular structures and scientific questions including nanoscale mapping of cellular structures across different organisms. In particular, U-ExM of previously characterized centrosome proteins various centriole proteins have significantly advanced our understanding of centriole ultrastructure. In our manuscript, we used the U-ExM protocol that was validated for centrioles by comparative analysis of U-ExM and cryo-ET imaging by our co-authors (Gambarotto et al., 2019; Hamel et al., 2017). To clarify these points, we included the following sentence along with the relevant references in the introduction: “Application of the U-ExM method to investigate known centrosome proteins has started to define the composition of the inner scaffold as well as other centriolar sub-compartments (Chen et al., 2015; Gambarotto et al., 2021; Gambarotto et al., 2019; Kong and Loncarek, 2021; Laporte et al., 2022; Mahen, 2022; Mercey et al., 2022; Odabasi et al., 2023; Sahabandu et al., 2019; Schweizer et al., 2021; Steib et al., 2022; Tiryaki et al., 2022; Tsekitsidou et al., 2023).”

      Minor points:

      1) Text, figures, and referencing are clear and accurate, apart from minor exceptions.

      We clarified and corrected the points regarding text, figures and references as suggested by the two reviewers.

      __ 2) The title suggests a regulator role for CCDC15 in centriole integrity and ciliogenesis, which has formally not been shown. __

      We revised the title as “CCDC15 localizes to the centriole inner scaffold and functions in centriole length control and integrity”.

      __3) As the authors observe changes in centriole lengths in the absence of CCDC15, it would be very insightful to compare these phenotypes to other components that affect centriolar length, such as C2CD3, human Augmin complex components (as HAUS6 is identified in Fig. 1) or others. These could be interesting aspects for discussion, additional experiments are OPTIONAL. __

      We agree with the reviewer that comparative analysis of centriole length phenotypes for CCDC15 and other components that regulate centriole length will provide insight into how these components work together at the centriole inner core. To this end, we phenotypically compared CCDC15 loss-of-function phenotypes to that of other components of the inner scaffold (POC5, POC1B, FAM161A) that interact with CCDC15. In agreement with their previously reported functions in U2OS or RPE1 cells, we found that POC5 depletion resulted in a 4% slight but significant increase in centriole length and POC1B depletion resulted in a 15% significant decrease. In contrast, FAM161A depletion did not alter centriole length (siControl: 447.8±59.7 nm, siFAM161A 436.3±64 nm). Together, our analysis of their centriolar localization dependency and regulatory roles during centriole length suggest that CCDC15 and POC1B might form a functional complex as positive regulators of centriole length. In contrast, POC5 functions as a negative regulator and might be part of a different pathway for centriole length regulation. We integrated the following sub-paragraph in the results section and also included discussion of this data in the discussion section:

      “Moreover, we quantified centriole length in control cells and cells depleted for POC5 or POC1B. While POC5 depletion resulted in longer centrioles, POC1B resulted in shorter centrioles (POC5: siControl: 414.1 nm±38.3, siPOC5: 432.7±44.8 nm, POC1B: siControl: 400.6±36.1 nm, siPOC1B: 341.5±44.39 nm,). FAMA161A depletion did not alter centriole length (siControl: 447.8±59.7 nm, siFAM161A 436.3±64 nm). Together, these results suggest that CCDC15 might cooperate with POC1B and compete with POC5 to establish and maintain proper centriole length.”

      __ 4) While the reduced ciliation rate in the absence of CCDC15 is convincing, the authors did not investigate "ciliogenesis", i.e. the formation of cilia, and hence should re-phrase. The sentence in the discussion that "CCDC15 functions during assembly" should be removed. __

      To clarify that we only investigated the role of CCDC15 in the ability of cells to form cilia, we replaced sentences that indicates “CCDC15 functions in cilium assembly” with “CCDC15 is required for the efficiency of cilia formation”.

      __5) The existence of stably associated CCDC15 pools with centrosomes (Fig. 2) requires further evidence. The recovery of fluorescence after photobleaching in FRAP experiments is strongly dependent on experimental setups and is only semi-quantitative. A full recovery is unrealistic, hence, it is ideally compared to a known static or known mobile component. I personally think this experiment -as it is presented now- is of little value to the overall fantastic study. The authors may consider omitting this piece of data. __

      We agree with the reviewer that FRAP data by itself does not prove the existence of stably associated CCDC15 pool. As controls in these experiments, we use FRAP analysis of GFP-CCDC66, which has a 100% immobile pool at the cilia and 50% immobile pool at the centrosomes as assessed by FRAP (Conkar et al., 2019). To address these points, we toned down the conclusions derived from this experiment by revising the sentence as follows:

      Additionally, we note that the following data provides support for the stable association of CCDC15 at the centrioles:

      • About 49.6% (± 3.96) of the centrioles still had CCDC15 fluorescence signal at one of the centrioles upon CCDC15 siRNA treatment (Fig. 5A, 5B). The inefficient depletion of the mature centriole pool of CCDC15 is analogous to what was observed upon depletion of other centriole lumen and inner scaffold proteins including WDR90 and HAUS6 (Schweizer et al., 2021; Steib et al., 2020). __6) The data that CCDC15 is a cell cycle-regulated protein is not very convincing (see Fig. 3H), as the signals area weak and the experiment has been performed only once (n= 1). This piece of data does not appear to be very critical for the main conclusions of the manuscript and may be omitted. Otherwise, this experiment should be repeated to allow for proper statistical analysis. __

      We will perform these experiments two more times, quantify cellular abundance of CCDC15 in synchronized populations from three experimental replicates and plot it with proper statistical analysis.

      __7) Experimental details on how "defective centrioles" are determined are missing. __

      We included the following experimental details to the methods section:

      “Centrioles were considered as defective when the roundness of the centriole was lost or the microtubule walls were broken or incomplete. In the longitudinal views of centrioles, defective centrioles were visualized as heterogenous acetylated signal along the centriole wall or irregularities in the cylindrical organization of the centriole wall (Fig. 5F). We clarified these points in the methods section.

      __ 8) For figures, in which the focus should be on growing centrioles (see Fig. 4), it could be helpful to guide the reader and indicate the respective areas of the micrographs by arrows. __

      We added arrows to point to the respective areas of the micrographs in Fig. 4F.

      __ 9) Page18: "centriole length shortening" could be changed to "centriole shortening". __

      We corrected this description as suggested.

      __10) It is unclear how the authors determine distal from proximal ends of centrioles in presented micrographs (see Fig. 5D). __

      We determined the proximal and distal ends of the centrioles by taking the centriole pairs as a proxy. Even though we only represent a micrograph containing a single centriole in some of the U-ExM figures including Fig. 5D, the uncropped micrographs contain two centrioles, which are oriented orthogonally and tethered to each other at their proximal ends in interphase cells. We added the following sentence to the methods section to clarify this point:

      *“Since centrioles are oriented orthogonally and tethered to each other at their proximal ends in interphase cells, we also used the orientation of the centriole pairs as a proxy to determine the proximal and distal ends of the centrioles.” *

      __11) Fig. 7A is missing scale bars and Fig.7 overall is lacking rectangle indicators of the areas that are shown at higher magnification in the insets. __

      We added scale bar to Fig. 7A and rectangle indicators for zoomed in regions in Fig. A, E, G.

      12) Fig. 7C displays cilia that appear very short, especially when comparing to the micrographs and bar graphs presented. The authors may want to explain this discrepancy.

      We thank the reviewer for the comment. The zoomed in representative cilia is 4.1 µM in control cells and 1.4 µM in CCDC15-depleted cells. Therefore, the representative cilia is in agreement with the quantification of cilia in Fig. 7C.

      Reviewer #1 (Significance (Required)):From a technical point of view the authors use two state-of-the-art technologies, namely proximity labeling combined with proteomics and ultrastructure expansion microscopy, that are both challenging and very well suited to address the main questions of this study. ____ • General assessment: The presented study is of highest experimental quality. Despite being very challenging, the expansion microscopy and proximity proteomics experiments have been designed and performed very well to allow solid interpretation. The results of the central data are consistent and allow strong first conclusions about the putative function of the newly identified centriolar protein CCDC15. The study presents a solid foundation for future hypothesis-driven, mechanistic analysis of CCDC15 and inner scaffold proteins in centriole length control and maintaining centriole integrity. The only limitation of the study is that the technically simpler experiments should be repeated to allow proper statistical assessment, which can be addressed easily. • Advance: This is the first study that identifies CCDC15 as a centriolar protein and localizes it to the inner scaffold. It further describes a function for CCDC15 in centriole length control and shows its importance in maintaining centriole integrity with consequences for stable cilia formation in tissue culture. The study provides further functional insights into the interdependence of inner scaffold proteins and the role of CCDC15 in the recruitment of the SFI1/centrin distal complex. • Audience: The manuscript will be of broad interest to the fields of centrosome and cell biology, both from a basic research and genetics/clinical point of view due to the association with human disorders. The state-of-the-art technologies applied will be of interest to a broader cell and molecular biology readership that studies subcellular compartments and microtubules. • Reviewer's field of expertise: Genetics, imaging, and protein-protein interaction studies with a focus on centrosomes and cilia.

      We thank the reviewer for recognizing the importance of our work and for supportive and insightful comments that will further strengthen the conclusions of our manuscript. Our planned revisions will address the only major technical limitation raised by the reviewer that requires adding one more experimental replicate for analysis of the data detailed in major point#1. Notably, we also thank the reviewer to specifying the experiments that are not essential or will be out of the scope of our manuscript as “optional”.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      __In this study, Arslanhan et al. propose CCDC15 as a novel component of the centriole inner scaffold structure with potential roles in centriole length control, stability and the primary cilium formation in cultured epithelial cells. Using proximity labelling they explore the common interactors of Poc5 and Centrin-2, two resident molecules of the centriole inner scaffold, to hunt for novel regulators of this structure. The authors leverage expansion microscopy-based localization and siRNA-dependent loss-of-function experiments to follow up on one such protein they identify, CCDC15, with the aforementioned roles in centriole and cilia biology.

      This study is designed and laid out nicely; however, to be able to support some of the important claims regarding their proximity labelling results and exploration on the roles of CCDC15, there are several major technical and reproducibility concerns that deem major revision. Similarly, the introduction (perhaps inadvertently) omits much of the recent studies on centriole size control that have highlighted the complexity of this biological problem. As such, addressing the following major points will be essential in further considering this work for publication. __

      __We thank the reviewer for recognizing the importance of our work and appreciate the positive reflections on our manuscript and the feedback comments that were well thought-out and articulated and will further strengthen the conclusions of our manuscript. Our planned revisions focus on addressing the reviewer’s comments especially in further supporting our conclusions for proximity-labeling, phenotypic characterization and immunoprecipitation experiments, examining CCDC15 centriole localization in an additional cell line and investigating how CCDC15 works together during centriole length control with known components of the inner scaffold. __

      Major points:

      __1a) The authors use Poc5 and Centrin-2 molecules as joint baits to reveal the interactome of the centriole inner scaffold, however the work lacks appropriate experimental and analytical controls to argue that this is a proximity mapping "at the centriole inner scaffold". In its current state, it is simply an interactome of total Poc5 and Centrin-2, and it might be misleading to call it an interactome at the centriole inner scaffold (the statistical identification of shared interactors cannot do full justice to their biology at the centrosome). Appropriate expression data needed to delineate how large the centrosomal vs. cytoplasmic (or nucleoplasmic) fraction is for either of these molecules, both without and upon the addition of biotin (to see whether the bulk of interaction data stem from the cytoplasm/nucleoplasm or the centrioles themselves). The authors can test this by selectively blotting a lysate fraction containing the centrosomes after centrifugation, and compare them with the simultaneous blot of the supernatant (which were readily used for the blots presented in Fig. 1B). This experiment also becomes very relevant for the case of Centrin-2, as it also heavily localizes to the nucleoplasm as the authors found out (see Fig. 1A and Fig. S1A). __

      __ Additionally, an orthogonal approach should be taken to perform bio-image analysis on their biotin/streptavidin imaging data to demonstrate the exact ratios between the centrosomal vs. cytoplasmic/nucleoplasmic biotin activation with appropriate signal normalization between the biotin/streptavidin images. This is particularly important, as although the authors claim that these cells stably express the V5BirA*, it seems that there is partial clonality to the expression. Some cells in both the Poc5 and Centrin-2 fusion constructs appear to lack the V5/Streptavidin signals upon Biotin addition (such as the two cells in the centre right in Poc5, and again a cell in the centre right for Centrin-2 images). In its current form, Fig. 1A lacks signal quantification and does not report any information about the replicates and distributions of the data. I worry that this may raise concerns on the reproducibility if published in its current form. __a) We agree with the reviewer that the proximity maps of POC5 and

      a) Centrin-2 are not specific to the centriole inner scaffold and thus, do not represent the inner scaffold interactome. The proximity maps identified interactions across different pools of POC5 and Centrin-2 in nucleus, cytoplasm and centrosomes (Fig. 1, S1). To highlight these important points, we already included extensive analysis of the different cellular compartments and biological processes identified by the POC5 and Centrin-2 proximity maps in the results section (pg. 9-10).

      We think that there are two reasons that caused the misinterpretation of the use of these proximity maps as the “inner scaffold interactome”: 1) the way we introduced the motivation for proximity mapping studies, 2) proposing the use of the resulting interactomes as resources for identification of the full repertoire of the inner scaffold proteins. To clarify these points, we revised the manuscript in all relevant parts that might have led to misinterpretation. Following are the specific revisions:

      • To clarify that the proximity maps are not specific to the inner scaffold pools of POC5 and Centrin-2, we revised the title of the results section for Fig. 1 and 2 as follows: “Proximity mapping of POC5 and Centrin-2 identifies new centriolar proteins”.

      • To indicate that POC5 and Centrin-2 localizes to the cytoplasm and/or nucleus in addition to the centrosome, we added the following sentence to the result section: In addition to centrosomes, both fusion proteins also localized to and induced biotinylation diffusely in the cytoplasm and/or nucleus (Fig. 1A).”

      • In the introduction, we revised the following sentence “Here, we used the known inner scaffold proteins as probes to identify the molecular makeup of the inner scaffold in an unbiased way.” as follows: *“Here, we used the known inner scaffold proteins as probes to identify new components of the inner scaffold”. *

      • To highlight the different cellular pools of POC5 and Centrin-2 and identification of their interactors in these pools, we included the following sentence in the results section: “As shown in Fig. S1, Centrin-2 and POC5 proximity interactomes were enriched for GO categories that are relevant for their published functions during centrosomal, cytoplasmic and/or nuclear biological processes and related cellular compartments (Azimzadeh et al., 2009; Dantas et al., 2013; Heydeck et al., 2020; Khouj et al., 2019; Resendes et al., 2008; Salisbury et al., 2002; Steib et al., 2020; Yang et al., 2010; Ying et al., 2019).”

      • We replaced the “interactome” statement with “proximity interaction maps” or “proximity interactors” throughout the manuscript to prevent the conclusion that the proximity maps represent the inner scaffold interactome. b) As the reviewer noted, most centrosome proteins have multiple different cellular pools including the centrosome. For most proteins like gamma-tubulin and centrin, their cytoplasmic/nucleoplasmic pools are more abundant than their centrosomal pools (Moudjou et al., 1996; Paoletti et al., 1996). For the Firat-Karalar et al. Current Biology 2015 paper, I compared the biotinylation levels of centrosomal fractions versus cytoplasmic fractions and confirmed that this is also true in cells expressing myc-BirA* fusions of CDK5RAP2, CEP192, CEP152 and CEP63 (unpublished) (Firat-Karalar et al., 2014). For the revised manuscript, we will compare the biotinylation level of centrosomal, nuclear and cytoplasmic pools of V5Bir*-POC5 and V5BirA*-Centrin-2 using the stable lines. To this end, we will use published centrosome purification protocols. We will include this data in Fig. S1 to highlight that the proximity interactomes represent the different pools of the bait proteins and to show the relative levels of the baits across their different pools.

      c) BioID approach has been successfully used to probe centrosome interactions by my lab and other labs in the field. In fact, proximity interaction maps of over 50 centrosome proteins were published as resource papers by Pelletier&Gingras labs (Gheiratmand et al., 2019; Gupta et al., 2015). Analogous to our strategy in this manuscript, these studies generated proximity maps of centrosome proteins by creating cell lines that stably express BioID-fusions of centrosome proteins followed by streptavidin pulldowns from whole cell extracts and mass spectrometry analysis. Since majority of centrosome proteins also have pools in multiple cellular locations, the published BioID proximity maps for centrosome proteins are not specific to centrosomes. However, the proximity maps included all known centrosome proteins and identified new proteins, which shows that centrosome interactions are represented in pulldowns form whole cell lysates. Moreover, maps form whole cell lysates are also advantageous as they are are unbiased and can be used in future studies as resources for studying the functions and interactions of the bait proteins in different contexts.

      In the Firat-Karalar et al. Current Biology 2015 paper, I combined centrosome purifications with BioID pulldowns to enrich for the centrosomal interactions in the proximity maps of centriole duplication proteins(Firat-Karalar et al., 2014). However, I started the purification with cells transiently transfected with the BioID-fusion constructs, which resulted in high ectopic expression of the fusions in the cytoplasm and/or nucleus. Therefore, centrosome enrichments were useful as an additional step before mass spectrometry. Comparative analysis of the data for proximity maps of 4 centrosome proteins generated from stable lines or centrosome fractions of transiently transfected cells substantially overlap as compared in the Gupta et al. Cell 2015 study and were more comprehensive (Table S2) (Gupta et al., 2015). Therefore, we are confident that the proximity interactomes we generated for POC5 and Centrin-2 include their centrosomal interactions.

      __1b) Similarly, it is not clear whether the expression of Poc5 and Centrin-2 fusion molecules somehow interfere with their endogenous interactions or function. At least some loss-of-function (e.g., RNAi) experiments should be performed where the depletion of endogenous proteins should be attempted to rescue by the fusion constructs. This will help evaluate whether the fusion proteins can rescue the depletion of their endogenous counterparts and behave as expected from a wild-type scenario. __

      The reviewer raises an important concern regarding the physiological relevance of the POC5 and Centrin-2 proximity maps. In the manuscript, we showed and discussed the validation of their proximity interactomes by two lines of evidence, which are: 1) the interactomes identified the previously described cellular compartments, biological processes or interactors of POC5 and Centrin-2, 2) the interactomes led to the identification of CCDC15 as a new inner scaffold protein.

      As the reviewer indicated, stable expression of POC5 and Centrin-2 in the presence of their endogenous pools might affect cellular physiology and thereby the landscape of the interactomes. We plan to address this using the following experiments:

      a) We will perform a set of functional assays to assess whether stable V5BirA*-Centrin-2 and V5BirA*-POC5 cells behaves like control cells in terms of their centrosome number, cell cycle profiles and mitotic progression. We will specifically quantify:

      • centrosome number (immunofluorescence analysis for gamma-tubulin and centrin)
      • their mitotic index (immunofluorescence analysis by DAPI)
      • spindle polarity and percentage of multinucleation (immunofluoerescence analysis for microtubules, gamma-tubulin and DAPI)
      • cell cycle profiles (flow cytometry and immunofluorescence)
      • apoptosis (immunoblotting for caspase 3) Together, results from these experiments indicate that the V5BirA*-POC5 or Centrin-2-expressing stable lines do not exhibit defects associated with their stable expression.

      b) We will perform expansion microscopy in V5BirA*-Centrin-2 and V5BirA*-POC5 cells to assess whether the fusion protein specifically localizes to the centriole inner scaffold, which will provide support for the presence of inner scaffold proteins in their proximity maps. Specifically, we plan to stain the fusion proteins by V5 or BirA antibodies and include the data for the antibody that works for expansion microscopy. This experiment will address whether their stable expression results in specific localization of these proteins at the centriole inner scaffold.

      1c) Overall, as the entire claim around the proximity mapping revolve around its assumption about the centriole inner scaffold, these controls seem imperative to substantiate the ground truth of the biology presented in the manuscript.

      In the revised manuscript, we toned down and made it clear that Centrin-2 and POC5 proximity maps are not specific to the inner scaffold and do not represent the inner scaffold interactome. Since the maps were generated from the whole cell extract, they will provide a resource for future studies aimed at studying functions and mechanisms of POC5 and Centrin-2 across their different cellular pools including the centrosome.

      We would like to also highlight that the proximity maps of POC5 and Centrin-2 are not the major advances of our manuscript. The major advance of our manuscript is the identification of CCDC15 as a new inner scaffold protein that is required for regulation of centriole size and architectural integrity and thereby, for maintaining the ability of centrioles to template the assembly of functional cilia. Importantly, our results identified CCDC15 as the first dual regulator of centriolar recruitment of inner scaffold protein POC1B and the distal end SFI1/Centrin complex and provided important insight into how inner scaffold proteins work together during centriole integrity and size regulation. The new set of experiments we will perform for the revisions of the paper will strengthen these conclusions.

      __2) I am curious about the choices of the cell lines in this work. The proximity mapping to reveal CCDC15 as a candidate protein for centriole inner scaffold was performed in HEK293T cells (human embryonic kidney), however its immunostaining was performed using RPE1 and U2OS cells (human retinal and osteosarcoma epithelial cells respectively). This raises questions regarding the generality of CCDC15 as a centriole inner scaffold protein. Could CCDC15 be simply unique to the centriole inner scaffold of epithelial cells such as RPE1 and U2OS cells? Or could the authors demonstrate any information/data on whether it's similarly localized to the inner scaffold in embryonic kidney cells or other cell types? If not, the claims should be moderated to reflect this fine detail. __

      To test whether CCDC15 localizes to the inner scaffold in other cell types, we performed U-ExM analysis of CCDC15 localization relative to the centriolar microtubules in differentiating multiciliated epithelial cultures (MTEC). As shown in Fig. S3A, CCDC15 localized to the inner scaffold in the centrioles in MTEC ALI+4 cells. Given that the inner scaffold proteins including CCDC15 and previously characterized ones have not been studied in multiciliated epithelia, this result is important and provides support for potential role of the inner scaffold in ensuring centriole integrity during ciliary beating. Additionally, we examined CCDC15 localization by 3D-SIM in centrosomes purified from HEK293T cells, which showed that CCDC15 localizes between the distal centriole markers CEP164 and Centrin-3 and proximal centriole markers gamma-tubulin and rootletin (Fig. S3B).

      3) Discussions and data on the localization of CCDC15 to centriolar satellites appear anecdotal and not fully convincing (Fig. S2D). Given that the authors test the relevance of PCM1 for CCDC15's centriolar localization, it is key to have quantitative data supporting their claim that centriolar satellites can help recruit CCDC15 to the centriole. Could the authors quantify what proportion of CCDC15 localize to the centriolar satellites? One way to do this could be to quantify the colocalization coefficience of CCDC15 and PCM1 signals.

      We only observed co-localization of CCDC15 with the centriolar satellite marker PCM1 in cells transiently transfected with mNG-CCDC15. In Fig. S2E, we included the quantification of the percentage of U2OS and RPE1 cells that exhibit co-localization of PCM1 (100% of U2OS cells, about 80% of RPE1 cells). Like CCDC15, ectopic expression of WDR90 revealed its centriolar satellite localization, suggesting a potential link between centriolar satellites and inner scaffold proteins that can be investigated in future studies (Steib et al., 2020). We now included these results in the discussion section as follows:

      As assessed by co-localization with the centriolar satellite marker PCM1, mNG-CCDC15 localized to centriolar satellites in all U2OS cells and in about 80% of RPE1 cells (Fig. S2C-E). Association of CCDC15 with centriolar satellites is further supported by its identification in the centriolar satellite proteomes(Gheiratmand et al., 2019; Quarantotti et al., 2019).”

      Even though endogenous staining for CCDC15 did not reveal its localization to centriolar satellites, following lines of data support the presence of a dynamic and low abundance pool of CCDC15 at the centriolar satellites: 1) CCDC15 was identified in the centriolar satellite proteome and interactome (Gheiratmand et al., 2019; Quarantotti et al., 2019). 2) CCDC15 centrosomal targeting is in part regulated by PCM1 (Fig. S2F, S2G). For majority of the proteins identified in the centriolar satellite proteome, their satellite pool can only be observed upon ectopic expression. This might be because their centriolar satellite pool is of low abundance and transient as satellite interactions are extensively identified only in proximity mapping studies, but not in traditional pulldowns

      __4) Similar to above (#3), there is no quantitative information on the co-localization or partial co-localization of the signal foci in Fig. 3A and 3B. The authors readily study CCDC15's localization in wonderful detail in their expansion microscopy data, so they could actually consider taking out Fig. 3A and 3B, as the data seem redundant without any quantification. __

      To address the reviewer’s concern, we included plot intensity profile analysis of CCDC15 and different centriole markers along a line drawn at the centrioles in Fig. 3A and 3B, which shows the extent of their overlap. As part of our revision plan, we will replace the confocal imaging data in Fig. 3A and 3B with 3D-SIM imaging data of CCDC15 relative to different centriole markers together with plot profile analysis. We already included 3D-SIM imaging of centrosomes purified form HEK293T cells in Fig. S3B. 3D-SIM imaging data will complement the localization data revealed by U-ExM.

      __5) Do the authors also feel that CCDC15 localize to the core lumen in a somehow helical manner (Fig. 1A, Fig. 1F top and bottom panels, Fig. 5A etc.)? Le Guennec et al. 2020's helical lattice proposal for the inner scaffold further reaffirms that CCDC15 is indeed a likely major component of the inner scaffold. In my view, authors should state this physical similarity explicitly to further support their findings on CCDC15. __

      As the reviewer indicated, cryo–electron tomography and subtomogram averaging of centrioles from four evolutionarily distant species showed that centriolar microtubules are bound together by a helical inner scaffold covering ~70% of the centriole length (Le Guennec et al., 2020). Although U-ExM data do not have enough resolution to show that CCDC15 localizes in a helical manner, we agree with the reviewer that the discussion of this possibility is important and thus we included the following sentence in the results:

      “Longitudinal views suggest potential helical organization of CCDC15 at the inner scaffold, which is consistent with its reported periodic, helical structure (Le Guennec et al., 2020).”

      __6a) The data on the link between the CCDC15 recruitment and the centriole growth (Fig. 4F) or the G2 phase of the cell cycle (Fig. 4H) are not fully convincing without quantitative data. For Fig. 4F, the authors should consider plotting the daughter centriole length vs the daughter CCDC15 intensities against each another, to see whether more elongated daughters truly tend to have more CCDC15. __

      To address the reviewer’s concern, we will plot the daughter centriole length versus CCDC15 intensity at different stages of centriole duplication. In asynchronous cultures that we analyzed with U-ExM, we were not able to find enough cells to perform such quantification. To overcome this limitation, we will perform U-ExM analysis of cells fixed at different points after mitotic shake-off and stained for CCDC15 and tubulin. We will include minimum 10 different representative U-ExM data for different stages of centriole duplication in the revised manuscript along with quantification of length versus signal.

      As detailed in the results section, the goal of these experiments was to determine when CCDC15 is recruited to the procentrioles during centriole duplication, but not to suggest a role for CCDC15 in centriole growth. We clarified this by including the following sentence:

      “To investigate the timing of CCDC15 centriolar recruitment during centriole biogenesis, we examined CCDC15 localization relative to the length of procentrioles that represent cells at different stages of centriole duplication (Fig. 4F).”

      __6b) For Fig. 4H, the argument regarding the cell cycle regulation requires quantification of the bands from several WB repeats, normalized to the expression of GAPDH within each blot (this is particularly relevant, as the bands of CCDC15 do not look dramatically different enough to draw conclusions by eye). __

      We will perform these experiments two more times, quantify cellular abundance of CCDC15 in synchronized populations from three experimental replicates and plot it with proper statistical analysis.

      __7a) The authors find herein that CCDC15 depletion lead to centrioles that are ~10% shorter than the controls. With the depletion of Poc5 and Wdr90 (other proposed components of the inner scaffold), the centrioles end up larger however (Steib et al., 2020). If the role of inner scaffold in promoting centriole elongation is structural, why are these two results the opposite of each other? I realize there is a brief discussion about this at the end of the paper, however, this requires a detailed discussion and speculation on the relevance of these findings. It would be key to clarify whether the inner scaffold as a structure inhibits or promotes centriole growth - or somehow both? If so, how? __

      We agree with the reviewer that comparative analysis of centriole length phenotypes for CCDC15 and other components that regulate centriole length will provide insight into how these components work together at the centriole inner core. To this end, we phenotypically compared CCDC15 loss-of-function phenotypes to that of other components of the inner scaffold (POC5, POC1B, FAM161A) that interact with CCDC15. In agreement with their previously reported functions in U2OS or RPE1 cells, we found that POC5 depletion resulted in a 4% slight but significant increase in centriole length and POC1B depletion resulted in a 15% significant decrease. In contrast, FAM161A depletion did not alter centriole length (siControl: 447.8±59.7 nm, siFAM161A 436.3±64 nm). Together, our analysis of their centriolar localization dependency and regulatory roles during centriole length suggest that CCDC15 and POC1B might form a functional complex as positive regulators of centriole length. In contrast, POC5 functions as a negative regulator and might be part of a different pathway for centriole length regulation. We integrated the following sub-paragraph in the results section in pg. 19 and also included discussion of this data in the discussion section in pg. 23:

      “Moreover, we quantified centriole length in control cells and cells depleted for POC5 or POC1B. While POC5 depletion resulted in longer centrioles, POC1B resulted in shorter centrioles (POC5: siControl: 414.1 nm±38.3, siPOC5: 432.7±44.8 nm, POC1B: siControl: 400.6±36.1 nm, siPOC1B: 341.5±44.39 nm,). FAMA161A depletion did not alter centriole length (siControl: 447.8±59.7 nm, siFAM161A 436.3±64 nm). Together, these results suggest that CCDC15 might cooperate with POC1B and compete with POC5 to establish and maintain proper centriole length.”

      __7b) There might be some intriguing opposing regulatory action of Poc5 and CCDC15 as demonstrated here, where CCDC15 depletion leads to slightly over-recruitment of Poc5, and vice versa. Does this suggest that a tug-of-war going on between different molecules that localize to the inner scaffold? Does this provide some dynamicity to this structure, which might in turn regulate centriole length both positively and negatively? This may be analogous to how opposing forces of dyneins and kinesins provide robust length control for mitotic spindles. I am speculating here, but hopefully these may provide some useful grounds for further discussion in the paper. If the authors deem it interesting experimentally, they can test whether the two molecules indeed regulate centriole length by opposing each other's action, by a double siRNA of CCDC15 and Poc5 to see if this retains the centriole length at its control siRNA size (like how they do a similar test for Poc1's potential co-operativity with CCDC15 in Fig. 6J). __

      We thank the reviewer for proposing excellent ideas on how inner scaffold proteins work together to regulate centriole length. As proposed by the reviewer, different proteins oppose each other analogous to how dynein and kinesin regulate mitotic spindle length. Loss-of-function and localization dependency data support that CCDC15 cooperates with POC1B, which was supported by phenotypic characterization of co-depleted cells (Fig. 6I-K).

      The increase in POC5 levels and coverage at the centrioles upon CCDC15 depletion and vice versa (Fig. 7B, 7G) suggest that CCDC15 and POC5 compete with each other in centriole length regulation. As suggested by the reviewer, we attempted to test this by comparing centriole length in cells co-depleted for CCDC15 and POC5 relative to their individual depletions. Although we tried different depletion workflows, we were not able to co-deplete CCDC15 and POC5. Specifically, we tried transfecting cells with CCDC15 and POC5 siRNAs at the same time or sequentially for 48 h or 96 h. The centrioles in cells that survived co-depletion were positive for both CCDC15 and POC5. This might be because co-depletion of both proteins is toxic to cells. Since CCDC15 and POC5 are likely part of two different pathway in regulation of centrioles and also have other cellular functions, this might have caused cell death. We included the following statement in the discussion to address the excellent model proposed by the reviewer:

      “Taken together, our results suggest that CCDC15 cooperates with POC1B and competes with POC5 during centriole length regulation. Moreover, they also raise the exciting possibility that centriole length can be regulated by opposing activities of inner scaffold proteins. Future studies that explore the relationship among centriole core proteins are required to uncover the precise mechanisms by which they regulate centriole integrity and size.”

      __8) In their introduction section, the authors discuss how relatively little is known about the size control of centrioles, however they fail to mention a series of recent primary literature that uncover striking, new mechanisms and novel molecular players that highlight the complexity of centriole size control. This complexity appears to arise from the existence of multitude of length control mechanisms that influence the cartwheel or the microtubule length individually, or simultaneously via yet-to-be further explored crosstalk mechanisms. a. As such, when the authors talk about the procentriole size control in the introduction, they should discuss and refer to the following studies, in terms of: • How theoretical and experimental work demonstrate that procentriole length may vary dependent on the levels of its building block Sas-6 in animals (Dias Louro et al., 2021 PMID: 33970906; Grzonka and Bazzi, 2022 bioRxiv). • How a homeostatic Polo-like kinase 4 clock regulates centriole size during the cell cycle (Aydogan et al., 2018 JCB PMID: 29500190), and how biochemistry and genetics coupled with mathematical modelling unravel a conserved negative feedback loop between Cep152 and Plk4 that constitutes the oscillations of this clock in flies (Boese et al., 2018 PMID: 30256714; Aydogan et al., 2020 PMID: 32531200) and human cells (Takao et al., 2019 PMID: 31533936). __

      __b. Similarly, when the authors refer to centriole size control induced by microtubule-related proteins, they should highlight the further complexity of this process by referring to: • How a molecule located at the microtubule wall, Cep295/Ana1, can regulate centriole length in flies (Saurya et al., 2016 PMID:27206860) and human cells (Chang et al., 2016 PMID:27185865) - like all the other centriolar MT molecules that the authors discuss in the manuscript. • How a crosstalk between Cep97 and Cep152 influences centriole growth in fly spermatids (Galletta et al., 2016 PMID:27185836). • How a crosstalk between CP110-Cep97 and Plk4 influences centriole growth in flies (Aydogan et al., 2022 PMID:35707992), and this molecular crosstalk is conserved, at least biochemically, in human cells (Lee et al., 2017 PMID:28562169). __

      We thank the reviewer for highlighting the papers that uncovered new mechanisms and players of centriole size and integrity control as well as for the detailed explanation of how different studies led to these discoveries in different organisms. We should have discussed these proteins, functional complexes and mechanisms in our manuscript and cited the relevant literature. We inadvertently focused on literature that uncovered centriole length regulation by MAPs and the inner scaffold. In the introduction section of the revised manuscript where we introduced centriole size regulation in pg. 5, we summarized the major findings on the role of different MAPs, cartwheel and PLK4 homeostatic clock in ensuring formation of centrioles at the correct size in different organisms.

      __Minor points: __

      __1) Introduction section: Literature reference missing for the sentence starting with "Importantly, the stable nature of centrioles enables them to withstand...". __

      We cited research articles that show the importance of centriole motility during ciliary motility and cell division.

      “Importantly, the stable nature of centrioles enables them to withstand mechanical forces during cell division and upon ciliary and flagellar motility (Abal et al., 2005; Bayless et al., 2012; Meehl et al., 2016; Pearson et al., 2009).

      __2) Fig. S1 legend: A typo as follows: CRAPome banalysis should read CRAPome analysis. __

      We corrected this typo.

      __3) Fig. S2: Info on the scale bar in the legend is missing in Fig. S2A. Scale bars for different panels are missing in general in Fig. S2A. __

      We added scale bar information for Fig. S2A and to all other supplementary figure legends that lack scale bar information.

      __4) Fig. 3A and 3B: When displaying the data, coloured cartoon diagrams would be beneficial to guide the reader who are not fully familiar with the spatial orientation of these proteins. __

      As suggested by the reviewer, we will remove the confocal imaging data for CCDC15 localization from Fig. 3A and 3B. For the revised version, we will include 3D-SIM imaging data along with a diagram that represents the spatial orientation of CCDC15 relative to the chosen centriole markers.

      __5) Fig. 3H: No information about the sample number (number of cells or technical repeats examined) reported. __

      We included information on the number of experimental replicates and cells analyzed.

      __6) Fig. S3B legend: A typo as follows: CCD15-depelted RPE1 cells should read CCDC15-depleted RPE1 cells. __

      We corrected this typo.

      __7) Fig. S3B legend: A typo as follows: cellswere fixed with should read cells were fixed with. __

      We corrected this typo.

      __8) There are many spelling mistakes and typos throughout the paper. I have listed a few examples above, but please carefully read through the manuscript to correct all the errors. __

      Thank you for indicating the spelling mistakes we missed to correct for initial submission. In the revised manuscript, we carefully read through the manuscript to correct the mistakes.

      __9) Fig. S3E: The orange columns depicting % of cells with Sas-6 dots look awkward. Why the columns look larger than the mean line? Please correct as appropriate. __

      The total percentage of cells in the two categories (orange and purple) we counted is 100%, which corresponds to the column value at the y-axis. Therefore, the value for each experimental replicate for the orange category is less than 100% and is marked below the 100% line.

      __10) Although authors provide microscopy information for the U-ExM and FRAP experiments, there is no information about the microscopy on regular confocal imaging experiments which should be detailed in Materials and Methods. Also, there is no information about the lenses, laser lines and the filter sets that were used in the imaging experiments. These should be provided as well. __

      In the methods section, we now included detailed information for the microscopes we used and imaging setup (lenses, laser lines, filter sets, detectors, z-stack size, resolution).

      11)

      • __ Fig. 2A: lacks a scale bar. __
      • __ Fig. 2C legend: lacks info on the scale bar length. __
      • __ Fig. 5A legend: lacks info on the scale bar length. __
      • __ Fig. 7A: lacks a scale bar. __
      • __ Fig. 7G legend: lacks info on the scale bar length. __
      • __ Fig. S2C-E: lack scale bars. __
      • __ Fig. S3D, F and H: lack scale bars. (Fig. S4 in the revised manuscript)__
      • __ Fig. S3J legend: lacks info on the scale bar length. (Fig. S4 in the revised manuscript)__
      • __ Fig. S4A, B, D and E: lack scale bars. (Fig. S5 in the revised manuscript)__
      • __ Fig. S4C legend: lacks info on the scale bar length. (Fig. S5 in the revised manuscript)__
      • __ Fig. S4G legend: lacks info on the scale bar length. (Fig. S5 in the revised manuscript)__ We added the scale bars and the size information to the figures and figure legends for the above figures.

      Reviewer #2 (Significance (Required)): __The findings of this study join among the relatively new literature (e.g., Steib et al., 2020 and Le Guennec et al. 2020) on the nature of centriole inner scaffold and its potential roles in centriole formation, integrity and its propensity to form the primary cilium. Therefore, it will be of interest to a group of scientists studying these topics in the field of centrosomes/cilia.

      My expertise is on the biochemistry and genetics of centriole formation in animals.__

      We thank the reviewer for his/her comments and constructive feedback to improve our manuscript. We are encouraged to see that the reviewer acknowledges how the results from our manuscript advances our understanding of centriole length, integrity and function regulation.

      References

      Abal, M., G. Keryer, and M. Bornens. 2005. Centrioles resist forces applied on centrosomes during G2/M transition. Biol Cell. 97:425-434.

      Atorino, E.S., S. Hata, C. Funaya, A. Neuner, and E. Schiebel. 2020. CEP44 ensures the formation of bona fide centriole wall, a requirement for the centriole-to-centrosome conversion. Nat Commun. 11:903.

      Azimzadeh, J., P. Hergert, A. Delouvee, U. Euteneuer, E. Formstecher, A. Khodjakov, and M. Bornens. 2009. hPOC5 is a centrin-binding protein required for assembly of full-length centrioles. J Cell Biol. 185:101-114.

      Bayless, B.A., T.H. Giddings, Jr., M. Winey, and C.G. Pearson. 2012. Bld10/Cep135 stabilizes basal bodies to resist cilia-generated forces. Mol Biol Cell. 23:4820-4832.

      Chen, F., P.W. Tillberg, and E.S. Boyden. 2015. Optical imaging. Expansion microscopy. Science. 347:543-548.

      Conkar, D., H. Bayraktar, and E.N. Firat-Karalar. 2019. Centrosomal and ciliary targeting of CCDC66 requires cooperative action of centriolar satellites, microtubules and molecular motors. Sci Rep. 9:14250.

      Dantas, T.J., O.M. Daly, P.C. Conroy, M. Tomas, Y. Wang, P. Lalor, P. Dockery, E. Ferrando-May, and C.G. Morrison. 2013. Calcium-binding capacity of centrin2 is required for linear POC5 assembly but not for nucleotide excision repair. PLoS One. 8:e68487.

      Firat-Karalar, E.N., N. Rauniyar, J.R. Yates, 3rd, and T. Stearns. 2014. Proximity interactions among centrosome components identify regulators of centriole duplication. Curr Biol. 24:664-670.

      Gambarotto, D., V. Hamel, and P. Guichard. 2021. Ultrastructure expansion microscopy (U-ExM). Methods Cell Biol. 161:57-81.

      Gambarotto, D., F.U. Zwettler, M. Le Guennec, M. Schmidt-Cernohorska, D. Fortun, S. Borgers, J. Heine, J.G. Schloetel, M. Reuss, M. Unser, E.S. Boyden, M. Sauer, V. Hamel, and P. Guichard. 2019. Imaging cellular ultrastructures using expansion microscopy (U-ExM). Nat Methods. 16:71-74.

      Gheiratmand, L., E. Coyaud, G.D. Gupta, E.M. Laurent, M. Hasegan, S.L. Prosser, J. Goncalves, B. Raught, and L. Pelletier. 2019. Spatial and proteomic profiling reveals centrosome-independent features of centriolar satellites. EMBO J.

      Gupta, G.D., E. Coyaud, J. Goncalves, B.A. Mojarad, Y. Liu, Q. Wu, L. Gheiratmand, D. Comartin, J.M. Tkach, S.W. Cheung, M. Bashkurov, M. Hasegan, J.D. Knight, Z.Y. Lin, M. Schueler, F. Hildebrandt, J. Moffat, A.C. Gingras, B. Raught, and L. Pelletier. 2015. A Dynamic Protein Interaction Landscape of the Human Centrosome-Cilium Interface. Cell. 163:1484-1499.

      Hamel, V., E. Steib, R. Hamelin, F. Armand, S. Borgers, I. Fluckiger, C. Busso, N. Olieric, C.O.S. Sorzano, M.O. Steinmetz, P. Guichard, and P. Gonczy. 2017. Identification of Chlamydomonas Central Core Centriolar Proteins Reveals a Role for Human WDR90 in Ciliogenesis. Curr Biol. 27:2486-2498 e2486.

      Heydeck, W., B.A. Bayless, A.J. Stemm-Wolf, E.T. O'Toole, A.S. Fabritius, C. Ozzello, M. Nguyen, and M. Winey. 2020. Tetrahymena Poc5 is a transient basal body component that is important for basal body maturation. J Cell Sci. 133.

      Khouj, E.M., S.L. Prosser, H. Tada, W.M. Chong, J.C. Liao, K. Sugasawa, and C.G. Morrison. 2019. Differential requirements for the EF-hand domains of human centrin 2 in primary ciliogenesis and nucleotide excision repair. J Cell Sci. 132.

      Kong, D., and J. Loncarek. 2021. Analyzing Centrioles and Cilia by Expansion Microscopy. Methods Mol Biol. 2329:249-263.

      Laporte, M.H., I.B. Bouhlel, E. Bertiaux, C.G. Morrison, A. Giroud, S. Borgers, J. Azimzadeh, M. Bornens, P. Guichard, A. Paoletti, and V. Hamel. 2022. Human SFI1 and Centrin form a complex critical for centriole architecture and ciliogenesis. EMBO J. 41:e112107.

      Le Guennec, M., N. Klena, D. Gambarotto, M.H. Laporte, A.M. Tassin, H. van den Hoek, P.S. Erdmann, M. Schaffer, L. Kovacik, S. Borgers, K.N. Goldie, H. Stahlberg, M. Bornens, J. Azimzadeh, B.D. Engel, V. Hamel, and P. Guichard. 2020. A helical inner scaffold provides a structural basis for centriole cohesion. Sci Adv. 6:eaaz4137.

      Mahen, R. 2022. cNap1 bridges centriole contact sites to maintain centrosome cohesion. PLoS Biol. 20:e3001854.

      Meehl, J.B., B.A. Bayless, T.H. Giddings, Jr., C.G. Pearson, and M. Winey. 2016. Tetrahymena Poc1 ensures proper intertriplet microtubule linkages to maintain basal body integrity. Mol Biol Cell. 27:2394-2403.

      Mercey, O., C. Kostic, E. Bertiaux, A. Giroud, Y. Sadian, D.C.A. Gaboriau, C.G. Morrison, N. Chang, Y. Arsenijevic, P. Guichard, and V. Hamel. 2022. The connecting cilium inner scaffold provides a structural foundation that protects against retinal degeneration. PLoS Biol. 20:e3001649.

      Moudjou, M., N. Bordes, M. Paintrand, and M. Bornens. 1996. gamma-Tubulin in mammalian cells: the centrosomal and the cytosolic forms. J Cell Sci. 109 ( Pt 4):875-887.

      Odabasi, E., D. Conkar, J. Deretic, U. Batman, K.M. Frikstad, S. Patzke, and E.N. Firat-Karalar. 2023. CCDC66 regulates primary cilium length and signaling via interactions with transition zone and axonemal proteins. J Cell Sci. 136.

      Paoletti, A., M. Moudjou, M. Paintrand, J.L. Salisbury, and M. Bornens. 1996. Most of centrin in animal cells is not centrosome-associated and centrosomal centrin is confined to the distal lumen of centrioles. J Cell Sci. 109 ( Pt 13):3089-3102.

      Pearson, C.G., D.P. Osborn, T.H. Giddings, Jr., P.L. Beales, and M. Winey. 2009. Basal body stability and ciliogenesis requires the conserved component Poc1. J Cell Biol. 187:905-920.

      Quarantotti, V., J.X. Chen, J. Tischer, C. Gonzalez Tejedo, E.K. Papachristou, C.S. D'Santos, J.V. Kilmartin, M.L. Miller, and F. Gergely. 2019. Centriolar satellites are acentriolar assemblies of centrosomal proteins. EMBO J.

      Resendes, K.K., B.A. Rasala, and D.J. Forbes. 2008. Centrin 2 localizes to the vertebrate nuclear pore and plays a role in mRNA and protein export. Mol Cell Biol. 28:1755-1769.

      Sahabandu, N., D. Kong, V. Magidson, R. Nanjundappa, C. Sullenberger, M.R. Mahjoub, and J. Loncarek. 2019. Expansion microscopy for the analysis of centrioles and cilia. J Microsc. 276:145-159.

      Salisbury, J.L., K.M. Suino, R. Busby, and M. Springett. 2002. Centrin-2 is required for centriole duplication in mammalian cells. Curr Biol. 12:1287-1292.

      Schweizer, N., L. Haren, I. Dutto, R. Viais, C. Lacasa, A. Merdes, and J. Luders. 2021. Sub-centrosomal mapping identifies augmin-gammaTuRC as part of a centriole-stabilizing scaffold. Nat Commun. 12:6042.

      Steib, E., M.H. Laporte, D. Gambarotto, N. O’lieric, C. Zheng, S. Borgers, V. Olieric, M.L. Guennec, F. Koll, A.M. Tassin, M.O. Steinnmetz, P. Guichard, and V. Hamel. 2020. WDR90 is a centriolar microtubule wall protein important for centriole architecture integrity. eLife.

      Steib, E., R. Tetley, R.F. Laine, D.P. Norris, Y. Mao, and J. Vermot. 2022. TissUExM enables quantitative ultrastructural analysis in whole vertebrate embryos by expansion microscopy. Cell Rep Methods. 2:100311.

      Sydor, A.M., E. Coyaud, C. Rovelli, E. Laurent, H. Liu, B. Raught, and V. Mennella. 2018. PPP1R35 is a novel centrosomal protein that regulates centriole length in concert with the microcephaly protein RTTN. Elife. 7.

      Tiryaki, F., J. Deretic, and E.N. Firat-Karalar. 2022. ENKD1 is a centrosomal and ciliary microtubule-associated protein important for primary cilium content regulation. FEBS J. 289:3789-3812.

      Tsekitsidou, E., C.J. Wong, I. Ulengin-Talkish, A.I.M. Barth, T. Stearns, A.C. Gingras, J.T. Wang, and M.S. Cyert. 2023. Calcineurin associates with centrosomes and regulates cilia length maintenance. J Cell Sci. 136.

      Van de Mark, D., D. Kong, J. Loncarek, and T. Stearns. 2015. MDM1 is a microtubule-binding protein that negatively regulates centriole duplication. Mol Biol Cell. 26:3788-3802.

      Yang, C.H., C. Kasbek, S. Majumder, A.M. Yusof, and H.A. Fisk. 2010. Mps1 phosphorylation sites regulate the function of centrin 2 in centriole assembly. Mol Biol Cell. 21:4361-4372.

      Ying, G., J.M. Frederick, and W. Baehr. 2019. Deletion of both centrin 2 (CETN2) and CETN3 destabilizes the distal connecting cilium of mouse photoreceptors. J Biol Chem. 294:3957-3973.

    1. Some recommendation algorithms can be simple such as reverse chronological order, meaning it shows users the latest posts (like how blogs work, or Twitter’s “See latest tweets” option). They can also be very complicated taking into account many factors, such as: Time since posting (e.g., show newer posts, or remind me of posts that were made 5 years ago today) Whether the post was made or liked by my friends or people I’m following How much this post has been liked, interacted with, or hovered over Which other posts I’ve been liking, interacting with, or hovering over What people connected to me or similar to me have been liking, interacting with, or hovering over What people near you have been liking, interacting with, or hovering over (they can find your approximate location, like your city, from your internet IP address, and they may know even more precisely) This perhaps explains why sometimes when you talk about something out loud it gets recommended to you (because someone around you then searched for it). Or maybe they are actually recording what you are saying and recommending based on that. Phone numbers or email addresses (sometimes collected deceptively) can be used to suggest friends or contacts. And probably many more factors as well!

      I think recommendation algorithms are very interesting and complex because different social media platforms use different algorithms to showcase content for users. For example, when I use Instagram my recommended posts to view can be so different from my friends because we have different interests and interactions with the app. However, I do think these algorithms are become so accurate and complex it can be really creepy.

    1. Author Response

      Reviewer #1 (Public Review):

      This is an interesting manuscript that proposes a new approach to for accounting for viral diversity within hosts in phylogenetic analyses of pathogens. Concretely, the authors consider sites for which a minor allele exist as an additional base in the substitution model. For example, if at a particular site 60% of reads have an C and 40% have a G, then this site is assigned Cg, as opposed to an C which is typical of analysing consensus sequences. Because we typically model sequence evolution as a Markovian process, as is the case here, the data become naturally more informative, given that there are more states in the Markov chain when adding these bases. As a result, phylogenetic trees estimated using these data are better resolved than those from consensus sequences. The branches of the trees are probably also longer, which is why temporal signal becomes more apparent.

      I commend the authors on their rigorous simulation study and careful empirical data analyses. However, I strongly suggest they consider whether treating minor alleles as an additional base is biologically realistic and whether this may have implication for other analyses, particularly when there is very high within-host diversity and the number of states in becomes very large.

      We thank the reviewer for the helpful and thorough review. We have included a paragraph in the Discussion regarding the biological interpretation of the 16-state model (Line 344-351), as well as the consequences when there’s high within-host diversity (Line 398).

      Reviewer #2 (Public Review):

      I agree that minor genetic variation could potentially be used to more accurately infer who-infected- whom in an outbreak scenario. Indeed, the use of minor genetic variation has proven very useful in reconstructing transmission chains for chronic infections such as HIV (e.g., see applications using Phyloscanner). To me, it seems that considering the full spectrum of viral genetic diversity within infected hosts would necessarily do the same if not better than considering only consensus-level viral sequence data. This is because there is a necessarily a loss of data and potentially a loss of information when going from considering the genetic composition of viral populations within a host to only considering the consensus sequences of those viral populations. As such, Ortiz et al.'s hypothesis stated on lines 66-70 is a reasonable one, and I was looking forward to seeing this hypothesis evaluated in detail in this manuscript.

      R2.1 There are several parts of this manuscript I really like. In particular, encoding within-sample diversity as character states and using that alternative representation of sequence data for phylogenetic inference (as shown in Figure 3) is a very interesting idea, I think. There are some limitations that are not explicitly mentioned, however. For example, when using this 16-character state representation for phylogenetic inference, they assume independence between nucleotide sites. This is a major assumption that can be violated when considering longitudinal intrahost data and transmission dynamics in an outbreak setting, given genetic linkage between sites.

      We have generated another set of simulations where the starting tree was a coalescent tree rather than a random phylogeny. This is described in the Results section, Line 228, and Figure 4—figure supplement 2. By using a coalescent tree, we increase the genetic linkage between sites. For all metrics used, the 16-state model performed better than the consensus sequence model. It is also important to note, as the reviewer points out, that longitudinal isolates should be removed from transmission inference, as we do in Figure 7 and Figure 7—figure supplement 2.. This point is now reflected in the Results (Line 286) and Methods (Line 534).

      I have several major concerns about the work as it stands, particularly in the context of the SARS-CoV-2 application.

      Concerns not related to the SARS-CoV-2 application:

      R2.2 Figure 4 shows that a model using within-sample diversity can more accurately reconstruct evolutionary histories than a model that uses only consensus-level genetic data. This is really interesting. The Materials and Methods section (particularly lines 351-354) indicates that the sequence data were generated using certain specified substitution rates. The rates specified seem to be chosen in such a way to facilitate finding an improvement when using within-sample diversity. I don't know whether the relative rates of these 'substitutions' at all mirror "real-life". It would be very useful to have a broader set of analyses here to examine the effect of these 'substitution' rates on the utility of incorporating within-sample diversity into phylogenetic inference. (Also, 1, 100, 200 (line 353) inconsistent with 1, 20, 200 in Supp Table 3)

      We have now corrected Supp Table 3 to reflect the rates described in the Methods section.

      We defined our model with three rates: rate of minor variant acquisition, rate of minor-major variant switch, and rate of minor variant loss. We chose the rates for the simulations (1, 100, 200) to reflect a low rate of minor variant acquisition (1) and high rates of minor-major variant switch (200) and minor variant loss (100). These rates will result in pure bases (A,C,G and T) 100 times more likely to be present than low frequency variants, as seen in the base frequencies in Supp Table 1 and 3, which would in turn minimize the effect of including minor variations. We chose these rates to reflect the high turnover of minor variation often observed in real data and the frequencies of minor alleles in the SARS-CoV-2 dataset, but we agree with the reviewer that this may not always be the case. We also agree with the reviewer that changing the parameters in the simulations also affects the effect of including low frequency variation in the model. As such, we have now included simulations using different sets of rates (Figure 4—figure supplement ):

      1) With a high rate of variant switch and loss compared to acquisition (1, 10, 100), reducing the frequency of minor variation.

      2) With a lower rate of switch and loss (1, 10, 10), promoting a stable landscape of low frequency variation.

      3) With no low frequency variation (Jukes-cantor model)

      R2.3 Figure 5 is very interesting, particularly the results at bottleneck sizes of 1-10. What are the 'substitution' rates that are inferred here from using this simulated dataset? The Material and Methods section also does not mention the within-host viral generation time anywhere, as far as I can see (~line 384 states the mutation rate per base per generation cycle but not the length of the generation cycle anywhere).

      Fastsimcoal2 is a coalescent simulator of population histories over several generations, given a population size and a mutation rate. For our purposes, transmissions are simulated as bottlenecks of constant size, and a generation is represented by each time step in the outbreak simulation, which corresponds to 1 day. This is further clarified in the Methods section (Line 475).

      Concerns related to the SARS-CoV-2 application:

      R2.4 I am very concerned about the testing of this hypothesis on the SARS-CoV-2 data presented. First, 1% is a very low variant calling threshold. Second, analysis of the 17 samples that were resequenced (out of 454) indicated that on average, 39% of iSNVS (intrahost single nucleotide variants) called between duplicate runs were only observed in one of the two runs (line 117). Their analysis in Figure 1 indicates that these discrepant (and seemingly spurious) variants occur at higher levels in high Ct samples (which makes sense; Figure 1b). They therefore decide to limit their analyses to samples with Ct values <= 30. This results in 249 samples. However, if we look at Figure 1b, only ~10% of iSNVs called across duplicate runs with Ct = 30 are shared! That means that 90% of iSNVs in the set appear to be spurious. If we assume that each duplicate run of a sample has approximately the same number of spurious iSNVs, then approximately 82% of iSNVs called in a sample with a Ct of 30 would be spurious. This fraction decreases with samples that have lower Ct values, but even at a Ct of 27, only ~60% of iSNVs called across duplicate runs are shared. All the downstream SARS-CoV-2 analyses based on within-host sample diversity therefore are based on samples where the large majority of considered sample diversity is not real. This leads to me necessarily discounting all of those downstream SARS-CoV-2 results.

      We agree with the reviewer that, as the results show, datasets that incorporate within-sample low frequency variation are expected to have considerably more noise than using exclusively consensus sequences, and perhaps this wasn’t properly discussed in the manuscript. We have incorporated some notes about this in the Discussion section (Line 408-413).

      The 1% variant frequency threshold was used to generate the analysis of Fig. 1 and Supp. Fig. 1-4. Looking at these results, we decided to establish the Ct cutt-off of 30 as mentioned by the reviewer, as well as a variant frequency threshold of 2% (as shown in the x-axis of Fig. 2). We overlooked this second variant frequency threshold in the manuscript, which has been added. As shown in Supp. Fig 4, this variant frequency threshold will increase the concordance between technical replicates, although some level of noise persists.

      R2.5 Lines 153-167: I can't figure out how to square the quantitative results given in this paragraph with what is shown in Figure 2. To me, Figure 2 shows only that Technical Replicates have higher probabilities of sharing a variant than with 'No' relationship. What would also be helpful here so that the reader can get a better feel for the data would be to see the iSNV frequencies plotted over time for the longitudinal replicate samples in the supplement and, for the 'epidemiological' samples to show 'TV plots' in the supplement (as in Fig 3c in McCrone et al. eLife)

      Figure 2 shows that technical replicates, longitudinal replicates, epidemiological samples and, in some instances, from the same department have a higher probability of sharing low frequency variants than those with no relationship (also shown in Supp Figure 5). However, also shown in Figure 2 is that the 95% CI is very wide, and therefore in many instances low frequency variants won’t be shared between epidemiological samples or samples from the same department.

      We have also added Figure 2—figure supplement showing the low frequency variants plotted over time for longitudinal replicates. Unlike McCrone et al, we don’t have proven transmission between pairs of samples, although we believe our analysis also shows a pattern of shared low frequency variants among potential epidemiological links.

      R2.6 Figure 6 and associated text: (a) root-to-tip distance: what units is this distance in? (b) That the authors find a temporal signal in these transmission clusters (where all consensus sequences within a cluster are the same) is interesting but also a bit baffling to me. Given the inference of very small transmission bottlenecks in previous studies (e.g., Martin & Koelle - reanalysis of Popa et al.; Lythgoe et al.; Braun et al.), I don't understand where the temporal signal comes in. Do the samples become more genetically diverse over the outbreak (this seems to be indicated in lines 260-262 but never shown and unlikely given bottleneck sizes)? Additional analyses to help the reader understand WHY within-sample diversity allows for the identification of temporal signal is important. This could involve plotting genetic diversity of the samples by collection date or some other, similar analyses.

      a) The units of the y-axis (root-to-tip distance) are measured in substitutions per genome. This is now reflected in the legend of the figure.

      b) As shown in Figure 5, even at small bottleneck sizes we are able to pick some of the diversity that evolves during the course of an outbreak. As hinted by the reviewer, the smaller the bottleneck the less diversity we can leverage for phylogenetic inference, and in fact for some epidemiological samples all the diversity will be lost during transmission, which is why many of the within-sample variants are not shared between the epidemiologically related samples. Figure 6 is indeed showing that the genetic distance (measured as number of substitutions per genome) increases per collection date. We have also added a Figure 6—figure supplement showing the increase in low frequency variants within outbreaks as the outbreaks progress in time (explained in Line 261 of the Results section), which explain in part the increasing temporal signal in clusters.

      R2.7 Paragraph consisting of lines 229-238 and Figure 7: This analysis stops abruptly. What are the conclusions here? Figure 7a (right) seems inconsistent to me with Figure 7b and 7C results. Also, the main hypothesis put forward in this paper is that within-sample sequence data can better resolve who-infected-whom in an outbreak setting. Figure 7b and 7c however are never compared against analogous panels that use just consensus sequences. (Even though the consensus sequences are the same, according to Figure 7a, the inferences shown in Figures 7b and 7c could use additional data such as collection times, etc. that would provide information even when using exclusively consensus-level data). Also, do the analyses in Figures 7b and 7c use the 16-character state model at all? I think Supp Figure 9 is relevant here but not sure how?)

      We have extended this section of the results to make it more coherent and clear (Line 284-293) and in the Discussion (Line 385-395). As added into the Discussion, we agree with the reviewer that even with equal sequences some inferences about transmission can be made with epidemiological data, specially collection dates. However, such data can’t be used to infer the genetic structure of the cluster, which complicates any analysis that can use a phylogenetic as input.

      Additional concerns:

      R2.8 Some of the stated conclusions, particularly in the Discussion section and in the Abstract, do not seem to be supported by the presented results. For example, line 27: 'within-sample diversity is stable among repeated serial samples from the same host': Figure 2 does not show this conclusively. Line 28: 'within-sample diversity... is transmitted between those cases with known epidemiological links': Figure 2 also does not show this conclusively. Line 29: 'within-sample diversity... improves phylogenetic inference and our understanding of who infected whom': Figure 7b/c results using within-sample diversity is never compared against results that use only consensus, so improvement not demonstrated. Line 272-273: 'samples with shorter distance in the consensus phylogeny were more likely to share low frequency variants'. Line 287: 'We demonstrated that phylogenies... were heavily biased'.

      Line 27 and Line 28: We agree with the reviewer that the genomic analysis of SARS-CoV-2 sequences show only partial congruence within technical replicates and epidemiological links. We have appropriately addressed this in the Abstract.

      Line 29 and Fig 7: Transmission inference using the consensus sequence in Figure 7b/c couldn’t be performed because the lack of any genetic difference between the consensus sequence meant that all sequences had the same transmission likelihood. This is now better explained in the Discussion section, lines 385-395.

      Line 272-273: We have removed this section as we did not perform this analysis, as pointed out by the reviewer.

      Line 287: The conclusion expressed in line 287 (now line 340) has been changed.

      R2.9 The manuscript at times does not cite previous work that is highly relevant and thus overstates the novelty of the current work. For example: lines 21-23: '..conventional whole-genome sequencing phylogenetic approaches to reconstruct outbreaks exclusively use consensus sequences...' Phyloscanner uses within-sample diversity, for example, as does SCOTTI. These are finally cited in the discussion section (~line 310), but because this previous work is not acknowledged earlier in the manuscript, the novelty of the work presented here is somewhat overstated.

      We have included background information in the introduction regarding the use of within-sample diversity for transmission inference (Line 69-73), as well as emphasizing that the novelty of our work lies more in the use of within-sample diversity in phylogenetic inference rather than exclusively transmission inference (Line 74, and other instance along the manuscript).

      In sum, I think that the 16 character-state model is a very interesting model. More analyses on simulated data would be helpful to expand on when below-the-consensus level genetic data would truly be informative of phylogenetic relationships and who-infected-whom in outbreak settings. The SARS-CoV-2 analyses are very worrisome to me, given the inclusion of samples where the majority of considered within-sample genetic diversity is very likely not real. Some of the stated conclusions appear to either be at odds with the results presented or not directly evaluated.

    1. Author Response

      Reviewer #1 (Public Review):

      In this interesting manuscript, Nasser et al explore long-term patterns of behavior and individuality in C. elegans following early-life nutritional stress. Using a rigorous, highly quantitative, high-throughput approach, they track patterns of motor behavior in many individual nematodes from L1 to young adulthood. Interestingly, they find that early-life food deprivation leads to decreased activity in young larvae and adults, but that activity between these times, during L2-L4, is largely unaffected. Further, they show that this "buffering" of stress requires dopamine signaling, as L2-L4 activity is significantly reduced by early-life starvation in cat-2 mutants. The paper also provides evidence that serotonin signaling has a role in modulating sensitivity to stress in L1 larvae and adults, but the size of these effects is modest. To evaluate patterns of individuality, the authors use principal components analysis to find that three temporal patterns of activity account for much of the variation in the data. While the paper refers to these as "individuality types," it may be more reasonable to think of these as "dimensions of individuality." Further, they provide evidence that stress may alter the strength and/or features of these dimensions. Though the circuit mechanisms underlying individuality and stress-induced changes in behavior remain unknown, this paper lays an important foundation for evaluating these questions. As the authors note, the behaviors studied here represent only a small fraction of the behavioral repertoire of this system. As such, the findings here are an interesting and very promising entry point for a deeper understanding of behavioral individuality, particularly because of the cellular/synaptic-level analysis that is possible in this system. This paper should be of interest to those studying C. elegans behavior and also more generally to those interested in behavioral plasticity and individuality.

      We thank the reviewer for finding our results interesting.

      Reviewer #2 (Public Review):

      This paper set out to understand the impact of early life stress on the behavior and individuality of animals, and how that impact might be amplified or masked by neuromodulation. To do so, the authors built on a previously established assay (Stern et al 2017) to measure the roaming fraction and speed of individuals. This technique allowed the authors to assess the effects of early life starvation on behavior across the entire developmental trajectory of the individual. By combining this with strains with mutant neuromodulatory systems, this enabled the authors to produce a rich dataset ripe for analysis to analyze the complicated interactions between behavior, starvation intensity, developmental time, individuality, and neuromodulatory systems.

      The richness of this dataset - 2 behavioral measures continuous across 5 developmental stages, 3 different neuromodulatory conditions (with the dopamine system subject to decomposition by receptor types) and 4 different levels of starvation, with ~50-500 individuals in each condition-underlies the strength of this paper. This dataset enabled the authors to convincingly demonstrate that starvation triggers a behavioral effect in L1 and adult animals that is largely masked in intermediate stages, and that this effect becomes larger with increased severity of starvation. Furthermore, they convincingly show that the masking of the effect of starvation in L2-L4 animals depends on dopaminergic systems. The richness of the dataset also allowed a careful analysis of individuality, though only neuromodulatory mutants convincingly manipulated individuality, recapitulating earlier research. Nonetheless, a few caveats exist on some of their findings and conclusions:

      We thank the reviewer for the constructive comments. In the revised manuscript we include additional analyses and textual changes as detailed below, to address the points raised.

      1) Lack of quantitative analysis for effects within developmental stages. In making the argument for buffered effects of starvation on behavior during periods of larval development, the authors make claims regarding the temporal structure of behavior within specific stages. However, no formal analysis is performed and and the traces are provided without confidence intervals, making it difficult to judge the significance of potential deviations between starvation conditions.

      In the revised manuscript, we include additional analyses of roaming fraction effects across shorter developmental-windows, showing within-stage differences in behavioral patterns following starvation (Figure 1 - figure supplement 1E; Figure 3 - figure supplement 1C). In addition, we further temper and rewrite our conclusions to clearly describe these effects (now- “…while 1 day of early starvation modified within-stage temporal behavioral structures by shifting roaming activity peaks to later time-windows during the L2 and L3 stages…” in p. 4 and “Interestingly, during the L2 intermediate stage the effects on roaming activity patterns were more pronounced during earlier time-windows of the stage…” in p. 8).

      2) Incorrect inferences from differences in significance demonstrating significant differences. The authors claim that there is an increase in PC1 inter-individual variation in tph-1 individuals, however the difference in significance is not evidence of a significant difference between conditions (see Nieuwenhuis et al. 2011). This undermines claims about an interaction of starvation, neuromodulators, and individuality.

      In the revised manuscript we provide now a direct comparison of PCs inter-individual variances between starved and unstarved populations, demonstrating significant differences in inter-individual variation in specific PC individuality dimensions following stress (Figure 6 and Figure 6 - figure supplement 1). These results include the increase in PC1 inter-individual variation in tph-1 mutants following 3 and 4 days of starvation (Figure 6A,E).

      3) Sensitivity of analysis to baseline effects and assumptions of additive/proportional effects. The neuromodulatory and stress conditions in this paper have a mixture of effects on baseline activity and differences from baseline. The authors normalize to the roaming fraction without starvation, making the reasonable assumption that the effect due to starvation is proportional to baseline, rather than an additive effect. This confound is most visible in the adult subpanel of figure 5d, where an ~2-3 fold difference in relative roaming due to starvation is clearly noted, however, this is from a baseline roaming fraction in tph-1 animals that are ~2 fold higher, suggesting that the effect could plausibly be comparable in absolute terms.

      Unavoidably, any such assumptions on the expected interaction between multiple effects will be a gross simplification in complicated nonlinear systems, and the data are largely shown with sufficient clarity to allow the reader to make their own conclusions. However, some of the interpretations in the paper lean heavily on an assumption that the data support a direct interpretation (e.g. "neuronal mechanisms actively buffer behavioral alterations at specific development times") rather than an indirect interpretation (e.g. that serotonin reduces baseline roaming fraction which makes a fixed sized effect more noticeable). Parsing the differences requires either more detailed mechanistic study or careful characterization of the effect of different baselines on the sensitivity of behavior to perturbation-barring that it's worth noting that many of these interactions may be due to differences in biological and experimental sensitivity to change under different conditions, rather than a direct interaction of stress and neuromodulatory processes or evidence of differing neuromodulatory activity at different stages of development.

      In the revised manuscript we added a discussion of the potential complicated interactions between neuromodulation and stress, altering baseline levels and deviations from baseline. We also discuss the interpretation of the results in the context of non-linear systems in which sensitivity of the behavioral response to underlying variations may be modified by specific neuromodulatory and environmental perturbations, without assuming direct differences in neuromodulatory states over development or across individuals (p. 16).

      Reviewer #3 (Public Review):

      In this study, Nasser et al. aim to understand how early-life experience affects 1) developmental behavior trajectory and 2) individuality. They use early life starvation and longitudinal recording of C. elegans locomotion across development as a model to address these questions. They focus on one specific behavioral response (roaming vs. dwelling) and demonstrate that early life (right after embryo hatching) starvation reduces roaming in the first larval (L1) and adult stages. However, roaming/dwelling behavior during mid-larval stages (L2 through L4) is buffered from early life starvation. Using dopamine and serotonin biosynthesis null mutant animals, they demonstrated that dopamine is important for the buffering/protection of behavioral responses to starvation in mid-larval stages, while in contrast, serotonin contributes to early-life starvation's effects on reduced roaming in the L1 and adult stages. While the technique and analysis approaches used are mostly solid and support many of the conclusions made in the manuscript for part 1), there are some technical limitations (e.g., whether the method has sufficient resolution to analyze the behaviors of younger animals) and confounding factors (e.g., size of the animal) that the authors do not yet sufficient address, and can affect interpretation of the results. Additionally, much of the study is descriptive and lacks deep mechanistic insight. Furthermore, the focus on a single behavioral parameter (dwelling vs. roaming) limits the broad applicability of the study's conclusions. Lastly, the manuscript does not provide clear presentation or analysis to address part 2), the question of how early life experience affect individuality.

      We thank the reviewer for these important comments. As described below, in the revised manuscript we include new analyses (following extraction of size data), showing behavioral modifications across different conditions/genotypes also in size-matched individuals (within the same size range) (Figure 1 - figure supplement 1F; Figure 3 - figure supplement 1D,E; Figure 5 - figure supplement 1B,D). We also made edits to the text to describe these results (Methods p. 21 and Results section). In addition, while we can detect behavioral changes using our imaging method even in young L1 worms across conditions and genotypes (described in Stern et al. 2017 and this manuscript), as the reviewer correctly pointed out, we may miss some milder behavioral effects due to lower spatial imaging resolution in younger worms. We are now referring to this spatial resolution limitation in the revised manuscript (discussion part). Lastly, in the revised manuscript we added clearer and more direct analyses of changes in inter-individual variation in multiple PC dimensions following early stress, by directly comparing variation between starved and unstarved individuals within the mutant and wild-type populations (Figure 6; Figure 6 - figure supplement 1). These analyses show significant changes in inter-individual variation within specific PC individuality dimensions following early stress. Also, we made textual changes along the manuscript to increase the clarity of presentation of these results.

    1. Author Response

      Reviewer 1 (Public Review):

      In this paper, Reato, Steinfeld et al. investigate a question that has long puzzled neuroscientists: what features of ongoing brain activity predict trial-to-trial variability in responding to the same sensory stimuli? They record spiking activity in the auditory cortex of head-fixed mice as the animals performed a tone frequency discrimination task. They then measure both overall activity and the synchronization between neurons, and link this ’baseline state’ (after removing slow drifts) of cortex to decision accuracy. They find that cortical state fluctuations only affect subsequent evoked responses and choice behavior after errors. This indicates that it’s important to take into account the behavioral context when examining the effects of neural state on behavior.

      Strengths of this work are the clear and beautiful presentation of the figures, and the careful consideration of the temporal properties of behavioral and neural signals. Indeed, slowly drifting signals are tricky as many authors have recently addressed (e.g. Ashwood, Gupta, Harris). The authors are well aware of the difficulties in correlating different signals with temporal and cross-correlation (such as in their ’epoch hypothesis’). To disentangle such slow trends from more short-lived state fluctuations, they remove the impact of the past 10 trials and continue their analyses with so-called ’innovations’ (a term that is unusual, and may more simply be replaced with ’residuals’).

      The terms ‘innovations’ and ‘residuals’ are sometimes used interchangeably. We used innovations because that’s how they were introduced in the signal processing literature (i.e., Kailath, T (1968). ”An innovations approach to least-squares estimation–Part I: Linear filtering in additive white noise.” IEEE transactions on automatic control). We try to be explicit in the text about the formal definition of this quantity, to avoid problems with terminology.

      I do wonder if this throws out the baby with the bathwater. If the concern is statistical confound, the ’session permutation’ method (Harris) may be better suited. If the concern is that short-term state fluctuations are more behaviorally relevant (and obscured by slow drifts), then why are the results with raw signals in the supplement (Suppfig 8) so similar?

      The concern was statistical confound, although this concern is ameliorated when using a mixed model approach and focusing on fixed effects. However, our approach allowed us to assess the relative importance of slow versus single-trial timescales in the predictive relationship between cortical state (and arousal) and behavior, revealing that, in the conditions of our experiment, only the fast timescales are relevant. Because of this, we think that the baby wasn’t thrown out with the bathwater as, qualitatively, no new phenomenology was revealed when the slow components of the signals were included. In hindsight, it is true that the results we obtained suggest that maybe the effort we made to isolate the fast component of the signals was unjustified. However, this can only be known after both options have been tried, as we did. Moreover, we started using innovations based on the results in Figure 2 where, as we show, the use of innovations does make a difference, even at the level of fixed effects in a mixed model. We agree that we could have used the ‘session permutation’ method, but given the depth at which we have explored this issue in the manuscript already, and the clarity of the results, we think that adding a third method would only make reading the manuscript more difficult without adding any substantially new content.

      While the authors are correct that go-nogo tasks have drawbacks in dissociating sensitivity from response bias, they only cursorily review the literature on 2AFC tasks and cortical state. In particular, it would be good to discuss how the specific method - spikes, EEG (Waschke), widefield (Jacobs) and algorithm for quantifying synchronization may affect outcomes. How do these population-based measures of cortical state relate to those described extensively with slightly different signals, notably LFP or EEG in humans (e.g. work by Saskia Haegens, Niko Busch, reviewed in https://doi.org/10.1016/j.tics.2020.05.004)? This review also points out the importance of moving beyond simple measures of accuracy and using SDT, which would be an interesting improvement for this paper too.

      We thank the reviewer for pointing us towards the oscillation-based brain-state literature in humans. We have expanded the paragraph in the discussion where we compare our results with previous work in order to (i) elaborate on the literature on 2AFC tasks, (ii) specifically address the literature linking alpha power in the pre-stimulus baseline and psychophysical performance, and (iii) mention different methods for assessing desynchronization. Our view is that absence of lowfrequency power is a robust measure which can be assessed using different types of signals (spikes, imaging, LFP, EEG). That said, the relationship between desynchronization and behavior appears subtle and variable, specially within discrimination paradigms. These issues are discussed in the paragraph starting in line 527 in the text.

      Regarding the use of SDT, we had already established that our main finding could be expressed as a significant interaction between FR/Synch and the stimulus-strength regressor, when predicting choice after errors (Supplementary Fig. 4A in original manuscript), which is equivalent to a cortical state-dependent increase in d′ after the mice made a mistake. In order to consider a possible effect of cortical state on the ‘criterion’ (i.e., an effect on the bias of the mice towards either response spout), we re-run this GLMM but adding the cortical state regressors as main effects. The results show that the FR-Synch predictor is only significantly greater than zero as an interaction after errors (p = 0.0025). As a main effect, it’s not significantly different from zero neither after errors (p = 0.28), nor after correct trials (p = 0.97). We have included this analysis as Figure 3-figure supplement 1B (replacing the previous Supplementary Fig. 4A) and commented on them in the text (lines 222-225).

      Reviewer 2 (Public Review):

      The relationship between measures of brain state, behavioral state, and performance has long been speculated to be relatively simple - with arousal and engagement reflecting EEG desynchronization and improved performance associated with increases in engagement and attention. The present study demonstrates that the outcome of the previous trial, specifically a miss, allows these associations to be seen - while a correct response appears less likely to do so. This is an interesting advance in our understanding of the relationship between brain state, behavioral state, and performance.

      This is probably just a typo, but we would like to clarify that the relevant outcome in the previous trial is not a miss, but an incorrect choice in an otherwise valid trial (i.e., a trial with a response within the allowed response window).

      While the study is well done, the results are likely to be specific to their trial structure and states exhibited by the mice. To examine the full range of arousal states, it needs to be demonstrated that animals are varying between near-sleep (e.g. drowsiness) and high-alertness such as in rapid running. The fact that the trials occurred rapidly means that the physiological and neural variables associated with each trial will overlap with upcoming trials - it takes a mouse more than a few seconds to relax from a previous miss or hit, for example. Spreading the rapidity of the trials out would allow for a broader range of states to be examined, and perhaps less cross-talk between adjacent trials. The interpretation of the results, therefore, must be taken in light of the trial structure and the states exhibited by the mice.

      We thank the reviewer for the positive assessment of our work and also for raising this point in particular. This motivated us to look more carefully at this issue, with results that, we believe, strengthen our study.

    1. Author Response

      Reviewer #1 (Public Review):

      In this work, Roche et al. study a 13-year long time series of microbiome samples from wild baboons from Kenya. The data used in this work challenge a previous finding from the same authors that temporal dynamics in microbiome changes are largely individualized. Using a multinomial logistic-normal modeling approach, the authors detect that co-variance in temporal dynamics in microbial pair-wise associations among individuals occurs more frequently between relatives. Furthermore, the authors identify that microbial phylogenetic proximity is associated with consistent co-abundance changes over time and that their metric of universal microbial relationships is robust across hosts and is detected even in human longitudinal data. The authors conduct a thorough statistical revision of publicly available results, highlighting this time (e.g. compared to Björk et al, doi: 10.1038/s41559-022-01773-4) the consistently shared microbial properties between individuals, rather that the individual microbial signatures highlighted in their previous work.

      Thank you for this summary. We would like to briefly clarify that we do not see the current work as inconsistent with our prior finding in Björk et al. that microbiome taxonomic compositions are idiosyncratic and asynchronized. However, this new analysis, which focuses on abundance correlations between pairs of taxa, indicates that the personalized compositions and dynamics we observed in Björk et al. are probably not attributable to personalized microbiome ecologies. In other words, Björk et al. showed that microbial taxa found in the guts of different baboons can be quite distinct (and remain so over time, giving rise to semi-stable individual signatures). The current study shows that, despite this taxonomic individuality, the correlations between pairs of microbes in the baboon gut are often quite consistent. To give a basic example, hot weather and ice cream, when observed, are often observed together (positively correlated), but while some places have a lot of both, some have little of either. This idea is discussed in more detail below (see response R6) and in the revised Discussion section (lines 572 to 586).

      Strengths:

      This work is foundational in its compelling effort to generate a rigorous method to evaluate coabundance dynamics in longitudinal microbiome data. The approach taken will likely inspire developments that will sharpen the capacity to extract co-varying microbial features, taking into account seasonality, diet, age, relatedness, and more. To the best of my understanding, their hierarchical model integrated into the Gaussian process to analyze microbial dynamics is reasonably robust and they clearly explain the implementation. Furthermore, this work introduces and defines the concept of a universality score for microbial taxon pairs. Overall, the work presented is clear and convincing and provides tools for the community to benefit from both methods and results. Furthermore, conceptually, this work stresses the value of consistent and shared microbial dynamics in groups, which enriches our understanding of host-associated microbial ecology, otherwise understood to be largely dependent on external fluctuations.

      Weakness:

      It is not entirely clear the extent to which the presented results revise, refute, or support the previously published analysis performed by the authors on the same dataset (doi: 10.1038/s41559-022-01773-4), which was more focused on individuality.

      We agree the relationship between Björk et al. and the current manuscript was unclear in our original submission. We now elucidate the relationship between these papers in the Discussion (lines 572 to 586). Briefly, Björk et al. found that microbiome taxonomic compositions are idiosyncratic and asynchronized. The current analysis finds that pairwise bacterial abundance correlations are predominantly shared and not highly personalized. We think the most likely explanation is that, as mentioned by Reviewer 2 below, the current analyses do not account for the role that environmental gradients play in the gut. If these environments differ asynchronously across hosts, it could lead to shared abundance correlations, but individualized microbiome compositions and individualized single-taxon dynamics. We discuss this possibility and other potential explanations in the revised Discussion (lines 572 to 586).

      Reviewer #2 (Public Review):

      The authors of this paper identify a knowledge gap in our understanding of the generalizability of ecological associations of gut bacteria across hosts. Theoretically, it is possible that ecological associations between bacteria are consistent within a host organism but differ between hosts, or that they are universal across hosts and their environmental gradients. The authors utilize longitudinal data with a unique temporal resolution, on Amboseli baboons, 56 individuals who were sampled for gut microbiome hundreds of times over a decade. This data allows disentangling ecological dynamics within and across individuals in a way that as far as I know has never been done before. The authors show that ecological relationships among baboon gut bacteria, measure through a correlation based on covariation, are largely universal (similar within and across host individuals) and that the most universally covarying taxa are almost always positively associated with each other. They also compare these results with two sets of human data, finding similar patterns in one human data set but not in the other.

      The main aim of this paper is to establish whether gut microbial ecologies are universal across hosts, and this the authors generally show to be true in a thorough and convincing way. However, some re-assessment or re-assurance on the solidity of their chosen method of estimating co-variation would be needed to fully assess the robustness of subsequent results. Specifically, the authors measure the correlation between microbial taxa from data on their abundance co-variation across samples. While necessary steps have been taken to validate the estimates across spurious correlations due to the compositional nature and autocorrelation structures present in the data, I worry that the sparsity of the data might influence the estimation of positive and negative correlations in a slightly different manner. There exist more microbial taxa than samples in the data and some taxa are present in as few as 20% of the samples, meaning that the covariation data will have a large amount of 0-0 pairs. I worry that the abundance of 0-0 pairs in the data might inflate the measures of positive co-variation, making taxa seem highly positively correlated in abundance when they in fact are missing from many samples. Of course, mutual absence is also a form of biologically meaningful covariation but taking the larger number of taxa than samples and the inability of sequencing technology to detect all low-abundance taxa in a sample, I am currently not convinced that all of the 0-0 pairs are modeled as a realistic and balanced way as a continuum of the other non-zero co-variation between taxa in the data. This may become problematic when positive and negative relationships are compared: The authors state that even though most associations between taxa were negative, the most universally correlated taxa pairs (taxa pairs with strongest correlations in abundance both within and between hosts) were enriched in positive associations. It may be possible that this is influenced by the fact that zero inflation in the data lends more weight to positive links than negative links. Whether these universal positive correlations are driven by positive non-zero abundance covariation or just 0-0 links in the data is currently unclear.

      Thank you for pointing out this weakness in our original analyses. As described in response R1 above, your hunch was correct: zero inflation biased our correlation patterns such that taxa pairs with a high frequency of joint zero observations (i.e., where both members of the pair had very low or zero abundances) tended to be positively correlated (Fig. R1). Consequently, as you suggested, zero inflation in the data lent more weight to positive links than negative links in our data set. To address this problem in the revised manuscript, we now restrict our analyses to taxon pairs whose joint zero-abundance observations were less than 5% of all samples across hosts (pairs to the left of the dashed vertical line in Fig. R1 above). We also restricted our analyses to taxa observed in at least 50% of all samples. The first of these criteria was the most restrictive. As described above, our new filtering procedure retained 1,878 of the original 7,750 ASV-ASV pairs; 57 of the original 66 phylum-phylum pairs; and 473 of the original 666 class/order/family-level pairs.

      Another additional result that would benefit from a more clear context is the result that taxa correlation patterns were more similar between phylogenetically close taxa and between genetically close host individuals. The former notion is to be expected if taxa abundances are driven by environmental (or host physiology-related) selective forces that favor bacteria with similar phenotypes. This yields more support to the idea that covariation is environmentally driven rather than driven by the ecological network of the bacteria themselves, and this could be more clearly emphasized. The latter notion of covariation being more similar in genetically related hosts is currently impossible to disentangle from the notion that covariation patterns were more similar with individuals harboring a more similar baseline microbiome composition since microbiome composition and genetic relatedness were apparently correlated. To understand if something about relatedness was actually influential over correlation pattern similarity, one would need to model that effect on top of the baseline similarity effect. Currently, it is not clear if this was done or not.

      We agree that shared responses to environmental gradients within hosts—especially immune profiles and pH—could explain both of these findings. These ideas are now described in the Discussion in lines 559 to 562.

      We also now report partial Mantel tests to control for baseline similarity in microbiome composition when testing for shared microbial correlation patterns among genetic relatives. Controlling for baseline similarity had little effect on the results, and we now report the statistics for this partial Mantel (Fig. 5B; Table S7; r2=0.009; partial Mantel p-value=0.002). See lines 391-392.

      The authors also slightly overemphasize the generalizability of their results to humans, taking that only one of the human data sets they compare their results to, shows similar patterns. While they mention that the other human data set (that was not similar in patterns to theirs) was different in some key aspects (sampling frequency was much higher), the other human data set was also dissimilar to the other two (it only contained infants, not adults). Furthermore, to back up the statement that higher sampling frequency would be the reason this data set had dissimilar covariation between taxa, one would need to show that the temporal variation in this data set was different from the baboon one and show that these covariation patterns were sensitive to timescale by subsampling either data to create mock data sets with different sampling frequency and see how this would change the inference of ecological associations.

      We have revised the text to tone down the generalizability of our results to humans. For instance, the abstract (line 58) now states that “universality in baboons was similar to that in human infants, and stronger than one data set from human adults” but does not state that our results are generalizable to humans.

      We also considered sub-sampling the data set from Johnson et al., from daily to monthly scales, but unfortunately that data set is only 17 days long, so doing so is impossible. This is now stated in the Discussion in line 619, which states, “However, without the ability to subsample Johnson et al. [7] to monthly scales (this data set is only 17 days long), it is impossible to test this prediction.”

      To the extent that the results are robust, particularly regarding to the main result of the universality of gut microbial ecological associations, the impact of this paper is not small. This question has never been so thoroughly and convincingly addressed, and the results as they stand have the power to strongly influence the expectations of gut microbial ecology across many different systems. Moreover, as the authors point out, evidence for universal gut microbial ecology is important for the future development of probiotics. An important point here, underemphasized by the authors, is that universal gut microbe ecologies will allow specific interventions that use gut microbe ecology to manipulate emergent community properties of microbiomes to be more beneficial for the host, rather than just designing compositional cocktails that should fit all. In addition to the main finding of this study, the unique data set and the methods developed as part of this study (e.g. the universality score, the enrichment measures, the model of log-ratio dynamics, the assessment of covariation from time-ordered abundance trajectories) will doubtlessly be translatable to many other studies in the future.

      Thank you for these suggestions. We now mention these implications in the introduction (line 82-84) and in the discussion in lines 537-539 and line 630.

      Reviewer #3 (Public Review):

      This is a well-executed study, offering thorough analysis and insightful interpretations. It is wellwritten, and I find the conclusions interesting, important, and well-supported.

      Thank you for your supportive comments.

      References

      1. Silverman JD, Roche K, Holmes ZC, David LA, Mukherjee S. Bayesian Multinomial Logistic Normal Models through Marginally Latent Matrix-T Processes. Journal of Machine Learning Research. 2022;23:1-42.
      2. Quinn TP, Richarrson MF, Lovell D, Crowley TM. propr: An R-package for Identifying Proportionally Abundant Features Using Compositional Data Analysis Scientific Reports. 2017;7:16252.
      3. Cao Y, Lin W, Li H. Large covariance estimation for compositional data via compositionadjusted thresholding. . J Am Stat Assoc. 2019:759-72.
      4. Friedman J, Alm EJ. Inferring correlation networks from genomic survey data. PLoS Comput Biol. 2012;8(9):e1002687. doi: 10.1371/journal.pcbi.1002687. PubMed PMID: 23028285; PubMed Central PMCID: PMCPMC3447976.
      5. Risely A, Schmid DW, Muller-Klein N, Wilhelm K, Clutton-Brock TH, Manser MB, et al. Gut microbiota individuality is contingent on temporal scale and age in wild meerkats. Proc Biol Sci. 2022;289(1981):20220609. Epub 20220817. doi: 10.1098/rspb.2022.0609. PubMed PMID: 35975437; PubMed Central PMCID: PMCPMC9382201.
      6. Wilmanski T, Diener C, Rappaport N, Patwardhan S, Wiedrick J, Lapidus J, et al. Gut microbiome pattern reflects healthy ageing and predicts survival in humans. Nat Metab. 2021;3(2):274-86. Epub 20210218. doi: 10.1038/s42255-021-00348-0. PubMed PMID: 33619379; PubMed Central PMCID: PMCPMC8169080.
      7. Johnson AJ, Vangay P, Al-Ghalith GA, Hillmann BM, Ward TL, Shields-Cutler RR, et al. Daily Sampling Reveals Personalized Diet-Microbiome Associations in Humans. Cell Host & Microbe. 2019;25(6):789-802. Epub 2019/06/14. doi: 10.1016/j.chom.2019.05.005. PubMed PMID: 31194939.
      8. Franzosa EA, Huang K, Meadow JF, Gevers D, Lemon KP, Bohannan BJM, et al. Identifying personal microbiomes using metagenomic codes. Proceedings of the National Academy of Sciences. 2015;112(22):E2930-E8. doi: 10.1073/pnas.1423854112. PubMed PMID: WOS:000355832200014.
      9. Faith JJ, Guruge JL, Charbonneau M, Subramanian S, Seedorf H, Goodman AL, et al. The long-term stability of the human gut microbiota. Science. 2013;341(6141):1237439. Epub 2013/07/06. doi: 10.1126/science.1237439. PubMed PMID: 23828941; PubMed Central PMCID: PMC3791589.
      10. Bik EM, Costello EK, Switzer AD, Callahan BJ, Holmes SP, Wells RS, et al. Marine mammals harbor unique microbiotas shaped by and yet distinct from the sea. Nat Commun. 2016;7:10516. Epub 20160203. doi: 10.1038/ncomms10516. PubMed PMID: 26839246; PubMed Central PMCID: PMCPMC4742810.
      11. Caporaso JG, Lauber CL, Costello EK, Berg-Lyons D, Gonzalez A, Stombaugh J, et al. Moving pictures of the human microbiome. Genome Biology. 2011;12(5):R50. doi: Artn R50 Doi 10.1186/Gb-2011-12-5-R50. PubMed PMID: ISI:000295732700014.
      12. Costello EK, Lauber CL, Hamady M, Fierer N, Gordon JI, Knight R. Bacterial community variation in human body habitats across space and time. Science. 2009;326(5960):1694-7. doi: Doi 10.1126/Science.1177486. PubMed PMID: ISI:000272839000053.
      13. Dolinsek J, Goldschmidt F, Johnson DR. Synthetic microbial ecology and the dynamic interplay between microbial genotypes. Fems Microbiology Reviews. 2016;40(6):961-79. doi: 10.1093/femsre/fuw024. PubMed PMID: WOS:000387995000010.
      14. Louca S, Polz MF, Mazel F, Albright MBN, Huber JA, O'Connor MI, et al. Function and functional redundancy in microbial systems. Nat Ecol Evol. 2018;2(6):936-43. Epub 2018/04/18. doi: 10.1038/s41559-018-0519-1. PubMed PMID: 29662222.
      15. Rainey PB, Quistad SD. Toward a dynamical understanding of microbial communities. Philos Trans R Soc Lond B Biol Sci. 2020;375(1798):20190248. Epub 2020/03/24. doi: 10.1098/rstb.2019.0248. PubMed PMID: 32200735; PubMed Central PMCID: PMCPMC7133524. 16. Martiny JB, Jones SE, Lennon JT, Martiny AC. Microbiomes in light of traits: A phylogenetic perspective. Science. 2015;350(6261):aac9323. doi: 10.1126/science.aac9323. PubMed PMID: 26542581.
      16. Debray R, Herbert RA, Jaffe AL, Crits-Christoph A, Power ME, Koskella B. Priority effects in microbiome assembly. Nat Rev Microbiol. 2022;20(2):109-21. Epub 20210827. doi: 10.1038/s41579-021-00604-w. PubMed PMID: 34453137.
      17. Gloor GB, Reid G. Compositional analysis: a valid approach to analyze microbiome highthroughput sequencing data. Can J Microbiol. 2016;62(8):692-703. Epub 2016/06/18. doi: 10.1139/cjm-2015-0821. PubMed PMID: 27314511.
      18. Joseph TA, Pasarkar AP, Pe'er I. Efficient and Accurate Inference of Mixed Microbial Population Trajectories from Longitudinal Count Data. Cell Syst. 2020;10(6):463-9 e6. Epub 20200624. doi: 10.1016/j.cels.2020.05.006. PubMed PMID: 32684275.
      19. Aijo T, Muller CL, Bonneau R. Temporal probabilistic modeling of bacterial compositions derived from 16S rRNA sequencing. Bioinformatics. 2018;34(3):372-80. doi: 10.1093/bioinformatics/btx549. PubMed PMID: 28968799; PubMed Central PMCID: PMCPMC5860357.
      20. Coyte KZ, Rao C, Rakoff-Nahoum S, Foster KR. Ecological rules for the assembly of microbiome communities. PLoS Biol. 2021;19(2):e3001116. Epub 20210219. doi: 10.1371/journal.pbio.3001116. PubMed PMID: 33606675; PubMed Central PMCID: PMCPMC7946185.
      21. Coyte KZ, Schluter J, Foster KR. The ecology of the microbiome: Networks, competition, and stability. Science. 2015;350(6261):663-6. doi: 10.1126/science.aad2602. PubMed PMID:
      22. Palmer JD, Foster KR. Bacterial species rarely work together. Science. 2022;376(6593):581-2. Epub 20220505. doi: 10.1126/science.abn5093. PubMed PMID:
      23. Reese AT, Pereira FC, Schintlmeister A, Berry D, Wagner M, Hale LP, et al. Microbial nitrogen limitation in the mammalian large intestine. Nat Microbiol. 2018. Epub 2018/10/31. doi: 10.1038/s41564-018-0267-7. PubMed PMID: 30374168.
      24. Firrman J, Liu L, Mahalak K, Tanes C, Bittinger K, Tu V, et al. The impact of environmental pH on the gut microbiota community structure and short chain fatty acid production. FEMS Microbiol Ecol. 2022;98(5). doi: 10.1093/femsec/fiac038. PubMed PMID: 35383853.
      25. de Vos WM, Tilg H, Van Hul M, Cani PD. Gut microbiome and health: mechanistic insights. Gut. 2022;71(5):1020-32. Epub 20220201. doi: 10.1136/gutjnl-2021-326789. PubMed PMID: 35105664; PubMed Central PMCID: PMCPMC8995832.
      26. Tamames J, Sanchez PD, Nikel PI, Pedros-Alio C. Quantifying the Relative Importance of Phylogeny and Environmental Preferences As Drivers of Gene Content in Prokaryotic Microorganisms. Front Microbiol. 2016;7:433. Epub 20160331. doi: 10.3389/fmicb.2016.00433. PubMed PMID: 27065987; PubMed Central PMCID: PMCPMC4814473.
      27. Gloor GB, Macklaim JM, Pawlowsky-Glahn V, Egozcue JJ. Microbiome datasets are compositional: and this is not optional. Front Microbiol. 2017;8:2224. Epub 2017/12/01. doi: 10.3389/fmicb.2017.02224. PubMed PMID: 29187837; PubMed Central PMCID: PMCPMC5695134.
      28. Caporaso JG, Lauber CL, Walters WA, Berg-Lyons D, Lozupone CA, Turnbaugh PJ, et al. Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proceedings of the National Academy of Sciences. 2011;108:4516-22. doi: Doi 10.1073/Pnas.1000080107. PubMed PMID: ISI:000288451300002.
    1. Motivation

      Reviewer 3: Wyeth Wasserman

      SYNOPSIS The manuscript describes an updated release of the ReMM regulatory variant mutation scoring system. The paper presents the performance of an updated version of the system and describes how it was applied to the most current release of the reference human genome.

      OVERALL PERSPECTIVE This is a valuable resource for the community of researchers and clinicians working on the interpretation of genetic variants in the human genome. The work appears to be thoughtfully done and appropriate assessments have been provided. The use of the random forest models to weigh the contributions of features was particularly noted for the insights it provided into how features contribute to prediction. My biggest concerns are stylistic, which falls outside the scientific quality of the work. I provide these comments for the authors to consider and do not expect that my stylistic preferences will be uniformly accepted. A fair amount of justification of the manuscript focuses on the value of having a release for version 38 of the human genome, pointing to the field as not having done so broadly. I think this is misguided, as by the time people are reading the manuscript such points will have lost relevance. I suggest a focus on the science be given, as there is no need to justify things based on where other resources have progressed in releasing their version 38 updates. Points below include language/text clarifications that can be assessed by the authors. Writing styles differ, so stylistic comments should be optional.

      MAJOR POINTS None. Well done and clearly presented.

      MINOR POINTS 1. The word "various" is vague and often shows up when people are too busy to provide an accurate statement. Starting the manuscript with it makes a bad impression on this reader. You do not have to change it, but I thought you might appreciate knowing this impression. You could delete it with no harm to the sentence. (Not to get carried away, but the next sentence starting with "some" heightens the impression of 'hand waving'.) 2. I think I understand ", we apply cytogenic band-aware cross-validation using ten folds" but I encourage the authors to provide clearer wording for this point. 3. I would allow the reader to make their own judgement of performance. So please remove "excellent" from "we achieve an excellent performance" 4. "Rather than using ReMM scores for ranking, some users need to specify score thresholds" is confusing. I would change 'need to' to 'choose to' 5. "with lots of false positives" is a bit informal. I suggest "with a high false positive rate" 6. I am confused by "from three genomic regions (genic content and not overlapping with assembly gap changes) " as the brackets include two items, not three. 7. "maybe due to better mapping" - "maybe" should be "may be" 8. I think the language like "seems to be the only tool directly trained on training data and features derived from GRCh38." Is not particularly valuable long term. This is a useful contribution, but many tools are being updated to 38 and by the time this appears and is read, such statements decline in relevance. I would focus on providing this valuable resource, and not try to justify it based on a transient perception of where the field stands in updating versions. 9. "It is worth noting that in the context of extremely unbalanced data…" - you do note it. So I would change the wording to "In the context of extremely unbalanced data…"

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Detailed Answer to the Reviewers

      Reviewer #1

      __Summary __

      The authors used a novel imaging technique to monitor glutamate release and correlated these measurements with gold standard electrophysiological measurements. The genetically encoded glutamate reporter, iGluSnFR, was expressed in mouse spiral ganglion neurons using the approach described in Ozcete and Moser (2021, EMBO J). The iGluSnFR signals and the postsynaptic currents were measured at the endbulb of Held synapse. A small effect of the expression of iGluSnFR on the mEPSC kinetics was found (but see comment 1). Furthermore, deconvolution of the iGluSnFR signals was performed enabling the comparison of some presynaptic properties assessed with either iGluSnFR or electrophysiology.

      We thank the reviewer for her/his appreciation of our work and for the comments that have helped/will help us further improve our manuscript.

      __Major comments __

      1. The central finding of the study is the prolonged decay time constant of the mEPSC. The difference is small but astonishingly significant (0.172 {plus minus} 0.002 and 0.158 {plus minus} 0.001, P=0.003). The SEM is about 100 times smaller than the measured time constant. This is biologically not plausible. Therefore, I am skeptical about the statistical significance of the results.

      We appreciate the feedback of the reviewer. We agree that our presentation of the data was easy to misunderstand and we changed it (see below). We modeled the statistical relationship of kinetic parameters with a mixed effects model (as described in methods). Since the presentation of regression parameters for this kind of data is not very usual in synaptic neuroscience (nor very informative in this study), we instead opted to report SEM and a p-value derived from the fit of the linear mixed effects model. For the SEM, there is no clear way to take into account the clustered nature of the data, so we calculated the SEM over all observations. Since the SEM is proportional to 1/sqrt(n) and the number of recorded mEPSCs is very large, this does indeed yield a very small SEM. We agree that reporting the SEM over all observations is unusual and leads to misunderstandings in this case. Now, we instead report the re-calculated the mean / SEM for all parameters over the median values per cell. We changed the presentation of the data also for the other values presented in the MS in all tables and the relevant parts of the main texts.. We note that the summary statistics do not directly influence the further statistical modeling.

      1. Analysis of the size of RRP with electrophysiology and iGluSnFR is potentially interesting but iGluSnFR recordings could not resolve the spontaneous fusion of single vesicles. Therefore, it is not possible to estimate RRP with these iGluSnFR recordings. This limitation of the approach should be emphasised more clearly.

      Yes, we think the inability to resolve single vesicles is one of the major limitations of the study and we note this in the introduction and in the relevant section of the discussion. We agree that it should be clear in the relevant section that we are not able to measure RRP size without resolving single vesicle release and modified the wording of the relevant results section to reflect this better (line 267, 497). We still believe that the cumulative release analysis is potentially interesting to researchers in the field, as RRP size is not the only parameter that can be estimated in this way. In particular, an estimation of the release probability in resting conditions is possible by dividing the amplitude of the first response (i.e. response to a single stimulus) by the RRP estimate even without knowing the exact number of vesicles that comprise either.

      1. The control conditions (no surgery/no virus injection) are not the correct conditions for comparison with the experimental conditions (surgery/virus injection and sensor expression). The control group should be operated and injected with saline or ideally with a virus expressing GFP at the extracellular membrane. The authors addressed this issue by citing their previous work (Özcete and Moser, 2021). However, I am not convinced that surgery does not induce subtle changes that could explain the small differences in mEPSCs.

      This is an excellent point that should be addressed in further research. A slowed decay would be consistent with the idea that iGluSnFR affects glutamatergic transmission by buffering glutamate, but we cannot rule out subtle changes due to the postnatal surgery or AAV-mediated transgene expression. In response to the reviewer’s comment, we modified the text to reflect the possibility of surgery and / or other parts of the expression system being responsible for the changes. We also discuss further control experiments (line 408). Finally, we believe that our comparison is still relevant for researchers using iGluSnFR in the system, as they will be asking if introducing a measurement system affects the underlying quantity.

      __Minor comments __

      The supplementary figures are not listed in the order in which they appear in the main text.

      We now list the supplementary figures in the order in which they appear in the main text.

      Figure 2B and 3 are not referenced in the main text.

      We now reference the figures in the text.

      The PPR in Figure 3 shows a PPR that cannot be evaluated because of the unusual plot with lines that are too thick.

      We updated Fig.3 and chose a more straight forward way to display the PPRs.

      Line 105: "...while simultaneously monitoring currents in postsynaptic cells". This sentence is not correct given that the EPSCs have not been shown yet at this point of the manuscript.

      We removed this part of the sentence.

      Line 110: "SV and are not cause are cause by spontaneous action potentials...". The sentence does not make sense.

      We corrected the sentence.

      Line 168-9: "...we did not find significant differences in amplitude and kinetics...". According to Table 2 (2mM Ca2+ condition), both Imax and Q appear to be almost twice as high in iGluSnFR as in control (2.05 {plus minus} 0.06 and 1.34 {plus minus} 0.03, respectively; P=0.241). Is this not a significant difference?

      The difference was not significant. The misunderstanding likely stems from the same problem in the presentation of the values as for the mEPSCs. We replaced the SEMs with the SEMs of the cell median to avert this.

      Table 4. 2mM Ca2+ condition. The Rrefill parameter is about an order of magnitude smaller in the iGluSnFR-expressing group. Is this correct or just a typo?

      Thank you for spotting this: it was an error with regards to unit conversion. The value for the control group was off by a factor of 10. We corrected this mistake.

      Referees cross-commenting

      I also agree with the comments of the other reviewers.

      Significance

      General assessment

      This topic is currently of interest because iGluSnFR techniques are widely used. However, the study is preliminary. The scientific progress in terms of quantity and quality is limited. For example, Figs. 1 and 5 show only images and traces with little scientific significance.

      Advance

      The main advance of the study is the implementation of the deconvolution of the iGluSnFR signal and the comparison of the back extrapolation with the first stimulus (Fig. 6). This comparison was similar between electrophysiology and iGluSnFR when deconvolution of the iGluSnFR data was performed. These data therefore argue against saturation of iGluSnFR, as expected from a large number of previous analyses of iGluSnFR.

      There is little methodological improvements compared with the group's previous study (Ozcete and Moser, 2021 EMBO J). In this earlier study, a different synapse was analyzed but the same iGluSnFR was injected into the scala tympani of the right ear through the round window in the same way as in this study. Surprisingly, the authors do not refer to Ozcete and Moser (2021) in the relevant methods section.

      Thank you for spotting this omission. We now cite Özçete and Moser (2021) in the appropriate place in the methods section as well.

      Reviewer #2

      Summary

      In the manuscript 'Optical measurement of glutamate release robustly reports short-term plasticity at a fast central synapse' the authors present a careful analysis of whether direct measurements of transmitter release using the genetically-encoded indicator iGluSnFR, are suitable for assessing changes in transmitter release at the spiral ganglion neuron end bulbs of Held in the mouse cochlear nucleus. What sets this study apart from other studies, which have demonstrated the utility of iGluSnFR measurements, is the use of a camera-based fluorescence readout as opposed to confocal or 2P microscopy methods and that it is performed in the cochlear nucleus.

      The primary methodology is the comparison of electrophysiological measurements of excitatory postsynaptic currents from bushy cells with fluorescence changes in the end bulbs of iGluSnFR expressing auditory nerve fibers with and without stimulation of the auditory nerve fibers. The experiments are technically demanding and introducing genetically encoded indicators in neurons of the cochlea is no small accomplishment. An important observation is that mEPSCs are slightly modified (prolonged) due to expression of iGluSnFR in the presynaptic end bulbs. This is perhaps not surprising as iGluSnFR binds glutamate and may act as a buffer to reduce the peak and slightly prolong the increase in cleft glutamate concentration after release from synaptic vesicles. To my knowledge, others have not reported iGluSnFR effects on mEPSCs. Perhaps earlier studies have not checked as carefully, alternatively previous studies had a too-low fraction of presynaptic terminals expressing iGluSnFR (or less expression of iGluSnFR) to detect a change in EPSC parameters, or this is a synapse-specific phenomenon. However, the authors demonstrate that EPSCs evoked by electrical stimulation of the auditory nerve fibers are unaffected by expression of iGluSnFR in the presynaptic neurons. Further findings are that the determined decay time constant is substantially longer than at other synapses (~16 ms at hippocampal synapses, Dürst et al., 2018). Synaptic depression was robustly reported by iGluSnFR at this synapse, but determination of single quantal events and thus quantal analysis was not really possible at this synapse using iGluSnFR in conjunction with the imaging and analysis techniques presented. The manuscript is carefully written and presented.

      We thank the reviewer for her/his appreciation of our work and for the comments that have helped/will help us further improve our manuscript.

      Major points

      1) The ROIs are selected to be 'outer bounds' of the glutamate spread from the synapses being studied. My concern is that these generously-sized ROIs include signal from many iGluSnFR molecules which are distal to the release sites and thus will be reached only slowly by low concentrations of glutamate or be contributing only noise and no changes in fluorescence. I suggest the temporal resolution could be improved by restricting the analysis of fluorescence changes to fewer pixels within the ROIs with the fastest rising/highest amplitude responses.

      Thanks for this helpful comment: The data in our data set should be well-suited to perform this analysis in addition to the presented analysis and so we added this new analysis to the Revision Plan.

      2) The observation that despite a 2 fold increase in eEPSCs when changing from 2 mM to 4 mM extracellular calcium there is no change in iGluSnFR peak is curious as pointed out by the authors but not really discussed.

      We don’t currently have an obvious explanation but consider saturation of the iGluSnFR peak response likely to contribute. In response to the comment of the reviewer, we have added the analysis of integrated iGluSnFR data, which we previously found to be more robust toward saturation than the peak, to the revision plan. We plan to add the relevant discussion along with the new analysis.

      Are the traces presented in Figure 5 examples from the same recording?

      Traces in fig. 5 are grand averages (wording modified for clarity). Unfortunately, it was not possible to routinely measure iGluSnFR responses from the same cell in 2mM Ca2+ and 4 mM Ca2+ as the time needed for the protocols was rather long which influenced cell stability and imaging conditions would deteriorate during the exchange of the bath solution. We think it is not quite possible to directly compare the absolute iGluSnFR responses at different extracellular Ca2+ levels.

      Assuming the examples are from one cell I first assumed the lack of change of peak was saturation of iGluSnFR but the larger fluorescence change with 100 Hz stimulation suggests otherwise. How many endbulbs are contacting one BC? Do you capture iGluSnFR responses from only one or several? In the previous point I suggested that restricting analysis to the soonest reacting pixels might improve temporal resolution but in the case of detecting the peaks with higher and normal calcium, these fastest reacting signals are probably also more likely to be saturated with glutamate.

      The eEPSCs elicited by this stimulation paradigm are monosynaptic (see methods / electrophysiology section), but there might be other iGluSnFR expressing endbulbs on the same bushy cell. Since we reduce the current just enough such that any further reduction leads to a complete failure to elicit an EPSC, we believe these additional endbulbs are not releasing glutamate. We cannot, however, exclude the possibility that iGluSnFR on neighboring structures captures any potential spillover glutamate.

      Minor points

      • mEPSCs are usually recorded in tetrodotoxin, I didn't find any mention in methods/results

      In this system, sEPSCs are not affected by TTX (Oleskevich and Walmsley, 2002) and thus usually recorded without adding TTX. We discuss this more explicitly and added a clarification to reflect this assumption (on line 111).

      • the large numbers of abbreviations make it difficult in places to follow the manuscript please at least define them again in the figure legends (e.g. BC, AVCN in figure 1, Q, FWHM in figure 2 etc.)

      We went over the manuscript again and removed some abbreviations or redefined them in captions.

      • it is a bit unusual to report results of a Wilcoxon test and at the same time mean and SEM instead of medians, if different tests were used then it is important to indicate this where the p values are given or make the sentence in the methods more definitive

      We agree that the initial presentation of the data was ambiguous. We changed the presentation to reflect this (see also answer to reviewer 1).

      • the liquid junction potential is reported as 12 mV, pretty sure it should be -12 mV (unless the QX-314 or some other of the more exotic ingredients in the extracellular solution is having a dramatic effect on the LJP).

      We follow the usual conventions of P. H. Barry, Methods in Enzymology, Vol. 171, p. 678, as described in E. Neher, Methods in Enzymology, Vol. 207, p. 123, in which the LJP is defined as the potential of the bath solution with respect to the pipette solution. We subtracted this positive potential (+12mV) in the end to obtain the membrane potential which therefore was more hyperpolarized than the nominal potential.

      I wonder if one of the faster/lower affinity iGluSnFR variants would be better suited for studying this synapse.

      We agree with the reviewer that future studies should explore the potential of faster/lower affinity iGluSnFR variants for studying the endbulb synapse. The reasons why we employed the original version include: i) sharing the same mice for studies of cochlear ribbon synapses (Özcete & Moser, 2021) and cochlear nucleus synapses (this MS) for the sake of reducing animal experiments, ii) good signal to background facilitating our first study establishing the recording in brainstem slice, iii) less signal to background and shorter signals with the new variants (as found in preliminary recordings from cochlear ribbon synapses) that would make the endbulb recordings more challenging. We have added the following statement to discussion. “Future imaging studies of glutamate release at calyceal synapses should explore the potential of new iGluSnFR variants with lower affinity that provide more rapid signal decay. This will ideally go along with imaging at higher framerate and might require stronger intensities of the excitation light to boost the fluorescence signal.” on line 430.

      The paper would benefit from a careful reading to shorten the text and to check for clarity. For instance page 15 line 436 I don't understand how 'the results can reduce the likelihood of biologically relevant changes'. I think the authors meant something different

      Thank you for spotting this. We reworded the sentence (now on line 399): "The data on hand suggests that this is not the case. Firstly, even if a larger sample size may uncover more subtle effects neurotransmission of evoked events, our measurements suggest a small effect size. Secondly, even as we did find changes in mEPSC, it is probable that the biological significance is limited"

      • page 5 'width' is misspelled

      Fixed.

      • page 18 'strychnine' is misspelled

      Fixed.

      • on many of the figures is text that it much too small

      We went through the manuscript and increased the text size in the figures, where appropriate.

      __Referees cross-commenting __

      I agree with all the comments of the other reviewers - both raise the point that there should be a 'control' AAV injected for comparison of the mEPSCs which I missed but is of course quite important. See https://pubmed.ncbi.nlm.nih.gov/24872574/ for a study of AAV serotype-dependent effects on presynaptic release.

      We now added a section on other possible factors influencing the results, citing the study above.

      Significance

      The main audience for this paper will be fairly specialized. Researchers interested in properties of presynaptic release and some specialists in synaptic transmission in the auditory system will be the main readers/citers of this work.

      The work is an important technical/methodological report. It highlights an important effect of expressing iGluSnFR and also demonstrates that the effect is overall not very problematic. Additional problems using iGluSnFR are also indicated.

      I am an electrophysiologist, studying synaptic transmission and plasticity with experience using a wide range of optogenetic tools

      Reviewer #3

      __Summary __

      In the present manuscript, the authors explore the information that can be obtained using optical measurement of glutamate release with iGluSnFR on synaptic dynamics in the endbulb of Held.

      They virally express iGluSnFR in presynaptic terminals, patch the postsynaptic cells and combine high-frame-rate optical recordings with electrophysiological measurements. Their first finding is that mEPCSs are prolonged when presynaptic cells express the glutamate indicator, which they interpret as buffering of extracellular glutamate by the indicator. Next, they repeated the experiment, this time with stimulating evoked EPSCs. In contrast to the previously observed effects, iGluSnFR did not affect the time course or the amplitude of the evoked EPSCs. The authors then asked whether iGluSnFR signals can be used to study synaptic dynamics, specifically, synaptic depression. In these experiments, the authors observed a change in the paired-pulse ratio with ISI of 10ms, but not longer intervals. They analyzed presynaptic release and did not find statistically significant differences.

      Can iGluSnFR signals be used for the analysis of synaptic release? When stimulated at a low frequency of 10Hz (allowing the fluorescence to return close to baseline levels in between pulses), iGluSnFR dynamics were somewhat comparable to postsynaptic signals. At higher frequencies, the slow time course of the indicator prevented the identification of individual responses and the resulting fluorescence had a very different shape. To resolve this problem, the authors used deconvolution analysis (fig 6). This analysis revealed a linear relationship between the optical readout and the patch-clamp data.

      I find the manuscript to be clearly written, the findings are well presented and discussed and are novel and of substantial interest to neuroscientists in the field. I do have a number of questions about experiments and analysis that may have an effect on the conclusions of this work.

      We thank the reviewer for her/his appreciation of our work and for the comments that have helped/will help us further improve our manuscript.

      1. In experiments comparing the effects of iGluSnFR expression on release dynamics (figure 1-4), the authors compare infected presynaptic cells to control (uninfected). The assumption is that synaptic buffering by iGluSnFR may affect glutamate diffusion in the synaptic cleft. However, it is possible that viral infection itself changes presynaptic properties. The authors should compare release from cells infected with GFP or a comparable indicator.

      We agree that this is an important control experiment to be done in the future and that causal attribution is not in the scope of this study. A slowed decay would be consistent with the idea that iGluSnFR affects glutamatergic transmission by buffering glutamate, but we cannot rule out subtle changes due to the postnatal surgery or AAV-mediated transgene expression. In response to the reviewer’s comment, we modified the text to reflect the possibility of surgery and / or other parts of the expression system being responsible for the changes. We now also discuss further control experiments (line 408). Finally, we believe that our comparison is still relevant for researchers using iGluSnFR in the system, as they will be asking if introducing a measurement system affects the underlying quantity.

      1. Analysis of pool parameters presented in table 4 indicates almost doubling of RRP size with iGLuSnFR with 2 mM Ca++. While not significant, this result may indicate a real effect that may have been missed due to low power (N=3 and 7 for these experiments). I do not believe the authors did a power analysis in this study. How was the number of experiments determined? I would suggest increasing the number of experiments to avoid type II errors.

      We thank the reviewer for this critical comment. Indeed, we would also have liked to have a greater statistical power for these experiments, but had to face the situation that the establishing the method required more animals than expected and the animal license did not offer further animals for the analysis. Moreover, we note that the obtained RRP size estimates were generally lower compared to previous estimates of our lab for the endbulb synapse (e.g. Butola et al., 2021: ~20 nA for 2 mM Ca2+ in Fig. 5). This can partially be attributed to the use of cyclothiazide in previous studies, which we avoided given reports of presynaptic effects of cyclothiazide. As the series resistances of the included recordings were below 8 MOhm (mean series resistances: 2mM Ca, injected: 5.58 MOhm; 2mM Ca, control: 5.93 MOhm; 4mM Ca, injected: 5.75 MOhm; 4mM Ca, control: 6.0 MOhm) and series resistance compensation was set to 80% we do not expect clamp-quality to contribute to the smaller estimates in the present data set.

      We have now added a statement noting the preliminary nature of these results and indicated that further experiments will be required to more certainly conclude on potential effects of iGluSnFR or the manipulation on endbulb transmission: “Our preliminary train stimulation analysis of vesicle pool dynamics in the presence and absence of AAV-mediated iGluSnFR expression in SGNs has not revealed significant differences between the two conditions. Further experiments, potentially involving faster versions of iGluSnFR and employing trains of different stimulation rates for model based analysis of vesicle pool dynamics (Neher and Taschenberger, 2021) will help to assess the value and impact of iGluSnFR in the analysis of transmission at calyceal synapses.” on line 381.

      1. The deconvolution analysis assumes an instantaneous rise time. Yet previous work (Armbruster et al., 2020) that took into account diffusion, suggested potentially slower rise time dynamics. More importantly, the deconvolved waveforms do not match the shapes of the EPSCs (figures 5 and supp 6-2).What is the aim of the deconvolution? It was not clear from the text, but I assume it shows iGluSnFR binding to glutamate - in which case the slow waveforms are indicative of extrasynaptic iGluSnFR activation.

      The deconvolution analysis was mainly used to recover the average responses to stimuli in the train without contamination by previous responses (see also Taschenberger et al. 2016, their figure 6).

      We did also try to use the average singular response instead of the exponential fit as a kernel for the (Wiener) deconvolution analysis, which more closely resembled the observed (fast) rise. Unfortunately, this led to markedly worse results, likely because of the noise levels in the measurements. We believe that it would be beneficial to model the rise of the signal more precisely if glutamate imaging data is acquired at higher framerates.

      The broad wave forms may be due to extrasynaptic binding of glutamate, but we also note that each frame corresponds to ~10ms and there is only ~10 data points between stimuli, so the responses are unlikely to be as sharp as eEPSCs.

      However, I suppose that the more interesting question is whether iGluSnFR could be deconvolved to reveal the underlying release events, similar to how calcium signals can be used to inform about single action potentials.

      We agree that it would be particularly interesting to use a "mini iGluSnFR" signal to deconvolve the resulting traces. Unfortunately, we failed to detect iGluSnFR signals reporting individual release events at this time, preventing this kind of analysis.

      1. I suggest referencing and discussing (Aggarwal et al., 2022; Srivastava et al., 2022) . These highly relevant papers analyzed iGluSnFR to probe synaptic release.

      References:

      Aggarwal, A., Liu, R., Chen, Y., Ralowicz, A. J., Bergerson, S. J., Tomaska, F., Hanson, T. L., Hasseman, J. P., Reep, D., Tsegaye, G., Yao, P., Ji, X., Kloos, M., Walpita, D., Patel, R., Mohr, M. A., Tilberg, P. W., Mohar, B., Team, T. G. P., . . . Podgorski, K. (2022). Glutamate indicators with improved activation kinetics and localization for imaging synaptic transmission. bioRxiv, 2022.2002.2013.480251. https://doi.org/10.1101/2022.02.13.480251

      Armbruster, M., Dulla, C. G., & Diamond, J. S. (2020). Effects of fluorescent glutamate indicators on neurotransmitter diffusion and uptake. Elife, 9. https://doi.org/10.7554/eLife.54441

      Srivastava, P., de Rosenroll, G., Matsumoto, A., Michaels, T., Turple, Z., Jain, V., Sethuramanujam, S., Murphy-Baum, B. L., Yonehara, K., & Awatramani, G. B. (2022). Spatiotemporal properties of glutamate input support direction selectivity in the dendrites of retinal starburst amacrine cells. Elife, 11. https://doi.org/10.7554/eLife.81533

      We thank the reviewer for the suggestions. Some of these studies were not available when we first drafted the manuscript. We now added a section discussing these studies starting on line 466:

      Optimizing the imaging technique may reduce noise level, while the development of improved GEGIs could improve the signal to a level, at which spontaneous release events can be identified reliably in the cochlear nucleus. In retinal slices, where quantal events have been reliably observed with two-photon imaging, temporal deconvolution was successfully employed to estimate release rates from iGluSnFR signal (Srivastava et al., 2022; James et al., 2019). Subcellular targeting of iGluSnFR variants to the postsynaptic membrane may reduce measurement errors introduced by contributing extrasynaptic iGluSnFR signal and improve spatial resolution of glutamate imaging data(Hao et al., 2023; Aggarwal et al., 2022).

      Referees cross-commenting

      I also agree with the comments made by other reviewers!

      Significance

      Overall, this study addresses an important problem in basic neuroscience research. With the developing reliance on optical measurement of neuronal function, it is important to understand the impact of the indicators on physiological function and the limitations of the technique. The study is well-executed and will be informative to neuroscientists performing optical glutamate recording to study single-cell and circuit function in and beyond the auditory system.

    1. Thank you so much for your paper! Metabolism of amino acids is extremely important to study but also very complex. It's also a really vast field so I really appreciate it when scientists decide to take a deep dive and uncover the existing metabolic pathways. Kudos for that! As excited as I am about L-amino acids, I'm even more excited to understand the metabolism of D-amino acids. I was wondering if you have considered applying your experimental approach to understand the metabolic pathway for D-arginine? May be also other D-amino acids? I think we know little about the metabolism of D-amino acids in B. subtilis and about the regulation of the metabolic enzymes. Thank you for your time!

    1. Author Response

      Reviewer #1 (Public Review):

      It is well established that valuation and value-based decision-making is context-dependent. This manuscript presents the results of six behavioral experiments specifically designed to disentangle two prominent functional forms of value normalization during reward learning: divisive normalization and range normalization. The behavioral and modeling results are clear and convincing, showing that key features of choice behavior in the current setting are incompatible with divisive normalization but are well predicted by a non-linear transformation of range-normalized values.

      Overall, this is an excellent study with important implications for reinforcement learning and decision-making research. The manuscript could be strengthened by examining individual variability in value normalization, as outlined below.

      We thank the Reviewer for the positive appreciation of our work and for the very relevant suggestions. Please find our point-by-point answer below.

      There is a lot of individual variation in the choice data that may potentially be explained by individual differences in normalization strategies. It would be important to examine whether there are any subgroups of subjects whose behavior is better explained by a divisive vs. range normalization process. Alternatively, it may be possible to compute an index that captures how much a given subject displays behavior compatible with divisive vs. range normalization. Seeing the distribution of such an index could provide insights into individual differences in normalization strategies.

      Thank you for pointing this out, it is indeed true that there is some variability. To address this, and in line with the Reviewer’s suggestion, we extracted model attributions per participant on the individual out-of-sample log-likelihood, using the VBA_toolbox in Matlab (Daunizeau et al., 2014). In experiment 1 (presented in the main text), we found that the RANGE model accounted for 79% of the participants, while the DIVISIVE model accounted for 12%. The relative difference was even higher when including the RANGEω model in the model space: the RANGE and RANGEω models account for a total of 85% of the participants, while the DIVISIVE model accounted only for 5%.

      In experiment 2 (presented in the supplementary materials), the results were comparable (see Figure 3-figure supplement 3: 73% vs 10%, 83% vs 2%).

      To provide further insights into the behavioral signatures behind inter-individual differences, we plotted the transfer choice rates for each group of participants (best explained by the RANGE, DIVISIVE, or UNBIASED models), and the results are similar to our model predictions from Figure 1C:

      Author Response Image 1. Behavioral data in the transfer phase, split over participants best explained by the RANGE (left), DIVISIVE (middle) or UNBIASED (right) model in experiment 1 (A) and experiment 2 (B) (versions a, b and c were pooled together).

      To keep things concise, we did not include this last figure in the revised manuscript, but it will be available for the interested readers in the Rebuttal letter.

      One possibility currently not considered by the authors is that both forms of value normalization are at work at the same time. It would be interesting to see the results from a hybrid model. R1.2 Thank you for the suggestion, we fitted and simulated a hybrid model as a weighted sum between both forms of normalization:

      First, the HYBRID model quantitatively wins over the DIVISIVE model (oosLLHYB vs oosLLDIV : t(149)=10.19, p<.0001, d=0.41) but not over the RANGE model, which produced a marginally higher log-likelihood (oosLLHYB vs oosLLRAN : t(149)=-1.82, p=.07, d=-0.008). Second, model simulations also suggest that the model would predict a very similar (if not worse) behavior compared to the RANGE model (see figure below). This is supported by the distribution of the weight parameter over our participants: it appears that, consistently with the model attributions presented above, most participants are best explained by a range-normalization rule (weight > 0.5, 87% of the participants, see figure below). Together, these results favor the RANGE model over the DIVISIVE model in our task.

      Out of curiosity, we also implemented a hybrid model as a weighted sum between absolute (UNBIASED model) and relative (RANGE model) valuations:

      Model fitting, simulations and comparisons slightly favored this hybrid model over the UNBIASED model (oosLLHYB vs oosLLUNB: t(149)=2.63, p=.0094, d=0.15), but also drastically favored the range normalization account (oosLLHYB vs oosLLRAN : t(149)=-3.80, p=.00021, d=-0.40, see Author Response Image 2).

      Author Response Image 2. Model simulations in the transfer phase for the RANGE model (left) and the HYBRID model (middle) defined as a weighted sum between divisive and range forms of normalization (top) and between unbiased (no normalization) and range normalization (bottom). The HYBRID model features an additional weight parameter, whose distribution favors the range normalization rule (right).

      To keep things concise, we did not include this last figure in the revised manuscript, but it will be available for the interested readers in the Rebuttal letter.

      Reviewer #2 (Public Review):

      This paper studies how relative values are encoded in a learning task, and how they are subsequently used to make a decision. This is a topic that integrates multiple disciplines (psych, neuro, economics) and has generated significant interest. The experimental setting is based on previous work from this research team that has advanced the field's understanding of value coding in learning tasks. These experiments are well-designed to distinguish some predictions of different accounts for value encoding. However there is an additional treatment that would provide an additional (strong) test of these theories: RN would make an equivalent set of predictions if the range were equivalently adjusted downward instead (for example by adding a "68" option to "50" and "86", and then comparing to WB and WT). The predictions of DN would differ however because adding a low-value alternative to the normalization would not change it much. Would the behaviour of subjects be symmetric for equivalent ranges, as RN predicts? If so this would be a compelling result, because symmetry is a very strong theoretical assumption in this setting.

      We thank the Reviewer for the overall positive appraisal concerning our work, but also for the stimulating and constructive remarks that we have addressed below. At this stage, we just wanted to mention that we also agree with the Reviewer concerning the fact that a design where we add "68" option to "50" and "86" would represent also an important test of our hypotheses. This is why we had, in fact, run this experiment. Unfortunately, their results were somehow buried in the Supplementary Materials of our original submission and not correctly highlighted in the main text. We modified the manuscript in order to make them more visible:

      Behavioral results in three experiments (N=50 each) featuring a slightly different design, where we added a mid value option (NT68) between NT50 and NT87 converge to the same broad conclusion: the behavioral pattern in the transfer phase is largely incompatible with that predicted by outcome divisive normalization during the learning phase (Figure 2-figure supplement 2).

      Reviewer #3 (Public Review):

      Bavard & Palminteri extend their research program by devising a task that enables them to disassociate two types of normalisation: range normalisation (by which outcomes are normalised by the min and max of the options) and divisive normalisation (in which outcomes are normalised by the average of the options in ones context). By providing 4 different training contexts in which the range of outcomes and number of options vary, they successfully show using 'ex ante' simulations that different learning approaches during training (unbiased, divisive, range) should lead to different patterns of choice in a subsequent probe phase during which all options from the training are paired with one another generating novel choice pairings. These patterns are somewhat subtle but are elegantly unpacked. They then fit participants' training choices to different learning models and test how well these models predict probe phase choices. They find evidence - both in terms of quantitive (i.e. comparing out-of-sample log-likelihood scores) and qualitative (comparing the pattern of choices observed to the pattern that would be observed under each mode) fit - for the range model. This fit is further improved by adding a power parameter which suggests that alongside being relativised via range normalisation, outcomes were also transformed non-linearly.

      I thought this approach to address their research question was really successful and the methods and results were strong, credible, and robust (owing to the number of experiments conducted, the design used and combination of approaches used). I do not think the paper has any major weaknesses. The paper is very clear and well-written which aids interpretability.

      This is an important topic for understanding, predicting, and improving behaviour in a range of domains potentially. The findings will be of interest to researchers in interdisciplinary fields such as neuroeconomics and behavioural economics as well as reinforcement learning and cognitive psychology.

      We thank Prof. Garrett for his positive evaluation and supportive attitude.

    1. Author Response

      Reviewer #2 (Public Review):

      Granell et al. investigated genetic factors underlying wheezing from birth to young adulthood using a robust data-driven approach with the aim of understanding the genetic architecture of different wheezing phenotypes. The association of 8.1 million single nucleotide polymorphisms (SNPs) with wheeze phenotypes derived from birth to 18 years of age was evaluated in 9,568 subjects from five independent cohorts from the United Kingdom. This meta-genome-wide association study (GWAS) revealed the suggestive association of 134 independent SNPs with at least one wheezing subtype. Among these, 85 genetic variants were found to be potentially causative. Indeed, some of these were located nearby well-known asthma loci (e.g., the 17q21 chromosome band), although ANXA1 was revealed for the first time to play an important role in early-onset persistent wheezing. This was strongly supported by functional evidence. One of the top ANXA1 SNPs associated with wheezing was found to be potentially involved in the regulation of the transcription of this gene due to its location at the promoter region. This polymorphism (rs75260654) had been previously evidenced to regulate the ANXA1 expression in immune cells, as well as in pulmonary cells through its association as an eQTL. Protein-protein network analyses revealed the interaction of ANXA1 with proteins involved in asthma pathophysiology and regulation of the inflammatory response. Additionally, the authors conducted a murine model, finding increased anxa1 levels after a challenge with house dust mite allergens. Mice deficient in anxa1 showed decreased lung function, increased eosinophilia, and Th2 cell levels after allergen stimulation. These results suggest the dysregulation of the immune response in the lungs, eosinophilia, and Th2-driven exacerbations in response to allergens as a result of decreased levels of anxa1. This coincides with evidence of lower plasmatic ANXA1 levels in patients with uncontrolled asthma, suggesting this locus is a very promising candidate as a target of novel therapeutic strategies.

      Limitations of this piece of work that need to be acknowledged:

      (1) the manual and visual inspection of Locus Zoom plots for the refinement of association signals and identification of functional elements does not seem to be objective enough;

      This is an important observation and we have now added the following text in the Discussion which can be found on lines 400-2 Revised Main Manuscript:

      “Finally, the manual and visual inspection of Locus Zoom plots for the refinement of association signals and identification of functional elements was an objective approach which might have undermined the findings.“

      (2) the sample size is limited, although the statistical power was improved by the assessment of very accurate disease sub-phenotype;

      This point was already mentioned as a limitation and it can now be found in lines 349-365 Revised Main Manuscript:

      “By GWAS standards, our study is comparatively small and may be considered to be underpowered. The sample size may be an issue when using an aggregated definition (such as “doctor-diagnosed asthma”) but is less likely to be an issue when primary outcome is determined by deep phenotyping. This is indirectly confirmed in our analyses. Our primary outcome was derived through careful phenotyping over a period of more than two decades in five independent birth cohorts, and although comparatively smaller than some asthma GWASs, our study proved to be powered enough to detect previously identified key associations (e.g. chr17q21 locus). Precise phenotyping has the potential to identify new risk loci. For example, a comparatively small GWAS (1,173 cases and 2,522 controls) which used a specific subtype of early-onset childhood asthma with recurrent severe exacerbations as an outcome, identified a functional variant in a novel susceptibility gene CDHR3 (SNP rs6967330) as an associate of this disease subtype, but not of doctor-diagnosed asthma(51). This important discovery was made with a considerably smaller sample size but using a more precise asthma subtype. In contrast, the largest asthma GWAS to date had a ~40-fold higher sample size(7), but reported no significant association between CDHR3 and aggregated asthma diagnosis. Therefore, with careful phenotyping, smaller sample sizes may be adequately powered to identify larger effect sizes than those in large GWASs with broader outcome definitions(52).”

      (3) association signals with moderate significance levels but with strong functional evidence were found;

      We do not think of this as a limitation but as a strength. We were able to support our genetic results with evidence from experimental mouse models.

      (4) no direct replication of the findings in independent populations including diverse ancestry groups was described.

      This point was already mentioned as a limitation and it can now be found in lines 375-391 and 392-399 Revised Main Manuscript.

      “We are cognisant that there may be a perception of the lack of replication of our GWAS findings. We would argue that direct replication is almost certainly not possible in other cohorts, as phenotypes for replication studies should be homogenous(56). However, there is a considerable heterogeneity in LCA-derived wheeze phenotypes between studies, and although phenotypes in different studies are usually designated with the same names, they differ between studies in temporal trajectories, distributions within a population, and associated risk factors(57). This heterogeneity is in part consequent on the number and the non-uniformity of the timepoints used, and is likely one of the factors responsible for the lack of consistent associations of discovered phenotypes with risk factors reported in previous studies(58). This will also adversely impact the ability to identify phenotype-specific genetic associates. For example, we have previously shown that less distinct wheeze phenotypes in PIAMA were identified compared to those derived in ALSPAC(59). Thus, phenotypes that are homogeneous to those in our study almost certainly cannot readily be derived in available populations. This is exemplified in our attempted replication of ANXA1 findings in PIAMA cohort (see OLS, Table E12). In this analysis, the number of individuals assigned to persistent wheezing in PIAMA was small (40), associates of this phenotype differed to those in STELAR cohorts, and the SNPs’ imputation scores were low (<0.60), which meant the conditions for replication were not met.”

      “Our study population is of European descent, and we cannot generalize the results to different ethnicities or environments. It is important to highlight the under-representation of ethnically diverse populations in most GWASs(9). To mitigate against this, large consortia have been formed, which combine the results of multiple ethnically diverse GWASs to increase the overall power to identify asthma-susceptibility loci. Examples include the GABRIEL(6), EVE(60) and TAGC(7) consortia, and the value of diverse, multi-ethnic participants in large-scale genomic studies has recently been shown(61). However, such consortia do not have the depth of longitudinal data to allow the type of analyses which we carried out to derive a multivariable primary outcome.”

      Nonetheless, the robustness and consistency of the findings supported by different analytical and experimental layers is the major strength of this study.

      The authors successfully achieved the aims of the study, strongly supported by the results presented. This study not only provides an exciting novel locus for wheezing with potential implications in the development of alternative therapeutic strategies but also opens the path for better-powered research of asthma genetics, focused on accurate disease phenotypes derived by innovative data-driven approaches that might speed up the process to disentangle the missing heritability of asthma, making use of still useful GWAS approaches.

    1. Author Response

      Reviewer #2 (Public Review):

      The manuscript by Mohebi et al. examines a critical open question regarding the interaction of cholinergic interneurons of the striatum and transmitter release from dopaminergic axons in behaving animals. Activation of cholinergic interneurons in the striatum can evoke dopamine release in brain slices and in vivo as measured with voltammetry. However, it remains an open question in what context and to what extent this acetylcholine-mediated dopamine occurs in behaving animals. Here, the authors argue that CIN activity triggers dopamine release in the nucleus accumbens which encodes the motivation to obtain a reward through increasing "ramps" of dopamine release. Their data suggest that the ramps are not reflected in the firing of dopaminergic neurons. Rather, they provide compelling evidence that the ramps of dopamine release correlate with ramps in cholinergic interneuron activity as measured with GCaMP6. What's more, the authors show that ACh-mediated dopamine release has no paired-pulse depression, a striking result that differs from all prior ex vivo brain slice data. The manuscript is extremely well written and the data are of very high quality. Overall, this study represents an important step forward in our understanding of how ACh-mediated dopamine release regulates behavior, and more broadly how axons can generate behaviors independently from somatic activity.

      Major comments

      1) The complete absence of any short-term plasticity in CIN-mediated dopamine release is a striking result that is important for the field. The authors should strengthen this result with additional quantitative analysis demonstrating the lack of STP. They have analyzed paired-pulse ratios, but they should analyze this for stimuli at the higher frequencies (4 Hz, etc) that are more physiologically relevant. For example, Fig 1e shows a CIN-evoked DA release at many optically-stimulated frequencies. The authors should quantify short-term plasticity by generating fits of the single stimulus signal and comparing the mathematical sum predicted from 4 stim DA signals at different frequencies to the recorded data. A similar analysis has been done with Ca signals (Koester and Sakmann, 2000).

      Thank you for this very helpful suggestion. We have performed this analysis as recommended, and now confirm the lack of STP even at the higher frequencies (see new Supplementary Figure 1).

      2) The authors show that optical activation of CINs results in DA release as measured by dLight. To clearly establish that these signals are generated by DA release driven by nicotinic receptors (and not a partial effect of some unknown artifact), it would be useful to show that the optical CIN-evoked dLight signals shown in Fig. 1 are inhibited by nicotinic receptor antagonists such as DHbE. This control experiment would significantly strengthen the result shown here.

      We agree that combining drug manipulations with photometry would be useful, but as noted above this is not a methodology in our current technical repertoire.

      3) Similarly, the authors show clear correlations between CIN activity and DA release during behavior. The authors should consider determining whether CINs play a causal role in triggering DA release during behavior. For example, does infusion of DHbE in the NAc prevent the light-mediated DA release during behavior? As an alternative hypothesis, some groups have been suggesting that CIN activity has almost no direct influence over DA. Therefore, testing whether a causal relationship exists between CINs and DA release would be an important experiment in addressing these two opposing viewpoints.

      As noted above we are not currently able to combine drug manipulations with photometry in behaving animals.

      4) The ramps that are described in this manuscript are an order of magnitude faster (increasing over 100s of milliseconds) than ramps described in other studies that occur over seconds. In fact, the two signals may be completely different functionally. Discussion of this topic would be helpful.

      Dopamine ramps have indeed been reported over multiple different time scales, and as discussed in Berke 2018, this seems to reflect the duration of the approach behavior. We think further discussion of this topic is better saved for another paper, especially as we are now actively studying ramping over longer time scales (Krausz et al. 2023).

      Reviewer #3 (Public Review):

      This report by Mohebi et al. provides new answers to old questions by showing that the activity of striatal cholinergic interneurons (CINs) escalates progressively during specific reward-related behaviors and that this correlates with previously observed ramps in dopamine (DA) release in the nucleus accumbens core. The report is strong and provides evidence for the authors' hypothesis that DA ramps are independent of DA neuron activity, but are instead the result of CIN activity and corresponding acetylcholine (ACh) release. The authors further demonstrate that the fidelity of CIN activation and consequent driving of DA release is even more robust in vivo than observed ex vivo slice preparations, which is fundamental for understanding the role of ACh-DA interactions in behavior. The findings complement the authors' previous evidence ventral tegmental area (VTA) DA neuron firing patterns do not show a ramping pattern; the previously reported VTA data are appropriately included here (in Fig. 3) to illustrate the absence of VTA firing during the time-locked increases in CIN activity and DA release. The present studies stop short of showing a direct link between CIN activity and DA release, however, which would require examining DA release during behavior in the presence of an antagonist of nicotinic ACh receptors. The authors also extend the understanding of the regulation of DA release by acetylcholine (ACh) by showing that optical activation of CINs in vivo promotes DA release responses that do not attenuate with repetitive stimulation. This contrasts with previous results in ex vivo striatal slices in which ACh-evoked DA release has been found to decline progressively from rundown and/or receptor desensitization. The authors propose that in vivo, AChE may be more effective in curtailing local ACh levels than in slices because of the slightly lower temperature typically used for slice studies, as well as the use of superfusion that might facilitate some AChE washout (AChE inhibitors are still effective in slices, of course). Overall, the report not only provides evidence for the cellular substrate for DA ramps but also shows the robustness of ACh-driven DA release in vivo. A few points to strengthen the report are listed below.

      1) The authors give a few details about how CINs were activated at the beginning of the results, but say only that DA dynamics were monitored using fiber photometry. Given that the methods are at the end, a brief summary should be given here to indicate whether this means direct monitoring of DA or indirect via GCaMP, for example. It would be helpful to note the sensor used in the abstract, as well. In this light, as it were, RdLight1 should be described upon the first mention.

      We have now clarified in both abstract and text that we are using the direct DA sensor RdLight1.

      2) The authors show that infusion of DHbE in the NAc likelihood of decisions to approach the center port, as did antagonism of DA receptors. This supports the authors' argument that ramping of CIN activity and consequent ACh release underlies observed ramps in DA release. However, to show a causal interaction requires testing whether the observed DA ramps are absent after DHbE infusion in the NAc, under the same conditions that attenuated behavior.

      As noted above we are not currently able to combine drug manipulations with photometry in behaving animals.

      3) In Fig. 3, the y-axis title for the upper panels should specify VTA, not simply "rate". This is stated in the legend, but should also be specified in the figure panel.

      We have updated the y-axis titles in this figure.

      4) A recent preprint in BioRxiv by AC Krok, NX Tritsch et al. shows a related correlation between ACh and DA release in vivo in a reward task, as well as differences in other conditions. This report shows also that cortical input to CINs indeed plays a role, as suggested in the concluding sections of the present report. Consideration of the data in the preprint in the context of the present results could be valuable for the field.

      We have also noted those pre-prints with interest, even though they investigated different brain regions using different approaches. There are established differences between CIN-DA interactions in dorsal vs. ventral striatum that we suspect are relevant here. But given the rapid pace of developments in this subfield, we prefer not to speculate too much at this point and instead review the overall body of work once it is published.

    1. Author Response

      Reviewer #1 (Public Review):

      This is a simulation study comparing the performance of two major approaches for dealing with “population structure” when carrying out Genome-wide Association Studies - Principal Component Analysis and Linear Mixed-effects Models - a subject of considerable practical importance. The author correctly notes that previous comparisons have been quite limited. In particular, any study not concluding that LMM was superior has relied on very simple models of structure.

      The paper is clearly written and beautifully reviews the theoretical underpinnings (albeit in a manner that will be difficult to penetrate without deep knowledge of several fields). The simulations are well-designed and far better than previous studies. From a theoretical point of view, the work is somewhat limited by being strongly anchored in a very classical quantitative genetics framework that is focused on allele frequencies and inbreeding coefficients, and totally ignores coalescent theory, but this is a minor quibble. The simulations are limited by utilizing ridiculously small sample sizes by the standards of modern human GWAS. And of course, they do not include all the complexities of real data.

      The quantitative genetics framework we used was ideal for motivating and interpreting LMMs in particular, since they model relatedness with a kinship matrix which consists of IBD probabilities, all of which arose from quantitative genetics.

      We also added the following text to the discussion: “However, our conclusions are not expected to change with larger sample sizes, as cryptic family relatedness will continue to be abundant in such data, if not increase in abundance, and thus give LMMs an advantage over PCA (Henn et al., 2012; Shchur et al., 2018; Loh et al., 2018).”

      The main conclusion of the study is that LMM really are generally superior - as expected on theoretical grounds. However, the authors do address whether switching to LMM really is practicable given the sample size and lack of data sharing that characterize human genetics. Nor is it clear whether the difference in performance matters in real life given that the entire framework used is an idealized one - the fact that real human data suffers from environmental confounders that are correlated with “ancestry” is not addressed, to take the most obvious example. That said, it is surely important to note that the approach routinely used by the majority of users (PCA with 10 PCs) is most used for historical reasons and has little theoretical or empirical justification.

      We added simulations with environment effects correlated with ancestry, which we hope will make our study even more relevant as it does make our evaluations even more realistic than before. In the presence of environment effects, LMM without PCs remains among the best approaches, although occasionally LMM with PCs or PCA will perform slightly better. However, modeling environment directly (with the true variables) improves performance much more than by using PCs to model environment indirectly, so we believe that is not a strong reason for continuing to use PCs (in LMMs or otherwise) unless there is no choice.

      We also added the following text to the discussion: “However, recent approaches not tested in this work have made LMMs more scalable and applicable to biobank-scale data (Loh et al., 2015; Zhou et al., 2018; Mbatchou et al., 2021), so one clear next step is carefully evaluating these approaches in simulations with larger sample sizes.” As stated earlier, we believe that the difference in performance between LMM and PCA will remain in larger sample sizes because cryptic relatedness is more prevalent in that setting.

      We excluded the “lack of data sharing” point from our discussion because it does not align well with the goals of our manuscript. The current solution to the lack of data sharing is meta-analysis, but its use does not give PCA or LMM an inherent advantage, since it can be applied to the summary statistics of either (or even a combination of models, in theory). There is interesting recent work on “federated” PCA and LMM association (both versions exist), that allow a single model to be fit jointly to separate datasets (residing in different buildings across the world) as if they were combined into a single dataset. Thus, these issues do not explain or motivate why PCA or LMM should be used.

      Reviewer #2 (Public Review):

      Yao and Ochoa present a very nice paper examining the age-old question of whether LMM or PCA is a better way to adjust for structure (population, family, admixture). The authors provide a very nice and detailed overview of the previous research addressing this question, summarizing it in a table. They find that LMMs are generally better at accounting for population structure. However, I feel there are a couple of important factors that are missing. One is the consideration of environmental structure. Another is that the relationship between PCA and LMM is usually a bit more complicated in practice than depicted here, where the devil really lies in the details. Also, I think there are a couple of key reasons why LMMs haven’t been adapted as quickly as one might have expected, including case-control imbalance and cohort meta-analyses, which I feel the authors could point out. In fact, I believe LMMs have become sort of popular in recent years (e.g. Japan Biobank GWAS results).

      We added environment simulations, which we agree was an important shortcoming of the previous version of our work.

      We now discuss how the PCA and LMM connection can be more complicated in practice, but as the main difference is in how LD is handled, once that is correctly adjusted, PCs and random effects are still mostly modeling the same relatedness signals. Ultimately, our main conclusion is unchanged, namely that only LMMs can model family relatedness, which is their key advantage.

      We briefly commented on case-control imbalance in our discussion (now made more clear), but since this involves binary traits, which we did not explicitly test in this work, it is out of scope.

      Cohort meta-analysis does not influence whether to use PCA or LMM, since it can be performed with summary statistics from either model (and in theory even a combination of different models per cohort). The broad use of meta-analysis does not in itself prevent users from using PCA or LMM within individual cohorts. The use of meta-analysis is very interesting in its own right, but it is outside the scope of this work.

      Reviewer #3 (Public Review):

      This paper examines the relative performance of linear mixed models (LMMs), principal components (PCA), and their combination (PCA-LMM) for genetic association studies in human populations. The authors claim that previous papers examining this question are inadequate and that: (i) there remains confusion on which method is best and in which context, (ii) that the metrics used in previous evaluations were insufficient, and (iii) that the simulation settings used in previous papers were not comprehensive. To fix these problems the authors perform an extensive set of simulations within several frameworks and suggest two new metrics for evaluating performance.

      Strengths:

      The simulation framework used in this paper and the extensive number of simulations provide an opportunity to examine the relative properties of the three approaches (LMM, PCA, PCA-LMM) in a variety of contexts.

      The parameters of the simulation framework are based on highly diverged populations, which is an increasingly common analysis choice that has not been examined in detail via simulation previously.

      The evaluation metrics used in this paper are AUC and a test of the uniformity of the p-value distribution under the null. This is an improvement over some previous analyses which did not examine power and relied on less sensitive tests of type I error.

      Weaknesses:

      This paper has a limited set of population frameworks just like all papers before it. The breakdown of which method is best (LMM, PCA, PCA-LMM) will be a function of the simulation framework chosen.

      Ameliorating this issue, we added additional simulations with low heritability and with environment effects. We are pleased to report that all of our conclusions hold at low heritability (h2 = 0.3), and for the most part under environment effects (which occasionally give LMM with PCs and PCA a small advantage, but often LMM with no PCs remains best, and we show PCs are no replacement for directly modeling these environment effects).

      The frameworks chosen for this paper are certainly not comprehensive in contemporary human genetic studies. In fact, the authors make a number of unusual choices. For example, the populations in the simulated study have extremely large Fsts. While this is also a strength, the lack of more standard study designs is a weakness. More importantly, there is no simulation of family effects, which is the basis of many of the PCA-LMM papers reported in Table 1.

      We now better motivate in the introduction our focus on association studies of multiethnic and admixed individuals, which are nowadays very common and which have greater FST values than earlier studies. In reference to higher simulated FSTs, we also now cite our recent work, which has found that many previous FST estimates are downwardly biased (Ochoa and Storey, 2021, 2019). We simulated data that was fit to each of our three real datasets using our unbiased methods, so those values that (understandably) appear high are actually more correct (for multiethnic populations such as those in 1000 Genomes, HGDP, etc) than previous estimates in the literature. In our previous work we also determined that only previous pairwise FST estimators are unbiased (under some conditions), and using a previous pairwise FST estimator (from Bhatia et al., 2013) we obtained equally high values between the most diverged human populations (values from a revised version of Ochoa and Storey, 2019 that isn’t on bioRxiv yet): In HGDP, the largest pairwise FST is 0.479, between Pima and PapuanSepik; In Human Origins, the largest estimate is 0.396, between Cabecar and Baining_Malasait; Lastly, in 1000 Genomes, the largest estimate is 0.135, between YRI and JPT. (1000 Genomes was generally less structured than HGDP and Human Origins, because the latter include more diverse populations.) Several previous estimates from the literature, all between one hunter-gatherer Sub-Saharan African subpopulation and one non-African subpopulation resulted in values of about 0.25 (Bowcock et al., 1991, Henn et al., 2011, Bergstrom et al., 2020). FST estimates are also greater from whole-genome sequencing versus array data (revised version of Ochoa and Storey, 2019).

      Family (household) effects is a case where PCA is not expected to outperform LMM, though standard LMMs do not model this effect explicitly either and may not do much better. As this is a feature of family studies that ought to be absent in population studies (as usually only siblings are in the same household, and not more distant relatives), it is also not entirely relevant to the majority of our simulations. In these ways, including such a feature in our simulations does not align with the goals of this present work, but we agree this is an important framework that deserves more attention in future evaluations.

      The discussion (and simulations) of LMM vs PCA, particularly LMMs with PCs as fixed effects misses the critical distinction of whether PCs are in-sample (in which case including PCs as fixed effects effectively serves as a preconditioner for the kinship matrix, speeding up iterative methods such as BOLT), or projections of individuals onto out-of-sample principal axes. There is also no discussion of LOO methods to address “proximal contamination”, also quite relevant in evaluating power as a function of the number of PCs.

      We added the following to our discussion concerning out-of-sample PC projections: “We do not consider the case where samples are projected onto PCs estimated from an external sample (Prive et al., 2020), which is uncommon in association studies, and whose primary effect is shrinkage, so if all samples are projected then they are all equally affected and larger regression coefficients compensate for the shrinkage, although this will no longer be the case if only a portion of the sample is projected onto the PCs of the rest of the sample.”

      We also added the following to the discussion concerning the LOCO approach: “Similarly, the leave-onechromosome-out (LOCO) approach for estimating kinship matrices for LMMs prevents the test locus and loci in LD with it from being modeled by the random effect as well, which is called”proximal contamination” (Lippert et al., 2011, Yang et al., 2014). While LOCO kinship estimates vary for each chromosome, they continue to model family relatedness, thus maintaining their key advantage over PCA.”

      The same new discussion paragraph closes with the following thoughts concerning LOCO and related approaches: “LD effects must be adjusted for, if present, so in unfiltered data we advise the previous methods be applied. However, in this work, simulated genotypes do not have LD, and the real datasets were filtered to remove LD, so here there is no proximal contamination and LD confounding is minimized if present at all, so these evaluations may be considered the ideal situation where LD effects have been adjusted successfully, and in this setting LMM outperforms PCA. Overall, these alternative PCs or kinship matrices differ from their basic counterparts by either the extent to which LD influences the estimates (which may be a confounder in a small portion of the genome, by definition) or by sampling noise, neither of which are expected to change our key conclusion.”

      Lastly, we added the following to a different discussion paragraph: “A different benefit for including PCs were recently reported for BOLT-LMM, which does not result in greater power but rather in reduced runtime, a property that may be specific to its use of scalable algorithms such as conjugate gradient and variational Bayes (Loh et al., 2018).”

      There is no discussion/simulation of spatial/environmental effects or rare vs common PCs as raised in Zaidi et al 2020. There are some open questions here regarding relative performance the authors could have looked at. Same for LMMs with multiple GRMs corresponding to maf/ld bins and thresholded GRMs. For example, it would be helpful to know if multiple-GRM LMMs mitigate some of the problems raised in the Zaidi paper.

      We added simulations with environment effects, which are based on a two-level hierarchy of population labels so they are spatial to the extent that these labels capture spatial relationships between populations. However, our small sample size data are not well suited to study rare variants and their structure, so its out of scope. (The sample size limitation is also covered in a new discussion paragraph.) We hope to tackle this very interesting question in future work.

      We added the following paragraph to our discussion: “Another limitation of this work is ignoring rare variants, a necessity given our smaller sample sizes, where rare variant association is miscalibrated and underpowered. Using simulations mimicking the UK Biobank, recent work has found that rare variants can have a more pronounced structure than common variants, and that modeling this rare variant structure (with either PCA and LMM) may better model environment confounding, improve inflation in association studies, and ameliorate stratification in polygenic risk scores (Zaidi and Mathieson, 2020). Better modeling rare variants and their structure is a key next step in association studies.”

  3. Apr 2023
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01586

      Corresponding author(s): Hammond, Gerald

      1. General Statements

      Our manuscript details a novel homeostatic feedback loop for the master plasma membrane regulatory molecule, PI(4,5)P2. In this loop, the PIP4K family of PI(4,5)P2-synthesizing enzymes act in a novel, non-enzymatic capacity: they sense PI(4,5)P2 levels and directly inhibit the lipid’s synthesis by inhibiting the major enzyme involved in the terminal step of synthesis, PIP5K. The three reviewers seem largely convinced of our data, and provided detailed, insightful and plausible suggestions for revision, which we have now comprehensively provided. This includes substantial new experimental work, including the generation of genomically tagged cell lines to localize all endogenous PIP4K isoforms.

      However, all three reviewers questioned the paper’s novelty and significance based on recent studies in the literature demonstrating PIP5K inhibition by PIP4Ks [refs 25 & 53 in the manuscript]. We feel that this is an inaccurate and somewhat unfair assessment of our findings, since it does not consider our central (and completely unprecedented) finding that PIP4Ks directly sense PI(4,5)P2 levels through low-affinity binding. As well as being a novel finding, this places the previously observed inhibition of PIP5K by PIP4Ks into a completely new paradigm consisting of a complete, enclosed homeostatic feedback loop. This was not demonstrated previously in the literature.

      Of course, the reviewers’ convergent opinions almost certainly reflect a deficit in our articulation of the novel findings in the original manuscript. We have therefore revised the current version to more clearly emphasize our novel findings.

      2. Point-by-point description of the revisions

      Reviewer #1

      __Summary: __In this manuscript, authors address how PIP4K regulates tonic plasma membrane (PM) PI(4,5)P2 levels which are generated by major PI(4,5)P2 synthesis enzyme, PIP5K by using PIP4K and PIP5K overexpressing cells or acutely manipulating PM PI(4,5)P2 levels by the chemically induced dimerization (CID) system. Additionally, authors assessed effect of direct interaction between PIP4K and PIP5K by using supported lipid bilayers (SLBs) and purified PIP4K and 5K. Authors also were successful in monitoring dynamics of endogenous PIP4K by using a split fluorescent protein approach. Through this study, authors propose a model of PI(4,5)P2 homeostatic mechanism that PIP4Ks sense elevated PM PI(4,5)P2 by PIP5Ks, are recruited to the PM, and bind to PIP5Ks to inhibit PIP5Ks activity.

      # 1.1: Although authors mention methods of statistical analysis in materials and methods, they did not present the results of statistical analysis in the figures. The quantitative data should be presented with statistical analysis data, which is important for showing where convincing differences between treatment groups are found.

      We agree that statistics are important to fully interpret the data; we have now included the results of statistical tests (non-parametric statistics were used, as the data are not normally distributed) with correction for multiple comparisons. Significant changes are denoted using asterisk notation in figs. 1A-C, 2B, 5B & 7A. The full results are now reported as tables:

      Fig 1A = table 1; Fig 1B = table 2; Fig. 1C = table 3; Fig. 2B = table 4; Fig. 5B = table 5; Fig 7A = tables 6 & 7.

      __#1.2a: __Fig. 1D. Fig. 1D and Fig. 3A should be presented together because these are exactly same set of cells and information of each PIP4K and PIP5K membrane localization could be important for understanding mechanisms of inhibitory effect of PIP4Ks.

      We struggled when writing the manuscript to reconcile these data into a single figure. The manuscript flows from showing inhibition of PIP5Ks by PIP4Ks in living cells (figs. 1 & 2), then showing low affinity PI(4,5)P2 binding by endogenous PIP4Ks (figs. 3-6) and finally to a direct interaction between PIP4K and PIP5K (fig. 7). We therefore felt that reconciling the data showing attenuated PI(4,5)P2 synthesis with the interaction between PIP4Ks and PIP5Ks, despite being demonstrated in the same experiments, would disrupt the flow of the paper. We therefore request to leave the data in Figs. 2B and 7A, whilst remaining explicit that the data derive from a single experiment.

      #1.2b: Authors claimed that over-expression of all three PIP4K isoforms were able to attenuate the elevated PM PI(4,5)P2 levels caused by PIP5K over-expression. However, in Fig. 3A, PIP4K2A was recruited to PM by both PIP5K1A and PIP5K1C but looks only attenuated PIP5K1A, but not PIP5K1C, overexpression mediated PM PI(4,5)P2 elevation (Fig. 1D). PIP4K2C was less recruited to the PM than PIP4K2A and 2B in PIP5K1A overexpressing cell (Fig. 3A) but PIP4K2A, B and C isoforms equally attenuated increase of PM PI(4,5)P2 in PIP5K1A overexpressing cell (Fig. 1D). It is likely that efficiency of inhibitory effect of each PIP4K isoform is different by co-overexpressed PIP5K isoform. These images should be more carefully documented with Fig. 1D and Fig. 3A together.

      As the reviewer suggests, we have now expanded our description of these data in both results and discussion; firstly, for the attenuating effects on PI(4,5)P2 synthesis, we write on the 3rd paragraph of p4: “We also reasoned that co-expression of PIP4K paralogs with PIP5K might attenuate the elevated PI(4,5)P2 levels induced by the latter. Broadly speaking, this was true, but with some curious paralog selectivity (fig. 2B, statistics reported in table 4): PIP4K2A and PIP4K2B both attenuated PI(4,5)P2 elevated by PIP5K1A and B, but not (or much less so) PIP5K1C; PIP4K2C, on the other hand, attenuated PIP5K1A and was the only paralog to significantly attenuate PIP5K1C’s effect, yet it did not attenuate PIP5K1B at all.”

      On the relative ability of PIP5Ks to localize PIP4Ks we focus on the key result, writing on the 2nd paragraph of p7: “When co-expressing EGFP-tagged PIP5Ks and TagBFP2-tagged PIP4K2s, we found that PIP5K paralogs’ PM binding is largely unaffected by PIP4K over-expression (fig. 7A, upper panel and table 6), whereas all three paralogs of PIP4K are strongly recruited to the PM by co-expression of any PIP5K (fig. 7A, lower panel and table 7)…”

      And finally, we describe a more nuanced discussion of the possible implications for differential inhibition of PIP5K isoforms by PIP4Ks in the discussion, starting in the first paragraph on p. 11: “Despite minor differences in the ability of over-expressed PIP5K paralogs to recruit over-expressed PIP4K enzymes (fig. 7A), we observed major differences in the ability of PIP4K paralogs to inhibit PI(4,5)P2 synthesis when over-expressed alone (fig. 1C) or in combination with PIP5K (fig. 2B). It is unclear what drives the partially overlapping inhibitory activity, where each PIP5K paralog can be attenuated by 2 or 3 PIP4Ks. This is however reminiscent of the biology of the PIPKs, where there is a high degree of redundancy among them, with few unique physiological functions assigned to specific paralogs [49]. There may be hints of paralog-specific functions in our data; for example, enhanced PI(4,5)P2 induced by over-expressed PIP5K1C is only really attenuated by PIP4K2C (fig. 2B). This could imply a requirement for PIP4K2C in regulating PI(4,5)P2 levels during PLC-mediated signaling, given the unique requirements for PIP5K1C in this process [50,51]. Regardless, a full understanding of paralog selectivity will need to be driven by a detailed structural analysis of the interaction between PIP4Ks and PIP5Ks - which is not immediately apparent from their known crystal structures, especially since PIP4Ks and PIP5Ks employ separate and distinct dimerization interfaces [49].

      #1.3: Fig. 1F. It seems that PIP4K2A accelerated PIP5K, but not Mss4, dependent PI(4,5)P2 generation before PI(4,5)P2 reaches 28,000 lipids/um2. Is this significant? If so, why did this happen?

      We have answered this question with a sentence added to the 1st paragraph on p 8: *“The ability of PIP4K to bind to PIP5K on a PI(4,5)P2-containing bilayer also potentially explains the slightly accelerated initial rate of PI(4,5)P2synthesis exhibited by PIP5K1A that we reported in fig. 2C, since PIP4K may initially introduce some avidity to the membrane interaction by PIP5K, before PI(4,5)P2 reaches a sufficient concentration that PIP4K-mediated inhibition is effective.” *

      #1.4: Fig. 3B. In this figure, authors only presented images after Rapa treatment. Therefore, it is not clear what these results mean. Before Rapa treatment, where did bait proteins and NG2-PIP4K2C localize? If ePIP4K2C delta PM intensity (ER:PM/PM) increase, does that mean increase in ER:PM intensity or decrease in PM intensity? According to Figure legend, PI(4,5)P2 indicator TubbycR332H was co-transfected, but those images are not shown in the figure. Images of PI(4,5)P2 indicator also should be presented to show whether after Rapa treatment PI(4,5)P2 increased at ER-PM contact sites, because that could be critical for the conclusion that "The use of Mss4 ruled out an effect of enhanced PI(4,5)P2 generation at contact sites, since this enzyme increases PI(4,5)P2 as potently as PIP5K1A (Fig. 1A), yet does not cause recruitment of PIP4K2C". Is this conclusion consistent with Fig. 2F and G?

      These data now appear in Fig. 7B. We have added images showing the pre-rapamycin state to the revised figure. The reference to tubby­cR332H co-expression was an error. In fact, the cells expressed the ER:PM contact site marker MAPPER, which allowed us to quantify ER:PM contact site localization before and after rapamycin induced capture of the baits at these sites. The revised figure appears as follows:

      The failure of Mss4 to recruit endogenous PIP4K2C is entirely consistent with the old Fig. 2F and G (now 5A and C), since these show PIP4K interaction with PI(4,5)P2 containing lipid bilayers (in Fig. 5C, the PI(4,5)P2 was synthesized by Mss4). We demonstrated that Mss4 is unable to interact with PIP4K2A in Fig. 7D.

      #1.5: Fig. 3C and D. Based on results of Fig. 3C and D, authors concluded that "PIP4K2C binding to PI(4,5)P2-containing SLBs was greatly enhanced by addition of PIP5K to the membranes, but not Mss4". I don't think Fig. 3C and D are comparable because experimental conditions are different. While lipid composition of SLB used in Fig. 3C was 2% PI(4,5)P2, 98% DOPC, in Fig. 3D, it was 4% PI(4,5)P2, 96% DOPC. And also, in Fig. 3C, PIP5K1A was added to SLB at the time about 50 sec, whereas in Fig. 3D, Mss4 was added at 600 sec. It seems that in Fig. 3D, PIP4K2A was already saturated on SLB before adding Mss4. These two experiments must be performed under the same conditions.

      We have repeated these experiments (which now appear in Fig. 7C & D) under identical conditions, with the same result.


      #1.6: Overall results discussed in the text are very compressed referring readers to the 4 multi-panel complex figures with elaborate figure legends. While it is possible to figure out what the authors' studies and results are, it is quite a laborious process.

      We have revised the manuscript to be less compressed and easier to read, with the data now organized as eight figures and the results section split into four sub-sections.

      Minor comments:

      #1.7: Fig. 2D. The purified 5-phosphatase used in Fig. 2D is INPP5E but described in figure legend and materials and methods ass OCRL. Which one is correct?

      Purified OCRL was indeed used in the supported lipid bilayer experiments. The figure (now Fig. 4A) and legend have been corrected – thank you for spotting the error.

      #1.8: Fig. 3B. Indicate which trace represents PIP5K1A, Lyn11 or Mss4.

      The data now appears in Fig. 7B, with the traces separated into separate graphs for greater clarity (see response to #1.4).

      #1.9: Fig. 4C. X-axis label. Is "Time (min)" correct? Or should it be "Time (sec)".

      Thank you for spotting this typo. It should have indeed been seconds, and this is corrected in the new fig. 8C.

      • *

      Reviewer #1 (Significance (Required)): The finding that PIP4K itself is a low-affinity PI(4,5)P2 binding protein and sense increases of PM PI(4,5)P2 generated by PIP5K to control tonic PI(4,5)P2 levels by inhibiting PIP5K activity is a novel concept. However, inhibition of PIP5K by PIP4K and importance of the inhibitory effect of PIP4K in PI3K signaling pathway have previously been reported (ref 24). This reduces the novelty of the current work somewhat however, the authors do provide evidence for dual interactions of PIP4K (PIP2, PIP5K), which the previous report did not.

      We appreciate the reviewer’s insightful comments and overall appreciation of our work. We agree that previous studies did not detect the dual interaction of PIP4Ks with PIP5Ks and PI(4,5)P2; as we argue strongly in the general comments, we think this actually fits as a complete, enclosed homeostatic feedback loop – which is a significant and novel finding.

      • *

      Reviewer #2

      Summary: This paper proposes that the enzyme PIP4K2C is a negative regulator of the synthesis of PI(4,5)P2 and that it does so by dampening the activity of PIP5K which is the enzymatic activity responsible for producing the major pool of PI(4,5)P2 in cells.

      • *

      Reviewer #2 (Significance (Required)): Although the findings of the paper are presented as a major new advance, the observation that PIP4K might acts as a negative regulator of PIP2 synthesis has been previously presented in two previous publications. The significance of this paper is that it also shows the same point in another model system.

      PIP4Ks Suppress Insulin Signaling through a Catalytic-Independent Mechanism

      Diana G Wang 1, Marcia N Paddock 2, Mark R Lundquist 3, Janet Y Sun 3, Oksana Mashadova 3, Solomon Amadiume 3, Timothy W Bumpus 4, Cindy Hodakoski 3, Benjamin D Hopkins 3, Matthew Fine 3, Amanda Hill 3, T Jonathan Yang 5, Jeremy M Baskin 4, Lukas E Dow 6, Lewis C Cantley 7

      PMID: 31091439; PMCID: PMC6619495;DOI: 10.1016/j.celrep.2019.04.070

      and

      Phosphatidylinositol 5 Phosphate 4-Kinase Regulates Plasma-Membrane PIP3 Turnover and Insulin Signaling.

      Sharma S, Mathre S, Ramya V, Shinde D, Raghu P.Cell Rep. 2019 May 14;27(7):1979-1990.e7. doi: 10.1016/j.celrep.2019.04.084.PMID: 31091438

      Both of these studies show that in cells lacking PIP4K, during signalling the levels of PIP2 rise much greater than in wild type cells. Indeed the Cantley lab paper (Wang et.al) have shown that this is likely due to an increase in PIP5K activity, using an in vitro assay. They have further disrupted the interaction between PIP4K and PIP5K and demonstrated the importance of this interaction in the enhanced levels of PIP2.

      Respectfully, we disagree with this assessment, because we believe it doesn’t consider the novel, central findings we report: that PIP4Ks sense PI(4,5)P2 levels through direct interaction with the lipid, and that this is what facilitates PIP5K inhibition. These findings were not reported in the prior studies. Nonetheless, the studies are foundational for ours and were cited in our original manuscript (and are still, as refs 25 and 53).

      • *

      #2.1: Likewise although the authors have claimed that no mechanisms have claimed that there are no mechanisms reported to sense and downregulate PIP2 resynthesis. It is suggested that they read and consider the following recent paper which studies Pip2 resynthesis during GPCR triggered PLC signalling.

      Kumari A, Ghosh A, Kolay S and Raghu P*. Septins tune lipid kinase activity and PI(4,5)P 2 turnover during G-protein-coupled PLC signalling in vivo. Life Sci Alliance. 2022 Mar 11;5(6):e202101293. doi: 10.26508/lsa.202101293. Print 2022 Jun.

      We have now included a full discussion of this paper in the discussion starting on the last paragraph of p 9: “Since this paper was initially submitted for publication, another study has reported a similar homeostatic feedback loop in Drosophila photoreceptors, utilizing the fly homologue of septin 7 as the receptor and control center [38]. This conclusion is based on the observation that cells with reduced septin 7 levels have enhanced PIP5K activity in lysates, and exhibit more rapid PI(4,5)P2 resynthesis after PLC activation. However, changes in septin 7 membrane localization in response to acute alterations in PI(4,5)P2 levels, as well as direct interactions between PIP5K and septin 7, have yet to be demonstrated. Nevertheless, septin 7 has distinct properties as a potential homeostatic mediator; as a foundational member of the septin family, it is essential for generating all major types of septin filament [39]. Therefore, a null allele for this subunit is expected to reduce the prevalence of the septin cytoskeleton by half. Given that septin subunits are found in mammalian cells at high copy number, around ~106 each [29], and the fact that septins bind PI4P and PI(4,5)P2 [40,41], it is likely that septin filaments sequester a significant fraction of the PM PI4P and PI(4,5)P2 through high-avidity interactions. In addition, membrane-bound septins appear to be effective diffusion barriers to PI(4,5)P2 and other lipids [42]. We therefore speculate that septins may play a unique role in systems such as the fly photoreceptor with extremely high levels of PLC-mediated PI(4,5)P2 turnover: The septin cytoskeleton can act as a significant buffer for PI4P and PI(4,5)P2 in such systems, as well as corralling pools of the lipids for use at the rhabdomeres were the high rate of turnover occurs. This is in contrast to the role played by the PIP4Ks, where PI(4,5)P2 levels are held in a narrow range under conditions of more limited turnover, as found in most cells.”

      __#2.2: __Likewise there are other earlier papers in the literature which have studied possible PIP2 binding proteins as sensors for this lipid.

      We are only aware of a single, specific example of a similar negative feedback, which is discussed in the 3rdparagraph of p 10:Curiously, although phosphatidylinositol phosphate kinases are found throughout eukarya, PIP4Ks are limited to holozoa (animals and closely related unicellular organisms) [47]. Indeed, we found the PIP5K from the fission yeast, Saccharomyces cerevisiae, does not interact with human PIP4Ks (fig. 7) and cannot modulate PI(4,5)P2 levels in human cells without its catalytic activity (fig. 1). This begs the question: how do S. cerevisiae regulate their own PI(4,5)P2 levels? Intriguingly, they seem to have a paralogous homeostatic mechanism: the dual PH domain containing protein Opy1 serves as receptor and control center [48], in an analogous role to PIP4K. Since there is no mammalian homolog of Opy1, this homeostatic mechanism appears to have appeared at least twice through convergent evolution. Combined with hints of a role for septins in maintaining PI(4,5)P2 levels [38], the possibility arises that there may yet be more feedback controls of PI(4,5)P2 levels to be discovered.”

      • *

      Technical standards: The work is done to a high technical standard.

      #2.3: Does catalytically dead isoform of PIP4K2B and 2C also yield the same result as a catalytically dead version of PIP4K2A in Fig 1B?

      In a word: yes. We have added these experiments, which are now presented in Fig. 2A:

      The results are described in the results in the 2nd paragraph of p. 4: “To directly test for negative regulation of PIP5K activity by PIP4K in cells, we wanted to assay PI(4,5)P2 levels after acute membrane recruitment of normally cytosolic PIP4K paralogs. To this end, we triggered rapid PM recruitment of cytosolic, FKBP-tagged PIP4K by chemically induced dimerization (CID) with a membrane targeted FRB domain, using rapamycin [27]. As shown in fig. 2A, all three paralogs of PIP4K induce a steady decline in PM PI(4,5)P2 levels within minutes of PM recruitment. Catalytically inactive mutants of all three paralogs produce identical responses (fig. 2A).”

      #2.4: The labelling on y-axis for PI(4,5)P2 biosensor intensity ratio is PM/cell at some places, PM/Cyt or PM/Cyto in some places. It is recommended to make it uniform across all the panels.

      PM/Cyto was a typo, now corrected to PM/Cyt. PM/Cell and PM/Cyt are two subtly different metrics used to normalize PM fluorescence intensity across varying transient expression levels. This is clarified in the methods in the 3rdparagraph on p.22: For confocal images, the ratio of fluorescence intensity between specific compartments was analyzed as described previously [59]. In brief, a custom macro was used to generate a compartment of interest specific binary mask through à trous wavelet decomposition[68]. This mask was applied to measure the fluorescence intensity within the given compartment while normalizing to the mean pixel intensity in the ROI. ROI corresponded to the whole cell (denoted PM/Cell ratio) or a region of cytosol (PM/Cyt), as indicated on the y axis of individual figures.”

      #2.5: The claim that PI(4,5)P2 production is sufficient to recruit PIP4K2C to the PM can be ascertained further if one is able to do an experiment where PI(4,5)P2 is ectopically expressed in some compartment of the cell which is non-native to PI(4,5)P2 and as a consequence of this PIP4K2C is recruited to this non-native compartment.

      We have now removed the assertion that PI(4,5)P2 is sufficient to localize PIP4Ks to the membrane, since our conclusion is that the coincident presence of PI(4,5)P2 and PIP5Ks in the PM is what ultimately localizes the PIP4Ks. We did not detect recruitment of endogenous PIP4Ks to lysosomes when ectopic PI(4,5)P2 synthesis was induced, although fluorescence levels are so low as to be inconclusive, and therefore not appropriate for inclusion in the manuscript.

      #2.6: In the entire figure 2, to establish that PI(4,5)P2 is necessary and sufficient for PM localisation of PIP4K, PIP4K2C is used as the PIP4K isoform on the basis that it is highly abundant in HEK293 cells. But PIP4K2A is localised mainly at the plasma membrane and here we are discussing about PI(4,5)P2 regulation at the PM . Can experiments be done with isoforms 2A and 2B as well? Can acute depletion of PI(4,5)P2 lead to the membrane dissociation of the isoform 2A as well? This will help us in understanding if there is an isoform specific difference in sensing PI(4,5)P2 levels which will help us in targeting specific isoform as therapeutic targets.

      We have now generated endogenously tagged PIP4K2A and PIP4K2B; these cell lines are characterized in the revised fig. 3:

      With the dependence on PI(4,5)P2 for PM binding for all isoforms shown in fig. 4:

      32 cells that were imaged across three independent experiments. (E) Depletion of PI(4,5)P2 causes NG2-PIP4K2C to dissociate from the membrane. As in C, NG2-PIP4K2C (blue) cells were transfected with FKBP-tagged proteins, TubbyC (orange) and Lyn11-FRB, scale bar is 2.5 µm; cells were stimulated with 1µM rapa, as indicated. TubbyC traces represent mean change in fluorescence intensity (Ft/Fpre) ± s.e. The NG2-PIP4K2C traces represent the mean change in puncta per µm2 ± s.e. of > 38 cells that were imaged across three independent experiments. " v:shapes="Text_x0020_Box_x0020_5">

      And increased binding by elevated PI(4,5)P2 levels shown in fig. 5B:

      The results are described in the accompanying results text “PIP4K are low affinity sensors of PM PI(4,5)P2”, pp.4-7. In short, endogenous PIP4K isoforms behave similarly with respect to PI(4,5)P2-dependent PM recruitment.

      • *

      #2.7: In Figure 1A, it is shown that overexpression of a catalytically dead PIP5K 1A/1B/1C is still able to increase PI(4,5)P2 levels. In the figure 2E, expression of homodimeric mutant of PIP5K domain which is a way to increase catalytic activity of PIP5K, increases PI(4,5)P2 levels which is consistent with the inferences from Fig, 1 , but what is surprising is a catalytically dead variant not being able to do so. Why is there a discrepancy between Fig. 1A and Fig. 2E? If the homodimeric mutant is the reason, then it is not clear in the explanation.

      We have added the following clarification to the results on the second paragraph of p.6:We next tested for rapid binding to acutely increasing PI(4,5)P2 levels in living cells, using CID of a homodimeric mutant PIP5K domain (PIP5K-HD), which can only dimerize with itself and not endogenous PIP5K paralogs [34]. This domain also lacks two basic residues that are crucial for membrane binding [35], and only elevates PM PI(4,5)P2 when it retains catalytic activity (fig. 5D), unlike the full-length protein (fig. 1A).” We currently do not fully understand why these well characterized residues of PIP5Ks are necessary for PM binding and inhibition by PIP4K. This is a focus of ongoing studies in the lab for the structural basis of PIP5K inhibition by PIP4K.

      • *

      #2.8: Show the loading control in Fig 2A western.

      We have added the loading control using alpha tubulin in the revised fig. 3B.


      #2.9: In the figure 2D, in the legend OCRL is written. So, the labelling in the panel should also be changed to OCRL from INPP5E. It is intermixed.

      Reviewer 1 also spotted this inconsistency (#1.7): Purified OCRL was indeed in the supported lipid bilayer experiments. The figure (now Fig. 4A) and legend have been corrected – thank you for spotting the error.

      • *

      #2.10: In the figure 2E, can the labelling be changed from HD to something more self-explanatory for homodimeric mutant of PIP5K domain?

      We prefer to keep the “HD” notation in the revised figure 5D for brevity’s sake, but now define the abbreviation in the text in the second paragraph of p.6:…a homodimeric mutant PIP5K domain (PIP5K-HD)…”.

      #2.11: In Fig. 2E, PIP5K expression is acute and in Fig. 2F Mss4 expression is chronic, both of which is able to recruit PIP4K2C to the plasma membrane. How can a likewise argument be drawn out of these two experiments when one is acute and the other one is a chronic expression? It is suggested to do an FRB-FKBP experiment for Mss4 as well.

      We agree with the reviewer that an FKBP-Mss4 would have been an excellent experiment. As can be seen from Fig. __1A, Mss4 is constitutively PM localized in mammalian cells. However, we were unable to identify a truncation of Mss4 that lost constitutive membrane binding whilst retaining catalytic activity. Therefore, we could only perform chronic overexpression as shown in __fig. 5B. The lack of an acute demonstration is why we went on to develop the PIP5K-HD constructs, results of which are reported in __fig. 5D. __

      #2.12: In the text, Fig. 2G and 2H is written for PIP4K2C, but in the corresponding panels and legends, it is an assay for purified PIP4K2A on SLBs. Kindly resolve the discrepancy.

      We thank the reviewer for spotting this discrepancy. PIP4K2A is the protein that was used in the SLB experiments now reported in fig. 5A & C and the accompanying results on pp.5-6. This is now corrected in the manuscript.

      #2.13: Kindly explain a bit in detail why the baits were now targeted to ER-PM contact sites. It is not self-explanatory.

      We have now added a more detailed description to the third paragraph of p. 7: “We therefore sought to distinguish between a direct PIP5K-PIP4K binding interaction versus PI(4,5)P2-induced co-enrichment on the PM. To this end, we devised an experiment whereby a bait protein (either PIP5K or control proteins) could be acutely localized to subdomains of the PM, with the same PI(4,5)P2 concentration. This was achieved using CID of baits with an endoplasmic reticulum (ER) tethered protein, causing restricted localization of the bait protein to ER-PM contact sites – a subdomain of the PM (fig. 7B).”

      • *

      #2.14: The conclusions for Fig. 3 most likely hints towards the possibility of PIP4K and PIP5K interaction being independent of PI(4,5)P2 levels. Well, Fig. 3C and 3D does suggest a direct interaction, but can other protein-protein interaction assays be used to establish the direct interaction of PIP4K with PIP5K such as FRET or Yeast two hybrid as assays scoring for interaction?

      We respectfully diverge from the reviewer’s assessment of the data, presented in the revised fig. 7. Figs. 7A & B__show PIP4K and PIP5K interacting in the context of a PI(4,5)P2 replete PM; __fig. 7C shows this in the context of a PI(4,5)P2 replete SLB. Therefore, we make no assertion that the PIP4K/PIP5K is independent of PI(4,5)P2 levels. We also contend that the latter experiment is a more direct demonstration than a Y2H assay, or even FRET (which can occur among non-interacting proteins localized to a membrane surface, see e.g. 10.1074/jbc.m007194200).

      #2.15: Conceptually a direct interaction can be explained to some extent from Fig. 3 but extending it to be an inhibitory interaction is not right without a direct experiment. Can an experiment be done with PI4P enriched SLB, wherein you put just PIP5K purified protein vs PIP5K+PIP4K combination and measure the % mol of PI(4,5)P2 produced using a probe. That will be suggestive of a negative interaction.

      This is a great experiment, the results of which are reported in fig. 2C, described in the third full paragraph of p. 4: “To more directly examine inhibition of PIP5K by PIP4K, we tested activity of purified PIP5K1A on PI4P-containing supported lipid bilayers (SLBs). Addition of PIP4K2A exhibited delayed inhibition of PIP5K1A activity (fig. 2C): Once PI(4,5)P2 reached approximately 28,000 lipids/µm2 (~2 mol %), PIP5K dependent lipid phosphorylation slowed down, which doubled the reaction completion time (fig. 2C, right). In contrast, we observed no PIP4K dependent inhibition of Mss4 (fig. 2C, inset). These data recapitulate the prior finding that PIP4K only inhibited purified PIP5K in the presence of bilayer-presented substrate [25]. We therefore hypothesized that inhibition of PIP5K by PIP4K requires recruitment of the latter enzyme to the PM by PI(4,5)P2 itself.”

      • *

      __#2.15: __ In Figure 3B, the FRB tagged constructs are magenta coded and PIP4K2C is cyan. Kindly change the labelling of the FRB constructs on the y axis to magenta so that it goes with what is written in the legend. It will also be appreciated to show a colocalization quantification between the magenta (FRB constructs) and cyan (PIP4K2C) post rapamycin addition and not just the intensity for ER-PM recruited PIP4K2C.

      These modifications and some additional points have been added in response to reviewer 1’s #1.4 to the revised fig. 7B. Note, we quantified the co-localization with an ER-PM contact site marker, MAPPER. Co-localization with the FRB-tagged construct would be misleading, because this construct is localized across the membrane at the start of the experiment and would thus have a high degree of co-localization. As can be seen from the inset graphs in the new analysis, however, all FRB-tagged constructs co-localize with MAPPER after rapamycin addition, but only FRB-PIP5K1A causes endogenous PIP4K2C to increase co-localization with this compartment.

      # 2.16: Again, in the text , the description is written for PIP4K2C but in the result panel and legend (Fig. 3C and Fig. 3D), PIP4K2A is mentioned. Kindly resolve the discrepancy

      We have corrected the results text on the last paragraph of p. 7: “Finally, we also demonstrate that PIP4K2A binding to PI(4,5)P2-containing supported lipid bilayers was greatly enhanced by addition of PIP5K to the membranes (fig. 7C), but not by Mss4 (fig. 7D).”

      • *

      # 2.17: In the Fig. 4B, it will be appreciated to show statistical significance in terms of R2 value for commenting on the linear response.

      “Linear response” was not the best description of what we were trying to articulate in the revised fig. 8B; we have now amended the results in the 2nd paragraph of p.8 to read: “Of these, Tubbyc showed the largest degree of change in PM localization across all changes in PI(4,5)P2 levels (fig. 8B).”

      • *

      #2.18: Discussion can be in general a bit more detailed which is suggestive of future experiments to do that can shed more light on the interaction such as which residues in PIP4K interacts with PIP5K to negatively regulate it.

      The revised manuscript contains a greatly expanded discussion, as described in detail in our responses to comments #1.2b, #2.1 and __#2.2. __

      #2.19: In the discussion, more light can be shed on the fact that Mss4 in spite of being a 5- kinase is not negatively regulated by PIP4K and the fact that PIP4K is present only in metazoans suggests that this fine tuning of PI(4,5)P2 levels is specific to metazoans. Another insight could be in the direction, that Fig 4. tells PI3K, but not calcium signaling is modulated by this fine tuning and interestingly class I PI3K is also an enzyme specific to metazoans. Hence, unlike yeast, metazoans rely on growth factor signalling processes, hence regulation of PI(4,5)P2 by PIP4K and hence Class I PI3K and PI(3,4,5)P3 could be a process relevant to metazoans.

      We have addressed the restriction of PIP4K to holozoa as described in our response to #2.2, wherein we describe a previously proposed paralogous mechanism in fungi. The reviewer’s point about the homeostatic process being related to class I PI3K signaling in growth control of multicellular organisms is interesting, but the presence of the PIP4Ks in some unicellular organisms complicates this view. We are of the view that a discussion of this important topic is a little nuanced for inclusion in the current manuscript.

      • *

      Reviewer #3

      __Summary: __Using state of the art imaging techniques the authors try to address how cells sense PI(4,5)P2 levels and regulate PIP5Ks to maintain an optimal level since any dysregulation of PI(4,5)P2 levels can have significant effects on the functioning of the cell and led to numerous disease states, such as cancers.

      The key conclusions are convincing and importantly validate previous disputed findings made by Wang et al. (Cell Reports 2019) using different and more rigorous methods, however unfortunately due to the Wang et al publication the overall novelty of this study is lacking. A suggestion to the authors is to state/explain with text more clearly how their findings are more precise and higher quality than the previous report and why their findings are necessary and significant to drive the field forward.

      We have revised the manuscript to more clearly state our novel finding that PIP4Ks are PI(4,5)P2 sensing proteins that inhibit PIP5Ks on the membrane in a PI(4,5)P2-dependent manner, which was not previously described in the literature.

      Further, experiments in the study were performed in vitro in cultured cells using overexpression methods making the physiological significance a bit unclear and the enthusiasm of the main discovery dampened. With that being said these findings are worthy of publication in order to advance the field and understanding of how the PIP kinase families are regulated and maintain PIP2 homeostasis which is important for life.

      We feel that this assessment is slightly unfair, since most of the key experiments have been validated using purified proteins in supported lipid bilayers, and endogenous proteins were studied using genomic tagging approaches, rather than over-expression.

      Minor and easily addressable experiments should be performed by the authors the following. Further, many of these experimental issues can easily go in supplemental materials

      #3.1: Include western blots for the constructs to compare expression levels.

      We agree that it is important to take into account differences in expression levels for the experiments presented in fig. 1. However, since these are single cell assays, Western blotting of whole populations of transiently transfected cells is not the best control. Instead, having acquired the images under consistent excitation and detection parameters, we compared the fluorescence intensity, expressed as relative expression in Fig. 1A and C, which is discussed in the results text in the first two paragraphs of the results on p. 3: “Notably, expression of the catalytically inactive mutants was usually somewhat less strong compared to the wild-type enzymes, yet effects on PI(4,5)P2 levels were similar (fig. 1A).” and “Again, differences in expression level between isoforms do not explain differences in activity, since all achieved comparable expression levels as assessed by fluorescence intensity (fig. 1C).”

      #3.2: For Figure 1A, what is the source of the observed increase in PI(4,5)P2, how do the authors take into account the role of endogenous PIP5Ks?

      We added a new experiment in the revised Fig. 1B showing that the increased PI(4,5)P2 occurs at the expense of PM PI4P:

      This is described in the first paragraph of the results on p.3: “PI(4,5)P2 levels are expected to increase at the expense of PM PI4P levels when over-expressing any of the three isoforms of human PIP5K (A-C) or the single paralog from the budding yeast, Saccharomyces cerevisiae (Mss4). Indeed, this was precisely what we observed (fig. 1A and B, statistics reported in tables 1 and 2).”

      The role for endogenous PIP5Ks is clarified on the sentence that spans pp. 3-4: “We therefore reasoned that saturation of endogenous, inhibitory PIP4K molecules by PIP5K over-expression, regardless of catalytic activity of the PIP5K, would free endogenous, active PIP5K enzyme from negative regulation (fig. 1D).”

      • *

      #3.3: For Figure 1B, could the authors comment on the intracellular distribution of PI(4,5)P2. How are they able to reliably distinguish their signal between plasma membrane and intracellular localizations and conclude that PIP2 on the plasma membrane is decreased?

      As detailed in the now expanded methods section covering image analysis on p. 22, our analysis specifically quantifies fluorescence in the plasma membrane.

      #3.4: Please include statistics for all image- based quantitation analysis.

      We have added details of statistical analysis and tabulated the results, as detailed in our response to __#1.1. __

      __#3.5: __ Could the authors comment on the ability of PIP4K to have affinity for its own product? How does PIP4K sense membrane PI(4,5)P2 since these kinases are mostly cytoplasmic?

      We have added a comment to the 1st paragraph of the Discussion on p.9: “PIP4K’s low affinity and highly co-operative binding to PI(4,5)P2 makes it an excellent sensor for tonic PI(4,5)P2 levels. It is poised to sense PI(4,5)P2generated in excess of the needs of the lipids’ legion effector proteins, ensuring these needs are met but not exceeded. Nevertheless, the relatively low PIP4K copy number of around 2.5 x 105 per cell [29] is a small fraction of the total PI(4,5)P2 pool, estimated to be ~107 [33], ensuring little impact on the capacity of the lipid to interact with its effectors.”

      __#3.6: __Do the authors have any other experiments to substantiate the binding of the two PIP kinases, similar to the Wang et al findings? Is the N-term motif required? Is it possible to disrupt that interaction and show the phenotype?

      We do not have additional, conclusive experiments to share at this time, and believe that characterization of the inhibitory interaction is beyond the scope of the current manuscript. We do however add a comment on this topic to the 1st paragraph of p. 11: “Regardless, a full understanding of paralog selectivity will need to be driven by a detailed structural analysis of the interaction between PIP4Ks and PIP5Ks - which is not immediately apparent from their known crystal structures, especially since PIP4Ks and PIP5Ks employ separate and distinct dimerization interfaces [50].”

      #3.7: With the overexpression studies in Figure 1, do the authors see any changes in signaling when they just overexpress PIP5Ks versus in combination with PIP4Ks to show that the changes in plasma membrane PI(4,5)P2 can affect downstream signaling?

      We agree with the reviewer that attenuating PIP5K-mediated PI(4,5)P2 increases with PIP4K should affect downstream signaling. However, we believe that these will not add additional insight compared to the already included experiments (fig. 8), whereby signaling output in response to graded changes in PI(4,5)P2 levels was investigated.

      • *

      Reviewer #3 (Significance (Required)): Overall, as mentioned above because of the 2019 Wang et al report the novelty is diminished, however using completely alternate methods and sophisticated microscopy this body of work indeed advances the field and provides further believable evidence of the PIP kinase families communicating in higher organisms which is required to maintain PIP2 levels shedding light on many of the findings that were previously unexplained surrounding the PIP4K studies. Further, the use of biosensors to describe these findings are new and will enable others in the field to begin to use such tools to investigate potential crosstalk between other lipid kinases.

      As we argued in the general comments, we do feel that this evaluation misses the key finding that PIP4Ks are PI(4,5)P2 sensors, and that this regulates PIP5K regulation as part of a feedback loop.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01723

      Corresponding author(s): Daphne Avgousti, Srinivas Ramachandran

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary This study by Lewis et al. examines the role of heterochromatin in the nuclear egress of herpesvirus capsids. They show that heterochromatin markers macroH2A1 and H3K27me3 are enriched at specific genome regions during the infection. They also show that when macroH2A1 is removed or H3K27me3 is depleted (both of which reduce the amount of heterochromatin at the nuclear periphery), the capsids are not able to egress as effectively. This is interesting since it could be argued that heterochromatin acts as a hindrance to the transport of viral capsids to the nuclear envelope and that the loss of it would allow capsids to reach the nuclear envelope more easily. However, this paper seems to show that heterochromatin formation, on the contrary, is necessary for efficient egress. Overall, the study seems comprehensive. The methodology is solid, and the experiments are very well controlled. However, some issues need to be addressed before publication.

      Major comments

      1) In line 49, the authors state, "Like most DNA viruses, herpes simplex virus (HSV-1) takes advantage of host chromatin factors both by incorporating histones onto its genome to promote gene expression and by reorganizing host chromatin during infection". In addition, HSV1 expression can be hindered by the host's interferon response via histone modifications. Ref. Johnson KE, Bottero V, Flaherty S, Dutta S, Singh VV, Chandran B. IFI16 restricts HSV-1 replication by accumulating on the HSV-1 genome, repressing HSV-1 gene expression, and directly or indirectly modulating histone modifications. PLoS Pathog. 2014 Nov 6;10(11):e1004503. doi: 10.1371/journal.ppat.1004503. Erratum in: PLoS Pathog. 2018 Jun 6;14(6):e1007113. PMID: 25375629; PMCID: PMC4223080.

      We agree with the reviewer and have amended our text and added the reference. See line 57.

      2) Reference 5 is misquoted in the sentence, "This redistribution of host chromatin results in a global increase in heterochromatin". In that reference, the amount of heterochromatin is not analyzed in any way. However, that particular paper shows that the transport of capsid through chromatin is the rate-limiting step in nuclear egress, which is important considering this study. Further, the article by Aho et al. shows that when the infection proceeds capsids can more easily traverse from the replication compartment into the chromatin, which means that infection can modify chromatin for easier capsid transport. For that reason, the article is an important reference, but it needs to be cited correctly.

      We agree with the reviewer that this citation was misquoted and have corrected the citation. See lines 55 and 62-64.

      3) The term heterochromatin channel at lines 54, 102, and 303 is misleading since the channels seen in the original referred paper are less dense chromatin areas. Also, this term is not used in the original paper where the phenomenon was first described. These less dense interchromatin channels were found by soft-X-ray tomography imaging and analyses, not by staining.

      We thank the reviewer for pointing out this discrepancy and have amended the text to accurately describe the methods used in the appropriate citations. See lines 65, 115, and 383.

      4) It is difficult to visualize chromatin using TEM microscopy. The values of peripheral chromatin thickness given in Figure 1e (5-15 nm) do not seem realistic given that the thickness of just one strand of histone-wrapped DNA is 11 nm. Why are the two values for WT different? If you can get so different values for WT, it is a bit worrisome (switching the WT results between the top and bottom parts of Fig. 1e would for example result in very different conclusions on the effect of macroH2A1 KO for the thickness of the chromatin layer).

      *We agree with the reviewer that it is difficult to visualize chromatin by TEM. It is also important to note that comparisons can only be made between samples treated on the same day in the same way. Taking this into account, we chose to compare macroH2A1 KO cell stains to controls done at the same time, and the same for H3K27me3 depleted conditions compared to DMSO treated and prepare for EM at the same time. Visually, it is apparent that the staining in the macroH2A1 KO control cells is somewhat different than those of the H3K27me3 depleted control cells, which represents the inherent variability of this method. It is also true that one nucleosome is around 11nm, however, since the cells contain highly compacted chromatin with many other proteins present, this measurement is not appropriate to apply. Adding up the millions of nucleosomes that make up the chromosomes at 11nm each would result in a space much larger than the nucleus, therefore we focus on comparing between control and experimental conditions restricted to this assay as a relative qualitative comparison. Nevertheless, we agree with the reviewer that the notion of changing chromatin is difficult to quantify by EM and so we have taken an additional approach to test our hypothesis and confirm EM interpretations (discussed lines 391-393). We have utilized live capsid trafficking to visualize capsid movement in nuclei in the presence or absence of macroH2A1. The results from these new experiments are presented in new Figure 5 and EV5 and support our model. *

      5) In lines 134-137 it says that "The enrichment of macroH2A1 and H3K27me3 was observed as large domains that were gained upon viral infection (Fig 2a), suggesting that the host landscape is altered upon infection. These gains were reflected in an increase in total protein levels measured by western blot (Fig 2b)." However, the protein levels of H3K27me3 do not seem to increase during infection. In other presented data as well (Figs. 2a, 2b, 2c, S2a) it is difficult to justify the statement that H3K27me3 is enriched in infection. When this is the case, the conclusion that the amount of heterochromatin increases in the infection (the quotation above and the one in line 315) is not supported. The statement in line 315 is also not specific since it is unclear what "newly formed heterochromatin increases" means.

      We agree with the reviewer that our original description was misleading. We now have edited the text to clarify that there is redistribution of macroH2A1 and H3K27me3. In the revised manuscript, we have also included mass spectrometry data mined from Kulej et al. that show peptide counts that reflect increases in the heterochromatin markers described (see new Figure EV1a). Despite this quantitative measure, upon rigorous replicates of our western blots as requested by Reviewer 2, we concluded that the increases originally described are somewhat inconsistent by western blot. This discrepancy between mass spectrometry data and western blot is likely due to the non-linear nature of antibody binding and developing of western blots by the ECL enzymatic reaction. Therefore, our revised manuscript focuses on this redistribution as a reaction to infection and stress responses instead of a global increase as the original manuscript stated. See lines 174, 182, 196, 397 and Fig EV4d in main text and discussion sections.

      • *

      6) Quantitation of viral capsid location in H3K27me3-depleted cells seems somewhat arbitrary. It would have been more robust to calculate the number of capsids per unit length of the nuclear envelope with and without depletion.

      We agree with the reviewer that the quantification of capsids in the H3K27me3-depleted conditions was arbitrary. In our revised manuscript, we have now repeated this quantification to accurately measure the phenotype observed, that is the chains of capsids lined up at the inner nuclear membrane. To do this, we used two measures: 1) the distance from the INM as less than 200nm and 2) the distance from other capsids as less than 300nm. Taking into account these two measures, we quantified the frequency with which multiple capsids lined up at the INM in WT and H3K27me3-depleted conditions. This is represented in the new Figure 5d. In the WT setting, we observe most often 1 single capsid at the INM, with a small fraction of 2 capsids. However, in the H3K27me3-depleted condition, we observe much greater numbers of capsids at the INM more frequently, as many as 16 at a time, leading to an average of 2-3 capsids at any single location. The source data for this figure are also provided. See lines 589 and Fig5d.

      7) In lines 300-302 it says "Elegant electron microscopy work showed that HSV-1 infection induces host chromatin redistribution to the nuclear periphery2,8." However, the redistribution data in reference 8 is based on soft x-ray tomography and not on electron microscopy."

      We have amended the text to accurately describe the methods used in the citations. See line 384.

      8) The authors bundle together the effects of macroH2A1 removal and H3K27me3 depletion by saying that they both decrease the amount of heterochromatin at the nuclear periphery and therefore hinder capsid egress. This seems overly simplistic and macroH2A1 and H3K27me3 seem to act very differently, which is manifested in the drastic difference in nuclear capsid localization between the two cases. This difference needs to be discussed more.

      We agree with the reviewer that there is a nuanced difference in the effect on nuclear egress in the absence of the two heterochromatin marks. Specifically, that macroH2A1 loss results in greater numbers of capsids dispersed throughout the nucleus, whereas depletion of H3K27me3 results in capsids reaching the INM and not escaping. To examine these differences further, we have carried out live imaging of capsid trafficking in macroH2A1 KO cells compared to control and found that capsids move much more slowly, consistent with our model, see new Figure 5h-I and EV5h-i. Conversely, H3K27me3 depletion does not prevent the capsids from reaching the INM, raising the question of whether they are successfully able to dock at the nuclear egress complex (NEC). To investigate this further, we obtained an antibody against the NEC component UL34 and probed during infection in our heterochromatin disrupted conditions. We found that UL34 levels are unchanged upon loss of macroH2A1 or depletion of H3K27me3, suggesting the levels of UL34 do not account for the decrease in titers. These data are now presented in new Figure EV3g-h. Furthermore, we have amended our model to include the two different scenarios upon loss of different types of heterochromatin (see new Figure 6) and discussion of these differences. See line 428.

      Minor comments Line 45: Nuclear replicating viruses -> Nuclear-replicating viruses Line 56: is -> are Line 64: 25kDa -> 25 kDa Line 159: macroH2A1 cells -> macroH2A1 KO cells Line 289: The term gDNA is rarely used for viral DNA. Replace gDNA with viral DNA. Line 405: 8hpi -> 8 hpi Line 449: mm2 -> μm2 "Scale bar as indicated" words can be removed in the figure legends or at least should not be repeated many times within one figure legend.

      We have amended the text to address these comments. See lines 52, 68, 76, 179, 334, 513, and 585.

      Reviewer #1 (Significance (Required)):

      These findings would appeal to a broad audience in the field of virology. Specifically, the researcher in the fields of virus-cell and virus-nucleus interactions. This manuscript analyses herpesvirus-induced structural changes in the chromatin structure and organization in the nucleus that are also likely to affect the intranuclear transport of viral capsids.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript "HSV-1 exploits heterochromatin for egress" describes the effects of heterochromatin at the nuclear periphery, macroH2A1 or H3K27me3 on HSV-1 replication and egress. Knocking out macroH2A1 or depleting H3K27me3 with high concentrations of tazemetostat depleted heterochromatin at the nuclear periphery, may not have affected HSV-1 protein expression and modestly inhibited the production of cell-free infectivity and HSV-1 genomes. macroH2A1 deposition was affected by infection, creating new heterochromatin domains which did not correlate directly with the levels of expression of the genes in them. The authors conclude that heterochromatin at the nuclear periphery dependent on macroH2A1 and H3K27me3 are critical for nuclear egress of HSV-1 capsids.

      The experiments leading to the conclusion that HSV-1 capsids egress the nucleus through channels in the peripheral chromatin confirm previously published results (https://doi.org/10.1038/srep28844). The previously published EM micrographs show a much larger number of nuclear capsids, more consistent with the images in the classical literature, even in conditions when nuclear egress was not inhibited. Figures 1 and 4 show scarce nuclear capsids, even under the conditions when nuclear egress should be inhibited according to the model and analyses. The large enrichment in nuclear capsids in KO cells predicted by the model is not reflected in figure 4a, which shows only a modest increase in nuclear capsid density (the total number of nuclear capsids would be more informative). The number or density of nuclear capsids is not shown in H3K27 "depleted" cells. The robustness of the analyses of the number of capsids at the membrane in H3K27 "depleted" cells is unclear. For example, the analyses could be repeated with different cut offs, such as 2 or 4. If they are robust, then the conclusions will not change when the cutoff value is changed.

      We appreciate the reviewer’s observation that to number of capsids we show differs from those published in the publication by Myllys et al. (Sci Rep 2016 PMID 27349677). It is important to note there are several differences between our study and that of Myllys et al. that explain the difference. First, as reviewer 1 pointed out, the Myllys et al. study used three-dimensional soft X-ray tomography combined with cryogenic fluorescence and electron microscopy to observe capsids in 3D rendered nuclei. Since our method uses only single ultrathin 50nm slices of cells, we cannot visualize the total number of capsids per nucleus, rather only per slice, which is why we have averaged slices of many nuclei to generate a statistical comparison between macroH2A1 KO or H3K27me3-depleted and control cells treated at the same time (see response to reviewer 1). Furthermore, these other methods are specialized techniques for 3D imaging that are beyond the scope of our study. Second, the Myllys et al. paper used B cells which are much smaller than HFFs, lending themselves to better tomography studies but not commonly used to study HSV-1 biology. Third, the Myllys et al. paper also used a different MOI and time point than we have. Taken together, these differences account for the disparity in visualizing capsids which is why we quantified capsid number across many images.

      We agree with the reviewer that our quantification in the H3K27me3-depleted cells compared to control was somewhat arbitrary. As stated in the response to Reviewer 1 above, in our revised manuscript we have now repeated this quantification to accurately reflect the phenotype observed, that is the chains of capsids lined up at the inner nuclear membrane. To do this, we used two measures: 1) the distance from the INM as less than 200nm and 2) the distance from other capsids as less than 300nm. Taking into account these two measures, we quantified the frequency with which multiple capsids lined up at the INM in WT and H3K27me3-depleted conditions. This is represented in the new Figure 5d. In the WT setting, we observe most often 1 single capsid at the INM, with a small fraction of 2 capsids. However, in the H3K27me3-depleted condition, we observe much greater numbers of capsids at the INM more frequently, as many as 16 at a time, leading to an average of 2-3 capsids at any single location. The source data for this figure are also provided. See lines 589 and Fig 5d.

      Furthermore, we have now also carried out live-imaging analysis of single capsids during infection which show the appropriate number of capsids expected when the full nucleus is visible. These results are presented in the new Figure 5 and EV5.

      The quantitation of the western blots present no evidence of reproducibility and/or variability. The number of biologically independent experiments analyzed must be stated in each figure and the standard deviation must be presented. As presented, the results do not support the conclusions reached. The quality of western blots should also be improved. it is unclear why figure 2b shows viral gene expression in wild-type cells only, and not in KO or H3K27me3 depleted cells, which are only shown in the supplementary information. These blots presented in Figure S5a and S5b are difficult to evaluate as the signal is rather weak and the controls appear to indicate different loading levels. These blots do not appear to be consistent with the conclusions reached. Some blots (VP16, ICP0 in HFF) appear to indicate a delay in protein expression whereas others (VP16, ICP0 in RPE) appear to indicate earlier expression of higher levels. The claimed "depletion of H3K27me3 is not clear in in figure S5d, in which the levels appear to be highly variable in all cases, without a consistent pattern, with no evidence of reproducibility and/or variability, and using a mostly cytoplasmic protein as loading control. All western blots should be repeated to a publication level quality, the number of independent experiments must be clearly stated in each figure, and the reproducibility and/or variability must be indicated by the standard deviation.

      *As reviewer 1 also pointed out, we appreciate that there is some variability with respect to the stated ‘increase’ in these heterochromatin marks during infection. As stated in response to reviewer 1, in our revised manuscript we have included a deeper analysis of these marks from global mass spectrometry that indicates an increase in total levels. Please see response to reviewer 1. *

      • *

      In the revised manuscript, we have now included mass spectrometry data mined from Kulej et al. that show peptide counts that reflect increases in the heterochromatin markers described (see new Figure EV1a). Despite this quantitative measure, upon rigorous replicates of our western blots as requested by Reviewer 2, we concluded that the increases originally described are somewhat inconsistent by western blot. This discrepancy between mass spectrometry data and western blot is likely due to the non-linear nature of antibody binding and developing of western blots by the ECL enzymatic reaction. Nevertheless, our genome-wide chromatin profiling showed consistent, reproducible, and statistically significant redistribution of macroH2A1 and H3K27me3 upon HSV-1 infection. Therefore, our revised manuscript now focuses on this redistribution as a reaction to infection and stress responses instead of a global increase as the original manuscript stated. See lines 174, 182, 196, 397 and Fig EV4b-c.

      • *

      With respect to viral protein levels, although there is slight variation in the levels of VP16 or ICP0 in RPEs compared to HFFs, we do not feel that this difference is biologically significant as several other measures of viral infection progression are unchanged (viral RNA, viral genome accumulation within infected cells). Furthermore, the significant difference in titers we observe is not explained by slight differences in ICP0 or VP16. Nevertheless, to document this variability in western blot and assuage any concern of impact infection progression, we have repeated each western blot presented in the paper three separate times and used these blots to quantify each relevant protein. Graphs of western blot quantitation can be found in each figure accompanying a western blot as follows:

      Western blots:

      Figures 3b-c, 4ab, EV1b, EV5a

      Quantitation of western blots:

      Figures 3d, 4c, EV1c, EV5b-f

      • *

      An enhanced analyses of the RNA-seq data, analyzing all individual genes rather than pooling them together, would provide better support to these conclusions. Then, the western blots are useful to show that the changes in mRNA result in changes in the levels of selected proteins.

      • *

      *We appreciate the reviewer’s interest in the RNA-seq data, however, we feel that reviewer has not understood the analysis we presented in the initial submission. To clarify, we calculated fold changes for individual genes and did not pool RNA-seq data anywhere in the manuscript. We show boxplots of log2 fold changes of individual genes. Boxplots enable summarization of the salient features of a distribution while still representing individual gene analysis. Here, the distribution being plotted is the log2 fold change of individual genes that intersect with macroH2A1 domains that change due to infection. As such, clusters 1-3 of macroH2A1 domains feature a loss in macroH2A1 due to infection and the boxplots show that the majority of genes are upregulated. To highlight this point further, in our revised manuscript we have included volcano plots of genes intersecting with each cluster also showing the split between the number of genes significantly upregulated and downregulated in each cluster at each time point (see new Figure EV3c). As expected from the boxplots, clusters 1-3 feature a much higher fraction of genes are significantly upregulated, whereas cluster 5 features a higher fraction of genes downregulated with concomitant increase in macroH2A1 due to infection. Taken together with the gene ontology analysis (new Figure Sd), these results support our model in which macroH2A1 is deposited in active regions to block transcription and promote heterochromatin formation. To further support these conclusions, we have also carried out analysis of 4sU-RNA data generated upon salt stress or heat shock and found that the regions defined by gain of macroH2A1 (i.e. clusters 5 and 6) also exhibit significant decreases in new transcription at just 1-2 hours after treatment. These data, which are presented in new Figure EV3b-c, strongly support our model in which macroH2A1 is deposited in genes downregulated upon stress response to generate new heterochromatin. *

      Figure S1 raises some questions about the specificity of the macroH2A1 antibody used for CUT&Tag. As expected CUT&Tagging the cellular genome in the KO cells with the specific antibody results in lower signal than with the IgG control antibody. In contrast, viral DNA is CUT&Tagged as efficiently in the KO as in the WT cells, and in both cases significantly above the IgG controls. The simplest interpretation of these results is that the antibody cross-reacts with a protein that binds to HSV-1 genomes. The manuscript must experimentally address this possibility.

      We agree with the reviewer that there is a possibility that antibodies cross react. However, we are confident that this is not the case in this scenario for the following reasons:

      • *

      *1 – We have carried out immunofluorescence analysis of macroH2A1 or H3K27me3 during HSV-1 infection and observe no overlap with ICP8 staining. We have included these images together with a histogram documenting the lack of overlap in the new Figure EV2f-g. *

      • *

      2 – CUT&Tag relies on the Tn5 transposase to insert barcodes into accessible regions of the genome. An inherent limitation of this method during viral infection is that the replicating viral genome is very dynamic and accessible, leading to easier and less specific insertion by the transposase. This is evidenced by the pattern of signal across the viral genome that is completely overlapping in the macroH2A1, H3K27me3 and IgG conditions. Snapshots of the full viral genome are now included in the new Figure EV2c-d.

      • *

      *Furthermore, using CUT&Tag with macroH2A1 antibody, we expect the transposition rate to be identical between WT and macroH2A1 KO conditions for the Ecoli and viral genomes. This is because we assume that the transposition in these two genomes is non-specific since there is no macroH2A1 present. Then, we expect the spike-in normalized CUT&Tag enrichment on the viral genome to be the same between WT and macroH2A1 KO conditions. Since IgG should not be affected by macroH2A1 KO, we expect the IgG enrichment to be same between WT and macroH2A1 KO conditions. Thus, non-specific background would result in higher enrichment in an apparent signal on viral genome in the macroH2A1 KO condition. *

      • *

      Combined with this expectation for background transposition and the following: 1) the distribution of the CUT&Tag signal across the viral genome is virtually identical between IgG, macroH2A1, and H3K27me3 CUT&Tag signal in WT and macroH2A1 KO cells (see new Figure EV2c-d), 2) that there is no colocalization between macroH2A1 or H3K27me3 with viral genomes by immunofluorescence (see new Figure EV2f-g), and 3) the whole genome correlation of the signals across CUT&Tag samples on the viral genome, but not the host, are virtually identical as presented in a heat map (see new Figure EV1g vs EV2e), we conclude that the viral CUT&Tag signal is noise. Therefore, any analysis of the signal on the viral genomes would not be biologically meaningful.

      • *

      Also, Figure S1 shows that the viral genome is CUT&Tag'ed with H3K27me3 antibody as efficiently in macro H2A1 WT and KO cells, and in both cases above the background signal from IgG control antibody. The authors conclude that the signal with the specific antibody "mirrors" that of the control antibody, but "mirroring" is not defined and the actual data show that there is a large increase in signal with the specific antibody. Not surprisingly, the background signal also increases, as the number of genomes increase while infection progresses. The authors conclude that "these results indicated that there was a significant background signal from the viral genome that could not be accounted for", but no evidence supporting this conclusion is presented. The data show clear signal above the background from the viral genome and that this signal is not affected by the presence or absence of macroH2A1. This section of the manuscript has to be thoroughly re-analyzed as there is clear H3K27 signal.

      *We agree with the reviewer that as presented in the current manuscript it seems as though there is a real H3K27me3 signal. However, as stated in the above comment, the pattern of this signal matches that of all other conditions, including IgG, suggesting it is not a real signal, cross-reacted or otherwise, but rather an artifact of the methodology. See new Figure EV2. *

      The concentration of tazemetostat used is high. Normally, concentrations of around 1µM are used in cells, and 10µM is often cytotoxic (for examplehttps://doi.org/10.1038/s41419-020-03266-3; https://doi.org/10.1158/1535-7163.MCT-16-0840). The effects on H3K27me3 presented in figure S1b appear to be normalized to mock infected treated cells. If so, they do not allow to evaluate the effectivity of the treatment. Cell viability after the four days treatment must be evaluated, the claimed "depletion" of H3K27me3 must be clearly demonstrated (the blots in figure S5 are not sufficient as presented), and levels of different histone methylations must be tested to support the claimed specificity of tazemetostat for H3K27me3 at the high concentrations used.

      *While we agree with the reviewer that the cytotoxicity of any inhibitor is an important aspect to take into account, in this instance the reviewer is incorrect. The reviewer has cited papers that highlight the potential use of tazemetostat as a cancer-cell specific treatment for colorectal and B-cell cancers. In both of these cases, the primary conclusion is that tazemetostat’s cytotoxic property is largely corelated to mutation in EZH2. In fact, WT EZH2 treated cells had a more “cytostatic” response, which shows that tazemetostat is not toxic with WT EZH2 (Brach et al. Mol Cancer Ther. 2017, PMID 28835384) as is the case in our system. Furthermore, the Tan et al. study shows a non-transformed human fibroblast (CCD-18co) and embryonic colon epithelial (FHC) as “healthy controls” for their work in colorectal cancer cell lines in Figure 1D. These 2 cell lines, which are comparable to the WT HFF cells we used, show no reduction in viability at a log fold greater concentration than the 10 µM used in our paper. *

      • *

      *Nevertheless, we agree with the reviewer that cytotoxicity should be formally ruled out. In our original experiment, we recorded cell counts at the harvested mock, 4-, 8-, and 12 hpi and found no difference in the number of cells over the course of infection (see new Figure EV3e). We also used trypan blue staining as a measure of cell viability upon tazemetostat treatment and found no toxicity. These results are presented in new Figure EV3f. *

      Furthermore, we agree with the reviewer that total H3 levels by western blot should be included in any comparison of H3 modification. While these were included in some figures, they were unintentionally omitted in others. In our revised manuscript we have now included these blots together with quantification of triplicate biological samples of H3K27me3 levels normalized to total H3. See new Figures 3, 4, EV1, and EV5.

      • *

      Minor comments. Reference No.27 is misquoted in lines 250-251, which state that it shows that "HSV-1 titers, but not viral replication, where reduced upon EZH2 inhibition." The reference actually shows inhibition of HSV-1 infectivity, DNA levels and mRNA for ICP4, ICP22 and ICP27. This reference uses much shorter treatments (12 h and only after infection). It also shows that inhibition of EZH2/1 up regulates expression of antiviral genes.

      *We appreciate that the reviewer has pointed out a discrepancy between our results using an EZH2 inhibitor (tazemetostat) and those from reference 27 (Arbuckle et al., mBio, 2017 PMID 28811345) that requires clarification. The reviewer states that the treatments were 12 hours after infection, however, this is incorrect. In the Arbuckle et al. study, the authors used multiple different inhibitors at high doses for short treatments before infection and noted that this caused an upregulation in antiviral genes that blocked infection progression of multiple viruses including HCMV, Ad5 and ZIKA. Importantly, these genes include multiple immune signaling and interferon stimulated genes. In our study, we specifically use a much lower dose of EZH2 inhibitor, with respect to the IC50 value, and waited 3 days to ensure a steady state. In our system, any initial burst of immune response from the inhibitor would likely have subsided by the time we do our infection. Furthermore, supplemental figure EV1 from the Arbuckle et al. study states that EZH1/2 inhibitors do not affect nuclear accumulation of viral genomes and suppress HSV-1 IE expression in an MOI-independent manner (Arbuckle et al. Supplemental Figure 1). These results in fact support our conclusions that it is not any antiviral effect of inhibition of EZH2 that causes the decrease in titers that we observe. *

      • *

      To clarify, the IC50 value of the inhibitors used in the Arbuckle et al. study are 10 nmol/L (GSK126) and 4 nmol/L (GSK343). The IC50 is a measurement used to denote the amount of drug needed to inhibit a biological process by 50% and is commonly used in pharmacology to compare drug potency. In the Arbuckle et al. study, GSK126 was used at a concentration range of 15-30 µM, that is 1500-3000x more than the IC50 level as converted from nmol/L to µM, and GSK343 was used at a concentration range of 20-35 µM, that is 5000-8750x more than the IC50 level, to see changes in viral mRNA levels. The IC50 value for tazemetostat is 11 nmol/L which means that one would need to use a much higher molarity of tazemetostat, at least 28 µM which would be 2500x the IC50 value, to achieve the comparable biological changes as the inhibitors used in the Arbuckle et al. study. Thus, we are confident that the 10 µM concentration used in our study is an appropriate and non-toxic amount that would not impact antiviral responses at the dose and times that we used. As shown above and reported in multiple studies (for example: Knutson et al. Molecular Cancer Therapy 2014 PMID 24563539, Tan et al. Cell Death and Disease 2020 PMID 33311453 cited above, and Zhang et al. Neoplasia 2021 PMID 34246076, among others) the concentration of tazemetostat that we used is not toxic to the cells. Importantly, it was also reported that a global decrease in H3K27me3 by EZH2 inhibition using a 10 µM concentration of tazemetostat (here referred to by the identifier EPZ6438) did not impact HSV-1 RNA transcript accumulation measured by bulk sequencing (Gao et al. Antiviral Res 2020 PMID 32014498), consistent with our findings.

      • *

      In our revised manuscript, we have now included a discussion of these important points. See lines 409-428.

      HFF are primary human cells but they are fibroblasts whereas the primary target of HSV-1 replication is epithelial cells. The wording used "they represent a common site of infection in humans" must be edited

      We agree with the reviewer and have updated the text. See lines 109.

      Disruption of macroH2A (1 and 2) results in general defects in nuclear architecture, not just peripheral chromatin (https://doi.org/10.1242/jcs.199216;, see also figure 1c and 5a, presenting invaginated and lobulated nuclei). The manuscript would benefit from including a broader discussion of the effects of macroH2A defects on the general nuclear architecture.

      • *

      We agree with the reviewer and our revised manuscript now includes a more in-depth discussion of the impact of macroH2A and other heterochromatin marks on nuclear structure. See lines 373-374 and 394.

      The title should be edited, as "egress" in virology is commonly used to refer to the egress of virions from the cell, not to the nuclear egress of capsids. Adding the words nuclear and capsid should be sufficient to address this issue.

      *We agree with the reviewer and will update the title to read “HSV-1 exploits host heterochromatin for nuclear egress”. Given that we are measuring multiple aspects of infection, we feel that adding the word ‘capsid’ is not necessary. *

      It is unclear why preferential changes in expression of housekeeping genes would indicate "stress responses to infection". The rationale for this conclusion must be fully articulated and supported.

      We agree with the reviewer that it may not be immediately clear as to why changes in house-keeping gene expression represent a stress response. In a recent study that we cite in our manuscript, Hennig et al. (PLOS Path 2018 PMID 29579120) demonstrate that changes in chromatin accessibility and gene transcription during HSV-1 infection resemble those that occur upon heat shock or salt stress. These results strongly support the model that global transcription changes caused upon stress (heat, salt, infection etc.) result in dramatic alterations to chromatin structure. In support of this notion, in our revised manuscript we now include analysis of these datasets based on our macroH2A1-defined clusters. Importantly, we found that the regions defined by gain of macroH2A1 (i.e. clusters 5 and 6) also exhibit significant decreases in new transcription at just 1-2 hours of exposure to salt and heat stress. These data, which are presented in new Figure EV3b-c, strongly support our model in which macroH2A1 is deposited on active genes to generate heterochromatin as a response to the stress of infection. We also discuss these results further in the revised manuscript, see lines 210-220, 233-236, and 424-426.

      Statistical methods must be fully described in materials and methods and the number of biologically independent experiments must be stated in each figure.

      *We agree with the reviewer and have included these details in each figure legend. *

      Reviewer #2 (Significance (Required)):

      The major strengths of the manuscript lie on the comprehensive analyses of the effects of knocking histone macroH2A in the nuclear architecture and chromatin organization. These analyses indicate that peripheral heterochromatin is defective in the KO. Another strength lies on the analyses of the news heterochromatin domains in HSV-1 infected cells. The relationship between the lack of correlation between the changes in gene expression and global heterochromatin domains defined by macroH2A1 with the main conclusion is less clear.

      The major weakness is that the data presented do not strongly support the conclusions. Additional experiments are required to support the main conclusion that the effects in peripheral heterochromatin result in a biologically significant effect on capsid egress. The authors should also consider that the additional experimentation may not support the conclusion that macroH2A or H3K27me3 play critical roles in the nuclear egress of capsids.

      • *

      *To support our conclusions, we have carried out an entirely different set of experiments to track capsid movement. Bosse et al. PNAS 2015 PMID 26438852 and Aho et al. PLOS Path 2021 PMID 34910768 use live-imaging and single-particle tracking to characterize capsid motion relative to host chromatin. These approaches allowed the authors to discover that infection-induced chromatin modifications promote capsid translocation to the INM. They showed that 1) HSV-1 infection alters host heterochromatin such that open space is induced at heterochromatin boundaries, termed "corrals", in which viral capsids diffuse and 2) the movement of viral capsids through the host heterochromatin is the rate limiting step in HSV-1 nuclear egress. *

      • *

      To test our hypothesis that macroH2A1-dependent heterochromatin specifically is required, we collaborated with Dr. Jens Bosse to carry out these same experiments in our macroH2A1 KO and paired control cells. We tracked RFP-VP26 using spinning-disk confocal live imaging to track individual capsid movement within the nucleus. We found that capsids in cells lacking macroH2A1 traveled much shorter distances on average. This is represented graphically by the mean-square displacement (MSD) of capsid movement in macroH2A1 KO cells plateauing at ~0.4 µm2 vs 0.6 µm2 in WT cells, which represents the size of the “corral”, or space through which capsids diffuse. The average corral size in macroH2A1 KO cells is ~300 nm less than the average corral size in WT cells (two-thirds the size). These results are consistent with the finding that macroH2A1 limits chromatin plasticity both in vitro (Muthurajan et al. J Biol Chem 2011 PMID 21532035) and in cells (Kozlowski et al. EMBO Rep 2018 PMID 30177554). These data strongly support our hypothesis that macroH2A1-dependent heterochromatin is critical for the translocation of HSV-1 capsids through the host chromatin to reach the INM. Furthermore, these data support the model in which macroH2A1 allows for the increase of open space induced during infection. Loss of this open space restricts the movement of capsids in the nucleus, as quantified by our live-imaging experiments. These data are now included in the new Figure 5 and EV5 and described in lines 348-372 and 1011-1037.

      • *

      NOTE: These experiments were done in a separate lab using the same cells and MOI we used for our TEM studies. It is important to note that because this was done by live imaging where the full nucleus and cell are visible, the appropriate number of capsids is apparent.

      Another major weakness is that the results of CUT&Tag of the viral genome are dismissed without proper justification. The authors conclude that the results invalidate the assays, but the results are consistent with cross-reactivity of the macroH2A1 antibody with another protein that interacts with the viral genomes and with H3K27me3 being associated with the viral genomes irrespectively of macroH2A1.

      *We agree with the reviewer that as presented the viral genome reads were dismissed without thorough justification. As stated above, we are confident that the patterns we detected do not represent a biologically relevant signal but rather an artifact of the experimental set up. Furthermore, it is well known in the field that normalizing replicating viral genomes during lytic infection in any kind of chromatin profiling technique is fraught with inconsistencies as each cell may have a different copy number of viral genomes at any given time point. Therefore, we feel strongly that any analysis of the viral genome chromatin profile during a lytic replication at this point in time would require single cell sequencing which is beyond the scope of this study. We appreciate that this was not clearly presented in the original manuscript and in our revised submission we have included a full supplemental figure documenting the negative data that support our conclusions (see new Figure EV2). *

      If the authors had additional data supporting the claim that these results do not reflect cross-reactivity or association with the viral genomes, these data must be presented. Without that additional data, the conclusions are not supported and these discussions must be removed from the manuscript. The authors may still opt to not analyze any association with the viral genomes, but they should not dismiss them as artifactual without actual evidence to support this claim. Previously published literature is also misquoted.

      This study makes an incremental contribution to the previously published evidence showing that HSV-1 capsids egress the nucleus through channels in between the peripheral chromatin. It shows that disruption of the heterochromatin at the nuclear periphery, and the nuclear architecture in general, may have a modest effect on capsid egress. This information may be of interest mostly to a specialized audience focused on the egress of nuclear capsids.

      While we agree with the reviewer on many points as stated above, we respectfully disagree that our study is merely an incremental contribution of interest only to a specialized audience focused on nuclear egress. As reviewer 2 states earlier, the strength of our study lies in the “comprehensive analyses of the effects of knocking histone macroH2A in the nuclear architecture and chromatin organization”, which would be of interest to a broad chromatin audience as well as virologists. Together with the new data presented here and a revised manuscript, we feel that our study would be of interest to a broad audience in the chromatin and virology fields as reviewers 1 and 3 also pointed out. Chromatin is generally analyzed in the context of how it might affect gene expression and the impact of chromatin on biological processes such as viral infections, and its structural role in the nucleus is not commonly considered. Here, we demonstrate an important example of the glaring effects of chromatin structure on the biological nuclear process of infection.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Lewis et al. reveal an unexpected role for heterochromatin formation in remodeling the nucleus to facilitate egress of the nuclear-replicating virus HSV1. By performing TEM in HSV1-infected primary human fibroblasts, the authors show that capsids accumulate at the inner nuclear membrane in regions of less densely stained heterochromatin, in agreement with studies in established cell lines. The authors go on to reveal that heterochromatin in the nuclear periphery of HSV1-infected primary fibroblasts was dependent on the histone variant macroH2A1 and is enriched with H3K27me3.CUT & Tag was used to profile macroH2A1 over time during lytic HSV1 infection and showed that both macroH2A1 and H3K27me3 were enriched over newly formed heterochromatic regions 10s-100s of Kb in length in active compartments. Remarkably, loss of macroH2A1 or H3K27me3 reduced released, cell free infection virus progeny and increased intranuclear capsid accumulation without detectably impacting the proportion of mature genome containing capsids, virus genome or protein accumulation. Their finding that newly remodeled heterochromatin forms in HSV infected cells and is a critical determinant for the association of capsids with the inner nuclear membrane is consistent with a critical role in egress.

      I have only relatively minor editorial suggestions listed below to improve the manuscript:

      Line 92: This subtitle should be revised to more precisely state the findings shown in the Fig 1 data. While the first part of the statement "HSV1 capsids associate with regions of less dense chromatin" is consistent with what is shown, the final phrase "...to escape the nucleus" is an interpretation of the data inferred from the static image.

      We agree with the reviewer and have amended our text to more accurately describe the figure. See lines 138-139.

      Line 96: I am not sure the statement that fibroblasts represent a "common" site of infection is supported by ref 15. FIbroblasts do, as indicated in ref 15, express the appropriate receptor(s) for virus entry and in culture support robust virus productive growth. However, in human tissue, infection of dermal fibroblasts appears rare, suggesting it may not be a "common" site of infection (PMCID: PMC8865408). Maybe simply revise wording to indicate fibroblasts represent "a site of infection or can be infected in tissue?".

      We agree with the reviewer, as was also pointed out by reviewer 2, and have amended the text. See lines 109.

      Line 126-127: As written it states that "....regions of the host genome that increase during infection", implying these genome regions are amplified (increase). I think the authors mean that infection increases binding of mH2A1 and H3K27me3 to broad regions of the host genome. Please clarify.

      We agree with the reviewer that this was written ambiguously. As was pointed out by reviewers 1 and 2, the increase in these marks depends on the type of measurement. Therefore, we have modified the text in a revised manuscript to focus instead on the redistribution of these marks during infection. See line 138-139.

      FIgS1, a,b,c,d: please indicate that 4,8,12 indicate hpi, correct? And indicate that in the legend M indicates Mock.

      This is correct and we have updated this in the figure legend. See lines 625-627.

      Line 197: "active compartments". Do the authors mean transcriptionally active compartments? Please clarify

      This is correct and have clarified this in the text. See line 248.

      Line 232: please replace "productive" with "infectious"

      We agree with the reviewer and have amended our text. See line 295.

      Line 233 - The authors conclude mH2A1 is important for egress, ruling out assembly before even bringing it up. As I read on, it is clear the authors addressed this important issue later on in the manuscript. That said, it was a bit jarring to conclude egress is important without addressing the assembly possibility at this juncture in the manuscript. One way to remedy this would be to move the Fig S6 assembly/capsid type data (lines 286-297, Fig S6) and surrounding text earlier to support the conclusion that mH2A1 did not detectably influence assembly, but is important for egress.

      *We agree with the reviewer that the order of presentation makes it difficult to follow. Our revised manuscript now includes these important data within the same figure. See new Figure 5. *

      Line 244: "progeny production" - it would be helpful to specify "cell free or released infectious virus progeny"

      Line 248: change "produced" to released"

      Line 273 replace "productive" with "infectious virus progeny released from infected cells"

      Fig S5c: Was the plaque assay performed on cell free supernatants? This should be indicated.

      We agree with the reviewer and have made all these changes in the text. See lines 285-287.

      Reviewer #3 (Significance (Required)):

      The experiments are well executed, the data are solid with appropriate statistical analysis and their analysis sufficiently rigorous, and the manuscript is clearly written. Moreover, the finding that HSV manipulates host heterochromatin marks to facilitate nuclear egress is significant and exciting. The work reveals an unexpected role for newly assembled heterochromatin in egress of nuclear replicating viruses like HSV1.

    1. Author Response:

      We thank the editors and reviewers for their assessment of our manuscript, and their agreement that we present compelling evidence for post-transcriptional regulation of AURKA through the 3’UTR.

      In response to Reviewer 1, we acknowledge that much of our study is performed exclusively in U2OS cells, and that study of alternative polyadenylation in additional cell lines would serve to further generalize our findings. However, as U2OS are a well-known model cell line for cell cycle studies we believe our demonstration of cell cycle regulation of AURKA through its 3’UTR offers a depth of understanding that is perhaps of greater interest than confirming the existence of alternative AURKA 3’UTRs in additional cell lines, using our methods. We note that the recent rapid growth in RNA seq data resources allows easy confirmation of the broad existence of alternative polyadenylation events on a genome-wide scale. For example, AURKA-specific data extracted from a recent benchmark study of Nanopore long read RNA sequencing (Chen et al., 2021) clearly shows the existence of two distinct AURKA 3’UTRs differentially expressed between a number of different cancer cell lines. In addition, a recent study investigating the landscape of APA at single-cell resolution detected AURKA APA isoforms in HeLa and MDA-MB-468 cell lines (Wang et al., 2022). Their study further identifies AURKA among genes showing negative correlation between generalized distal polyA site usage index (gDPAU) and expression levels, meaning preference to use the proximal polyA site when expression levels increase, and include AURKA in the gene cluster showing slight increase in usage of the distal polyA site from G1 to M phase (Wang et al., 2022). Both studies are in support of the evidence presented in our manuscript.

      We agree with Reviewer 2 that better information on translation rates would improve our understanding of the impact of translation regulation on AURKA levels. Some insight on the translation rate of AURKA in the cell cycle can be derived from inspection of the ribosome profiling dataset published by Tanenbaum et al., 2015. From their analysis, translation efficiency of AURKA mRNA in G2 is 1.59 times that in G1 and in G1 it is 0.69 times that in M phase, whilst in G2 it is 1.10 times higher than in M. Such data reveal a reversible increase in translation of AURKA mRNA, alongside other mitotic regulators, in preparation for M phase (Tanenbaum et al., 2015). These results are in accordance with our findings that translation rates contribute modestly to cell cycle changes in AURKA levels in normal cells, and we concur with Reviewer 3’s comment that the contribution of increased translation rate to AURKA levels at mitosis is less than the change in mRNA levels at this point in the cell cycle.

      We think the significance of the regulatory mechanism we describe lies rather in the large effect it has on AURKA levels in interphase (when AURKA expression is normally repressed at both mRNA and translation rate). We hypothesise that it is interphase regulation that may be relevant to roles of AURKA in cancer (and to the association of APA with cancer) (Bertolin and Tramier, 2020; Naso et al., 2021). It is indeed the case that (i) AURKA regulation by miRNA, (ii) cooperation between APA and translation and (iii) cell-cycle dependent control of AURKA at the translation level, are already known. We believe the novelty of our study lies in drawing together these elements to provide new insight into AURKA regulation, using tools that allow similar investigation of other APA events, and contributing new ideas for future therapeutic interventions for disease proteins regulated via APA.

    1. These findings suggest that ToM-like ability (thus far considered to beuniquely human) may have spontaneously emerged as a byproduct of language models’improving language skills.

      How can we be sure that ToM is uniquely human?

      What kind of tests have been administered on chimps, dolphins etc? We shouldn't equate that they're unable of ToM ability just because they can't tell us what they think some other being is thinking. (language barrier).

      Moreover, ToM ability probably breaks down for humans if they have to infer what a member of another species is thinking (e.g Try to get a human to tell you what a chimp, dolphin or bat is thinking)

    1. Propuso el Memex en As we may Think.

      ¿Podría considerarse Memex como el precedente de Google?, pues si bien no estuvo materializado se parte de la idea primaria de víncular muchos textos al tiempo

    1. Can we devise solutions that aren’t reactive and ad hoc, and aren’t bogged down by accusations of partisan bias? One idea is to treat fake news as a distribution problem, treating it more like spam. Spam is something the platforms already understand and deal with.

      I think this is an extremely interesting aspect of this article. Another way of understanding fake news is to realize that it is a form of spam. Spam emails, calls and texts are almost always labeled spam and wind up in our spam inboxes/folders. If social media companies created better filter systems for weeding out fake news and identifying them as spam like, the circulation of these fake stories may decrease.

    1. In universal design, the goal is to make environments and buildings have options so that there is a way for everyone to use it22. For example, a building with stairs might also have ramps and elevators, so people with different mobility needs (e.g., people with wheelchairs, baby strollers, or luggage) can access each area. In the elevators the buttons might be at a height that both short and tall people can reach. The elevator buttons might have labels both drawn (for people who can see them) and in braille (for people who cannot), and the ground floor button may be marked with a star, so that even those who cannot read can at least choose the ground floor.

      I think this approach is the most widely applicable solution for the disability community. Strategies that contrast with assistive devices that are expensive or try to make them "normal" are changing the group itself. If changes are made from the designer's point of view, this transpersonal strategy can protect the disability community to the broadest extent. Because we need to respect them as they are, not force them to change in order to fit in.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01784

      Corresponding author(s): Felipe, Court

      1. General Statements [optional]

      We submit a revision plan for our manuscript “Senescent Schwann cells induced by aging and chronic denervation impair axonal regeneration after peripheral nerve injury” by Fuentes-Flores et al. from the groups of Felipe Court, Judith Campisi, Jose Gomez, and Ahmet Hoke.

      One of the greatest challenges in the field of peripheral nerve regeneration is the decrease in the nerve regenerative capacity in aged patients or after delayed repair, a condition also known as chronic denervation. For the last two decades, several research groups have focused on understanding this phenomenon, but the main drivers of unsuccessful regeneration and poor functional recovery have been elusive, remaining an important clinical problem.

      In the work described in this manuscript we found an unexpected property of Schwann cells in the denervated nerves. Aged and chronically denervated Schwann cells are not just passive participants in the impaired regeneration process, but they actively inhibit the regeneration of peripheral axons. Using a combination of morphological, behavioral and molecular techniques in a collaborative multi-lab approach we demonstrate for the first time that senescent Schwann cells accumulate in aged or chronically denervated peripheral nerves modifying the nerve environment, increasing proinflammatory and regeneration-inhibitory factors. Elimination of senescent Schwann cells using a systemic intervention with senolytic or genetically targeting p16-positive senescent cells, greatly improve axonal regeneration in both chronic denervation and aging conditions. Importantly, the enhanced axonal regeneration observed after senescent cell elimination is accompanied by improved functional recovery after chronic denervation. Chronic denervation and aging are the main clinical problems associated to peripheral nerve injuries. Our approach, using FDA approved drugs currently in clinicals trials for its application as senotherapeutics, effectively broadens the spectrum of its clinical use and effectiveness.

      We foresee this work will be of interest to a wide audience, including experts in nerve regeneration, senescent cells, aging and those studying the effect of chronic insults in regenerative medicine.

      We have now received the comments from two reviewers and we are prepared to experimentally approach the issues raised. We thank their criticism and suggestions, as well as their very enthusiastic comments. We are extremely pleased as both reviewers recognized the important implication of this work, from reviewer 1:

      “The findings reported in this manuscript are very interesting and will move the field of nerve repair forward. This paper will be of interest for basic science audience in the fields of aging and neurobiology and has also potential interest to the broader clinical and translational fields. Indeed, this paper provides data that Schwann cells entering a senescent stage not only fail to support axon regeneration in aged animals, but actively inhibit axon regeneration…. Furthermore, the use of an FDA approved drug, currently in clinicals trials for its application as senotherapeutics, to increase axon regeneration in aged and chronic denervation conditions will provide new avenues for clinical applications”.

      Which is backed up by reviewer 2:

      “Overall, this is an interesting study that undertake fundamental question in the field of nerve physiopathology and also could open a good opportunity in developing therapeutic strategies for translational research”.

      We understand the reviewers have raised issues associated with the manuscript format and we are prepared to profoundly edit the manuscript as suggested. In addition, after discussion the experimental issues raised by the reviewers, we are prepared to perform all the experiments and controls suggested (some of them are currently underway), including new animal experiments and in vitro work. This information is detailed in the point-by point revision plan below.

      Thank you in advance for the consideration and we look forward to hearing from you in due course. Please do not hesitate to contact me if you want to discuss anything associated to the manuscript and the revision plan.

      2. Description of the planned revisions

      Point-by point Reponses and revision plan in blue

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary This manuscript by Fuentes-Flores et al reports that elimination of senescent Schwann cells by systemic senolytic drug treatment or genetic targeting improves nerve regeneration and functional recovery in aging and chronic denervation. This improved regeneration is associated with an upregulation of c-Jun expression. Mechanistically the authors provide data to show that senescent Schwann cells secrete factors that are inhibitory to axon regeneration. These findings are very interesting and move the field forward beyond the notion that Schwann cells fail to support axon regeneration in aged animals and identify potential targets to enhance nerve repair. The use of a senolytic drug to increase regeneration in aged and chronic denervation conditions provides new avenues for clinical interventions. However, some of the claims are overstated since this is not the first characterization of senescent Schwann cells and the manipulations used in the study are not entirely specific for Schwann cells. The manuscript is also poorly written and difficult to follow, given the complex set of surgeries and terminology, and lack of explanation of the rationale for the surgery model used. Figures are poorly labeled and difficult to follow without figure legends, and figure legends do not match the figures.

      We thank the reviewer for the positive comments, we also acknowledge the problems detected in the manuscript format, including the lack of a clear explanation of the complex procedures used. We are prepared to work carefully on the format, including clear explanations of the procedures and new schemes to complement the text. As detailed below we are also prepared to perform all the experimental work proposed by the reviewer, which will strengthen the conclusion of this manuscript.

      Below are suggestions for improvements:

      Major comments:

      • In the axon regeneration assays, how is the reconnection site defined in longitudinal images stained for SCG10? Of particular concern is that Figure 1B "adult chronic dmg" nerve section image appears to be identical to the image in Figure 5A "Vehicle adult (47dpi)". However, the reconnection site is located at different sites along the nerve. Also, the scale bar appears identical but the legend states different sizes. In Figure 1 chronic damage is 42dpi, and figure 5 is 47dpi, yet with what appears as the same image.

      Revisions incorporated in the transferred manuscript, see section 3, below.

      • The authors need to provide a rationale for the choice of this complex injury model and what are the advantages over other models. Please clearly describe the timepoints for each experiment and why the time points were chosen for analysis. Provide the scheme of injury in the main Figures to ease comprehension. A scheme is provided in what appears to be Figure S2, but the legend of Figure S2 does not match. Please compare same time points between aged and adults. Days post injury is sometimes referred to 7 or 42, and it is difficult to follow if it is days post initial transection of the tibial nerve or days post reconnection of transected tibial to peroneal. Revise all Figure legends and supplementary Figure legends to match figures.

      We thank the reviewer for this comment. In a revised manuscript we will provide a clear explanation for the injury model used, as well as references, including one from the group of Tessa Gordon that describe for the first time this model in rats (PMID: 30215557), and the one applying this model to the mouse from the groups of Rhona Mirsky and Kristjan Jessen (PMID: 33475496). Briefly, this model has two advantages: first, it allows to generate chronic denervation for months and then be able to connect the distal denervated stump with a proximal one without the need of a nerve bridge; And second, neurons in the different groups that have different denervation times (1 week versus 6 weeks) are all damaged at the same time, eliminating variability associated to chronic axonal damage. We will include this information in the results section of a revised manuscript along with the above references.

      We will include schemes of the injuries performed in each experiment in each figure, also adding a timeline. This is an excellent suggestion to clearly understand the different procedures performed. We will also check all figures and legends, including correcting the problem detected by the reviewer (legends of Figures 2 and 5 were swapped). We understand the problem of referring to days post-injury, then we will introduce a new form of referring to the initial transection and the experiment, which include reconnection. Adding schemes per figure will also help to understand the different timelines used in different experiments, including the ones using the senolytics.

      As detailed above, we will perform a very careful revision of the text and legends for consistency.

      • The authors' main conclusion is that Senescent Schwann cells inhibit axon regeneration. The authors need to tune down this statement and acknowledge that their manipulations are not entirely Schwann cells specific. While the data nicely shows a contribution of senescent Schwann cells, it does not sufficiently acknowledge the possibility that other senescent cells in the nerve contribute to this effect. First, the authors refer in the discussion that 60% of the senescent cells are SOX10 negative, and thus represent other cells beyond Schwann cells. This quantification needs to be shown in Figure 1. Second, the genetic and pharmacology manipulation eliminate all senescent cells, including Schwann cells. Third, while the culture experiment may be Schwann cells specific, the authors need to provide detailed information on how they purify these cells, how they induce repair Schwann cells (rSC) as claimed in Figure 3, and demonstrate whether these are pure Schwann cells. This is important because other cells in the nerve contribute to nerve repair, including mesenchymal cells. Finally, the claim that using Mpz-cre will lead to c-jun overexpression only in SC also needs to be demonstrated, since Mpz is also expressed in satellite glial cells in the DRG.

      We thank the reviewer for these comments and suggestions. We will tone down the statement that senescence Schwann cells are the only cell candidates for modulating regeneration. We discussed this in the original manuscript, but we agree we need to review this statement, including new data detailed below.

      We will include the data requested by the reviewer (60% SOX-10 negative senescent cells) in a new graph in Figure 1. Also, we are currently performing new experiments and quantifications using specific markers for macrophages, epithelial cells, and fibroblasts, to identify the cell identity of the 40% SOX-10 negative senescent cells in aging and chronic denervation.

      Regarding in vitro experiments, we will provide detailed methods for Schwann cell purification, and induction of rSC phenotype. Related to the purity of these cultures, in past experiments we have obtained numbers ranging from 95-98% of Schwann cell purity; we will repeat these experiments and quantification for this manuscript and include this data in the method section of a revised text.

      Regarding c-jun overexpression in the Mpz-cre, we agree with the reviewer that there is probably overexpression in satellite glial cells in the DRG. Satellite glial cells (SGCs) are a subset of cells in the Schwann cell (SC) lineage that express several early myelination markers, such as Mbp, Mag, and Plp, and the transcription factor Sox10. SGCs express early SC markers, such as CDH19, and are transcriptionally and morphologically similar to SCs, even in the absence of axonal contact. Regarding the possibility that SGCs are contributing to the enhanced regeneration presented by mice with c-jun overexpression, this issue was somehow approached previously by Wagstaff et al. (PMID: 33475496), as they showed that in this mouse strain, increased axonal regeneration was equally observed in sensory neurons, in contact with SGCs, and motor neurons, which are not associated to SGCs. This observation suggests that the effect is associated with c-jun overexpressing Schwann cells in the distal stump. In addition, in our work, the changes in senescent cells observed in the c-jun overexpressed mouse, were associated to the distal nerve stump, which was mechanically separated from the proximal region. We agree it is important to include this discussion, and we will do so in the revised manuscript.

      • In Figure 5, c-jun is shown after denervation (42 dpi). The results describe 28 days of denervation, 5 days of GCV and 7 days post reconnection, which makes 40 days. If that is not the case, results need to better explain timeline of this procedure. Also, what is the basal c-jun expression in p16-3MR mice? In addition to the number of c-jun positive cells shown in Figure 5G, the authors need to quantify the percent of c-jun puncta that co-localize with Sox10. The size of the c-jun puncta appears different in size in vehicle and GCV, is that an expected phenotype?

      As expressed above, we will include schemes and timelines for all the surgical experiments in a revised manuscript, including detailed information for the experiments using the p16-3MR mice.

      Regarding c-jun expression in the p16-3MR mice, we are currently performing the suggested control experiment which is important to draw conclusions of this research. We will use immunofluorescence, but also include western blots as an extra analytical method in this and other experiments. All this information will be included in a revised manuscript.

      The observation of the apparent difference c-jun puncta is intriguing. Is not an expected phenotype, but it will important to check if there is a quantifiable change in the pattern of expression. We will quantify this in the different groups and include the results in a revised Figure 5.

      Minor comments

      • Improve labeling of Figures or at the very least describe in the Figure legend. For example: Figure 5B-C, which of the graphs is from adult mice and which is from aged mice?

      We are sorry about the lack of clear labeling in the figures; we will carefully review all figures in the manuscript and their corresponding legends, adding better labeling. Labelling of Figure 5B-C has been corrected.

      • The authors need to carefully describe where the high magnification images were taken in the injured nerve and keep the comparison at same site between groups. Please check the scale bar for each image. For example, the images in Figure 1D/F/I/K used same scale, but the cell size and cell morphology are different. The images for split individual channels need to match the merge channel images. For example, the individual channel and merge images are not properly aligned in Figure 4C, ABY-263 group.

      As suggested by the reviewer, we will show the regions in which high magnification were taken. All quantifications were performed in comparable sites, but we will include information in a revised manuscript to clearly describe the methodology used. We will check all scale bars in a revised manuscript.

      For the comment on alignment problems we have incorporated this in the transferred manuscript, see section 3 below.

      • The analysis method used to quantify axon regeneration should be consistent throughout. For example, in Figure 1C, number of axons/nerve width(um) was used for regeneration assay, but axon density (width corrected) was used in Figure 5B-C in regeneration assay.

      Revisions incorporated in the transferred manuscript, see section 3, below.

      **Referees cross-commenting**

      I agree with all Referee #2's comments. Both sets of comments are important, complementary and point to the same major concerns that need to be addressed. Agree as well that both reviewer think this is an interesting and relevant study for the field of nerve repair, if revised appropriately.

      Reviewer #1 (Significance (Required)):

      __Significance____ __The findings reported in this manuscript are very interesting and will move the field of nerve repair forward. This paper will be of interest for basic science audience in the fields of aging and neurobiology and has also potential interest to the broader clinical and translational fields. Indeed, this paper provides data that Schwann cells entering a senescent stage not only fail to support axon regeneration in aged animals, but actively inhibit axon regeneration. While this reviewer raises questions on whether only senescent Schwann cells or other senescent cells in the nerve contribute to this effect, the identification of potential targets to enhance nerve repair is highly significant. Furthermore, the use of an FDA approved drug, currently in clinicals trials for its application as senotherapeutics, to increase axon regeneration in aged and chronic dennervation conditions will provide new avenues for clinical applications.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary The reported study by Fuentes-Flores et al. shows that Schwann cells (SCs) in peripheral nerves undergo senescence with aging or in chronic denervation. This senescent SC phenotype correlates with downregulation of c-Jun expression and axon regeneration capacity and consequently, affecting functional recovery. The study has been undertaken by using in vivo mice model of chronically denervated sciatic nerve and in vitro of rat-primary cell cultures (Schwann cell, DRG-explants, and their coculture). Schwann cell exosome manipulation was also included to exploit their released factor as media for cell cultures.

      Major comments:

      1. __ __ Chronic denervated nerve model and Schwann cell phenotype

      *Tibial nerve transection and chronic denervated nerve: As the referee have no information about this model (no reference is cited), a detailed description should be provided highlighting the interest of such model compared to standard sciatic nerve lesion model.

      We thank the reviewer for this comment. We will provide a clear explanation for the injury model used, as well as references, including one from the group of Tessa Gordon that describe for the first time this model in rats (PMID: 30215557), and the one applying this model to the mouse from the groups of Rhona Mirsky and Kristjan Jessen (PMID: 33475496). Briefly, this model has two advantages: first, it allows to generate a chronic denervation for months and then be able to connect the distal denervated stump with a proximal one without the need of a nerve bridge; second, neurons in the different groups that have different denervation times (1 week versus 6 weeks) are all damaged at the same time, eliminating variability associated to chronic axonal damage. We will include this information in the result section of a revised manuscript along with the above references.

      *Should be shown, histological analysis of denervated tibial branch prior reconnection with freshly cut proximal peroneal branch with specific immunostainings of rSCs v.s. sSC associated with dapi nuclear staining (as used along this study). Specific staining for other cells should be also provided (i.e., macrophage and endothelial cells). In simple words, "how chronically denervated nerve looks like and what is his cellular content? This is necessary to responds to the following main referee question: does the increase/decrease of rSCs or sSC under specific condition all through the study concerns the SCs that have migrated from freshly cut peroneal branch into denervated tibial distal branch, or resident SCs that have survived in chronically denervated tibial distal branch. In other words, whether rSCs that migrate (and accompanying regenerating axons) into chronically denervated branch nerve undergo phenotype change into sSC because of the environment of chronically denervated nerve. This is not clearly described or discussed, and remain confusing for reader.

      We are sorry about the lack of clarity in the text and figures. The immunostaining analysis of denervated tibial branch prior reconnection is included in the original manuscript, specifically in Figure 1D-K, and Figure 4. In a revised manuscript we will include schemes in the Figures to shown the regions analyzed in each case.

      We thank for the suggestion of including staining for other cell types. As suggested by the reviewer we are currently performing these experiments and analysis for macrophages, endothelial cells and fibroblasts, together with staining for cell senescence (p16), in both aged and chronically denervated conditions. We will include this data in a revised manuscript

      About the specific question of the reviewer: “does the increase/decrease of rSCs or sSC under specific condition all through the study concerns the SCs that have migrated from freshly cut peroneal branch into denervated tibial distal branch, or resident SCs that have survived in chronically denervated tibial distal branch”. Our data demonstrate that senescence Schwann cells appear in the distal nerve stump in aged mice and after chronic denervation. The distal stump is physically disconnected from the proximal part of the nerve. Therefore, after reconnection the regenerating axons encounters a tissue which is already populated with senescent cells. To clearly explain this, we will add extra text in the results and discussion section to clarify these findings.

      Support of up- and down-regulation of gene expression illustrated in fig.4

      The conclusions and statements on up- or down-regulation of c-jun, yH2AX, beta-gal, P16, arise from quantitative and qualitative analysis from immunostaining of these specific markers by determining the number of positive cells rSCs vs. sSC. For these strong statements appropriate methods for quantification of protein levels such as western blots are required. For example, the statement of down regulation of c-jun expression, the quantitative graph shows strong increase in c-jun cell number under ABT263 treatment but the histological photo does not illustrate such decrease in number of the cell. It shows rather an increase in brightness of c-jun staining. Thus, only appropriate method for protein level quantification could be conclusive. This would also remove the doubt that some photos are under- or over- exposed as it appears in the figure. For example, in aged animal under vehicle condition, there is no variability in staining intensity. Accordingly, one question to the authors: for quantification of cell number, are weakly stained cells considered as positive cell?

      We agree with the reviewer that a more quantitative method is required to complement the immunofluorescence data. As the reviewer correctly states, quantification of the immunofluorescence data corresponds to cell positive for the specific marker, expressed as % of cell positive for that maker. Therefore, we will perform western blot for c-jun, yH2AX, and p16 for the different models, including treatments with senolytics.

      Regarding the method for quantification, we performed all these quantifications using Imaris software, in which we set up the same threshold for all conditions for a specific antibody marker. Then, in addition to the quantitative western blot analysis, we will include a graph representing the distribution of the labelling (intensity histogram) for all cell number quantification from immunofluorescence data, comparing control with the experimental condition. Finally, the methods used for quantification will be expanded in a revised manuscript.

      The method section should be revised in general

      Methods could be described in brief only when are supported by provided refs in which the reader could find details. Several refs are missing, i.e., 4.4 for thermal allodynia, 4.5 for ABT263 gavage administration; senescence induction, ...

      Quantification methods should be more detailed, several information are missing and not found in result section or legends (i.e., number of nerve section per animal, neurite length, ...).

      We completely agree with the reviewer that the method section was not developed adequately in the original manuscript. As described in previous responses, we will detail the methods used, especially those associated with quantifications performed. We will also include references for the different methods used, including those detailed by the reviewer.

      The use of rat for in vitro DRG and SC culture while in vivo study is undertaken on mice The switch of species from in vivo to in vitro (mice vs. rat), is not justified as mice DRG and SC culture are also commonly used. In addition, the use of transgenic mice (used here only for in vivo) could also be exploited to address specific and reinforce the data.

      We agree with the reviewer that using mouse SC in in vitro experiments will be a better approximation to support our findings. We have been using rat SC in this and other publications as they were the standard model used in in vitro experiments. Nevertheless, as the author states, now there are suitable methods for culturing mouse SC, that we have incorporated in our lab. Therefore, we will perform key in vitro experiments using mouse SC together with mouse DRGs.

      Regarding the use of the transgenic mice (3MR), we thank the reviewer for this suggestion; we will perform new experiments using SC derived from 3MR mice in order to demonstrate induction of senescence (by expression of the red fluorescent protein in this transgenic line) and senolysis in vitro.

      Use of conditioned exosome/media

      Should be explained why the use of exosomes directly in cell culture was not tested. This would be close to physiological condition, regarding the concentration of released factors.

      This is an important point that was not explored. As we have plenty of experience using SC-derived exosomes, we will perform the suggested experiments comparing the effect of exosomes from conditioned media from senescent-induced SC and include the results from these experiments in a revised manuscript.

      The statement on the effect of rSC vs. sSC cell on growth cone dynamic

      The provided data illustrated in fig. 3 are not in support that sSC affect growth cone dynamics. Only what would be "suggested" is that the decrease in neurite length could be associated to changes of growth cone morphology, on fixed tissue, that appeared to be affected. If such statement has to be maintained, time-laps is required. The image does not reflect a retracting neurite nor collapsed growth cone. In addition, other mechanisms could be at the basis of observed decrease in neurite length, which are not evaluated here. This is an important point to address as the authors state that sSC release inhibitory factors.

      We completely agree with the reviewer: we are not exploring growth cone dynamics. We will change the manner these results are presented as we are not demonstrating a dynamic process in our results. We prefer to modify the text associated with these experiments rather than perform a time-lapse analysis at this moment. This is part of a future exploration we want to achieve, that will take some time to develop, and we consider at this moment lies outside the scope of the present work. Included in section 4, below.

      other comments

      • The surgical description is complicated, also annotation to be added in supp fig #2A; provide

      We will work in the description of the model, including references to other papers using this nerve anastomosis model for assessing regenerative potential. As stated above we will also include schemes in all figures to help the reader with the surgical procedure and different timelines used.

      • M&M 4.2: lign #8, error referring to Fig 2A, correct by. Supp fig 2A

      Revisions incorporated in the transferred manuscript, see section 3, below.

      • Review refs list, ref#17, full info needed

      Revisions incorporated in the transferred manuscript, see section 3, below.

      • Fig 3H would be interesting only if contain a column of the 21 proteins exclusively expressed in senescent-induced SCs

      Revisions incorporated in the transferred manuscript, see section 3, below.

      • The immunostaining of Lamin B1 positive nuclear invaginations in fig S4 should better described in results for non-familiar reader

      We will describe better this staining pattern in the result section of the revised manuscript.

      • Title of Fig 3, the expression "neuronal growth" is not appropriate here (neurite outgrowth)

      Revisions incorporated in the transferred manuscript, see section 3 below.

      **Referees cross-commenting**

      I agree with referee#1's comments. He/she has raised complementary and important points that should be taken into account by the authors as well as we share the same major concerns. Furthermore, we both expressed the interest of such study if revised appropriately.

      Reviewer #2 (Significance (Required)):

      Significance

      Overall, this is an interesting study that undertake fundamental question in the field of nerve physiopathology and also could open a good opportunity in developing therapeutic strategies for translational research. However, additional investigations are needed to support the main conclusions.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer 1

      Major comments

      • In the axon regeneration assays, how is the reconnection site defined in longitudinal images stained for SCG10? Of particular concern is that Figure 1B "adult chronic dmg" nerve section image appears to be identical to the image in Figure 5A "Vehicle adult (47dpi)". However, the reconnection site is located at different sites along the nerve. Also, the scale bar appears identical but the legend states different sizes. In Figure 1 chronic damage is 42dpi, and figure 5 is 47dpi, yet with what appears as the same image.

      We thank the reviewer for detecting this issue. The problem arises as the data shown in Figure 1a corresponds to the controls (vehicle) of Figure 5a. The data in Figure 1 is a known phenomenon in the field of peripheral regeneration (i.e., decreased regeneration in aged animals as well as in chronically denervated nerves); nevertheless, we decided to add this data at the start of the manuscript to clearly shown the reader (thinking in a broader scientific audience) the baseline of the evident decrease in axonal regeneration in these two conditions. To make this very clear, we have included the same image Figure 1 and 5 for the controls and detailed this in the legends of both Figure 1 and 5 in the uploaded manuscript.

      We have checked the scales and modify the scale in Figure 5 as it was not correct. We have also corrected the nomenclature for days post injury in this image as well as in the corresponding legend.

      In addition, for transparency, we have uploaded in a public repository (EBI BioStudies database, https://www.ebi.ac.uk/biostudies/) all the microscopy images used in this work, which is detailed in the uploaded manuscript in a new section named data availability.

      Regarding the localization of the reconnection site, this is identified using the whole z-stack of the nerve and not a single section, the region in the z-stack can be recognized using two parameters: the difference in diameter between the proximal and distal stump and by identifying the filament used to suture both stumps. We have included a description of this procedure in the method section of the revised manuscript. As this is not always a perpendicular line, in the revised Figures we have now used an arrowhead to denote the reconnection site.

      We are sorry for the confusion generated by the labeling of some images. We will review the text and figures and fix errors. In a revised manuscript we will also add schemes for several figures in order to explain better the experimental procedure and timelines.

      Minor comments

      • Improve labeling of Figures or at the very least describe in the Figure legend. For example: Figure 5B-C, which of the graphs is from adult mice and which is from aged mice?

      We have included the suggested labelling for Figure 5B-C.

      • The images for split individual channels need to match the merge channel images. For example, the individual channel and merge images are not properly aligned in Figure 4C, ABY-263 group.

      We thank the reviewer for spotting the error in the split channels, we have now fixed this in Fig 4C, but also corrected other alignment problems detected in Fig 4A and 5F.

      • The analysis method used to quantify axon regeneration should be consistent throughout. For example, in Figure 1C, number of axons/nerve width(um) was used for regeneration assay, but axon density (width corrected) was used in Figure 5B-C in regeneration assay.

      We have included the procedure used to quantify axonal regeneration in the method section of the uploaded manuscript, which is the same throughout the manuscript. We are sorry for the different texts in the axes of graphs included in Figure 1 and Figure 5. In the first version of the figures, we were using the term “axon density (width corrected)”, but then we decided to change it to “number of axons/nerve width (mm)”, which was more precise. Unfortunately, the text of the graph axis in Figure 5 was not changed by mistake. We have now fix this in the revised version.

      Reviewer 2

      • M&M 4.2: lign #8, error referring to Fig 2A, correct by. Supp fig 2A

      We have corrected this error in the uploaded manuscript.

      • Review refs list, ref#17, full info needed

      We have fixed this reference in the uploaded manuscript.

      • Fig 3H would be interesting only if contain a column of the 21 proteins exclusively expressed in senescent-induced SCs

      We are sorry about this omission in Figure 3H. We included a list of all the identified proteins of repair and senescent-induced SCs in Supplementary Table 4 (Table S4) of the original manuscript, including the identity of the 21 proteins exclusively expressed in senescent-induced SCs. In the revised version, we have incorporated the information of the 21 proteins in Figure 3H as suggested by the reviewer.

      • Title of Fig 3, the expression "neuronal growth" is not appropriate here (neurite outgrowth)

      We thank the reviewer for detecting this error, we have changed the expression as suggested in the uploaded manuscript.

      Other changes included in the revised manuscript and Figures

      1. Numeric data for graphs We have included a new supplementary excel file: Supplementary Table 8, including the data for each replicate associated to the graphs of text and supplementary figures.

      Revision Figure 5G.

      During our review on all the individual replicates of the manuscript data to upload into BioStudies, the first author noticed that the data in graph of Figure 5G corresponded to a pilot experiment performed to set up the protocol. The final experiment was not included, then we uploaded the correct images and included the final quantification. Data is comparable, and statistical differences remains.

      Revision Figure 3A.

      We have modified the pseudocolors of the S100 antibody channel from magenta to green for ease of visualization. The image and quantification remain exactly the same.

      4. Description of analyses that authors prefer not to carry out

      Reviewer 2

      The statement on the effect of rSC vs. sSC cell on growth cone dynamic

      The provided data illustrated in fig. 3 are not in support that sSC affect growth cone dynamics. Only what would be "suggested" is that the decrease in neurite length could be associated to changes of growth cone morphology, on fixed tissue, that appeared to be affected. If such statement has to be maintained, time-laps is required. The image does not reflect a retracting neurite nor collapsed growth cone. In addition, other mechanisms could be at the basis of observed decrease in neurite length, which are not evaluated here. This is an important point to address as the authors state that sSC release inhibitory factors.

      We completely agree with the reviewer: we are not exploring growth cone dynamics. We will change the manner these results are presented as we are not demonstrating a dynamic process in our results. We prefer to modify the text associated with these experiments rather than perform a time-lapse analysis at this moment. This is part of a future exploration we want to achieve, that will take some time to develop, and we consider at this moment lies outside the scope of the present work.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General Statements

      Thank you for providing an initial assessment of our manuscript. We went through all the raised comments and suggestions aiming to improve our manuscript. Our manuscript will benefit from addressing them.

      Our main impression is that the concerns regarding the novelty of our work by Reviewers #1 and #3 come from the fact that we apply a known flexible statistical framework (group factor analysis) to novel applications in single-cell data analysis, namely the estimation of multicellular programs and sample-level unsupervised analysis. The core methodology of our work is indeed based on the popular tool Multi-omics factor analysis (MOFA). We see the novelty of our study in the formulation of these relatively new applications within this framework, and the demonstration of the added value that this formulation provides building on MOFA’s strengths, in particular by expanding the possibilities of downstream analysis of single-cell data including the meta-analysis of distinct single-cell patient cohorts and its integration to complementary bulk and spatial data modalities.

      The simultaneous estimation of multicellular programs together with sample-level unsupervised analysis is only possible with a single available tool, scITD, which is limited by its modeling strategy, based on tensor decomposition: with tensor decomposition, multicellular programs can not be estimated from distinct feature sets across cell-types, making this method less flexible and sensitive to technical effects, such as background expression. We compared our proposed methodology with scITD and showed the benefits of using group factor analysis as implemented in MOFA for this task. Moreover, as of now, no other methodology is able to estimate multicellular programs and perform sample-level unsupervised analysis, simultaneously in multiple independent single-cell atlases. We also showed how multicellular programs are traceable in bulk transcriptomics data and show that they are better fit to classify heart failure patients compared to classic cell-type deconvolution approaches.

      Altogether, we believe that our current manuscript complements existing literature and puts forward an approach with distinct features to analyze single-cell atlases. We will edit the text to make more explicit the novelty and advantages of our proposed methodology, and we will emphasize that our work does not mean to propose a new method, but rather demonstrate how group factor analysis can be used for novel sample-level analysis of single-cell data. We plan to incorporate the suggestions by Reviewer #1 regarding the inclusion of additional datasets, model validations, and novel applications involving a direct modeling of cell-compositions and spatial organization of cells. Moreover, we plan to discuss perspectives on how cell communications can be incorporated in the analysis of multicellular programs as suggested by Reviewer #2. Additionally, we will correct all the figure and text typos identified by the reviewers. Finally, we will provide an R package (https://github.com/saezlab/MOFAcellulaR) and python implementations (https://liana-py.readthedocs.io/en/latest/notebooks/mofacellular.html) that facilitate the use of our approach.

      Please find below the point-by-point response to the reviewers in blue, numbered for convenience.

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __

      Remark to authors

      Flores et al. present a pipeline in which they leverage MOFA framework, a matrix factorization algorithm to infer multi-cellular programs (MCPs). Learning and using MCP has already been proposed by others. Yet, authors pursue a similar goals by using MOFA, providing a cell*sample matrix for different cell types as different views (instead of multiple modalities/views) as the input. They later apply MOFA using this data format on a series of applications to analyze acute and chronic human heart failure single-cell datasets using MCPs. Authors further try to expand their analysis by incorporating other modalities.

      Major points:

      1.1 As briefly outlined in the remarks, the current manuscript needs novel findings and methodology to grant a research article which I can' see here. The underlying matrix factorization is the original MOFA (literally imported in the code) with no modification to further optimize the method toward the task. While I appreciate and acknowledge the author's efforts resulting in a detailed analysis of heart samples, I think all of these could have been part of MOFA's existing tutorials.

      Response 1.1 As the reviewer correctly states, we used the framework and code of MOFA. The novelty lies in its application for the unsupervised analysis of samples from cross-condition single-cell data and the inference of MCPs. MOFA is a statistical framework implementing a generalization of group factor analysis with fast inference and its current version fits the task of MCP inference and unsupervised analysis of samples across cell-types that provides a more flexible modeling alternative than current available methods (as presented in Table 1 of the manuscript). Current work on MCP inference is based on the premise of multi-view factorization with distinct statistical modeling alternatives. As mentioned in the discussion of our manuscript, three main points distinguish our discussed methodology from present alternatives and provide evidence about its relevance and uniqueness over available tools:

      Simultaneous unsupervised analysis of samples across cell-types and inference of MCPs, together with comprehensive interpretable descriptions of the reconstruction of the original multi-view dataset. This is only currently possible with scITD (Mitchel et al, 2022) and is compared in the manuscript. DIALOGUE (Jerby-Arnon & Regev, 2022) is limited to the generation of MCPs and Tensor-cell2cell (Armingol et al, 2021) is only focused in cell-communications with limited interpretability.

      Flexible non-overlapping feature set that handles better technical effects such as background expression, as discussed in section “__2.2 Multicellular factor analysis for an unsupervised analysis of samples in single-cell cohorts”. __Moreover, as mentioned by the reviewer in a later point (Reviewer comment 1.2), this enables joint modeling of distinct aspects of the tissue, such as cell compositions, cell communications (preliminary work: https://liana-py.readthedocs.io/en/latest/notebooks/mofatalk.htm) and spatial organization.

      Joint-modeling of independent atlases that enables meta-analysis at the sample level of cross-condition single-cell data. No currently available methodology is capable of performing similar modeling. For these reasons, we believe that our work is worth being discussed and presented to the community as a research article. We will modify the discussion to put more emphasis on the added value of group factor analysis as implemented in MOFA.

      Moreover, we now provide an R package (https://github.com/saezlab/MOFAcellulaR) and python implementations within our analysis framework LIANA (https://liana-py.readthedocs.io/en/latest/notebooks/mofacellular.html) that facilitates the usage of our proposed methodology. The R and python implementations are compatible with current Bioconductor and scverse pipelines, respectively.

      Application of our methodology to heart failure datasets also revealed novel knowledge about heart disease processes:

      In myocardial infarction, we found that our MCPs associated with cardiac remodeling capture cell-state-independent gene expression changes. This provides a novel understanding on the effect of disease contexts in the expression profiles of specialized cells. This finding was not reported in the original atlas publication.

      In chronic heart failure, we identified a conserved MCP of cardiac remodeling across patient cohorts and etiologies, suggesting a common chronic phase between distinct initial causes of heart failure.

      Moreover, we showed that deconvoluted chronic heart failure MCPs from bulk transcriptomics better classify patients in comparison to classic cell-type composition deconvolution of bulk data. To our knowledge, this finding was not presented in any of the manuscripts of other methodologies focused on MCPs.

      Altogether, our current work shows a novel application of group factor analysis for the simultaneous estimation of MCPs and the sample-level unsupervised analysis of cross-condition single cell data. We showed the unique features compared to current available tools. Distinct post-hoc analysis in combination with other data modalities shows the biological relevance of our proposed methodology to complement the tissue-centric knowledge of disease.

      1.2 How can you explain that the results in donor-level analyses are not due to technical artifacts (batch variation)? Can this be used to infer a new patient similarity map? For example, I would test this by leaving out a few patients from training, projecting them, and seeing where they would end up in the manifold or classifying disease conditions for new patients and explaining the classification by MCPs responsible for that condition.

      Response 1.2 When knowledge of the technical batches is available it is possible to test for association between these labels and the factors encoding MCPs as shown in Figure 2.

      In our current applications, we additionally showed the biological relevance of our estimated MCPs by mapping them to spatial and bulk data sets, which is a direct way of testing how generalizable were our findings:

      In the application of MOFA to human myocardial infarction data, we mapped the gene loadings conforming the MCP associated with cardiac remodeling to paired spatial transcriptomics datasets. We showed that in general, the cell-type specific expression of the MCP of cardiac remodeling encompassed larger areas in ischemic and fibrotic samples compared to myogenic (control) samples.

      In the application of MOFA to chronic human end-stage heart failure data, we mapped the gene loadings conforming the MCP associated with cardiac remodeling to 16 independent bulk transcriptomics datasets of heart failure. There we showed that the cell-type specific expression of the MCP of cardiac remodeling separates heart failure patients from control individuals. Regarding the generation of new patient similarity maps, it is possible to estimate the positions of new samples in the manifold formed by the factors representing the MCPs. As suggested by the reviewer we will show this by classifying heart failure single-cell samples using MCPs of two independent patient cohorts (presented in section 2.7).

      1.3 The bulk and spatial analysis are used posthoc after running MOFA, I think since MOFA can use non-overlapping features set, it would be interesting to see if deconvoluted bulk or ST data can be encoded as another view (one view from scRNAseq data for each cell-type and another view from bulk RNA-seq or ST, you can get normalized expression per spot (for ST) or per sample (for bulk) and use them as input.

      Response 1.3 Thanks for the suggestion. We agree that the possibility of using non-overlapping features opens options of complex models that include the cell-type compositional and organizational aspects of tissues. However these features must be quantified in the same sample, thus it is limited to samples profiled simultaneously at different scales.

      We will present the results of a sample-level joint model of multicellular programs together with cell-proportions and spatial dependencies using the myocardial infarction dataset presented in section 2.2. For this dataset based on our previous work we have the compositions of major cell-types and their spatial relationships based on spatially contextualized models (Kuppe et al, 2022). We will run a MOFA model and show how it can be used to find factors associated with structural and molecular features of tissues.

      __Minor: __

      1.4 Some figure references are not correct (e.g., "the single-cell data into a multi-view data representation by estimating pseudo bulk gene expression profiles for each cell-type across samples (Figure 1b)." should be figure 2b)

      Response 1.4 Thanks for pointing this out. We apologize for these mistakes and we will adjust all labels correctly.

      1.5 The paper is well written, but there could be some more clarifications about what authors consider as cell-type and cell-state, condition, MCPs which I think is critical to current analysis (see here https://linkinghub.elsevier.com/retrieve/pii/S0092867423001599) for the reader not familiar with those concepts.

      Response 1.5 We agree with the reviewer that it is important to introduce these concepts in more detail to avoid confusion. We will adapt the current manuscript to incorporate these definitions in the introduction.

      __Reviewer #1 (Significance (Required)): __

      1.6 While I find the concept of MCPs interesting, the current work seems like a series of vignettes and tutorials by simply applying MOFA on different datasets (The authors rightfully state this). However, It needs to be clarified what the novelty is since there is no algorithmic improvement to current MCP methods (because there is no new method) nor novel biological findings. Additionally, even in the current form, the applications are limited to the heart, and the generalization of this proposed analysis pipeline to other tissues and datasets is not explored. Overall, the paper lacks focus and novelty, which is required to grant a publication at this level.

      Response 1.6 As mentioned in response 1.1, we show that group factor analysis as implemented in MOFA has advantages given its flexibility of the feature space, the joint-modeling of independent datasets, and the interpretability of the model. We will make these advantages clearer in the discussion, and we will explicitly mention the disadvantages and lack of functionalities of available methods.

      The applications were mainly done in heart data for consistency although they represent four distinct single-cell datasets, one spatial transcriptomics dataset, and 16 independent bulk transcriptomics datasets. For completeness, as suggested by the reviewer, we will show the application of our methodology to peripheral blood mononuclear cell data of lupus samples (preliminary results: https://liana-py.readthedocs.io/en/latest/notebooks/mofacellular.html)

      __expertise: Computational biology, single-cell genomics, machine learning __

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      Summary:

      The authors use MOFA, an unsupervised method to analyze multi-omics data, to create multicellular programs of cross-condition multi-sample studies. First, for each cell-type, a pseudobulk expression matrix per sample is created. The cell-type now functions as the separate view, typically reserved for the different omics layers in MOFA. This then results in a latent space with a certain number of factors across samples. The factors, representing coordinated gene expression changes across cell-types, can then be checked for associations with covariates of interest across the samples.

      MOFA is well-suited for this task, as it can handle missing data and it is a linear model facilitating the interpretation of the factors. Users should be aware that MOFA can estimate the number of factors, but the pseudobulk profiles require a rigorous selection of cell-type specific marker genes. The result will be most suited for downstream analysis if there is a clear association with one factor and a clinical covariate of interest. In a final step, a positive or negative gene signature can be created by setting a cut-off on the gene weights for that specific factor.

      The method is applied on 3 separate data sets of heart disease, each time demonstrating that at least one of the factors is associated with a disease covariate of interest. The authors also compare the method to a competitor tool, scITD, and explore to what extent a factor mainly captures variance associated with (i) a general condition covariate or rather (ii) specific cell states.

      The multicellular programs are also mapped to spatial data with spot resolution. Though this analysis does not bring any novel biological insight in the use case, it does support the claim that the programs are associated with the covariate of interest.

      The most interesting applications of MOFA are in my opinion the potential for meta-analysis of single-cell studies and validation of cell-type specific gene signatures with publicly available bulkRNAseq data sets.

      The authors provide various data sets and data types to support their claims and the paper is well written. The relevant code and data has been made available.

      We thank the reviewer for the positive comments to our work.

      __Major comments __

      2.1 What is the added value of the gene signatures obtained from MOFA compared to e.g. a naive univariate approach? In theory, a similar collection of genes or gene signature could be obtained by running a differential gene expression analysis across the samples for each cell-type (e.g. myogenic vs ischemic ) and applying a set of relevant cut-offs or filters on the results. In other words, does MOFA detect genes that would otherwise be missed?

      Response 2.1 Thank you for the relevant comment. The original motivation of our work is the unsupervised analysis of samples based on a manifold formed by a collection of multicellular molecular programs. We envisioned that this unsupervised analysis would be relevant in situations where a clear histological or clinical classification of samples is not possible with reliability. As mentioned by Reviewer #1 in comment 1.2, one advantage of these approaches is that they create patient similarity maps, which have been shown useful to stratify patients in a recent analogous work in multiple sclerosis (Macnair et al, 2022). The cell-type signatures obtained from relevant factors explaining the patient stratification avoid the likelihood of performing “double dipping” by avoiding the need of a direct differential expression analysis between newly formed groups.

      In our applications, the generation of cell-type signatures (here called multicellular programs) associated to a specific clinical covariate (eg. control vs perturbation) are post-hoc analyses of the generated manifold. And as the reviewer correctly points out, these signatures should be similar to performing direct differential expression analysis between those patient conditions. In the related work of scITD (Mitchel et al, 2022) the authors showed high concordance between the cell-type signatures and the results of differential expression analysis. For completion, we will similarly quantify the degree of overlap between genes of our generated signatures with the ones coming from differential expression analysis.

      It is relevant to mention that in complex experimental designs with multiple conditions, our approach facilitates patient ordering, which allows the understanding of one condition in the context of all the others, avoiding the need of multiple testing and the definition of multiple contrasts, as mentioned in the text.

      We will incorporate these points in the discussion section of the manuscript.

      2.2 Could scITD also be used for meta-analysis or could the obtained gene signatures of that method also be mapped to bulkRNAseq data? If so, it would be interesting to show the relative performance with MOFA. If not, this specific advantage should be highlighted.

      Response 2.2 Thank you for pointing this out. scITD does not provide a group-based model to perform meta-analysis, and this feature is one of the main advantages of group factor analysis as currently implemented in MOFA. We will highlight this feature in Table 1 and in the discussion.

      Although scITD signatures of a single study could be mapped to bulk transcriptomics data, the stringent tensor representation leads to the generation of signatures that may be influenced by technical effects as shown in the manuscript section 2.2. Thus we believe that the flexibility of the feature space in MOFA is an advantage for this task. We will add this observation to the discussion.

      2.3 Users need to specify gene set signatures based on the weights for a factor of interest. This might suggest a limitation to categorical covariates of interest. If the authors see potential for a continuous covariate of interest, this should at least be highlighted in the text and if possible demonstrated on a use case.

      Response 2.3 In our applications we limited ourselves to categorical variables, however, it is possible to associate factors to continuous variables. An implementation of the association with continuous variables is already available in our newly created R package “MOFAcellulaR”: https://github.com/saezlab/MOFAcellulaR/blob/main/R/get_associations.R.

      The datasets we analyzed have no continuous clinical covariates to showcase this functionality, but as suggested by the reviewer we will highlight this feature in the text.

      __Minor comments __

      2.4 In Figure 2c the association between factor 2 and the technical factor shows a very strong outlier. Please verify that the association is still significant after applying a more robust statistical test (e.g. non-parametric test as Wilcoxon).

      Response 2.4 Thanks for the observation, we will test these differences with a non-parametric test.

      2.5 For mapping the cell-type specific factor signatures to bulk transcriptomics, the exact performed comparison or model is unclear. There are seven cell-type signatures for each sample in every study. Was there a t-test run for each cell-type or was a summary measure taken across the cell-types? he thresholding is also rather lenient (adj. p-val 0.1).

      Response 2.5 We are sorry for not being clear about our procedure. After identifying the multicellular program associated with heart failure estimated from the two single cell studies meta-analyzed, we calculated the weighted mean expression of the seven cell-type signatures independently to every sample of the 16 bulk studies. In other words each sample within each bulk study will be represented by a vector of 7 values representing the relative expression of a cell-type specific signature (Figure 6D-left). For each bulk transcriptomics study, first, we centered the gene expression data before calculating the weighted mean.

      In supplementary figure 4-e we show the results of performing a t-test of the cell-type scores between heart failure and control samples within each study. Given the relative low sample size of most of the studies (affecting the power of the test), we chose a not so stringent adjusted p-value. For completion, we will show the results of a more classical threshold (adj. p-value

      2.6 typo in abstract: In sum, our framework serves as an exploratory tool for unsupervised analysis of cross-condition single-cell ***atlas*** and allows for the integration of the measurements of patient cohorts across distinct data modalities

      Response 2.6 Thanks for pointing out this typo. We will modify the text.

      2.7 In Figure 4a it is not clear to me why on the one hand we see marker enrichment vs loading enrichment with healthy and disease.

      Response 2.7 We apologize, this is a typo after editing the labels. Both should contain the marker enrichment label. We will fix this.

      2.8 IN Figure 4b it would help if the same color scheme would be maintained throughout the paper (here now black and white) and if for the cell states the boxplots would be connected per condition, emphasizing the (absence) of change across cell states within a condition.

      Response 2.8 We thank the reviewer for the suggestion. We will reorganize the panels showing the gene expression per condition and fix the color scheme.

      __Reviewer #2 (Significance (Required)): __

      __General assessment: __

      2.9 MOFA is well-suited for detecting multicellular programs because it can handle missing data and allows for easy interpretation of the factors as a linear method. It might have particular potential for meta-analysis across multiple studies and reevaluating bulkRNAseq data sets, but in the current manuscript it is unclear to what extent this is a specific advantage of MOFA or could also be done with competitors. The authors show how the obtained results and associations with clinical covariates can be validated across multiple data types. How the resulting multicellular programs can provide additional biological insight or form the starting point for additional downstream analysis (e.g. cell communication) is not covered in the paper.

      Response 2.9 We thank the reviewer for highlighting the methodological advantages of group factor analysis for the estimation of multicellular programs and the unsupervised analysis of samples from cross-condition single-cell atlas. As mentioned in response 1.1 and 2.2, the added value of our methodology is the flexibility of feature views (that goes beyond gene expression) and simultaneous modeling of independent single-cell datasets, a feature not present in any of the currently available methods that facilitates the meta-analysis of datasets across modalities.

      While we interpret the presented multicellular programs in the context of cellular functions and the division of labor of cell states, it is true that we did not attempt to provide mechanistic hypotheses, for example, via cell-cell communication, on how this coordination across cell-types emerges.

      Previous work of the related tool Tensor-cell2cell (Armingol et al, 2021) has presented the idea of the estimation of multicellular programs from cell-cell communications and group factor analysis can also be used for this task (preliminary work: https://liana-py.readthedocs.io/en/latest/notebooks/mofatalk.html). We will discuss in the text perspectives on how the estimation of multicellular programs can be linked to the inference of cell communications from single-cell data together with analysis alternatives previously proposed by scITD and Tensor-cell2cell. However, we believe that this question requires further work and it is out of scope of our current manuscript.

      __Audience: This paper will be mainly of interest to a specialized public interested in unsupervised methods for large scale multi-sample and multi-condition studies. __

      __Reviewer: main background in the analysis of scRNAseq data. __

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      This manuscript by Saez-Rodriguez and colleagues proposes to repurpose Multi-Omics Factor Analysis for the use of single cell data. The initial open problem stated by the paper is the need for a framework to map multicellular programs (such as derived from factor analysis) to other modalities such as spatial or bulk data. The authors propose to repurpose MOFA for use in single cell data. Case studies involve human heart failure datasets (and focuses on spatial and bulk comparisons).

      There are particular issues with clarity regarding the key methodological contribution (and assessment of it), discussed under significance.

      __Reviewer #3 (Significance (Required)): __

      3.1 I am very puzzled by the repeated claims the manuscript makes that their central methodological contribution and innovation is to use MOFA for single cell data. One of their citations for MOFA is to MOFA+, which is precisely that (in a relatively popular manuscript published by the original authors of MOFA and not overlapping with the present authors). I am left to wonder what I missed.

      Response 3.1 We apologize for the misunderstanding, as mentioned in the response to review 1.1 and explained by reviewer 2’s summary, the main objective of our work is to use the statistical framework of group factor analysis for the inference of multicellular programs and the sample-level unsupervised analysis of cross-condition single-cell data, which is a distinct task to multimodal integration (Argelaguet et al, 2021).

      While it is true that MOFA+ introduced expansions to the model for the modeling of single-cell data, namely fast inference and group-based modeling, the main focus in their applications is the multimodal integration of data, where each cell is represented by a collection of distinct collection of features (e.g. chromatin accessibility and gene expression). Unlike multimodal integration, here we propose a different approach to analyze single-cell data at the sample level instead of the cell level, without modifying the underlying statistical model (see section 2.1 of the manuscript).

      In detail, what we assume is that samples of single-cell transcriptomics data (e.g. tissue from a patient) can be represented by a collection of independent vectors collecting the gene expression information of cell types composing the tissue analyzed. Decomposition of these multiple views with group factor analysis produces a manifold that captures multicellular programs (coordinated expression processes across cell-types), or shared variability across cell-types simultaneously. Altogether, this represents a novel usage of group factor analysis in an application for the inference of multicellular programs, where the main focus is not at the cell-level but at the patient level.

      As a side note, Britta Velten, one of main developers of MOFA and coauthor of both the MOFA and MOFA+ papers, is a contributor and coauthor of this manuscript, and Ricard Argelaguet, who also led both versions of MOFA, gave us helpful feedback and is acknowledged as such on this work.

      3.2 Multimodal integration methods are fairly numerous and even if they're not all exactly factor analyses, it's strange to argue that MOFA fills some unique conceptual gap. I agree it fills something of an interesting gap (except for MOFA+ already filling it), but it's not like the quite popular spatial to single-cell integration approaches aren't doing similar things. If this is a methods paper (as it is presented) then there would have to be very substantially more comparative evaluation to these other approaches.

      Response 3.2 As presented in the previous response (3.1) our current work is not focused on multimodal integration, but rather the inference of multicellular programs and the sample-level unsupervised analysis of single-cell data. Given this, in the current manuscript we compared our proposed methodology with the only three other available methods that address at least partially the inference of multicellular programs (see Table 1 in our manuscript). In response 1.1 and 3.2 we discussed the advantages of our proposed methodology compared to available methods. In the manuscript section 2.2 we compared group factor analysis with tensor decomposition and showed that the former better deals with technical artifacts and better identifies known patient groups.

      We will distinguish our work from multimodal integration explicitly in the introduction and the manuscript section 2.1 to avoid confusions.

      3.3 The biological use cases are comparatively interesting and dominate the manuscript (but are still presented principally as use cases rather than a compelling biological narrative of their own).

      Response 3.3 The focus of our manuscript was the reintroduction of group factor analysis for the novel applications of the inference of multicellular programs and the sample-level unsupervised analysis from single-cell data. Given the distinct possibilities of post-hoc analyses, we mainly used acute and chronic heart failure data to showcase the utility of MOFA to connect spatial and bulk modalities with single-cell data.

      That said, as discussed in response 1.1, our analyses allowed to generate novel hypotheses of these datasets:

      In myocardial infarction, we found that our estimated multicellular programs associated with cardiac remodeling capture cell-state-independent gene expression changes. This provides a novel understanding of the effect of disease contexts in the expression profiles of specialized cells. In other words, we found that cell-states, regardless of their specialized function, share a common response in the tissue context.

      In chronic heart failure, we identified a conserved multicellular program of cardiac remodeling across patient cohorts and etiologies, suggesting a common chronic phase between distinct initial causes of heart failure, which again may be linked to the dominating response to the tissue context that is shared across etiologies.

      These two results support the observation that deconvoluted chronic heart failure multicellular programs from bulk transcriptomics better classify patients in comparison to classic cell-type composition deconvolution of bulk data. To our knowledge, this finding was not presented in any of the manuscripts of other methodologies focused on MCPs. We summarize these results in the third paragraph of the discussion in the manuscript:

      “In an application to a collection of public single-cell atlases of acute and chronic heart failure, we found evidence of dominant cell-state independent transcriptional deregulation of cell-types upon myocardial infarction. This may suggest that while certain functional states within a cell-type are more favored in a disease context, most of the cells of a specific type have a shared transcriptional profile in disease tissues. If part of this shared transcriptional profile is interpreted as a signature of the tissue microenvironment that drives cells in tissues towards specific functions, this result may also indicate that a major source of variability across tissues, besides cellular composition, is the degree in which the homeostatic transcriptional balance of the tissue is disturbed. By combining the results of multicellular factor analysis with spatial transcriptomics datasets, we explored this hypothesis and identified larger areas of cell-type-specific transcriptional alterations in diseased tissues. Given these observations on global alterations upon myocardial infarction, we meta-analyzed single-cell samples from two additional studies of healthy and heart failure patients with multiple cardiomyopathies. Here, we found a conserved transcriptional response across cell-types in failing hearts, despite technical and clinical variability between patients. Further, we could find traces of these cell-type alterations in independent bulk data sets. These observations suggest that our approach can estimate cell-type-specific transcriptional changes from bulk data that, together with changes in cell-type compositions, describe tissue pathophysiology. Altogether, these results highlight how MOFA can be used to integrate the measurements of independent single-cell, spatial, and bulk datasets to measure cell-type alterations in disease.”

      To fully assess the relevance of these observations, they should be investigated in more datasets and analyses, where shared functional cell-states across distinct heart failure etiologies are identified and then compared at their compositional and molecular level. This, in our opinion, represents an independent study on its own.

      3.4 Altogether, I found the framing of this manuscript very puzzling. It is possible the result would be more clearly presented if the use case was the major focus rather than the more conceptual point about factor analysis.

      Response 3.4 Thanks for the suggestion. The major aim of this manuscript is to highlight the versatility of the generalization of group factor analysis as implemented in MOFA for novel applications in single-cell data analysis, beyond multimodal integration of single cells. The definition of multicellular programs from single-cell data and its sample-level unsupervised analysis are relatively new analyses in the field, and thus we believe that it is timely to show how a known statistical framework can be used for these applications.

      We believe that a detailed analysis of single-cell datasets of heart failure deserves its own focus and it is out of scope of our current objective with this manuscript. We apologize for the apparent misunderstanding of the objective of our methodology. We will add these distinctions in the introduction of the manuscript.

      References

      Argelaguet R, Cuomo ASE, Stegle O & Marioni JC (2021) Computational principles and challenges in single-cell data integration. Nat Biotechnol 39: 1202–1215

      Armingol E, Baghdassarian H, Martino C, Perez-Lopez A, Knight R & Lewis NE (2021) Context-aware deconvolution of cell-cell communication with Tensor-cell2cell. BioRxiv

      Jerby-Arnon L & Regev A (2022) DIALOGUE maps multicellular programs in tissue from single-cell or spatial transcriptomics data. Nat Biotechnol 40: 1467–1477

      Kuppe C, Ramirez Flores RO, Li Z, Hayat S, Levinson RT, Liao X, Hannani MT, Tanevski J, Wünnemann F, Nagai JS, et al (2022) Spatial multi-omic map of human myocardial infarction. Nature 608: 766–777

      Macnair W, Calini D, Agirre E, Bryois J, Jaekel S, Kukanja P, Stokar-Regenscheit N, Ott V, Foo LC, Collin L, et al (2022) Single nuclei RNAseq stratifies multiple sclerosis patients into three distinct white matter glia responses. BioRxiv

      Mitchel J, Gordon MG, Perez RK, Biederstedt E, Bueno R, Ye CJ & Kharchenko P (2022) Tensor decomposition reveals coordinated multicellular patterns of transcriptional variation that distinguish and stratify disease individuals. BioRxiv

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The work presented here examined the combined contribution of intermediate gray matter spinal interneurons of the spinal lumbar enlargement (L2-L4) to locomotion in rats. By targeting this region with kainic acid, we were able to produce a specific locomotor signature that was not compensated for over time, indicating the need for cellular replacement therapies in the treatment of such spinal cord injuries leading to the loss of spinal enlargement intermediate gray matter. Further, the newly developed techniques of a combinatorial behavioral assessment using Random Forest classification and a machine learning intermediate gray matter neuronal loss assessment established in this work add an unbiased, in-depth approach that we are making available to others.

      The reviewers have critically evaluated our work and highlighted points of weakness either in the research itself or in connecting with our audience. Below is our detailed response to all the comments as well as our revision plan for submission. We believe we have been able to sufficiently address the concerns that were voiced to strengthen our manuscript and express our gratitude for the feedback.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): __ In this paper, Kuehn and colleagues report on the analysis of functional impairments following intermediate gray matter lesion with kainic acid. The image convincingly show that mostly purely grey matter lesion can be achieved throughout the paper. The authors took care to do a battery of well-designed behavioral tests and sophisticated analysis in order to access functional impairment. They then correlate their behavioral assessment to lesion size, the number of NeuN positive cells in layers V-VII epicenters as well motoneuron numbers and the percentage of white matter. Overall, the manuscript is well written, nicely framed in the existing literature, very clear and the experiments are simple but well designed. The behavioral testing and evaluations including random forest ranking are well performed. The methodology is complete and would allow reproducing the experiments. Statistics are used appropriately. We have however some reserves and comments on some of the results and interpretations. Addressing these comments would not involve new experiments but new re-analysis of the existing datasets.

      Major comments:__

      __ While the claims that grey matter lesions trigger major behavioral impairments is convincing in particular with the refine behavioral experiments performed, the key claim that only interneuron loss in layer V-VII mediates those deficits is currently not supported by the presented data. In particular, we would suggest that the lesions performed, in contrast to the claims, are not purely and selectively impacting layer V-VII but might also impact layers VIII-IX. We think that presenting neuronal counts based on NeuN staining separately for layer I-IV, V-VII, VIII-IX and comparing control vs KA is necessary. Only with these data can conclusions be supported either in the direction suggested by the authors or otherwise.__

      • Although primarily targeting laminae V-VII, we realize this is not exclusively doing so with our lesion model. We understand the value of what you request and are retraining our computer models to be able to do the additional neuronal quantification in laminae I-IV, VIII, IX. We will then combine lamina VIII with laminae V-VII to make up the intermediate gray matter NeuN counts. Completion of all manually validated new analysis is ongoing and will be finished shortly. We plan on adding this additional analysis to the paper, which means much of Figure 6 and Supplementary Figure 3 will be altered and partially for Figure 7, but we won’t know exactly how until we finish the analysis. Tracked changes are shown in the updated manuscript PDF and highlighted text may change depending on results of this analysis.

      Another claim relative to the lack of involvement of motoneurons in the related behavioral deficits is also difficult to resolve with the current data. Motoneurons have been identified based on NeuN staining and size. While this is not the state of the art (ChAT staining would have been preferable), it remains acceptable. However, the data presented figures 7 and 8 show a very wide range in the motoneuron count (15 to 50) indicating either motoneuron loss or a count performed at different lumbar levels in the animals. This raises questions on the model (is it really involving only layers V-VII?) or on the interpretation of the data. Therefore we believe that motoneurons counts need to be presented separately (see above) in control vs KA groups and data need to be discussed in this perspective. Authors should also tone down the specificity of the model and involvement of motoneurons accordingly (page 20 for example).

      • Although we agree with the reviewers that ChAT staining would have been preferable, we had a limited amount of tissue available. Our unbiased, machine-learning-based analysis of neuronal loss by NeuN required much of the existing tissue. However, neuronal staining has been previously established to identify motoneurons based on size inclusion (Hadi et al., 2000; Wen et al., 2015), as we have used here. Additionally, we will be including total neuronal analysis from lamina IX as requested (please see answer to previous comment).

      • By including the Controls along with the KA rats, we postulate that the wide range of motoneuron numbers is due to natural individual variation as well as due to variation at each spinal level, and not due to the KA lesion, as the KA animals have a range of motoneuron counts, sometimes even greater than the controls (Figure 7 and 8). However, as requested, we have split L2, L3 and L4 (graphs below) and still do not see a correlation with behavioral performance (BBB and inclined beam). The variation due to spinal level may partially be explained by the fact that there are different numbers of motoneurons at each spinal level, dependent upon the number of muscles each spinal level is responsible for and the number of motor columns at a given level (Mohan et al., 2015; Nicolopoulos-Stournaras & Iles, 1983). These counts are taken from a given section and not the entirety of the spinal level, adding further possible variation. Moreover, we have removed the controls as suggested (graphed below) for motoneuron analysis and still do not see a correlation between the number of motoneurons and behavioral performance (BBB and inclined beam). We do not find this the correct way to graphically represent the data as it does not allow the reader to see the natural number of motoneurons that exist at each spinal level and variation within as well as knowing that this is not due to injury correlating with behavioral differences, and therefore we would like to keep these graphs with controls in the manuscript.

      We have toned down the specificity of the model and involvement of motoneurons as requested on pages 20-21.

      Most of the conclusions rely on correlations that include control animals (injected with saline hence with no lesions and no behavioral deficits; Fig 6 and 7). This artificially skews the correlations as those animals show no lesions and good performance in the behavioral tests. These correlations need to be performed only with KA injected animals to determine the respective involvements of interneurons and motoneurons.

      • To address your concern, we first did as you asked and removed the controls and performed the correlation analysis for Figure 6, shown below. There are no significant correlations between neurons at each spinal level and behavior. We would further argue that unlike a contusion injury where control animals only receive a laminectomy, our control animals have very minor neuronal loss due to the saline injection itself and therefore do have a minor lesion. An example of this is seen in Figure 6 for the control animal at spinal level L2 where the pipette track is visible. Therefore, to show that the observed behavioral deficits are from the kainic acid and not the injection itself, we would argue that it is important that the control animals remain in the correlation analysis.

      The long-term study (Fig 8) is performed with very few animals and hence, drawing conclusions from these animal numbers is difficult. All correlations are performed including control animals which is even more of a problem here as in Figure 6 and 7 due to the low number of animals. The authors should either add animals or remove the figure. When control animals (injected with saline) are removed (as they do not show any lesion and perform accurately in the behavior), one would actually see a correlation between the number of motoneurons and the behavioral performance (Fig. 8E,F) but not with the lesion size (Fig.8C,D).

      • The long-term study was planned with more animals, but due to exclusion criteria by lesion length, the numbers remain low. We had discussed extensively whether to include this data in the manuscript or not. We decided for several reasons to include it in the manuscript within the main figures. First, it demonstrates that once these interneurons are lost, there are no compensatory mechanisms that restore function, which is quite striking given that the ones that lose weight support by 2 weeks do not regain it over a 3-month observational period. Further indicating that loss of lumbar gray matter interneurons is essential to locomotor function of hindlimbs and should be targeted in SCI replacement therapeutics. However, we do not agree with removing controls to examine the motoneuron number as there is motoneuron number variation within the lesion area and the motoneuron number from the KA animals is within the Control motoneuron range, which can be seen with the graph including the Controls. We can provide the individual spinal lesion level correlations, but this does not provide the entire picture as one level alone has not been found to be essential to the behavioral deficits. We are currently processing these animals to also provide NeuN numbers from laminae I-IV, V-VIII and IX.

      Minor comments:

      __ Figure 1A: if lesions are bilateral, it would be nice to illustrate this on the schematic.__

      • This has been fixed. Figure 1B-D: scale bars are missing

      • This has been fixed. Figure 3H: What represents the y-axis? % of completion or number of completion?

      • This has been fixed. Figure 4 Table: Please specific what the acronym stands for: pLDA.

      • This has been clarified in the figure legend. Figure 6 A: scale bars are missing

      • This will be fixed when the data for the analysis is finished and the figure is redone. Figure 6B/C/D: please add the spinal level analyzed directly on the graphs. This will ease the comprehension.

      • This has been adjusted. Figure 7 and Figure 8: While it is quite convincing that the model is purely a grey matter injury (panel C and D), the data are very much spread out for the number of motoneurons per mice (see major comments above). We would suggest to plot those data to present the number of neurons (interneurons in layer I-IV, V-VII and motoneurons) control vs KA.

      • Thank you for the suggestion. We will plan on presenting the additional neuronal quantification data mentioned above by comparing Controls and KA animals.

      Dots are missing on those figures (probably superimposed on top of each other). This should be changed to see all data points

      • Thank you for the observation. They were superimposed but we have fixed this. Figure 8E,F: the number of motoneurons is very low also in controls. How is this explained?

      • Depending on where the section was taken at each spinal level, there is variation in the number of motoneuron columns innervating targeted muscles (Mohan et al., 2015), Figure 6). Therefore, it is not surprising to see a range of motoneurons. In addition, we would like to clarify that these motoneuron counts are taken from only three sections across the lesion (from the three lesion injection epicenters), not the whole lumbar section. Often the motoneuron number in the KA group was equal to or greater than the Control group, indicating more often variation than motoneuron loss. Regardless motoneuron numbers do not correlate with the observed behavioral deficits.

      __Reviewer #1 (Significance (Required)):

      This paper by Kuehn and colleagues reports on the functional impairments that follow intermediate gray matter lesions using kainic acid. This work is largely confirmatory of previous studies (Magnuson et al., 1999; Hadi et al., 2000) with modern behavioral evaluation. After revision, it would provide a description of the functional impairments following those specific lesions. The paper would be informative for a specific audience in particular scientists in the field of spinal cord injury and spinal interneuron. Our field of expertise is spinal cord injury, inflammation, behavior and axon outgrowth.__

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This manuscript reports on a pair of well-designed and well-carried out studies investigating a Kainic Acid (KA)-mediated gray matter lesion in the lumbar enlargement of adult female SD rats. The investigators demonstrate, using NeuN immunohistochemistry, that the KA lesion reduces NeuN positive cells along the length of the lumbar spinal cord from rostral to L2 to slightly caudal to L4 following 6 separate injections made on the right and left sides of the spinal cord at L2, L3 and L4. The investigators made significant efforts to avoid depleting neurons in the dorsal and ventral horns, and the evidence provided suggests they were successful. The methodology described is sound and sufficient details are provided to allow the reader to fully understand the studies. It is outstanding that the study was done while following all of the PREPARE and ARRIVE guidelines. A second major component of the work is the use of multiple outcome measures and efforts (using a Forest analysis) to develop a relatively quick, accurate and efficient system to screen or classify the injuries in individual animals within 2 weeks of the injury so that subsequent treatments could be done on animals which received injuries of sufficient severity (within a relatively narrow range) and with balanced experimental and groups. Again, with this effort the investigators were largely successful. The KA lesion results in persistent locomotor and sensorimotor deficits, that plateau early without substantial sensory dysfunction.__

      Major Comments:

      __ Introduction: Overall, the rationale presented and the review of the pertinent literature is solid, with the following exception: The authors state that their model should allow them to thoroughly investigate the behavioral readout of premotor IN loss. It is generally accepted that the designation of premotor interneurons refer to those directly connected to motor neurons, and while the chosen KA lesion certainly targets some premotor neurons, it also targets many other interneurons that do not directly contact motoneurons. Please revise how the lesions are referred to. In the very next paragraph the targets are defined somewhat differently as "INs and propriospinal INs in laminae V-VII in spinal levels L2-L4".__

      • We agree that our wording does cause confusion to the reader and to avoid this we have now made the change from premotor INs to SpINs (pages 3-5).

      • On a side note, we would like to state these are adult female Fischer rats and not adult female SD rats, also described in the methods.

      Spared white matter. In many (but not all) labs, spared white matter at the epicenter is an important measurement because it presumably represents all the spared axons, such that any/all rostrocaudal communication is represented. Thus, it is the single point (or section, in this case) that has the smallest number of axons represented as stained white matter. So, to indicate that you assessed "three epicenters per spinal cord" doesn't make sense in this context, Even if you are referring to three separate KA injection sites (L2, L3 and L4). Thus, averaging three sections also doesn't really make sense because the actual epicenter should be represented by the single cross section that has the smallest area of stained white matter. Also related to spared white matter, in many labs they calculate %SWM based on a section from a control animal, and this should reduce variability because some cords shrink (injured gray matter) more than others after the injury, whether it be a contusion or mild excitotoxic injury. Please either re-calculate your SWM or provide additional justification for your current method.

      • We agree with the reviewer that normally only the epicenter of the lesion needs to be examined for white matter damage as once the connection is severed it does not matter what is rostral or caudal to this site. However, in our case we do not find any significant differences in white matter between the Controls and the KA groups. To be certain we looked at all three lesion epicenters where the damage occurred. If you examine the graphs below, you will notice that in fact the KA animals have a higher % white matter of the CSA than the Controls. Given how this analysis is done we are looking at % white matter of the cross sectional area (CSA). In the KA animals the loss of gray matter causes a collapse that makes it appear as though the white matter covers more of the CSA area than it normally does. Even if we were to normalize to the Controls you would see the same as what you already observe in these graphs.

      For this reason, we have compared the average area of white matter at the three lesion epicenters between the Control and KA groups and did not find significant differences (new Figure 7C). We also evaluated average area of white matter at the individual spinal levels (L2-L4) and did not find significant differences between the two groups and therefore averaged them. This indicates that we are not seeing any white matter alterations with our lesion model.

      Results: Within the results (and elsewhere) there are a number of un-supported statements that should be removed, softened or supported. For example, on page 18 the authors talk about how the CatWalk "further investigates the role of propriospinal INs connecting the cervical and lumbar enlargements" and no reference is provided.

      • The requested references have now been added.

      It is important to note that two animals were not included overall because they were unable to perform the CatWalk assessment. Additional information about these animals might be helpful to further characterize the KA lesions, for example, when they are too large.

      • Yes, we have looked into this. Lesion size appears to play a role (Figure 2C) but does not appear to be the only determining factor as two animals (KA#6, KA#7) with and without weight support had the same lesion length (10,325um). We predict this is due to the amount of neuronal loss; KA#7 had greater neuronal loss in all three levels compared to KA#6.

      Figure 6 brings up a number of questions including how the three "epicenters" were determined and how some KA lesioned spinal cords appear to have more than 100% the number of neurons in the control spinal cords. Yes, there is variability in normal animals, but still this seems unlikely. Is it possible that the KA injection sites were not accurate in these animals? I know it is unlikely, however, the large number of neurons in some animals at L2 is bothersome. Did the investigators always inject L2, L3 and L4 in that order? Pipettes tend to wick up liquid thus diluting the drug/cells/whatever at the tip.

      • We understand your concern of being greater than 100%, therefore we have changed the normalization to the greatest control value vs the average of controls (except for lesion size which is done to largest lesion size overall, new Supplemental Figure 3 and Figures 6 and 7 will be altered once our new analysis is finished).

      • Animal KA #1 that you are referring to could have been a technical error due to injections but it is hard to say at this point as we have found nothing from our surgery records that indicate why this animal would be different from others. Yes, bilateral injections were always performed in the same order (L2, L3, L4). However, we think it is unlikely that this created a significant drug dilution problem as we see animals with more damage in L4 than L3 or L2 (KA #3, #6 and #7 in new Supplemental Figure 3). But clearly L2 in animal KA 1 is not significantly damaged.

      Also for Figure 6, I am not convinced that the color coding is really very useful here. I think what might be more useful would be some higher magnification images of the intermediate gray matter. This figure also appears to show pipette tracks in some sections suggesting that the KA was leaking up the track either during injection or when the pipette was withdrawn. This is not a serious issue, but might be worth mentioning as a confound.

      • First, we would like to clarify that Figure 6 is already a higher magnification image of only laminae V-VII not the entire gray matter (please see figure legend). Figure 1 is a lower magnification but here in Figure 6 we wanted to highlight the region of interest that was analyzed for neuronal loss. Pipette tracks were also observed in the Controls and not thought to be due to KA leakage, as we don’t see neuronal damage beyond the injection tracts in the dorsal horns. With the new figure we will see if the color coding will be added or not dependent on the space available.

      Finally, for Figure 6, the correlations shown are quite poor, and would be even worse of control animals were not included. Too much strength is given to these findings.

      These issues with Figure 6 become even more serious as we move to Figure 7. Here, looking at the correlation to loss of MNs is weak because this reviewer is not convinced that looking at the "three epicenters" is a valid approach. Were the epicenters identified by particular criteria? Also, I think images showing how MNs were identified and counted would be important, in particular since you did not use ChAT staining but relied on NeuN and size.

      • These epicenters were chosen after reviewing all coronal sections in a 1:7 series of the lumbar cord (T12-L5). The three epicenters were the three coronal slices with the greatest neuronal loss (methods, page 12). This is supported by the inflammatory response in these sections (not shown).

      • Please see the schematic below that explains the motoneuron analysis that is also performed in our work which is detailed in the methods. Briefly, the cell soma area of NeuN+ cells in lamina IX were measured in Image J. NeuN+ cells with an area greater than 916µm2 were used for the motoneuron analysis (Wen et al, 2015).

      • We agree that in Figure 6 the correlations for each spinal level although significant are moderate but this is due to the fact that one given spinal level was not found to be responsible for the behavioral deficits. This is supported by our work on correlation with lesion length, the lesion must span multiple levels to produce the behavioral deficit. Finally, the correlations may change when we add in lamina VIII, but we won’t know until the analysis is finished.

      • As for Figure 7, we agree that we do not see correlations and our argument is that motoneuron and white matter area are not responsible for the behavioral deficits we observe (new Figure 7). Therefore, you are reading those correctly, these are not significant correlations.

      Discussion Yes, interneurons in the intermediate gray matter throughout the lumbar enlargement "regulate lower motoneurons" but they also do other things, most notably communicating both intra and intersegmentally (short and long propriospinals). Please adjust this statement.

      • We appreciate this detailed feedback, we have adjusted this statement to the following:

      “Damage to this area, which includes regulation of lower motoneurons leads not only to gross motor deficits (BBB score), but rhythmic and skilled walking (even and uneven horizontal ladders), coordination (BBB subscore), balance (inclined beam) and gait deficits (CatWalk), as well.” (page 25)

      On page 25, you talk again about premotor SpINs. I understand that you are using this term/nomenclature to distinguish these INs from motoneurons, but this is problematic because many if not most of your readers will assume the premotor SpINs synapse directly onto MNs, which of course many of the INs that are eliminated by KA do not. Calling them simply SpINs would be sufficient and still distinguish them from MNs.

      • We have adjusted this to the term “SpIN and premotor circuitry” on pages 26 and 27.

      On page 27 you talk about the RI, and while there is a statistically significant drop in RI, it must be admitted that the RI remains above 90% (0.9) which means that 9 out of 10 steps use a normal sequence. Thus, I think it is misleading to indicate that this indicates a difference for the KA animals. In fact, I think it is more important to consider how these animals were able to maintain an RI in excess of 90% despite the loss of substantial numbers of INs.

      • Thank you for the comment, we have adjusted this in the discussion:

      “In addition to gait rhythm changes, we also saw significant differences in pattern generation. The regularity index (RI) measures correctly sequenced footsteps and is used to analyze recovery in mild to moderate injuries and coordination (Koopmans et al., 2005; Kuerzi et al., 2010; Shepard et al., 2021). While KA-animals have a significantly lower RI in comparison to the controls, the RI remains above 90% which is still relatively high given the amount of neuronal loss. However, we would argue that a single parameter is not the defining factor of gait/coordination, but a combination of parameters and tests provides a more comprehensive picture, as we have seen with our pLDA analysis and Random Forest classification approaches.” (Pages 28-29)

      The rationale for determining classification prior to histological analysis is somewhat weak, and I think it would be worthwhile strengthening this rationale at the beginning of this paragraph...it becomes more obvious later why this classification is important. Is the variability of the KA model greater than an NYU or IH contusion model? If so, why? The early functional plateau is key to this argument.

      • We postulate that less severe SCIs and our milder KA lesion tend to have more variability than more severe SCI models. In the contusion models this is due to the delayed natural compensatory functional recovery plateau that can last up to 5-6 weeks. However with the KA model, variability arises from titrating down KA and adding multiple injection sites increasing variable success rate per injection. In the KA model, the early functional plateau at two weeks allows for correctly excluding or classifying animals into equally lesioned groups prior to treatment with our Random Forest Eco model. We agree that we need to clarify this reasoning in the results and have now done so on page 22. “To test the efficacy of experimental SCI therapies, it is important to effectively evaluate recovery performance through the combination of behavioral tests. In addition to carefully classifying groups at the end of the study, there is a need to provide exclusion criteria and equal sorting of variability between groups prior to treatment (after deficits have stabilized at two weeks).” (page 22)

      Minor Comments:

      __ Heatmap Analysis: The term "lesion size" is insufficiently accurate to be used in this context. Do you mean lesion length?__

      • This term has now been adjusted to lesion length throughout the manuscript and figures.

      Kainic Acid injuries are known to be accompanied by cell division and neurogenesis in the brain, and if that kind of thing is happening in the presented model, it could be an interesting confound/addition to the alluded to cellular replacement __therapies.____

      __

      • KA has been shown to be accompanied by cell division and neurogenesis in the brain, however from our own work and previous work with KA in the spinal cord if this occurs it is not at a level that is relevant to functional recovery as evidenced in our long-term study. A previous study by Magnuson et al compared E14 cerebral rat precursor cell transplantation 40 minutes and 4 weeks post-KA injury and did not find significant differences in cell survival/division (Magnuson et al., 2001). Therefore, we do not believe this would hamper or confound our future work with cellular replacement therapies. In addition, cell transplantation would take place 2 weeks post-KA injury when KA would no longer be able to hamper the transplanted cells.

      __Reviewer #2 (Significance (Required)):

      __

      __ Overall, this is a well-designed and performed set of studies that takes the KA lesion model into new territory, well set-up to perform delayed (sub-acute or early chronic) neuron replacement studies. The work characterizes a multi-segment but mild KA injury model that demonstrates persistent dysfunction that plateaus early, and a rapid and efficient system to classify the injury with a high predictability of long-term dysfunction by 2 weeks post-injury.

      This model should be of interest because it focuses on gray-matter specific tissue loss and functional deficits that should be amenable to neuron replacement strategies without the complications of white-matter dependent functional losses.

      My expertise: I have been using a variety of spinal cord injury models, in rats, for many years including contusions, lacerations and excitotoxic (KA) lesions. I have a lot of experience with locomotor, motor and sensory outcome measures. However, I have very limited experience with the Random Forest analysis employed and am not an expert in statistics.__

      __References: __

      Hadi, B., Zhang, Y. P., Burke, D. A., Shields, C. B., & Magnuson, D. S. (2000). Lasting paraplegia caused by loss of lumbar spinal cord interneurons in rats: no direct correlation with motor neuron loss. J Neurosurg, 93(2 Suppl), 266-275. https://doi.org/10.3171/spi.2000.93.2.0266

      Koopmans, G. C., Deumens, R., Honig, W. M., Hamers, F. P., Steinbusch, H. W., & Joosten, E. A. (2005). The assessment of locomotor function in spinal cord injured rats: the importance of objective analysis of coordination. J Neurotrauma, 22(2), 214-225. https://doi.org/10.1089/neu.2005.22.214

      Kuerzi, J., Brown, E. H., Shum-Siu, A., Siu, A., Burke, D., Morehouse, J., Smith, R. R., & Magnuson, D. S. (2010). Task-specificity vs. ceiling effect: step-training in shallow water after spinal cord injury. Exp Neurol, 224(1), 178-187. https://doi.org/10.1016/j.expneurol.2010.03.008

      Mohan, R., Tosolini, A. P., & Morris, R. (2015). Segmental Distribution of the Motor Neuron Columns That Supply the Rat Hindlimb: A Muscle/Motor Neuron Tract-Tracing Analysis Targeting the Motor End Plates. Neuroscience, 307, 98-108. https://doi.org/10.1016/j.neuroscience.2015.08.030

      Nicolopoulos-Stournaras, S., & Iles, J. F. (1983). Motor neuron columns in the lumbar spinal cord of the rat. J Comp Neurol, 217(1), 75-85. https://doi.org/10.1002/cne.902170107

      Pitzer, C., Kurpiers, B., & Eltokhi, A. (2021). Gait performance of adolescent mice assessed by the CatWalk XT depends on age, strain and sex and correlates with speed and body weight. Sci Rep, 11(1), 21372. https://doi.org/10.1038/s41598-021-00625-8

      Shepard, C. T., Pocratsky, A. M., Brown, B. L., Van Rijswijck, M. A., Zalla, R. M., Burke, D. A., Morehouse, J. R., Riegler, A. S., Whittemore, S. R., & Magnuson, D. S. (2021). Silencing long ascending propriospinal neurons after spinal cord injury improves hindlimb stepping in the adult rat. Elife, 10. https://doi.org/10.7554/eLife.70058

      Wen, J., Sun, D., Tan, J., & Young, W. (2015). A consistent, quantifiable, and graded rat lumbosacral spinal cord injury model. J Neurotrauma, 32(12), 875-892. https://doi.org/10.1089/neu.2013.3321

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      *Randhawa and co-authors have studied various aspects of the regulation of lignocellulose degradation by the filamentous ascomycete fungus Penicillium funiculosum. Over-expression of the well-known transcription factor clr2 (which regulates cellulase gene expression in Neurospora and other ascomycetes) in a delta-mig1 strain did not result in an increase in cellulase activity. However, when combined with an increased Ca2+ concentration the cellulase activity in the medium did increase. Using RNA-Seq, the authors have identified a candidate regulator: Snf1. Indeed, a knockout confirms that this gene is involved in the posttranscriptional regulation of cellulase production, specifically by regulating the secretion of the cellulases. *

      Major comments:

      In general, the topic and results are interesting. There are a few issues that need to be addressed, however. The manuscript would benefit from some careful proofreading. For example, articles ('the', 'a') are frequently missing. Very informal language is sometimes used ('zilch effect'). Put a space between '1000bp', etc. It is 'kDa', not 'kD', etc.

      Response – Thank you very much for the encouraging remarks. We have thoroughly checked the manuscript and have added the articles at appropriate places. We have also improved the manuscript’s language and removed any informal language used.

      I am a bit puzzled by the choice of calcium source: CaCO3, up to 10 g/L. Calcium carbonate does not efficiently dissolve in water unless the pH is low. Fungi generally acidify their culture medium during growth. As such, calcium carbonate likely has a pH buffering effect. Therefore, the described effects may also be attributed to a more neutral pH of the medium, and not necessarily to an increase in calcium ions.

      Response – We completely agree with the reviewer and had the same thought that the pH buffering effect of CaCO3 could be the reason for increased cellulase production. We ruled out this by using 50 mg/l CaCl2 solely in rest of the experiments performed in Fig. 3 and afterwards. We have also mentioned the same in the manuscript (lines 175-178).

      The authors have performed RNA-Seq, but as far as I can tell the data has not been made publicly available. At least, the raw reads should be deposited in the Short Read Archive of NCBI (or a similar repository), and preferably also the expression values in GEO of NCBI (or a similar repository).

      Response – We will comply and deposit the raw reads in the short read archive of NCBI. We will also be providing the differential analysis of transcription factors expressed under glucose and Avicel in NCIM1228 and ∆Mig1 in the supplementary information.

      P21. Very little information is provided in the M&M regarding the gene expression analysis. Provide references to all the tools, as well as the version numbers. Were any non-default parameters used?

      Response – We have added the complete information on tools and procedures used for RNA-seq data analysis. For differential expression profiling, all FPKM values were normalized to the library size using the R package, Edge R. The expression value for the transcript was calculated using the reads aligned & normalised it on library size (Total sequencing reads generated) & transcript length giving us FPKM value (Fragments Per Kilobase of transcript per Million mapped reads), and TPM value (Transcript per million reads), which is regarded as normalized expression value for a particular transcript. We have taken the number of reads which got aligned to the conserved transcripts (Present in both the comparison group i.e Wild Type Glu & Cellulose samples (S1, S2, S7 & S8) Vs MIG1 glu & Cellulose sample (S3, S4, S5 & S6) and performed the differential gene expression between the two groups. The excel sheet having differential expression profiling of transcription factors is available as supplementary data.

      The authors claim that SSP1 CaMKK phosphorylates SNF1 AMPK (last title of the Results section). I don't see any evidence for a direct interaction between these two proteins. I will believe that they are in the same pathway, but if the authors want to claim a direct interaction then additional experiments will be required. E.g. Y2H.

      Response – Ssp1 is known to phosphorylate SNF1 during nutritional stress in S. pombe and they were found to interact directly by Co-IP studies. Based on the literature, we planned to over-express Ssp1 in P. funiculosum.

      Minor comments:

      • Please add line numbers to the manuscript, this facilitates the review process.*

      Response - Line numbers have been added.

      *P14 "in all yeasts and filamentous fungi". I doubt that all fungi have been tested. *

      Response - The phrase has been modified.

      P18. "in diverse yeasts and fungi". Yeasts are also fungi.

      Response - The phrase has been modified.

      P16. "solves dual purpose". I think this is meant: "serves a dual purpose"?

      Response - The phrase has been modified.

      *P17, first paragraph: this seems very speculative to me, so it should probably be labeled as such. *

      Response - The phrase has been modified.

      P21. What reference genome is used? Please cite the paper.

      Response - We have our own reference genome in lab which is yet to be published.

      Fig 1B. These are reported as volcano plots, but to me it looks like an empty graph (no data points), only a number of genes.

      Response - The pictures have been changed.

      Fig 1D. What do the colors on the right represent? The colors on the right represents k-means clustering of the genes of transcription factors.

      Response - The same has been added to the figure legend also.

      On various places in the manuscript the term "three times in triplicate" is used. What is meant here, three technical replicates of each of the three biological replicates?

      Response - Yes we mean the same and the phrase has been modified.

      P46. "We aimed to sought"

      Response - The phrase has been modified.

      Abstract: The sentence "Further, Ca2+-signaling" should be rewritten, because currently is seems to suggest that SSP1 downregulates the phospho-HOG1 levels.

      Response – As suggested by the Western blot in the Fig.4b, Snf1 gets phosphorylated only when dual signal of calcium and cellulose are present. Since we observed upregulated Ssp1 expression in Avicel (Fig. 4a), and increased Ssp1 expression could increase the phosphorylated Snf1 in the cell (Fig. 7i), our data suggests that Ssp1 phosphorylates Snf1 in a Ca2+-dependent manner. Further Hog1 was found in hyperphosphorylated state in ∆Snf1 (Fig 6e), thus we believe Snf1 AMPK downregulates phospho-Hog1 levels.

      Reviewer #1 (Significance (Required)):

      *In general, the topic and results are interesting. There are a few issues that need to be addressed, however. The manuscript would benefit from some careful proofreading. *

      Response – We highly appreciate the encouraging words of the reviewer. We have addressed all the issues raised by the reviewer. The major ones included the language and readability of the text, which has been improvised. We have replaced the volcano plot figures, and will be uploading the RNA-seq data to the SRA database of NCBI and excel sheet of differential expression analysis of transcription factor will be added as a supplementary file to the manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      • Randhawa et al. study the effect of loss of function of Snf1 kinase and calcium on the production of enzymes related to cellulose degradation in the fungus Penicillium funiculosum. *
      • The manuscript is well structured and the researchers have done an enormous amount of work in constructing a number of mutant strains in this fungus. *
      • Transcriptomics and proteomics support the conclusions reached with the strains generated.*

      Response - Thank you very much for showing confidence in our research work, we are highly obliged by positive remarks on the manuscript.

      The manuscript is long and suffers from an excess of results presented in figures. My main criticism focuses on the presentation of data on the cellular distribution of the ER and Golgi apparatus. The micrographs are inconclusive and it is not really clear what the authors are trying to show in these experiments. These results are not really necessary for the article and I suggest that they be removed from the article.

      Response – We agree with the reviewers comments on data on the cellular distribution of the ER and Golgi apparatus. We have removed the micrograph data on the cellular distribution of the ER and Golgi apparatus (earlier Figure 3j and Figure 4r).

      Reviewer #2 (Significance (Required)):

      The authors have done an excellent job in producing a large number of strains carrying null alleles. In addition, they have used two broad analysis techniques that allow them to establish coherent hypotheses and corroborate them with the results.

      Response – Thank you very much for the positive comments

      The manuscript is difficult to understand in some sections because of the excessive amount of data and panels in the figures. The names designating each strain and given in full length in the graphs do not help either.

      Response – Thank you very much for the valuable suggestion. We have reduced the number of graphs by including all enzymes assays in one concise graph in Figure 4. We have also shortened the names of strains and enzymes, in all the figures.

      This work is of interest to all researchers interested in the integrity of signaling and regulatory pathways on extracellular enzymes of biotechnological interest.

      *My interests focus on the cell biology of filamentous fungi, in particular on the molecular mechanisms and subcellular localization of elements involved in intracellular transport, signaling against environmental stresses and changes in transcriptional regulatory patterns. *

      Response – Thank you once again for the encouraging remarks.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript submitted by Randhawa et al focus on the mechanism of cellulases secretion, the very important and basal question in the filamentous fungi, particularly for cellulases biotechnology. As the author said the molecular basis of cellulases production previously study mainly focuses on regulation mechanism at transcription level, the study of molecular mechanism of cellulases translation and secretion are much rare. Therefore the submitted work is very impressive me on the progress of this area. What they presented shown the Ca2+ is critical for the regulation of cellulases secretion by SNF-1, SSP1 and HOG1. The regulation might be caused by affecting the protein trafficking in ER and Golgi, the manuscript found the development of ER and Golgi shown changes by staining by ER-tracker and Bodipy under different conditions and mutants. The manuscript constructed a model about regulatory mechanism of Ca2+ on cellulases translation and secretion level. The present study is close to make significant progress in the cellulases regulation area.

      Response – We appreciate the positive comments of the reviewer.

      Major comments: I am really impressive for the great work in the manuscript, however, I think the more work do need for give the conclusion of paper.

      1.In terms of dynamics development of ER and Golgi of strains, the very critical data for the conclusion of the paper, the current data is only by chemical staining. It is not robust, it will be needed by other methods, for example, GFP-labeling the marker of ER or Golgi.

      Response – The manuscript focuses on the signaling events governing cellulase production, and secretion. Since ER and Golgi are the sites of protein production and secretion, we hypothesized, if the Ca2+ signaling affects post-transcriptional events, it must have had some impact on the dynamics of these organelles; and microscopy experiments suggested us the same. In the next set of experiments, we proved our hypothesis with the proteomics and functional analysis of Snf1, Ssp1, and Hog1 MAPK. Hog1 MAPK pathway is known to regulate protein trafficking and secretion in yeast. We here showed that Ca2+- dependent regulation of Hog1 MAPK and its downregulation by Snf1 AMPK is crucial to cellulase secretion.

      2.Also the author try to suggest the cellulases were detained in the ER, not went into Golgi, therefore the secretome protein decreased. It is very much possibly but the evidence is not robust either, to trafficking the GFP-labelled CBH1 might be a good experiment to make it clear.

      Response – Thank you very much for raising the query. The manuscript majorly focuses on the role of calcium signaling on cellulase translation and secretion. Further, we have studied two signaling proteins, Snf1 AMPK and Hog1 MAPK which are downstream to calcium signaling, and we found their crosstalk vital to cellulase secretion. We have not talked about cellulases being detained in the ER or Golgi, rather we focused on the signaling events regulating cellulase production and transport.

      Since we had already ruled out the role of calcium in cellulase transcriptional activation, and ER and Golgi being major site of protein production in the cell; we performed microscopy experiments to see if the calcium signaling modifies ER and Golgi morphology during carbon stress. We found under-developed Golgi in the absence of calcium in wild type. This experiment helped us to build a hypothesis that calcium signaling might have role in downstream events like protein translation, and secretion. The hypothesis was proved by functional analysis of signaling proteins, Western blot and proteomics experiments. Further, microscopy experiments further strengthened our observation that Snf1 AMPK is downstream target of calcium signaling and has no role in the cellulase translation, but cellulase secretion.

      Considering that we are not focusing on the protein trafficking of cellulase, the confocal microscopy experiments are not decisive, rather build supporting evidence for our hypothesis, as suggested by the second reviewer. We have proved our hypothesis of Ca2+-dependent post-transcriptional regulation of cellulase by proteomics, and other biochemical experiments. Nevertheless, we plan to perform the confocal experiments again to achieve pictures with higher resolution.

      1.On page 9, please indicate the fold changes of the kinases genes talked about, snf1 and so on.

      Response – We have added the Fold change in the expression of Snf1 and Ssp1 (line number 221).

      2.The quality of microscopic figure is not good, should have one with higher resolution, even consider to present the electron microscope picture to give the er and Golgi dynamics changes the manuscript talked about(optional).

      Response: We agree with the reviewer’s suggestion to add high resolution confocal images of mycelia in Fig 3j and Fig. 4o. We are in the process of repeating the confocal microscopy experiment. We will update the manuscript with improved microscopic pictures.

      *3. The quality of Western plot need to be improved, particularly figure 4f,figure 7i, it is hard to give the conclusion based on the picture presented *

      Response – We have replaced the pictures of western blots (Fig 4f, and Fig 7i) with high resolution images.

      Reviewer #3 (Significance (Required)):

      The manuscript submitted by Randhawa et al focus on the mechanism of cellulases secretion, the very important and basal question in the filamentous fungi, particularly for cellulases biotechnology. As the author said the molecular basis of cellulases production previously study mainly focuses on regulation mechanism at transcription level, the study of molecular mechanism of cellulases translation and secretion are much rare. Therefore the submitted work is very impressive me on the progress of this area. What they presented shown the Ca2+ is critical for the regulation of cellulases secretion by SNF-1, SSP1 and HOG1. The regulation might caused by affecting the protein trafficking in ER and Golgi, the manuscript found the development of ER and Golgi shown changes by staining by ER-tracker and Bodipy under different conditions and mutants. The manuscript constructed a model about regulatory mechanism of Ca2+ on cellulases translation and secretion level. The present study is close to make significant progress in the cellulases regulation area.

      Response - Thank you for the positive comments on the manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Randhawa and co-authors have studied various aspects of the regulation of lignocellulose degradation by the filamentous ascomycete fungus Penicillium funiculosum. Over-expression of the well-known transcription factor clr2 (which regulates cellulase gene expression in Neurospora and other ascomycetes) in a delta-mig1 strain did not result in an increase in cellulase activity. However, when combined with an increased Ca2+ concentration the cellulase activity in the medium did increase. Using RNA-Seq, the authors have identified a candidate regulator: Snf1. Indeed, a knockout confirms that this gene is involved in the posttranscriptional regulation of cellulase production, specifically by regulating the secretion of the cellulases.

      Major comments:

      In general, the topic and results are interesting. There are a few issues that need to be addressed, however. The manuscript would benefit from some careful proofreading. For example, articles ('the', 'a') are frequently missing. Very informal language is sometimes used ('zilch effect'). Put a space between '1000bp', etc. It is 'kDa', not 'kD', etc.

      I am a bit puzzled by the choice of calcium source: CaCO3, up to 10 g/L. Calcium carbonate does not efficiently dissolve in water unless the pH is low. Fungi generally acidify their culture medium during growth. As such, calcium carbonate likely has a pH buffering effect. Therefore, the described effects may also be attributed to a more neutral pH of the medium, and not necessarily to an increase in calcium ions. The authors have performed RNA-Seq, but as far as I can tell the data has not been made publicly available. At least, the raw reads should be deposited in the Short Read Archive of NCBI (or a similar repository), and preferably also the expression values in GEO of NCBI (or a similar repository). P21. Very little information is provided in the M&M regarding the gene expression analysis. Provide references to all the tools, as well as the version numbers. Were any non-default parameters used? The authors claim that SSP1 CaMKK phosphorylates SNF1 AMPK (last title of the Results section). I don't see any evidence for a direct interaction between these two proteins. I will believe that they are in the same pathway, but if the authors want to claim a direct interaction then additional experiments will be required. Eg Y2H.

      Minor comments:

      Please add line numbers to the manuscript, this facilitates the review process.

      P14 "in all yeasts and filamentous fungi". I doubt that all fungi have been tested.

      P18. "in diverse yeasts and fungi". Yeasts are also fungi.

      P16. "solves dual purpose". I think this is meant: "serves a dual purpose"?

      P17, first paragraph: this seems very speculative to me, so it should probably be labeled as such.

      P21. What reference genome is used? Please cite the paper.

      Fig 1B. These are reported as volcano plots, but to me it looks like an empty graph (no data points), only a number of genes.

      Fig 1D. What do the colors on the right represent?

      On various places in the manuscript the term "three times in triplicate" is used. What is meant here, three technical replicates of each of the three biological replicates?

      P46. "We aimed to sought"

      Abstract: The sentence "Further, Ca2+-signaling" should be rewritten, because currently is seems to suggest that SSP1 downregulates the phosphor-HOG1 levels.

      Significance

      In general, the topic and results are interesting. There are a few issues that need to be addressed, however. The manuscript would benefit from some careful proofreading.

    1. I hope that they pass something because the whole nation deserves some protections, not just certain states. There are improvements that they should make on transparency. For example, if Facebook’s privacy policy says, “We use information about groups you follow to personalize ads,” as opposed to if they say, “We use the content of your communications to personalize ads,” I think most consumers would think of those as different. Some people may be more comfortable with one rather than the other.

      The desire to provide comprehensive protection for everyone in the country, not just certain states. Regarding transparency, the importance of using clear and precise language in privacy policies has been well accepted. Consumers should be able to understand how companies like Facebook use their data, and using clear, simple language in privacy policies can help achieve that goal. This example highlights the difference in perception, depending on how the information is presented. Ultimately, it is important that consumers have access to transparent and meaningful information about how their data is used so they can make informed decisions about whether to share it.

    2. I think most consumers would think of those as different. Some people may be more comfortable with one rather than the other. Right now, we don’t know which one Facebook does, because all they say is “We use your personal information.”

      The companies are very sneaky and cheat us in purpose. They write their privacy informations so long and hide it. We need to expand to read them all. Some of them do not have decline bottom. We have to agree the statement and accept to continue.

    3. I think most consumers would think of those as different. Some people may be more comfortable with one rather than the other. Right now, we don’t know which one Facebook does, because all they say is “We use your personal information.”

      It is very sneaky how companies write their privacy information and policies/contracts. The purposefully make it long and difficult to understand which is probably why it is such a struggle to make and maintain laws about privacy.

    1. we should not refer to persons in ways that may imply that they are essentially defined by something that they are, in fact, managing.

      I don't think that LGBT people are managing this, I think that they are using it to define themselves and make that almost their whole existence. It is not the same thing as having a disease.

    1. Author Response

      Reviewer #1 (Public review):

      1.0) This paper investigates the metabolic basis of a node, posterior cingulate cortex (PCC), in the default node network (DMN). They employed sophisticated MRI-PET methods to measure both BOLD and CMRglc changes (both magnitude and dynamics) during attention-demanding and working memory tasks. They found uncoupling of BOLD and CMRglc in PCC with these different tasks. The implications of these findings are poorly interpreted, with a conclusion that is purely based on other work independent of this study. Various suggestions could allow them to place some speculations in line with a stronger interpretation of their results.

      This is one of several papers in recent years investigating the metabolic underpinnings of activated (or task-positive) and deactivated (or task-negative) cortical areas in the human brain. In this study, they used BOLD fMRI and glucose PET scan to examine the metabolic distinction of the default node network (DMN), which is known to be deactivated during attention-demanding tasks, with different types of cognitively demanding tasks. Unlike the BOLD response in posteromedial DMN which is consistently negative, they found that CMRglc of the posteromedial DMN (a task-negative network) is dependent on the metabolic demands of adjacent task-positive networks like the dorsal attention network (DAN) and frontoparietal network (FPN). With attention-demanding tasks (like Tetris) the BOLD and CMRglc are both downregulated in DMN (specifically the posterior cingulate cortex, PCC, a task-negative node of DMN), but working memory induces CMRglc increase in PCC and which is decoupled from the negative BOLD response in PCC.

      We thank the reviewer for the constructive feedback and the possibility to improve our manuscript. We agree that the interpretation of the results should be strengthened to provide a stronger focus on our data. Regarding the uncoupling of BOLD and CMRGlu during working memory, we acknowledge the need to further elaborate on this topic in our discussion. These suggestions and comments have been incorporated into the revised manuscript as outlined below.

      1.1) These complicated results are the main findings, and to provide a biological basis to these data they rather surprisingly, but without their own experimental evidence, conclude that the negative BOLD and negative CMRglc in PCC during attention-demanding tasks is due to decreased glutamate signaling (which was not measured in this study) and the negative BOLD and positive CMRglc in PCC during working memory is due to increased GABAergic activity (which was not measured in this study). It is rather surprising that without measurement, a conclusion is made which would at best be considered a hypothesis to be tested. Thus, independent of these hypothesized mechanisms, they need to summarize their results based on their own measurements in this study (see 3 for a hint).

      Thank you for bringing up this point and for the insightful suggestion concerning point 3. We have now explicitly stated that the interpretation regarding glutamate and GABAergic signaling is of speculative nature as theses were not measured in the current work, moreover, we have substantially reduced this section. As such, we agree with the reviewer that this represents an interesting hypothesis to be tested in future work. For further details please see response to comments 1.3 and 1.4.

      Discussion, page 16, line 341:

      On the neurotransmitter level, one of the current hypotheses regarding BOLD deactivations proposes that CMRO2 and CBF are affected by the balance of the excitatory and inhibitory neurotransmitters, specifically GABA and glutamate (Buzsáki et al., 2007; Lauritzen et al., 2012; Sten et al., 2017). In the PCC, glutamate release prevents negative BOLD responses (Hu et al., 2013), whereas a lower glutamate/GABA ratio is associated with greater deactivation (Gu et al., 2019). As glutamate elicits proportional glucose consumption (Lundgaard et al., 2015; Zimmer et al., 2017), decreases in glutamate signaling in the pmDMN could indeed explain both, the decreased BOLD response and decreased CMRGlu during the Tetris® task. Conversely, increased GABA supports a negative BOLD response in the PCC (Hu et al., 2013), as do working memory tasks (Koush et al., 2021) and pharmacological stimulation with GABAergic benzodiazepines (Walter et al., 2016). In consequence, the observed dissociation between BOLD changes and CMRGlu during working memory could indeed result from metabolically expensive (Harris et al., 2012) GABAergic suppression of the BOLD signal (Stiernman et al., 2021). However, we need to emphasize that glutamate and GABAergic signaling was not measured in the current study, thus, the above interpretations are of speculative nature. Nonetheless, future work may test this promising hypothesis, e.g., using pharmacological alteration of GABAergic and glutamatergic signaling or optogenetic approaches modulating GABAergic interneuron activity.

      Furthermore, to maintain a more concise discussion that is closer aligned with the measured results, we have removed the following paragraph:

      Discussion, page 15, line 309:

      The associations of these metabolic demands between the DMN and task-positive networks is also reflected in their distance along a connectivity gradient, which is hierarchically organized from unimodal sensory/motor to complex associative functions and the DMN being at the end of the processing stream (Margulies et al., 2016; Smallwood et al., 2021). A corresponding decrease in pmDMN glucose metabolism was observed for tasks that activate unimodal networks and the DAN, but not for the FPN. The inverse influence of attention and control networks on the pmDMN may therefore suggest that connectivity gradients are supported by the underlying energy metabolism.

      1.2) It is mentioned that the FDG-PET scans allow quantitative CMRglc, both in terms of units of glucose use but also with high time resolution. Based on the method described, it isn't clear how this is possible. Important details of either prior work or their own work have been excluded that show how the time course of CMRglc (regardless of whether it's absolute or relative) can be compared with the BOLD time course. Furthermore, it is extremely difficult to conceive that quantitative CMRglc can be estimated without additional measurements (e.g., blood samples, etc). Significant methodological details have to be provided, which even should make their way to results given the importance of their BOLD-CMRglc coupling and decoupling in the same region.

      We thank the reviewer for this important comment and apologize for the lack of clarity. We would like to emphasize that in the current work only spatial patterns of CMRGlu and BOLD signal changes were compared, but not the time course of these signals. The manuscript was edited throughout to clarify this point.

      Introduction, page 5, line 110:

      Studies using simultaneous fPET/fMRI have shown a strong spatial correspondence between the BOLD signal changes and glucose metabolism in several task-positive networks and across various tasks requiring different levels of cognitive engagement (Hahn et al., 2020, 2016; Jamadar et al., 2019; Rischka et al., 2018; Stiernman et al., 2021; Villien et al., 2014).

      Introduction, page 5, line 123

      Specifically, it is unknown whether the observed dissociation between patterns of metabolism and BOLD changes in the DMN generalizes for complex cognitive tasks, and whether this in turn depends on the brain networks supporting the task performance and their interaction with the DMN.

      Results, page 7, line 143:

      From this dataset (DS1) we evaluated the spatial overlap of negative task responses in the cerebral metabolic rate of glucose (CMRGlu quantified with the Patlak plot) and the BOLD signal specifically in the pmDMN. […] After that, the distinct spatial activation patterns across different tasks were used to quantitatively characterize the CMRGlu response of the pmDMN in DS1.

      The method of functional PET (fPET) imaging indeed enables the evaluation of changes in glucose metabolism with a relatively high temporal resolution. That is, a conventional bolus application and subsequent quantification yield a single CMRGlu image per scan of about 60 min (typical frame length ~1-5 min) or a single SUV image from a static scan. In contrast, the constant infusion employed in fPET allows to assess baseline metabolism and changes induced by different tasks in a single scan by using a frame length currently down to 6-30 s (Rischka et al., 2018), where the latter was also used in the current study. A general description of the fPET approach is now also included in the manuscript.

      Introduction, page 5, line 99:

      In this context, functional PET (fPET) imaging represents a promising approach to investigate the dynamics of brain metabolism. fPET refers to the assessment of stimulation-induced changes in physiological processes such as glucose metabolism (Villien et al., 2014; Hahn et al., 2016) and neurotransmitter synthesis (Hahn et al., 2021) in a single scan. The temporal resolution of this approach of 6-30 s (Rischka et al., 2018) is considerably higher than that of a conventional bolus administration. This is achieved through the constant infusion of the radioligand, thereby providing free radioligand throughout the scan that is available to bind according to the actual task demands. Here, the term “functional” is used in analogy to fMRI, where paradigms are often presented in repeated blocks of stimulation, which can subsequently be assessed by the general linear model.

      Regarding the absolute quantification of CMRGlu, arterial blood samples were obtained from all subjects of DS1. These were used for absolution quantification of CMRGlu with the Patlak plot. Full details were already provided in the methods section and are now also mentioned in the results.

      Results, page 7, line 140:

      Simultaneous fPET/fMRI data and arterial blood samples were acquired from 50 healthy participants during the performance of the video game Tetris®, a challenging cognitive task requiring rapid visuo spatial processing and motor coordination (Hahn et al., 2020; Klug et al., 2022). From this dataset (DS1) we evaluated the spatial overlap of negative task responses in the cerebral metabolic rate of glucose (CMRGlu quantified with the Patlak plot) and the BOLD signal specifically in the pmDMN.

      Methods, page 19, line 399:

      For glucose metabolism, these changes are absolutely quantified in μmol/100g/min with the arterial input function and the Patlak plot.

      Methods, blood sampling, page 24, line 536:

      Before the PET/MRI scan blood glucose levels were assessed as triplicate (Gluplasma). During the PET/MRI acquisitions manual arterial blood samples were drawn at 3, 4, 5, 14, 25, 36 and 47 min after the start of the radiotracer administration (Rischka et al., 2018). From these samples whole-blood and plasma activity were measured in a gamma counter (Wizard2, Perkin Elmer). The arterial input function was obtained by linear interpolation of the manual samples to match PET frames and multiplication with the average plasma-to-whole-blood ratio.

      Methods, cerebral metabolic rate of glucose metabolism, page 25, line 561:

      Quantification was carried out with the Patlak plot (t* fixed to 15 min) and the influx constant Ki was converted to CMRGlu as CMRGlu = Ki * Gluplasma / LC * 100 with LC being the lumped constant = 0.89 (Graham et al. 2002, Wienhard 2002).

      1.3) It is surmised that the glutamatergic/GABAergic involvement of these metabolic differences in PCC is from another study, but what mechanism causes the BOLD signal to decrease in both stimuli? This is where the authors have to divulge the biophysical basis of the BOLD response. At the most basic level, the BOLD signal change (dS) can be positive or negative depending on the degree of coupling with changed blood flow (dCBF) and oxidative metabolism (dCMRO2) from resting condition. Unfortunately, neither CBF nor CMRO2 was measured in this study. In the absence of these additional measurements, the authors should at least discuss the basis of the BOLD response with regard to CBF and CMRO2. If we assume that both attention-demanding and working memory tasks decreased BOLD response in PCC in the same way, we have identical dCBF/dCMRO2 in PCC with both tasks, i.e., their results seem to suggest an alteration in aerobic glycolysis with different tasks. With attention-demanding tasks, CMRglc decreases similarly to CMRO2 decreases in PCC, whereas with working memory tasks, CMRglc increases differently from CMRO2 decreases. This suggests PCC may the oxygen to glucose index (OGI=CMRO2/CMRglc) would rise in PCC attention-demanding tasks, but fall in PCC with working memory tasks. This is obviously an implication rather than a conclusion as CBF or CMRO2 were not measured.

      1.4) Given the missing attention that gives rise to the BOLD contrast mechanism, it is almost necessary to discuss the biophysical basis of BOLD contrast and specifically how metabolic changes have been linked to both increases and decreases in neuronal activity in the past. Although this type of work has largely been conducted in animal models, it seems that this topic needs to be discussed as well.

      We would like to thank the reviewer for sharing these insightful ideas and for bringing up these aspects that indeed appear to be essential for the manuscript. Since the points 1.3. and 1.4 complement each other, we have combined them and created a shared response. To fully address the points, the following paragraphs were added to the manuscript.

      Discussion, page 15, line 310:

      Metabolic and neurophysiological considerations effects

      The distinct relationships between BOLD and CMRGlu signals that emerge during specific tasks highlight the different physiological processes contributing to neuronal activation of cognitive processing (Goyal and Snyder, 2021; Singh, 2012). While CMRGlu measured by fPET provides an absolute indicator for glucose consumption, the BOLD signal reflects deoxyhemoglobin concentration, which depends on various factors, such as cerebral blood flow (CBF), cerebral blood volume (CBV) and the cerebral metabolic rate of oxygen (CMRO2) (Goense et al., 2016). In simple terms, the BOLD signal relates to the ratio of ∆CBF/∆CMRO2. Assuming that the observed BOLD decreases during Tetris® and WM emerge from the same mechanisms, this would result in a comparable ∆CBF/∆CMRO2 in the pmDMN for both tasks. Given that these types of tasks (external attention and cognitive control) elicit a reduction in CBF in the pmDMN (Shulman 97, Zou 2011), CMRO2 also decreases albeit to a lesser extent (Raichle 2001). Therefore, the respective metabolic processes can be described by their oxygen-to-glucose index (OGI), the ratio of CMRO2/CMRGlu. Accordingly, our results suggest two distinct pathways underlying BOLD deactivations in the pmDMN that differ regarding their OGI. During Tetris® there is a BOLD deactivation with a high OGI, resulting from a larger decrease in CMRGlu than CMRO2. This metabolically inactive state is in line with electrophysiological recordings in humans (Fox et al., 2018) and in non-human primates showing a decrease of neuronal activity in the pmDMN that covaries with the degree of exteroceptive vigilance (Shmuel et al., 2006; Bentley et al., 2016; Hayden et al., 2009). Therefore, we suggest that the negative BOLD response during external tasks reflects a reduction of neuronal activity and their respective metabolic demands. On the other hand, the relatively increased CMRGlu without the corresponding surge in CMRO2 hints at another kind of BOLD deactivation with a low OGI in the pmDMN during working memory, indicating energy supply by aerobic glycolysis (Vaishnavi et al., 2010; Blazey et al., 2019). Previous work in non-human primates has indeed suggested a differential coupling of neuronal activity to hemodynamic oxygen supply in this region (Bentley et al., 2016). Furthermore, tonic suppression of PCC neuronal spiking during task performance was punctuated by positive phasic responses (Hayden et al., 2009), which could indicate differences between both tasks also at the level of electrophysiologically measured activity.

      Reviewer #2 (Public Review):

      2.0) This paper provides an important and insightful investigation into patterns of activations that emerge in external task states. The authors use state-of-the-art methods and novel analytic approaches to establish that deactivations in the default mode network during external tasks are driven by activity in brain regions that are important in the current tasks (such as the visual or dorsal attention networks). It will be important in the future to understand whether this is a symmetrical phenomenon by studying this behaviour in states that maximize activity within the default mode network and also drive reductions in networks that are not relevant to these situations.

      We thank the reviewer for the encouraging feedback and the constructive comments on our manuscript. We particularly appreciate the interest in the research and the insightful suggestions for future work.

      Reviewer #3 (Public Review):

      3.0) The authors report a study where, using multiple datasets with [18F]FDG PET bolus + continuous infusion ("functional PET") and BOLD fMRI data, they re-evaluate the metabolic and hemodynamic properties of the default mode network (DMN) in a task-evoked context, with a focus on posteromedial DMN due to its relevance for across-network integration. They show how posterior DMN is differently engaged depending on the chosen task: while visual and motor tasks lead to BOLD deactivations and glucose metabolic decrease, specifically in the dorsal posterior cingulate cortex (PCC) area, working memory tasks produce BOLD deactivations but metabolic increases, specifically in ventral PCC, as shown in their previous paper (Stiernman et al. 2021, https://doi.org/10.1073/pnas.2021913118). This aims to solve the controversies elicited by findings of both increased and decreased glucose consumption in the presence of BOLD deactivation in the DMN.

      Additionally, they show how task-evoked glucose metabolism in posterior DMN seems to be shaped by that of the corresponding task-positive networks, with a positive link with dorsal attention and a negative link with frontoparietal network metabolism. This is explored using a type of directional connectivity analysis called "metabolic connectivity mapping", drawn from their previous work (Riedl et al. 2016, https://doi.org/10.1073/pnas.1513752113; Hahn et al. 2020, https://doi.org/10.7554/eLife.52443). They go on to speculate that concomitant BOLD deactivation and reductions in glucose expense might relate to decreased glutamatergic signaling, while BOLD deactivations accompanied by increased glucose consumption might depend on increased GABAergic neuronal activity.

      This is a relevant topic because it not only shows how the DMN is flexibly engaged in different tasks but also allows us to better understand the complex relationships between BOLD fMRI and [18F]FDG PET signals, which are still not fully characterized to this day. Of course, while in resting state the situation is further complicated by the more uncertain physiological meaning of the resting BOLD signal, task-evoked states are expected to provide a more interpretable intermodal link between metabolism and hemodynamics, due to the known major changes in blood flow, blood volume, and glucose metabolism - which underlie BOLD and [18F]FDG signal changes - in response to neural activation. However, even in task states, there is not always a strong association between the two responses, as previously shown by the authors themselves (Rischka et al. 2018, https://doi.org/10.1016/j.neuroimage.2018.06.079). This is something I think the authors should stress out a little more, as they have previously done (Rischka et al. 2018, https://doi.org/10.1016/j.neuroimage.2018.06.079), both in the introduction and in reference to Figure 1, which shows clear differences between BOLD and [18F]FDG activations/deactivations (e.g., widespread negative responses in the cerebellum for [18F]FDG).

      Overall, the analyses reported in the manuscript are simple and seem mostly sound, drawing from well-established methods in PET and fMRI activation studies, with additional approaches previously developed by some of the authors themselves (e.g., "metabolic connectivity mapping", Riedl et al. 2016, https://doi.org/10.1073/pnas.1513752113). Moreover, a clear strength of the paper is the high number of subjects, at least from a PET perspective, i.e., n = 50 for the Tetris task, plus group averages of previously published data for working memory (Stiernman et al. 2021, https://doi.org/10.1073/pnas.2021913118) and motor tasks (Hahn et al. 2018, https://doi.org/10.1007/s00429-017-1558-0).

      The conclusions are in line with the results, and, though a little speculative, are potentially relevant for further exploration aimed at characterizing the neurotransmitter pathways underlying positive and negative BOLD and [18F]FDG responses. Moreover, the language is sufficiently clear to allow a proper understanding of the aims and the results, as well as the details of the analyses. As a side note, the title should probably be adjusted to "Task-evoked metabolic demands of the posteromedial default mode network are shaped by dorsal attention and frontoparietal control networks", to emphasize that the findings do not necessarily generalize to the resting state.

      In conclusion, I am overall quite positive about this manuscript, which seems to nicely position itself within the existing literature, making some additional contributions.

      We thank the reviewer for the thorough evaluation and the positive feedback on our manuscript, we appreciate the constructive and insightful suggestions. We agree that the differential spatial patterns of activation between the BOLD signal and CMRGlu response require further attention. To address this point in more detail, we have added the following information to the manuscript.

      Introduction, page 5, line 110:

      Studies using simultaneous fPET/fMRI have shown a strong spatial correspondence between the BOLD signal changes and glucose metabolism in several task-positive networks and across various tasks requiring different levels of cognitive engagement (Hahn et al., 2020, 2016; Jamadar et al., 2019; Rischka et al., 2018; Stiernman et al., 2021; Villien et al., 2014). […]. However, also regional differences in activation patterns have been observed previously between these modalities in these and previous studies (Wehrl et al., 2013). Moreover, a dissociation between BOLD changes (negative) and glucose metabolism (positive) has recently been observed even in the same region of the DMN during working memory (Stiernman et al., 2021), namely the posteromedial default mode network (pmDMN).

      Results, caption Figure 1, page 8, line 173

      White clusters represent the intersection of significant CMRGlu and BOLD signal changes, irrespective of direction. Note, that also relevant differences between both imaging parameters can be observed, such as decreased CMRGlu in the cerebellum (in both datasets), without changes in the BOLD signal.

      We appreciate the reviewer’s proposal for the title as it raises awareness that the activation patterns reflect task-specific inference.

      Title:

      Task-evoked metabolic demands of the posteromedial default mode network are shaped by dorsal attention and frontoparietal control networks

      We have limited the discussion of underlying neurotransmitter effects and explicitly mention that these are of speculative nature. For manuscript adaptation on this point, we would like to refer to points 1.1, 1.3, 1.4 that address this topic as well.

    1. Author Response

      Reviewer #1 (Public Review):

      The study tackles the topic of male harm (sexual selection favoring male reproductive strategies that incur a reduction of female fitness) from an interesting angle. The authors put emphasis on using wild-collected populations and studying them within their normal thermal range of reproductive conditions. Where previous studies have used temperature variation as a proxy for stressful environmental change, this approach should instead clarify what can be the role of male harm on female fitness in natural conditions. A minor caveat regarding this point is the fact the polygamy treatment also has a heavily male-biased sex ratio (3:1). The authors argue that this sex ratio is within the range of normal variation in that species, but it is likely that the average is still (1:1) in natural populations and using a male-biased sex ratio could magnify the intensity of male harm. This does not undermine the conclusions regarding the temperature sensitivity of sexual conflict but should be acknowledged.

      The authors find that varying temperature within a range found in natural conditions affects the reproductive interactions between males and females, particularly through male-harm mechanisms. Male harm, measured as a reduction in lifetime reproductive success (LRS) from monogamy to polygamy settings is present at 20C, stronger at 24, and absent or undetectable at 28C. Female senescence is always faster in the polygamy mating systems as compared to monogamy, but the effect appears strongest at 20C. Mating behaviors of males and females in these different settings are used to attempt to uncover underlying mechanisms of the sensitivity of male harm to temperature.

      A weakness of the manuscript in its current form is the lack of clarity about the experimental design, which makes understanding the results a long and involved procedure, even for someone who is familiar with the field. If the authors consider revising the manuscript, I suggest giving a better overview of the experimental design(s) earlier in the manuscript, perhaps supported by a diagram or flowchart. I also suggest structuring the results better to aid the reader (e.g., make clearer distinctions between results that come from the different experiments). Finally, some additional figures and statistical tests corrected for multiple testing would help get a better feel of some aspects of the dataset.

      I believe that the conclusions are generally justified and the results overall convincing. Overall, this is an impressive study with a lot of dimensions to it. Its complexity is a challenge and may require additional effort from the authors to make it easier to access. The core of the question is answered by LRS measures, but the authors have also provided a wealth of behavioral data as well as other fitness components. The manuscript could be greatly improved by putting more effort into linking the different metrics together to track down potential mechanisms for the observed variation in male-harm-induced reduction in female LRS. The discussion would also benefit from considering the female side of the sexual conflict coevolution arms race.

      We are thankful for the nice words and constructive appraisal of our work. As stated above, reviews like this are extraordinarily helpful. The reviewer mentions four main points that we have addressed:

      1. We now expand a bit on the justification to use a (3:1) male-biased sex ratio in the methods section (lines 150-155). We also acknowledge potential limitations of this design in the discussion (lines 563-571).
      2. To clarify the methods, we have placed this section before the results. This, in itself, has significantly improved the clarity of the manuscript. We have also substantially re-written the methods and results (including adding some tables) to streamline the text while providing all the necessary details, and have also included several diagrams to illustrate all our experiments (in the SM, see Figs. S1.1 to S1.5) along with a general schematic figure of the general design that we present early on in the main text (in the introduction, see Fig. 1).
      3. As suggested, we have re-run all analyses using the Benjamini-Hochberg procedure in order to correct for inflation of type I error rate due to multiple testing. We have also included in the SM a complementary set of models that also test for this via post hoc Tukey contrasts. Both these approached corroborate our initial findings, and thus contribute to strengthen our results.
      4. We now explicitly discuss the female side of things in the discussion (lines 636-647).

      Reviewer #2 (Public Review):

      Londoño-Nieto et al. investigated the influence of temperature on the form and intensity of sexual conflict in Drosophila melanogaster. They aimed to test the effect of naturally occurring temperature fluctuations on a wild population of Drosophila while disentangling pre- and postcopulatory episodes of sexual conflict. To this end, they exposed females to males under monogamy or polyandry, hence manipulating the degree of male harm experienced by females. The effect of temperature was explored by exposing these groups to 20, 24, or 28{degree sign}C. They found that female fitness suffered from male harm most at 24{degree sign}C and less at the other two temperatures. Interestingly, pre- and postcopulatory episodes of sexual conflict were affected differently by temperature. Overall, these data suggest that the relationship between sexual conflict and temperature can be strong and complex. Hence, these results can have important implications for the impact of sexual conflict on population viability, especially in light of the climate crisis.

      We want to thank the reviewer for the time invested in reading and reviewing our work. We are glad to read that the reviewer found our results interesting and considered our study to be of importance to the field.

      This paper tackles a highly relevant question using an established model organism for sexual conflict and contains a rich dataset obtained using a series of carefully planned experiments and analysed in an appropriate way. Importantly, the authors used biologically meaningful temperatures and mating treatments, which increases the relevance of the data. The main conclusions are well supported by the data. Nevertheless, the devil is in the detail, and given the way the authors frame their study (i.e. testing a natural population under naturally occurring temperature fluctuations) and their results (i.e. sexual conflict is buffered by temperature effects in the wild) there are some limitations to be considered:

      We appreciate the positive feedback! The reviewer identified potential limitations and made good suggestions that have only served to improve our manuscript considerably, for which we are very grateful. Details follow on how we have dealt with each specific comment.

      1) The authors frame their study as addressing the question of how sexual conflict reacts to naturally occurring temperature fluctuations in the wild. Nevertheless, the population used in this experiment had been kept for nearly 3 years in the laboratory prior to the experiment. Importantly, the authors ensured that the laboratory population maintained genetic diversity, by regularly crossing wild lines into it. Nevertheless, this population remained for some time in the laboratory under standardized conditions. The applied temperature fluctuations are in a biologically meaningful range (though only during the reproductive season), but it remains unclear if the applied fluctuations were in a standardized way (i.e. pre-programmed) or included random fluctuations (i.e. a more natural setting). This laboratory setup has certainly clear advantages, for example, it enables the exclusion of any effects other than the temperature on sexual conflict. Nevertheless, how these will then ultimately play out in the wild could be a different story.

      Agree. We clarify now that we meant pre-programmed fluctuations and acknowledge this limitation in the methods (lines 124-131).

      2) The authors highlight clearly that temperature fluctuations in the wild might play an important part in how sexual conflict plays out in natural populations. This very interesting and highly relevant point might lead the reader to assume that this is what was actually tested in the experiment. Nevertheless, in the experiments, different constant temperatures were applied to the flies, while only the stock population was kept at a fluctuating temperature regime. Hence, the influence of fluctuations during episodes of sexual conflict remains untested. While the present data show that sexual conflict can be modulated by temperature, the effect of naturally occurring fluctuations on the net cost of sexual conflict to a population remains unclear.

      Again, a fair point that we acknowledge in the current version (lines 571-575). “Second, our treatment temperatures were stable, designed to study how coarse-grain changes in temperature across the adult lifespan of flies may influence how sexual conflict unfolds in nature. Thus, future studies will need to encompass how fine-grained fluctuation (i.e., repeated variation of temperature across an adult’s lifespan) may affect male harm for a more comprehensive picture of temperature effects on sexual conflict in the wild”.

      3) The authors conclude that the effect of sexual conflict can be buffered by temperature in the wild. In general, I agree with this, although a more conservative way of framing this would be to say that temperature modulates or moderates sexual conflict instead of buffers it. If there really is a buffering effect of temperature in the wild remains to be tested, I believe. This will depend on how actual changes in temperature affect this dynamic (see point 2). In addition, I think another interesting open question is what the mechanism behind the observed differences might be. Are male and female interests really more aligned at different temperatures (i.e. males plastically reduce harm)? This would really buffer the harm of sexual conflict at those temperatures. Nevertheless, alternatively, males might not be perfectly adapted to manipulate the female optimally at lower or higher temperatures. This would mean that if the temperatures change, males might evolve to increase the manipulation of females, and hence the scope for sexual conflict might not change in the end under this scenario. Nevertheless, as the authors themselves state: 'An intriguing possibility is thus that SFPs are more effective at lowering female re-mating rates at warm temperatures, thereby buffering these costs.' Therefore, a temperature-dependent increase in the effectiveness of male manipulation might counterintuitively reduce sexual conflict in this species.

      We echo both points in the current version of the paper (see lines 633-655).

      4) In the end the authors argue that the climate crisis might have 'unexpected positive consequences via its effect on male harm'. Sexual conflict is indeed widespread, but it takes many different forms (as has been nicely described in the introduction of this paper). Because the studied system seems to be quite a specific example, it is questionable how far spread this phenomenon is in nature. In addition, it remains unclear how male harm will evolve in response to the climate crisis (see point 3). Finally, the relative fitness of females increased in the present experiment, as the tested range was within the reproductive optimum of the species. Nevertheless, the relative importance of the positive effect of sexual conflict on fitness outside of optimal temperatures seems questionable.

      Agree. Altogether, we have tried to tone down our conclusions regarding the implication of our results for a climate change scenario, and acknowledge all the points highlighted by the reviewer in the current version of the manuscript (see lines 563-575).

      Nonetheless, I believe these results to be of exceeding interest to the scientific community and of importance to the field. It opens up many potential research directions and adds further data to the fascinating field of sexual conflict, SFPs, and male harm in Drosophila.

      We are thrilled to read that the reviewer found our study of exceeding interest.

      Reviewer #3 (Public Review):

      In this paper, the authors explore the effects of the environment, specifically temperature, on male harm to females. Male harm is the phenomenon where males reduce female fitness in polyandrous systems, where a single female may mate with multiple males. The selection of males to increase their reproductive success in male-male competition can lead to genetic conflict that increases male fitness at the expense of female fitness. Typically, male harm has been studied in single environments under optimal conditions. However, there is an increasing focus on the effect of the environment on fitness costs of male harm to females, as a way to better understand the effect of male harm on population fitness in more realistic ecological contexts. In this paper, the authors add to these studies by exploring the effect of temperature on male harm and female fitness, using the fruit fly Drosophila melanogaster, as a model system. They find that temperature affects the impact of male harm on female fitness, with male harm having the greatest effect at 24˚C relative to 20˚C and 28˚C. The authors then go on to disentangle how temperature affects the various components of male harm that impact female fitness (e.g. harassment, ejaculate toxicity). The paper demonstrates that male harm depends on ecological context, which has implications for understanding its impact on population fitness under realistic ecological scenarios, particularly with respect to climate change.

      The strength of the paper is that it demonstrates that male harm (presented as differences in female life reproductive success between monogamous and polyandrous matings) changes with temperature. The authors dissect this general observation by showing that different aspects of precopulatory reproductive behavior, for example, male-male aggression, copulation rate, and female rejection rate, also change with temperature. Further, they demonstrate that correlates for male ejaculate quality also change with temperature, suggesting that temperature also affects postcopulatory mechanisms of male harm.

      The weakness of the paper is that the method and results section are difficult to follow, which negatively impacts the interpretation of the data. The experiments are complex and need to be for what the authors are studying. Nevertheless, the paper is written in a way that makes it challenging for the reader to fully understand how precisely the experiments were conducted. Further, the authors do not explain clearly how some of the experiments relate to the phenomenon ostensibly being assayed. For example, a more detailed explanation of why mating duration and remating latency are assays for ejaculate quality in the context of sperm competition would be very helpful in interpreting the data. Further, a clearer explanation of the statistical analyses conducted

      Thank you for the positive, detailed and constructive review. We agree with all the weaknesses laid out and we have strived to address all of them in the current version. This includes a mayor rearrangement, structuring and re-write of the methods and results section and extra statistical analyses. Please find the details below.

    1. he looming presence of climate change, as a kind of techno-social disaster that has already begun and which will inundate the next couple of centuries as somekind of overdetermining factor, no matter what we do

      I think this statement highlights the severity and urgency of the threat posed by climate change. While it may create a sense of despair or helplessness, it can also serve as a call to action for individuals and societies to take concrete steps towards reducing our impact on the planet and mitigating the impacts of climate change.

    1. Author Response

      Reviewer #1 (Public Review):

      The authors have used computational models and protein design to enhance antibody binding, which should have broad applications pending a few additional controls. The authors' new method could have a broad and immediate impact on a variety of diagnostic procedures that use antibodies as sensitivity is often an issue in these kinds of experiments and the sensitivity enhancement achieved in the two test cases is substantial. Affinity maturation is a viable approach, but it is laborious and expensive. If the catenation method is generalizable, it will open up opportunities for antibody optimization for cases where affinity maturation is either not feasible or otherwise impractical. Less clear is how this method might enhance therapeutic potency. Issues that arise when using therapeutic antibodies are often multifactorial and vary depending on the target and disease state. Many issues that occur with antibody-based therapies will not be rectified with affinity enhancement.

      We agree with the limitation.

      Reviewer #2 (Public Review):

      The paper presents an interesting design approach to having homodimeric IgGs with higher binding affinity to the antigens on a surface by fusing a weakly homodimerizing protein (a catenator) to the C-terminus of IgG. Considering the homodimeric IgGs with likely enhanced antigen binding ability and their stabilization with a reversible catenation when bound to the surface is an interesting idea. With agent-based modeling - the simulations based on Markov Chain Monte Carlo (MCMC) sampling - and proof of concept experiments, it has been possible to show the enhanced antigen binding ability of the homodimer Igs for many folds, where the weakly homodimerizing ability of the catenator is indicated to have a central role, enabling proximity effect driven catenation on the antigen bound surfaces. While the results render the enhanced binding affinity of the catenated homodimeric IgGs, the study would benefit from a more elaborated interpretation and discussions of the results.

      The following discussion is now stated in the revision (pages 19-20, in the revision); “While we demonstrated that dual catenator-fused heterodimeric IgGs can enhance binding avidity, the oligomer formation or potential intramolecular homodimerization of the catenator necessitates the development of a more robust catenator for application to conventional homodimeric IgGs. Specifically, the ideal catenator should geometrically disallow intramolecular homodimerization, exhibit fast association kinetics, and be able to withstand the standard low pH purification step. On the other hand, our demonstration indicates that this approach can be applied to bispecific antibodies employing a heterodimeric Fc.”

      One interesting base of the discussion may include how the fusion of the catenator may likely affect the binding behavior, the intrinsic binding behavior, and/or on the global structural changes, of IgGs (monomeric and homodimeric (catenated) per se beyond its proximity-driven contribution. Would it lead to a more restricted structure in the mobility in the unbound states so as to decrease the entropic cost for the binding and thus increase the binding avidity/affinity (in addition to external proximity-driven association). In other words, what would be the role of entropy in the free energy of binding, given that the enthalpic contributions remain the same? Possible effects of the length of the catenator should also in parts be related to the entropy. For example, if a longer and more flexible catenator is considered, what would the resulting observation experimentally and computationally be?

      The binding site occupancy depends on [catAb]/KD. Figure 4-figure supplement 2 shows the binding site occupancy and (KD)eff as a function of (KD)catenator. In this simulation, [catAb] was fixed (10-9 M) while KD was varied (from 10-8 to 10-6). In the figure legend and in the main text, we now explicitly state that KD was varied from 10-8 to 10-6 (page 30, in the revision). To address this comment, we set KD = 10 nM (as used for simulation in Figures 3 and 4), and varied [catAb] from 0.1 to 10 nM. The binding site occupancy and (KD)eff as a function of [catAb] are plotted for three different set values of (KD)catenator (1 μM, 10 μM and 100 μM). The new figures are now presented as Figure 4-figure supplement 3. This simulation shows that the enhancement of (KD)eff by increasing the concentration of catAb is much less dramatic than that by increasing the affinity for catenator homodimerization at [catAb] > 10 nM.

      On the other side, simple simulation approaches have a high value with a level of abstraction while still keeping the physical and biological relevance. In the simulations, i.e. in the sampling of various states, three main terms/rules to govern the behavior are implemented. One is a term favoring an increase in the ability to bind (preventing to unbinding) to the surface upon the catenation of IgGs. This may need to be substantiated for the simulations not imposing a preassumed ability to increase the binding (or decrease the unbinding) ability upon the catenation.

      We agree with the review in that the third rule favors the binding ability of catenated IgGs, because it assumes that catenated antibodies are not allowed to dissociate from the binding site. While this assumption is not exactly correct, we think that it is valid, considering the behavior of a multivalent ligand. When the IgG portion dissociates completely from the binding site, it is still anchored by the catenation arm, and thus it will rebind the same binding site immediately. This postulation agrees with the quantitative analysis showing that multivalent ligand exhibits orders of magnitude binding likelihood increase when the ligand size is comparable to the stretch length of a conjugating linker [Liese, S. & Netz, R. R., ACS Nano, 12, 4140 (2018)].

      The weakly homodimerizing state of the catenator appears as one of the important aspects of the proposed design strategy. Would it also be possible that the experimental observations may readily also imply the higher binding ability of the catenator fused IfgG without the homodimerization on the surface (due to the reduced entropic cost for the binding)? The presentation of the evidence of the homodimerization of the catenator and the catenated IgGs on the surface would strengthen the findings and discussions.

      To fully address this comment, we would need to consider the detailed molecular behavior of the IgG part, the catenator and the linker, probably using molecular dynamics simulation, which we think is outside the scope of the current work. We like to qualitatively describe what we think about the raised issues. Fused to the C-terminus of Fc, the catenator won’t affect the complementary determining region (CDR) of Fab which is located on the opposite side of the C-terminus of Fc. This notion is supported by the observation that the SDF-1α-fused antibodies exhibited association kinetics similar to those of the mother antibodies (Figure 5).

      Regarding the mobility of the structure, we presume that the fused catenator would not interact with the antibody portion and thus it would not affect the intrinsic structural mobility of the antibody.

      Since the catenator is fused to the C-terminus of Fc by a flexible linker, the homodimerization of catenator would decrease the entropy upon catenation. However, the enthalpic contribution would overcome the entropic loss, and result in negative free energy of the catenator homodimerization.

      Figure 2-figure supplement 1 (in the revision) shows the simulation for five different values of the reach length (R), which is the sum of the linker length and half of the catenator length. The simulation results show that the likelihood of catenation decreases as the linker length increases over the distance (d) between the two adjacent catAb-2Ag complexes, while it is maximum when the reach length equals d. Since the catenator length is fixed, increasing the linker length (such that R > d) will lower the catenation effect.

      Reviewer #3 (Public Review):

      The authors proposed an antibody catenation strategy by fusing a homodimeric protein (catenator) to the C-terminus of IgG heavy chain and hypothesized that the catenated IgGs would enhance their overall antigen-binding strength (avidity) compared to individual IgGs. The thermodynamic simulations supported the hypothesis and indicated that the fold enhancement in antibody-antigen binding depended on the density of the antigen. The authors tested a catenator candidate, stromal cell-derived factor 1α (SDF-1α), on two purposely weakened antibodies, Trastuzumab(N30A/H91A), a weakened variant of the clinically used anti-HER2 antibody Trastuzumab, and glCV30, the germline version of a neutralizing antibody CV30 against SARS-CoV-2. Measured by a binding assay, the catenator-fused antibodies enhanced the two weak antibody-antigen binding by hundreds and thousands of folds, largely through slowing down the dissociation of the antibody-antigen interaction. Thus, the experimental data supported the catenation strategy and provided proof-of-concept for the enhanced overall antibody-antigen binding strength. Depending on specific applications, an enhanced antibody-antigen binding strength may improve an antibody's diagnostic sensitivity or therapeutic efficacy, thus holding clinical potential.

      Thanks for the favorable comments.

    1. “Our lessons, units, and courses should be logically inferred from the results sought, not derived from the methods, books, and activities with which we are most comfortable. Curriculum should lay out the most effective ways of achieving specific results… in short, the best designs derive backward from the learnings sought.”

      I'm actually a little bit surprised that this was a revolutionary idea - or that it had to be intentionally staked out as a new school of thought in learning design, where the roles of the learning objective and assessment are so foundational. I suppose tradition and inertia play a role here - certain topics have always been taught using specific instructional activities, and those learning activities are treated as a given by instructors and designers, even if they do not always lend themselves to observable and measurable assessments. I think of the role of the essay in humanities courses - where essays are treated as the established learning activity because of academic traditions, when we may find there are more effective ways to teach and assess a topic if we worked backwards from the learning objectives to find the best way for learners to demonstrate mastery. (I suspect the essay still wins out in many cases, but it's still worth interrogating).

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all reviewers for their comments and suggestions. The revised manuscript included new experiments they suggested and extensive text edits. Our point-by-point response is shown in bold.

      Point-by-point description of the revisions

      —----------------------------------------------------------------------------------------------------------------

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary

      In this manuscript, Blank et al. propose a link between cell-cycle dependent changes in metabolic flux and corresponding changes in TORC1 activity in yeast cells. Based on their findings, the authors propose that Bat1-dependent leucine synthesis from glucose increases as cells progress through G1 and that this activates TORC1 to drive cell cycle progression. Although the existence of cell-cycle dependent synthesis of leucine is a novel and exciting finding, several aspects of the proposed model are not sufficiently supported by experimental evidence, in particular the fact that the increase in Leu synthesis is causing the increase in TORC1 activity in late G1.

      Major comments:

      1. To show that the increase in Leu biosynthesis in S-phase is activating TOR, one would ideally want to blunt this increase in biosynthesis and assay TORC1 activity. Admittedly, this is difficult. So, instead, the authors study bat1- cells which have strongly impaired synthesis of BCAA including Leucine. The relevance of these bat1- cells to the proposed cell-cycle dependent model, however, is questionable for two reasons: 1) Although the authors state that "exogenous supplementation of BCAAs in all combinations suppressed the growth defect of bat1- cells, especially when valine was present", the spot assays in Figure 3 show visible rescues only when valine is present either alone or in combination, while supplementation of leucine or isoleucine does not seem to have any effect. Hence it appears that the bat1- phenotype is mainly due to limiting valine levels, not leucine levels. 2) The relevance of these results for understanding TORC1 regulation are questionable, since valine does not typically activate TORC1. Does addition of Leu to bat1- cells increase TORC1 activity ? RESPONSE: The reviewer’s comments were very valuable. We performed the suggested experiments (adding not only Leu but also Ile and Val) to bat1 cells and measuring phosphorylation of Rps6 (see new Figure 4D) and the DNA content of those cells (see new Figure 3C). We found that Leu weakly promotes cell cycle progression, compared to the addition of Val, which also leads to pronounced activation of TORC1 (>10-fold activation; see Figure 4D). We discuss these findings in the revised text.

      We also note, as published by others and now discussed in the text, that in WT cells, exogenous addition of Leu (or any other BCAA) does not lead to sustained activation of TORC1 (see new Figure 4D). This is not surprising. As reported by the Hall lab (see PMID: 25063813, which we now cite), the Gtr-dependent activation of TORC1 by BCAAs mentioned by the reviewer is very transient. Hence, our new data, showing sustained TORC1 activation and cell cycle effects upon Val addition in bat1 cells, is exciting. They argue that bat1 cells serve as a highly sensitized background of low TORC1 activity, enabling the display of effects that are difficult to measure in WT cells.

      TORC1 activity is known to depend on steady-state leucine concentrations in the cell rather than on leucine flux. Although the authors observe that the synthesis rate of leucine increases during G1 progression, this does not necessarily translate into increased leucine concentrations in the cell. To support the claim that the increase in TORC1 activity during G1 progression depends on leucine, the authors would need to show that, not only leucine synthesis, but also overall leucine levels in the cell increase during G1 progression.

      RESPONSE: We did this experiment and now report the data (see new Figure EV2), using the Edman degradation-based assay. We found that changes in the steady-state levels of BCAAs had a similar pattern, and those changes were most significant for valine (rising 30-40% from late G1 to G2/M). Nonetheless, we note also that the kinetics of amino acid synthesis measured by our isotope tracing experiment need not match the steady-state levels of amino acids. Steady-state levels are affected by a multitude of parameters, only one of which is the rate of synthesis, as we now discuss in detail in the manuscript.

      To test whether the increase in Leu biosynthesis in S-phase activates TORC1, a few different approaches could be tested: 1) Since leucine activates TORC1 through the Gtr proteins, the authors could test whether rendering TORC1 resistant to low leucine through expression of constitutively active Gtrs abolishes the cell-cycle dependence in TORC1 activity. 2) Leu could be added to the medium of wildtype cells in G1 to the amount necessary to cause an increase in intracellular Leu levels similar to those seen in S-phase to test whether this increases TORC1 activity.

      RESPONSE: We did the suggested experiments, which are now shown in the new Figure 5. Leucine and valine accelerated the rise in TORC1 activity in G1. However, there were no noticeable downstream consequences in the kinetics of cell cycle progression. As we discuss in the text:

      “A small acceleration of the rise in the levels of phosphorylated Rps6 was evident in both the leucine- and valine supplemented cells (Figure 5A,B). Nonetheless, there were no noticeable downstream consequences in the kinetics of cell cycle progression, in either the rate the cells increased in size or their critical size (Figure 5A; see values above the corresponding blots), consistent with the notion that TORC1 activity already is at a maximal level in these conditions…”

      In Fig 2B one sees that Leu biosynthesis peaks at 150min and then drops again. The p-RpS6 blot in Fig. 5D, however, only goes up to 140 min and shows that TORC1 activity increases up to 140 min, but it doesn't show timepoints beyond 150 min when Leu biosynthesis drops again, and hence one would expect TORC1 activity to drop. If TORC1 activity were to drop from 150min onwards, this would strengthen the correlation between Leu biosynthesis and TORC1 activity.

      RESPONSE: The reason for the drop in Figure 2 is trivial and does not affect the interpretation. As seen in Figure 1 (the experiment from which the data in Figure 2 are shown), by 180 min, the cells were entering a new cell cycle, evidenced by a reduction in cell size (Figure 1B) and in the fraction of budded cells (Figure 2B). At that point, there is a mix of mothers and daughters with very poor synchrony, making it impossible to conclude much about the drop in Leu synthesis (i.e., does it arise from the lack of new synthesis in mothers, daughters, or both?). In the experiment in Figure 5, the reviewer mentions (now those figures have moved to File S8 because we added more experiments in the figure) the experiment terminated when peak budding was reached, which was 140 min, within one cell cycle. Lastly, it is important to stress that every elutriation experiment is different. While the times are close, comparing various experiments on a time basis alone is inaccurate. Instead, the metric used in the field to compare different experiments is usually cell size, which we use in all other Figures except Figure 1 because, in that case, the experiment was a time-based, pulse-chase one.

      Minor concerns:

      1. In Figure EV4, the authors should highlight some of the metabolites that are significantly changed, in particular the BCAA. The figure is not very informative as currently presented. __RESPONSE: We have now labeled the BCAAs, and a few more metabolites as suggested (note the Figure is now EV5). __

      Fig 2 - are "expressed ratios" the best term for metabolite levels? Unlike genes, where such heat maps are often used, the metabolites are not 'expressed'. How about 'relative metabolite level' instead?

      RESPONSE: Good point. The axis now reads “relative abundance”.

      Page 8: "We also measured the MID values from the media of the same cultures used to prepare the cell extracts." Where are these data? We don't see them in File S2?

      RESPONSE: The data are in File S2 (there are many ‘sheets’ in the file). In sheets 3,4 are the MID values and the analysis from metabolites in the media.

      Fig 4B - the x-axis labeling is missing for the bat1- cells

      RESPONSE: Corrected. Note that new DNA content measurements are now shown in Figure 3C.

      Although the authors state repeatedly that they show "for the first time in any system" that TORC1 activity is dynamic in the cell cycle, similar observations have already been made before, for instance showing high mTORC1 activity in the G1/S transition in the Drosophila wing disc or low mTORC1 activity during mitosis in mammalian cells (see PMIDs 28829944, 28829945, and 31733992). The text should be amended accordingly.

      RESPONSE: Thank you. Corrected.

      There are two entries for valine in File S1/Sheet8. Why?

      RESPONSE: The reason is that they were detected in both analytical pipelines (primary metabolites and biogenic amines; primary metabolites were measured with GC-TOF MS, while biogenic amines with HILIC-QTOF MS/MS), which were combined in the Table. We did not describe it adequately in the previous version. We do now, in the Methods. We also note that the raw data from each method are shown in the corresponding supplemental files. We combined them in the Table used in the Figure for display purposes. We also note that the amino acids were also measured by another method (PTH-based HPLC). Hopefully, the new edits in the Methods clarify these points.

      Reviewer #1 (Significance (Required)):

      Significance

      Despite the well-known effects of pharmacological or genetic manipulations of TORC1/mTORC1 on cell cycle progression, whether and how mTORC1 activity itself is physiologically coupled to cell cycle progression is still an insufficiently studied aspect. Hence this study provides an interesting link between cell-cycle dependent regulation of amino acid biosynthesis and TORC1 regulation. Importantly, the results of this study rely on centrifugal elutriation to obtain cell cycle synchronization, thus ruling out potential metabolic artifacts due to pharmacological methods. The observed changes in metabolic flux are therefore likely genuine and represent the major strength of the study. The major limitation is the lack of strong evidence supporting the notion that the increase in Leu biosynthesis at late G1 or S-phase is causing the increase in TORC1 activity.

      The major advance is conceptual - that amino acid biosynthesis rates are cell-cycle dependent.

      These results will be of interest to a broad audience of people studying the cell cycle, cell growth, TORC1 activity, cell metabolism and cancer.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This paper provides evidence that branched chain amino acid (BCAA) in the G1 phase of the cell cycle, fueled by pyruvate generated by glucose catabolism activates cell growth and allows cells to reach the critical size required for entry into S phase by activation of TORC1 signaling. Previous work had indicated that Leucine supplementation of a bat1 bat2 mutant, lacking both enzymes that catalyze BCAA from the alpha-keto acid precursors and starved on minimal medium, led to TORC1 activation. This work is significant in suggesting that BCAA synthesis from glucose is responsible for a cyclic activation of TORC1 necessary for a normal rate of cell growth in the G1 phase of the cell cycle.

      The study employs metabolic flux analysis of metabolites derived from glucose following a pulse-chase with different isotopes of glucose in synchronized early G1 cells (obtained by elutriation) throughout one cell cycle. They claim that the only compelling changes in metabolites observed as the cell cycle proceeds was a decline in pyruvate containing only one heavy 13C carbon atom and a corresponding increase in Leu (M6) with 6 heavy carbon atoms, which is interpreted to indicate Leu synthesis from pyruvate that begins in early G1 and peaks at mitosis. They show that a bat1 mutant exhibits a slow-growth phenotype that can be mitigated only by valine (although they infer similar effects for Leu and Ile that I find unconvincing) and they observed reductions in all three BCAAs in different experiments that measure steady amino acid levels in different ways (although the results are compelling only for Val). They go on to show evidence that the bat1 mutation reduces birth and mean cell size and leads to an increased proportion of G1 cells in asynchronous cultures, and they claim that bat1 cells take much longer than WT to achieve the same size found when a synchronized WT culture reaches 50% budding (although they don't show the data for this last point.) Interestingly, they find that deleting BAT1 suppresses sensitivity to the TORC1 inhibitor rapamycin (Rap), consistent with the idea that the bat1 mutation impairs TORC1 activity in the same manner as Rap and that BCAA are required to activate TORC1 in WT cells to the level that can be impaired by Rap, as summarized in the model in Fig. 5F. Consistent with this, they present evidence that the bat1 mutation reduces TORC1 signaling as judged by diminished Rps6 phosphorylation (although it was not shown that this effect could be reversed by Val addition). They also show that TORC1 signaling/Rps6-P increases as the cell cycle progresses using elutriated early G1 cells, suggesting that TORC1 activity is periodic in the cell cycle (although they don't establish this periodicity through a second cell cycle).

      General critique:

      The conclusion that BCAA synthesis from glucose is responsible for a cyclic activation of TORC1 necessary for a normal rate of cell growth in the G1 phase of the cell cycle is potentially of considerable significance. There are however a number of puzzling aspects of the data that seem to weaken this conclusion. As described in greater detail below, it is difficult to explain why only Leu is synthesized from glucose during the cell cycle, and why only Val shows a marked reduction in the bat1 mutant that appears to be responsible for the slow-growth phenotype. In addition, there are important controls lacking of showing that a Val supplement can suppress the G1 delay and reduction in TORC1 signaling in the bat1 mutant. In addition, the evidence that TORC1 activity is periodic in the cell cycle is lacking and it needs to be shown that Rps6-P levels are periodic through at least a second cell cycle.

      Major comments:

      -Why don't they observe synthesis of Ile and particularly Val in the metabolic flux experiment of Fig. 1, especially considering that only Val appears to be critically required for normal cell growth in the bat1 mutant based on the results in Fig. 3B?

      RESPONSE: We now show the actual plots and the errors of all the measurements in Figure 2 (instead of a heatmap we had shown before). Valine (M5) levels show a very similar trend to leucine (M6). The variance in the measurements was higher, though, and statistically, the valine changes were less significant. Hence, it was more appropriate to highlight the leucine changes. Lastly, the new DNA content data (Figure 3C) show an effect upon the addition of leucine, albeit less significant than that of valine addition.

      -The data in Fig. 3B do not show a convincing increase in growth of the bat1 mutant with addition of Leu and Ile; and the stimulation by Val alone seems identical to that seen with Val in combination with Leu and Ile. Thus, it appears that the slow-growth of the bat1 mutant results only from reduced Val levels, not all 3 BCAAs, which is at odds with their interpretation of the data.

      __RESPONSE. As mentioned above, the effect of valine is more pronounced than leucine's, but leucine does have consequences, best shown in the DNA content analysis (new Figure 3C). We also note that valine alone is insufficient to suppress the growth and cell cycle defects of bat1 cells. The latest data we have added (see Figures 3 and 5) are consistent with the interpretation that at least some de novo synthesis of BCAAs in the cell may be needed, explaining why exogenous BCAAs, including valine, are unable to correct the defects of bat1 cells fully. __

      -they claim to see reductions in all three BCAAs in the bat1 mutant; however, no significant reduction was found for Leu in Fig. EV3, and only Val was altered by the 1.5-fold cut-off imposed on the MS metabolomics data in Fig. EV4 (which could be appreciated only by an in-depth examination of the supplementary data in File S1-the Val, Leu, and Ile dots should be labeled in Fig. EV4). In addition, the reductions in Ala and Gly showin in Fig. EV3 were not found in the MS analysis of Fig. EV4. It needs to be acknowledged that the metabolomics data show a marked reduction in the bat1 mutant only for Valine with little or no change in Leucine levels. This result is difficult to explain with the simple models shown in Fig. 3A and 5F, which requires additional comment. The authors should acknowledge the much greater effect of the bat1 mutation on Val levels versus Leu and Ile, revealed both by measuring the levels of BCAAs in the mutant and comparing the BCAAs for rescuing the slow-growth of the mutant, and explain how this can be reconciled with the results in Fig. 2 where only Leu and not Val or Ile synthesis was detected.

      __RESPONSE. The perceived discrepancy in the steady-state measurements could easily arise from the different analytical methods used in each case. The differences are less substantial than the reviewer implies. For steady-state measurements in BAT1 vs. bat1 cells, we used the PTH-based method (which only detects amino acids) and two different MS-based pipelines (which detect various metabolites). From the MS-based analyses, the drop for all BCAAs was statistically significant. Although the magnitude of the drop was greater for valine (about 60% for valine vs. ~30% for isoleucine and leucine). Why is this a problem? __

      As for the valine changes in the isotope tracing experiments, as we mentioned above, the trend for valine (M5) was similar to that of leucine (M6) (now, hopefully, that data is shown better in Figure 2). Furthermore, as we commented above (see response to Reviewer 1) and now stated in the text, our isotope tracing experiments measure only the rate of synthesis, which need not match the steady-state abundances. The latter are affected by a multitude of variables, including the turnover of proteins and amino acids, not to mention their partition into distinct intracellular pools.

      __Lastly, please note that we have now added PTH-based measurements of amino acid levels in the cell cycle of wild type cells (new Figure EV2). As mentioned in our response to Reviewer 1, we found that changes in the steady-state levels of BCAAs had a similar pattern, and those changes were most significant for valine (rising 30-40% from late G1 to G2/M). __

      -They need to add the data indicating that the bat1 mutant requires longer than WT cells to reach the ~35 fL volume at which 50% of WT cells are budded.

      __RESPONSE: We added all that data (new Figure EV6) and discussed it better in the text. Note that our elutriation analyses allow accurate estimates of the G1 duration, which is at least 2x longer in bat1 vs. BAT1 cells. __

      -It seems important to show that Val supplementation can suppress the overabundance of G1 cells in bat1 mutant cells shown in Fig. 4C; and can restore sensitivity to Rap and Rps6-P accumulation in bat1 mutant cells (in Fig.s 5A & B).

      __RESPONSE: Excellent suggestions. We now present the requested experiments. The DNA content data are in Figure 3C, and the phospho-Rps6 data in the new Figure 4D are discussed in the text. Briefly, exogenous valine, and to a lesser extent leucine, suppressed the G1 accumulation, but not to wild type levels. Exogenous valine also substantially increased TORC1 activity (>10-fold). __

      -It seems important to show that Rps6-P will decline in M phase and increase during a second cell cycle to establish that TORC1 activity actually fluctuates in the cell cycle instead of just by reduced by the manipulations involved in collecting young G1 cells by elutriation.

      RESPONSE: The second cycle comment is not pertinent to our elutriation setup. The two-cycle approach should be used in arrest-and-release synchronizations to minimize arrest-related artifacts when cells continue to grow in size. This is why we used elutriation in the first place, as described in the text, to avoid such artifacts. In elutriations it is the first cycle, exclusively of daughter cells, that can be meaningfully scored. After that, the cells lose synchrony very fast because you have mothers (which grow in size very little) and daughters (which need to double in size until mitosis). Hence, the second cycle will be meaningless and impossible to interpret.

      Reviewer #2 (Significance (Required)):

      General Assessment:

      Strengths: Evidence for BCAA biosynthesis from glucose in the G1 phase of the cell cycle, and evidence obtained from analyzing the bat1 mutant that BCAA synthesis underlies activation of TORC1 early in the cell cycle in a manner required to achieve the critical cell size necessary for G1 to S transition.

      Weaknesses: Lack of evidence for Val biosynthesis in G1 despite evidence that Val limitation is more crucial than Leu limitation in the bat1 mutant; lack of confirmation that Val limitation underlies the delayed G1-S transition and reduced TORC1 signaling in the bat1 mutant; and lack of compelling evidence that TORC1 activity is periodic in WT cells.

      Advance: This would be the first evidence that TORC1 activity varies through the cell cycle in a manner controlled by synthesis of BCAAs

      Audience: This advance would be of great interest to a wide range of workers studying how the cell cycle is regulated and the role of TORC1 in controlling cell growth and division in normal cells and in human disease.

      My expertise: Mechanisms of metabolic regulation of gene expression at the transcriptional and translational levels in budding yeast

      **Referees cross-commenting**

      Ref. #1's major comment 1 echoes my request for clarification about whether Leu, and not just Val, is limiting growth in the bat1 mutant, and also the need to determine which BCAA supplement to bat1 cells will restore TORC1 activity (which was also requested by Ref. #3).

      I agree with this reviewer's request to provide evidence that Leu levels actually increase during G1 progression (comment #2). I also think the suggested experiments in Comment #3 are reasonable for their potential to provide stronger evidence that Leu production in the G1 phase of wild-type cells activates TORC1, as currently the argument is based on the finding of low TORC1 activation in bat1 cells (that seem to be limiting for Val vs. Leu). Comment #4 echoes similar requests made by both me and Ref. #3. Ref. #3's major comments 1 and 3 mirror two of my major comments. I wasn't convinced of the need to monitor Sch9 versus Rps6 phosphorylation as a read-out of TORC1 activity-does being a direct substrate truly matter? Regarding comment 5, I wasn't convinced of the need to include Rap-sensitive or -resistant control strains for the analysis in Fig. 5A. And regarding comment 4, while it would be interesting to examine if TORC1 regulates BCAA synthesis during cell cycle progression, this seems to be outside the scope of a demonstration that BCAA synthesis stimulates TORC1.

      Thus, it seems we all agree on certain experiments that need to be carried out, and Ref. #1 has rightly proposed a few others with the potential to strengthen the evidence that Leu production during G1 phase mediates cyclic activation of TORC1

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Blank and colleagues measure the synthesis of various metabolites from glucose during cell cycle progression and observe an increased synthesis of branched-chain amino acids (BCAA) from the early G1 to late G1 phase. Interestingly, they also found a gradual increase in TORC1 activity from the early G1 to the S phase which is proposed to be dependent on BCAA synthesis.

      Major comments:

      1. The authors show that TORC1 activity increases from the early G1 to the S phase. TORC1 activity is sensitive to short-term starvations caused during changing media or centrifugations. Hence, the concern arises regarding the increased pattern of TORC1 activity during the cell cycle. Is it really a biological phenomenon or a cellular adaptation to experimental conditions? Can authors provide more support for this observation? Can authors monitor the cell cycle for the two cell cycles to confirm that TORC1 activity shows a wavy pattern? RESPONSE: The same point was also made by Reviewer #2. As we noted in our response above, “____The second cycle comment is not pertinent to our elutriation setup. The two-cycle approach should be used in arrest-and-release synchronizations to minimize arrest-related artifacts when cells continue to grow in size. This is why we used elutriation in the first place, as described in the text, to avoid such artifacts. In elutriations it is the first cycle, exclusively of daughter cells, that can be meaningfully scored. After that, the cells lose synchrony very fast because you have mothers (which grow in size very little) and daughters (which need to double in size until mitosis). Hence, the second cycle will be meaningless and impossible to interpret.____”

      The authors use Rps6 phosphorylation as a read-out of TORC1 activity, which is not a direct substrate of TORC1. Analysis of the direct substrates of TORC1, such as phosphorylation of Sch9 will solidify the author's claim.

      RESPONSE: The reviewers discussed this point (see their comments above). We agree with the opinion that Rps6 phosphorylation accurately reports on TORC1 activity (also used in the fly experiments we now cite, as requested by Reviewer 1). For all our experiments' objectives and conclusions, it doesn't matter if the phosphorylation of Rps6 lies more downstream than Sch9 phosphorylation.

      Authors show that Bat1 lacking strain have reduced TORC1 activity. Can authors restimulate these cells with Leucin, Valine, and Isoleucine individually or in combination to identify the critical amino acid for the TORC1 activity?

      RESPONSE: Yes, that is an excellent suggestion. We show the experiment in Figure 4D (see previous response). Valine showed pronounced activation (>10-fold).

      The authors claim that increased BCAA synthesis is necessary for TORC1 activation. Since TORC1 is shown to be upstream of amino acid biosynthesis pathways, it will be interesting to check if TORC1 per se regulates BCAA synthesis during cell cycle progression. The authors could inhibit TORC1 by rapamycin treatment and monitor if the BCAA synthesis still shows cell cycle-dependent modulation.

      RESPONSE: The reviewers also discussed this point (see their comments above). We agree with the view that it is a very substantial undertaking, well beyond the scope of this work.

      In Figure 5A, the use of any rapamycin-sensitive and rapamycin-resistant strains as controls will strengthen their claim of TORC1 inhibition being epistatic to Bat1 deletion, since the rapamycin in minimal media might be less effective.

      RESPONSE: Again, the reviewers also discussed this point (see their comments above). We agree that it will not add much to the conclusions in the context of all the data we show and the existing literature.

      Minor comments:

      1. The data of metabolic labeling, especially various species M1, M2, M3, etc., of an individual metabolite is difficult to understand for the general readers. Hence, a schematic explaining various species might be helpful. RESPONSE: We added a new Figure (EV1) delineating the carbons from glucose to valine and leucine.

      Please describe the elutriation approach in more detail with media conditions and buffer conditions to understand the overall experimental setup.

      RESPONSE: We now added this information (see the second section of the Materials and Methods).

      Reviewer #3 (Significance (Required)):

      Significance:

      Overall, this study presents an interesting observation to the researchers working in TORC1 and cell cycle regulation.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General Statement

      We thank the reviewers for a thorough review that will help us to improve the manuscript in the revision process. In our opinion, all three reviewers found the manuscript interesting, novel, and relevant for a broader readership. The reviewers suggested performing additional analyses of cell quantification from existing brain tissue or from newly generated tissue. All reviewers identified several shared concerns that we are happy to address by additional experiments and analyses to improve our manuscript. The reviewers suggested including the Control Diet + LiPR treatment group to further characterize the effects of LiPR on adult neurogenesis outside the context of the High Fat Diet. Also, the reviewers suggested including built upon the analysis of tanycytes and their proliferation. Some of these analyses will require generating new experimental animals, however, most analyses can be performed from already available brain tissue or previously collected confocal microscope images. Because we had anticipated some of the possible concerns, we have placed mice in the experiment already in February 2023. These mice are in the 4-month treatment group of Control Diet + LiPR. We will collect the brain tissue at the end of May 2023 and will analyze it in June and July 2023. In April and May 2023, we will work on analyses from existing tissue or images as described in detail below. We estimate that the suggested analyses are all feasible and should be manageable in 3 months. In fact, we are pleasantly surprised by the favorable nature of the reviews, especially from the reviewer 1 and 3, which allowed us to address around 50% of comments already as demonstrated in this revision plan (see section 3). Therefore, we are confident that we will be able to address the remaining concerns to full satisfaction of all relevant reviewers’ comments.

      Reviewer 1


      In this manuscript, Jorgensen and colleagues describe their findings on the action of a palmitoylated form of prolactin-release peptide (LiPR) on neural stem cells (NSC) in the adult mouse hypothalamus and adult mouse hippocampus. Their main conclusion is that LiPR can counteract the effects of high-fat diet (HFD) and rescue some of the adverse effects of HFD. Specifically, the authors provide evidence that: - Exposure to HFD reduces the number of presumptive adult neural stem cells (NSCs) in the adult hypothalamus, whereas exposure to LiPR reverses this trend. - The results suggest that LiPR reduces the proliferation of alpha-tanycytes and/or their progeny in the hypothalamus in the context of HFD, with Liraglutide acting similarly. In contrast, while LiPR also suppresses proliferation in the SGZ, Liraglutide works there in the opposite direction. - LiPR also helps the survival of adult-born hypothalamic neurons. - Reduction of proliferation by LiPR suggests a model where LiPR increases the number of NSCs presumably by reducing their rate of activation. - The results suggest that LiPR promotes expression of PrRP receptors in the hypothalamic neurons, suggesting that PrRP may act directly on such neurons (and tanycytes?) in vivo. - The authors also show that HFD and LiPR alter gene expression profiles of the MBH cells, with HFD, but not LiPR, inducing myelination-related genes. - Finally, they show that PrRP stimulates an increase in Ca2+ in in vitro-derived human hypothalamic neurons. - The authors conclude that LiPR may be reducing activation and proliferation of the hypothalamic stem cells and thereby preserve their pool from exhaustion, which was stimulated by HFD. The manuscript presents interesting data and is clearly written. There are several comments, mainly editorial.

      RESPONSE: We thank the reviewer for the favorable and positive assessment of our manuscript and for finding our study to be interesting to a broad audience and well written, with most comments described by the reviewer as “editiorial”. Below, we address the reviewer’s concerns in a detailed revision plan.

        • It is unclear why most of the experiments do not include the control+LiPR group. Even though the focus of the study was the action of LiPR in the context of HFD, questions remain regarding the action of LiPR per se. Is LiPR (or Liraglutide, for that matter) completely inactive on the normal diet background, with respect to neurogenesis in the hypothalamus and the hippocampus? Whether the Response is positive or negative, it would give a much better understanding of the action of LiPR - does it regulate neurogenesis in various physiological contexts, or does it only kick in with a particular type of diet? In fact, this was examined (see Supplementary figures), but only for the cells in culture and, when performed with animals, was limited to 7 and 21 days, rather than 4 months, which would have been much more informative.* RESPONSE: We thank the reviewer for this suggestion. We agree that including the Control Diet + LiPR group for the 4-month HFD group would complement the results from the 7 and 21 days. We will generate this treatment group for the 4-month HFD group and analyze the effect of LiPR on aNSC and adult-generated neurons. These mice in the 4-month treatment are in the experiment already from February 2023 and we plan to analyze their brain sections in June and July 2023.
      1. The question above is also relevant when considering the conclusions on the potential depletion of the stem cell pool (again, whether in the hypothalamus or the hippocampus), particularly at the 4-month time point. The mice are ~6 months old by that time, and neurogenesis in both regions is expected to decrease by that time. Are LiPR or Liraglutide able to suppress or exacerbate this decrease? Can they be used to mitigate this decrease when mice are on a regular diet?*

      RESPONSE: This concern will be addressed by analyzing the Control + LiPR mice for the 4-month HFD group (see our response to the point 1 above). We will analyze neural stem cells in the Hypothalamic Ventricular Zone and neural progenitors in the Median Eminence of these mice to address whether LiPR treatment changes the time-dependent decrease in both cell populations.

      • A somewhat related issue is that, in most cases, only the percentage or the density of cells are shown on the graphs, rather than the absolute numbers (at least for some cases). This sometimes complicates the comparisons; for instance, does the surface of the hypothalamus change between 2 and 6 months of age? The tanycytes' number stays, apparently, the same (e.g., Fig. 2) but the production of new neurons is supposed to fall dramatically.*

      RESPONSE: We thank the reviewer for this comment. We agree that the quantification of absolute number of cells is the preferable approach that we have used in our previous publications on subventricular (SVZ) or subgranular (SGZ) neurogenesis. However, hypothalamic adult neurogenesis is dispersed over much larger volume of tissue than neurogenesis in the SVZ or SGZ, which is confined to narrow tissue compartments. As we do not have access to a confocal microscope with stereological software, absolute quantification in entire MBH is not feasible. Nevertheless, we believe that our quantification of cell density provides an unbiased and informative approach that allowed us to compare the effects of LiPR and diet on the neurogenic process.

      • The authors write "LiPR may prevent stem cells from exhaustion, induced by HFD" - but it is not clear that HFD indeed leads to exhaustion - there is no statistically significant difference in the number of the stem cells (alpha-tanycytes) between the control and HFD or between HFD at 1, 3, or 12 weeks.*

      RESPONSE: We thank the reviewers for their insights. We adjusted the interpretation to better reflect our results. On line 442, we replaced the original statement “The lower cell activation may protect the stem cell pool from exhaustion elicited by the HFD“ with a new one, “The lower cell activation may protect the stem cell pool from exhaustion elicited by the HFD“.

      • Numerous papers show that the rate of production of new adult hypothalamic neurons (mainly those derived from beta-tanycytes) drops drastically within the first several weeks of mouse life. Does HFD accelerate, and LiPR mitigate, this decrease? Perhaps one can calculate the numbers from the graphs, but it would help if this is explained in the text of the manuscript. Also, it is not always clear whether specific experiments were performed with the zones of the hypothalamic wall that only contain alpha-tanycytes.*

      RESPONSE: Our results show that LiPR rescues the HFD-induced reduction in adult-generated hypothalamic neurons only in the context of 4-month HFD but not in the 7- and 21-day HFD. In the methods (line 877), we specify that “the Region of Interest (ROI) quantified included the MBH parenchyma with the Arcuate (Arc), DMN and Ventromedial (VMN) Nuclei and the Medial Eminence (ME)”. In the results of the revised manuscript (lines 301-303), we highlighted the areas of the ROI. Upon the request of Reviewer 3 (comment 14), we included new data on quantification of BrdU+ neurons in the Arcuate Nucleus (S.Fig.5O). This data show that 21d HFD increases the number of new neurons in ArcN, which is reversed by LiPR or Liraglutide (text added to results and discussion on lines 309-313 and 468-474, respectively). Finally, in the discussion (lines 464-488), it is stated that HFD and/or LiPR had no effect on number of new hypothalamic neurons or cells in the MBH parenchyma in the 7- and 21-day groups and this is discussed in the context of relevant literature.

      • A sharp increase in PCNA+ cells in the hippocampus at the 21-day time point, both in the control and in the HFD and HFD/LiPR groups (Fig. S2f) is a little puzzling because neither the Dcx+ nor the Ki67+ cells show this increase.*

      RESPONSE: We agree with the reviewer that this increase in the number of PCNA+ cells is puzzling. We quantified the number of PCNA+ cells twice by two different people, always getting the same result. Given that this is a minor result in a supplementary figure, we would prefer not analyzing this again, unless the reviewer would insist on it.

      • The study deals with several agents and several processes; a simple scheme that summarizes authors' conclusions might help to better understand the relationships between those agents and processes.*

      RESPONSE: We thank the reviewer for this useful suggestion. We included a summarizing schematic in the revised manuscript as the new Figure 6. We will update the schematic for the final revised manuscript, when we will incorporate the new analyses.

      ***Referee cross-commenting**

      I agree, the lack of the LiPR group complicates the interpretation of the results. I also agree that the experiments with vimentin staining, calcium increase, and even with neurospheres do not add much to the main questions that this study attempts to Response, and I'd rather see a more thorough analysis of the activation and differentiation data. I also want to reiterate that the concept of LiPR/PrRP preventing the exhaustion of the hypothalamic stem cell pool is not clear, because it is not shown that this pool does actually get exhausted under normal or HFD conditions. This latter issue again requires the LiPR-alone group. Also, as a clarification - I wrote about 1 month required to compete the revision assuming that the authors actually have the data on the Control+LipR group or at least the specimens available, mainly because the supplementary material shows results with this group, at least with the neurospheres. If this group is fully missing, then the effort will obviously take a longer time.

      Reviewer #1 (Significance (Required)):

      The provided evidence suggests, for the first time, that PrRP prevents the loss of the neural stem cells population in the adult hypothalamus that was diminished by obesity and HFD. This finding might be interesting to a broad audience.

      *

      Reviewer 2


      *The authors examine the effect of an anorexigenic drug, LiPR in the context of treatment with high fat diet (HFD) and with a special focus on hypothalamic neural stem/progenitor cells and neurogenesis. The work is mostly based on mice and a barrage of different techniques (confocal imaging, cell cultures with time lapse, gene expression...) are used. The results are interesting because they address the yet-poorly understood implication of hypothalamic neurogenesis in food intake and energy balance. The results point at complex effects at different levels (neural stem cells, neurons, division, survival...). The experimental approach is sometimes thorough in the treatment of details on the one hand, it also lacks of consistency on the other, and as a result the conclusions lack strength. There is a number of experiments that sometimes seem unrelated and this hurts the comprehension of the manuscript, specially in lieu of the complexity of the results obtained.

      *

      RESPONSE: We thank the reviewer for finding our results interesting and relevant. We will strive to improve the consistency of our results in the revised manuscript to satisfy the reviewer’s concerns.

      1. A major issue is the lack of a LiPR-only group, which would much facilitate the interpretation of the results. The effect of LiPR alone is however tested, but only in comparison with the Control in one of the in vitro experiments (S.Fig. 3) RESPONSE: We agree with the reviewer that expanding on the LiPR-only effect would facilitate the interpretation of the results (see concern 1 and 2 of reviewer 2). We want to emphasize, however, that we analyzed the HFD-independent LiPR effects not only in vitro but also in vivo by quantifying the number of BrdU+ cells and neurons in the MBH of mice exposed to 21-day HFD (S.Fig. 5 O-Q) and by including the Control Diet + LiPR in the RNAseq experiment (Fig.5C). Nevertheless, we will analyze the number of alpha tanycytes and proliferating cells for the 21-day Control Diet + LiPR treatment group. And we will generate mice treated with Control Diet + LiPR to complement the 4-month group. In this Control Diet + LiPR group, we will quantify the number of tanycytes and number of BrdU+ cells and neurons.

      2. As plotted, in Fig 1B is difficult to interpret the effect of HFD and LiPR, might be using percentage and noting the statistical differences as in the other would help. It looks like HFD has no effect compared to control on weight and only at the end LiPR could have an effect. On the other hand, after 4 months, HFD mice are clearly above the controls and it is then, albeit when weight gain has reached a plateau, that LiPR has an effect. The election of these arbitrary paradigms and their drawbacks has to be better explained.*

      RESPONSE: We thank the reviewer for the comment. We analyzed the effect of HFD and/or LiPR on the body weight for the 21-day group (Fig.1B) in the original manuscript (lines 111-115). The two-way, repeated measure ANOVA revealed no effect of the treatment on the body weight in the 7-day group, however, it revealed the effect of the duration of treatment on the body weight in the 21-day group. As suggested by the reviewer, we included the Control Diet + LiPR in the 21-day group (Fig.1B). We analyzed the data with ANOVA and found that the treatment has a statistically significant effect on the body weight, however, without any statistical difference between treatment groups (lines 112-116 in the revised manuscript). In addition, we will include the Control Diet + LiPR in the 4-month group.

      Why was the proportion of GPR10+BrdU+MAP2+ cells only assessed in control mice and no in the experimental groups if its expression in overall neurons changes? This suggests that the receptor is expressed in neurons. Interestingly, exposure to 21d HFD reduced density of GPR10, which was rescued by LiPR administration (Fig.1L). Why was this time point chosen and not the longer-term one? What is the consequence of the alterations in the potential number of GPR10, specially in relation to the administration of LiPR? This clarification is important because a 14-day treatment was chosen for the in vitro experiments in which LiPR, but not HFD, seems to have an effect on cell proliferation. Might be it would have been more useful to use a paradigm in which HFD has an effect to better compare with in vivo work and for the rationale of the work. "Besides GPR10, we co-localized neuronal cytoskeleton structures with NPFFR2 in the MBH (Fig.1O-P)..." Why were not GPR10 and NPFF2 analyzed in a similar and consistent manner ? It is confusing.

      RESPONSE: The proportion of GPR10+BrdU+Map2+ neurons was quantified to address whether new neurons express the PrRP receptor. We chose to analyze the proportion of GPR10+BrdU+Map2+ neurons at the 21d time-point because we had the most robust data for this or related time points in vitro and in vivo. We will emphasize this in the text. But we prefer not to analyze the effect of LiPR on the density or expression of GPR10 or NPFF2 for all time points. We consider this to be beyond the scope and focus of the manuscript.

      The number of GFAP+ α-tanycytes is not significantly changed by HFD therefore LiPR does not rescue, but rather increases the number of GFAP+ α-tanycytes in the 7-day setting. There are no differences among groups later, the effect is lost by 21 days, therefore there is a transient excess of GFAP+ α-tanycytes which later "disappear" in the LiPR group. The authors state that LiPR rescues the decrease in "htNSCs", but after 21 days the number of the GFAP+ α-tanycytes is the same in all groups without the need of LiPR. There is no experimental follow up (addressing proliferation and survival of these cells) and the conclusions stated in the text (results and discussion) are not really supported by the data. The in vitro experiments could be a complement, but are no substitute for the missing in vivo exploration.

      RESPONSE: We thank the reviewer for this comment. We agree that we did not correctly interpret the data. On line 158, we replaced the original statement “This suggests that short LiPR rescues HFD-induced reduction in the number of htNSCs” with a new one that reflects of date correctly, “This suggests that short LiPR increases the number of htNSCs. In our revision plan, we will quantify the number of proliferating tanycytes to complement our in vitro results.

      • The fact that cell division is "rarely found" (Rax GFAP) experiments also push for further investigation. It is difficult to see that relevance of the inclusion of the vimentin staining experiment if there is no further exploration. The effect of LiPR is only transient, in the 7-day paradigm and as the parameter evaluated is the proportion of vimentin+ tanycytes among GFAP+ tanycytes it could only be reflecting increased expression of the filament. "Nevertheless, we did not observe a statistically different change in the area occupied by Rax+ tanycytes (Fig.2H)." Why did the authors use Rax only for this experiment if "GFAP+ α-tanycytes which are considered the putative htNSCs?" What is the justification for not seeing changes in relation to the results reported in Fig 2D-F? "Because Vimentin is associated with nutrient transport in cells and with metabolic response to HFD 52-54, we quantified the proportion of GFAP+ tanycytes expressing Vimentin (Fig.2F)." It is difficult to see that relevance of the inclusion of the vimentin staining experiment if there is no further exploration. The effect of LiPR is only transient, in the 7-day paradigm and as the parameter evaluated is the proportion of vimentin+ tanycytes among GFAP+ tanicytes it could only be reflecting increased expression of the filament.*

      RESPONSE: Because Vimentin is a marker of neural stem cells and alpha tanycytes, we quantified the number of GFAP+Vimentin+ tanycytes to complement the quantification of GFAP+ alpha tanycytes. We are sorry that this was not clear, and we highlighted this connection in the revised manuscript (line 165). Because Rax is expressed in alpha tanycytes, we expected that LiPR will increase Rax in the Hypothalamic Ventricular Zone (HVZ). We agree with the reviewer that further investigation may be useful, and we will quantify the number of alpha tanycytes positive for Rax instead of determining only the volume of Rax+ tissue. We will quantify Rax+GFAP+ neural stem cells in the HVZ and Rax+GFAP+ neural progenitors (so-called beta tanycytes) in the Median Eminence to improve characterization of the cell dynamics in vivo.

      • Why there is no Ki67 experiment in the 7-day paradigm if that is the timepoint in which changes in the number or proportion of GFAP+ tanycytes are observed? PCNA was then used but only in the 21-day paradigm. What is the interpretation and relevance of these data? What are the non-htNSCs proliferating cells, whose dynamics are different from the changes in the number or proportion of htNSCs that could be potentially related to changes in mitosis? Again, I think it would be much useful for the work to explore in detail the changes in the putative htNSCs than investing in experiments that only add confusion.*

      __RESPONSE: __We apologize if the data presentation is confusing. We will include the quantification of the Ki67+ cells for the 7-day time point. In the MBH, many cell types undergo mitosis, including the oligodendrocyte precursor cells, microglia, astrocytes, and infiltrating macrophages. However, characterizing the identify of all these different cell types in response to the HFD and/or LiPR is beyond the scope of this study. To resolve whether HFD and/or LiPR influence proliferating aNSCs, we will quantify the proliferating cells in the HVZ, which will allow us to separate the proliferating aNSCs from all other proliferating cell types in the MBH.

      • The inclusion of Liraglutide + HFD, (not Liraglutide alone) only in some of the experiments is pointless if there is no direct comparison with LiPR and a timepoint is missing. In S.Fig 3, Fig. 5 and S.Fig 7 LFD (low fat diet?) is used in several occasions as in: "on reducing number of PCNA+ cells in 21d protocol (one-way ANOVA (OWA), F(2,12) = 16.66, p = 0.0003) when compared to both LFD and HFD groups". Is this the control diet?*

      RESPONSE: We apologize for the confusion caused by labelling the conditions of the Control Diet inconsistently. In some figures (e.g., Fig.2, S.Fig.3, Fig.4), we labelled the Control Diet as “Control”, whereas in some other figures (e.g., Fig.5, S.Fig.7) we labelled the Control Diet as “LFD” (Low Fat Diet). In all experiments and figures, the used Control Diet was identical. We unified the labelling of the Control Diet in all figures and in the text of the revised manuscript. Respectfully, we do not agree that including the Liraglutide data is pointless. We included the Liraglutide in the context of the HFD as a direct comparison with the HFD + LiPR group to demonstrate that the two anti-obesity compounds exert differential effects on adult neurogenesis. Such comparison has not been done before in analyzing adult neurogenesis and is valuable for better understanding of functions of these anti-obesity compounds.

      • The final experiment shows that application of hPrRP31, a variation of LiPR, causes an immediate calcium increase in human induced pluripotent stem cell-derived hypothalamic nucleus. This finding is interesting in itself because it brings light about the function of the receptor/s. It would have been very useful to test what other receptors mentioned to bind LiPR is mediating the effect. In any case, the focus of the work are the neural stem/progenitor cells responsible for neurogenesis and the changes in their properties because of HFD and LiPR, therefore I would trade these experiments for a more thorough and detailed dissection of these effects.*

      RESPONSE: We thank the reviewer for recognizing the relevance of the experiments with the hiPSC-derived neurons. As described in the comments above, we will conduct additional experiments to address the effect of LiPR on aNSCs and proliferation to more thoroughly as suggested by the reviewer.

      Minor points: __ A.__ Introduce "GLP-1RA"

      __RESPONSE: __We thank the reviewer for identifying this omission. We introduced the term in the revised manusript (line 50).

        • "HFD-induced inflammation and astrogliosis in the hypothalamus 45,46, whereas the long (4mo) protocol leads to DIO" Are these notions exclusive?* __RESPONSE: __This statement emphasized that HFD-induced inflammation and astrogliosis precede obesity. We prefer to leave the statement as it is.
        • LiPR displays no effects on astrocytes" "Displays" is not the correct term.* RESPONSE: We replaced the term “display” with the word “show” in the revised manuscript (line 342).

      ***Referee cross-commenting**

      I think we all referees agree for the most part. The main concern stated by all of us is the lack of a LiPR-alone group. The rest of the concerns are also related or complementary. In my opinion the mostly common view by the referees is reasuring.

      Reviewer #2 (Significance (Required)):

      The strengths of the work are its novelty in the field and the variety of techniques employed. The work has the potential of unveiling mechanistic insight into the regulation of neural stem/progenitor cells and neurogenesis. The main audience of this work would be the community working on this field. The lack of experiments testing that the changes observed actually participate in food intake prevent the work from being of relevance for a broader audience (food intake, energy balance, obesity...). The limitations are the descriptive nature of the work and the lack of a consistent and systematic experimental design that would allow to extract solid conclusions upon to which build upon future research.

      *

      Reviewer 3

      The work of Jörgensen et al describes the effect of a lipidized analogue of the prolactin releasing peptide (LiPR) on the mouse metabolism in response to high fat diet (HFD) and on hypothalamic and subgranular zone (SGZ) neurogenesis. They conclude that LiPR reduces body weight and improves metabolic parameters affected by HFD as well as it concomitantly stimulates neurogenesis in both niches the SGZ and the hypothalamus. The link between both effects is not demonstrated. The work is well conducted, the hypothesis is interesting and the experimental approach is adequate. The scope is wide and results are interesting, however a few aspects need to be further clarified. The manuscript is well written although the modification of some aspects would facilitate the reading such as the use of non described abbreviations for example.

      RESPONSE: We thank the reviewer for the positive assessment of our manuscript and for recognizing its novelty and importance for the research in neurogenesis, endocrinology, and metabolism. We will strive to clarify and facilitate our conclusions to improve the manuscript.

        • One concern in this study is the experimental groups. Authors analyze three groups control,HFD and HFD treated with LiPR. Authors conclude that the effects of LiPR are diet independent. However, given the results obtained by the authors on the effect of LiPR, the main question that arises in here is whether LiPR would have an effect on control mice. It seems tha a group is missing in the experimental design in which control ,mice are treated with LiPR during 7, 21 and the last two weeks of the 4 months. Author must include this information or at least argue the election of the experimental design.* RESPONSE: We thank the reviewer for this insight. We agree that including the Control Diet + LiPR in some of our analyses would improve the revised manuscript as also noted by Reviewer 2 (comment 1 and 2) and by Reviewer 2 (comment 1 and 2). In the original manuscript, we included the quantification of BrdU+ cells in the MBH for the Control Diet + LiPR in the 21-day group. To expand on these results, we will quantify the effects of LiPR on alpha tanycytes in the 21-day group. In addition, we will generate Control Diet + LiPR mice for the 4-month group to complement the HFD and HFD + LiPR data.
      1. Body weight is found reduced by LiPR as well as other metabolic parameters in mice treated with LiPR during the last two weeks of the 4 Mo HFD. However, no effects on hypothalamic or SGZ neurogenesis are not observed in this experimental group. How do authors explain this results?*

      __RESPONSE: __The 4-month group contains animals that are over 6-month-old, which display very low levels of cell proliferation and differentiation in comparison with the 7 and 21-day groups that contain mice that are 2 and 2.5 months old, respectively. It is possible that these low levels of neurogenesis did not allow us to detect any pro-neurogenic effects of LiPR. Alternatively, the low neurogenesis in older animals precludes us from detecting the adverse effects of the HFD, which are rescued by LiPR in younger animals.

      • In figure 1 I-K images are not clear and better resolution images would help.*

      RESPONSE: We provided images with higher resolution for Figure 1I-K of the revised manuscript.

      • Authors conclude that LiPR is increasing the number of NSC by reducing their activation. However, authors show an induced increase in htNSC only in mice fed HFD for 7 days and not in the 21 day fed mice or the 4 mo fed mice (fig 2 d-f). In addition, authors test for the number of cells expressing Ki67 (fig 2 L), however, the number of Ki67+ alpha tanicytes is not shown.*

      RESPONSE: We thank the reviewer for this insight. In the revised manuscript (line 158), we corrected the inaccurate statement that LiPR increased the number of aNSCs and did not rescue their number, which was also noted by Reviewer 1 (comment 5) and by Reviewer 2 (comment 4). In addition, we will quantify the number of Ki67+ cells in the Hypothalmic Ventricular Zone (HVZ), which will address whether LiPR affects proliferation of aNSCs. This concern parallels comment 6 of Reviewer 2.

      • On figure 2B it seems that is alpha 2 tanicytes that are missing in response to HFD.*

      RESPONSE: Indeed, the panel in Figure 2B shows that the HFD reduces the number of alpha tanycytes, including the alpha 2 tanycytes. This representative image supports our quantification results in Figure 2D-E.

      • Are Fig 2 A-C images representative of mice fed HFD for 7 days?*

      __RESPONSE: __Yes, the representative images in panels of Fig. 2A-C are from the 7-day group. However, the legend states that these images are from the 21-day group. This is an error that we corrected in the revised manuscript in the legend of Figure 2 (line 572). We apologize for this and thank the reviewer for double-checking.

      • By looking at figure 2B it seems like the proportion of alpha tanicytes is higher in HFD since no or very few tanicytes are observed and almost all of them are alpha tanicytes.*

      RESPONSE: Indeed, 7 days of HFD reduced the number of alpha 2 tanycytes, which occupy the ventral-lateral aspect of the 3rd ventricle. This reduction of alpha 2 tanycytes drives the lover proportion of GFAP+ alpha-tanycytes out of all GFAP+ tanycytes. We emphasized this in the text of the revised manuscript (line 435-437).

      • In fig 2 d-f, an increase in the number of GFAP+ alpha tanicytes and its proportion as well as labelled with vimentin is observed in control mice fed with normal diet for 7 days compared with mice fed normal diet for 21 days. How do authors explain this difference?*

      RESPONSE: There is no difference in the number of GFAP+ alpha tanycytes or proportion of GFAP+ alpha tanycytes between 7-day and 21-day Control Diet mice. We used the two-way, repeated measure ANOVA with the Bonferroni’s pots-hoc test and did not observe any statistical difference between these 2 quantifications for the Control Diet mice at 7 and 21 days. There is a statistical difference between 7-day and 21-day Control Diet mice in the proportion of GFAP+Vimentin+ tanycytes. This could be due to expansion of the Vimentin+ tanycytes in relatively young adult mice. Given that this is not a major point, we prefer not expanding its discussion in the manuscript.

      • In fig 2 Why are the differences in RAX, KI67 and PCNA only present in mice fed HFD for 21 days?*

      RESPONSE: We thank the reviewer for this question, which reflects a similar comment 6 of Reviewer 2. To improve consistency of the presented data, we will quantify the proliferating cells also for the 7-day time point. In addition, we will quantify the number of proliferating cells in the HVZ, which will allow us to address whether HFD and/or LiPR alter proliferation of tanycytes.

      • Authors test for adult hippocampal neurogenesis in the three groups. DO images in fig S2 correspond to the 21 day treatment group?*

      RESPONSE: Yes, the representative images in the Supplementary Figure 2 are from the 21-day group. This is stated in the figure legend.

      • On fig S2 C, it seems that in HFD fed mice treated with LiPR newly generated neuroblasts are more differentiated have authors looked at DCX+ cell morphology?*

      RESPONSE: We thank the reviewer for this observation. We have not analyzed the morphology of DCX+ cells or DCX+ neuroblasts in the SGZ. As the manuscript focuses on the hypothalamic and not hippocampal neurogenesis, we prefer not to analyze the morphology in the revised manuscript.

      • In this same figure, it seems like the number of DCX+ neuroblasts and the number of newly generated neurons is reduced in mice of the 21 d group compared to the 7 day group. Is this statistically significant?*

      RESPONSE: We used the two-way, repeated measure ANOVA with the Bonferroni’s pots-hoc test to analyze the DCX+ neuroblasts and neurons. We observed a statistically very significant effect of LiPR treatment on the number of DCX+ neuroblasts and neurons (page 10 of the original manuscript). However, the Bonferroni’s test did not reveal any difference between 7-day and 21-day treatment groups.

      • There is a large reduction in the number of DCX+ cells from control 21 d treated mice to control 4 month treated mice. Is this statistically significat? How do authors explain this dramatic reduction?*

      RESPONSE: Yes, there is statistically significant reduction in the number of DCX+ cells and DCX+ neurons in the SGZ between the 21-day and 4-month group S.Fig.2). This reduction is most likely a result of aging. The mice of the 21-day group were around 2.5 months of age when culled, whereas the 4-month group month mice were over 6.5-month-old. The decline in SGZ neurogenesis with age is well documented. Because this decrease in DCX+ cells in the SGZ is an obvious consequence of the animals’ age and because the hippocampal neurogenesis is not the primary focus of this manuscript, we prefer not to discuss this feature in the manuscript.

      • Authors do not show the effect of HFD on BrdU+ neurons in the Arcuate. However, all data need to be shown.

      *

      RESPONSE: We stated (on page 12 of the original manuscript) that in the Arcuate Nucleus of the 21-day group, there was “a statistically significant increase of BrdU+ neurons by HFD compared to Control (data not shown)”. To satisfy reviewer’s comment, we incorporated this data in the S.Fig.5 as the new panel S.Fig.5O and added the following text (lines 309-313) to the revised manuscript: “However, in the ArcN, the primary nutrient and hormone sensing neuronal nucleus of MBH 4, there was a statistically significant difference in number of BrdU+ neurons due to treatment (OWA, F(3,15) = 3.97, p = 0.0029). Exposure to 21d HFD significantly increased the number of BrdU+ neurons in the ArcN, which was reversed by co-administration of LiPR or Liraglutide (S.Fig.5O).” In addition, we adjusted the relevant discussion (lines 468-472): “Our results show that the short and intermediate exposure to HFD does not change the number of newly generated, BrdU+ cells, neurons, or astrocytes in the MBH parenchyma, however, it increases the number of BrdU+ neurons in the primary sensing ArcN, which is reversed by the con-current administration of LiPR or Liraglutide” and (lines 474-476): “In addition, our results show that while LiPR does not change the number of new cells in the MBH parenchyma, it can rescue the increased production of new neurons in the ArcN in the context of the intermediate HFD exposure.”

      *Reviewer #3 (Significance (Required)):

      In general the manuscript includes a great amount of work to demonstrate the effect of LiPR on neurogenesis (hippocampal and hypothalamic). The scope is wide, and the hypothesis is really interesting. Authors may need to solve some issues in order to completely demonstrate their claims and conclusions, but once the work is done, it will be very valuable to understand the effect of pharmacological agents used in the field of endocrinology to treat metabolic disorders such as type 2 diabetes di type 2 diabetes. So far, no studies have been done in which the effect of this molecules have been described on SGZ and hypothalamic neurogenesis. Both the field of endocrinology and metabolism as well as the field of adult neurogenesis may benefit of a study of this type.*

    1. One artform we didn’t look at in class but I have found myself interested in lately is that of bonsai. Did you know that any tree can be bonsai? It isn’t a specific type of tree! Bonsai again involves the idea of finding beauty in the natural world. In this case bonsai trees are also a practice in mindful attentiveness as it requires one to trim and shape a tree. One must have a vision for the tree and patiently cut and shape the branches so that it conforms to that vision. In most instances a bonsai should have a wide base with large roots that taper as it goes up. The branches should form a triangular shape. By limiting the space of the tree to grow i.e a small pot, the tree will stay its miniature size.

      I did not realize any tree can be Bonsai, I thought that like Oak, Cherry, Vine, Birch, etc Bonsai was a tree where the Bonsai was some sort of etymology or signified what the tree was -- that is interesting, as this may be confirming the ideas earlier stated with the two ideas of Wabi and Sabi along with holding Beauty in the moment, the fact that the Bonsai can be any tree makes it's have the same fluidity and identity of Mono No Aware philosophy I think/believe

    1. As an interpretive bias, technological determinism is often an inexplicit, taken-for-granted assumption which is assumed to be 'self-evident'. Persuasive writers can make it seem like 'natural' common sense: it is presented as an unproblematic 'given'. The assumptions of technological determinism can usually be easily in spotted frequent references to the 'impact' of technological 'revolutions' which 'led to' or 'brought about', 'inevitable', 'far-reaching', 'effects', or 'consequences' or assertions about what 'will be' happening 'sooner than we think' 'whether we like it or not'. This sort of language gives such writing an animated, visionary, prophetic tone which many people find inspiring and convincing.

      The statement highlights how technological determinism often operates as an implicit, unquestioned assumption presented as "self-evident" or "natural" common sense. Writers can use persuasive language to portray technological determinism as an unproblematic given, making sweeping assertions about the impact and consequences of technology. Such language can create an animated, visionary, or prophetic tone that may be appealing and convincing to many readers. However, it is important to critically examine the assumptions underlying technological determinism and recognize that the relationship between technology and society is complex, multifaceted, and shaped by human agency, values, and social dynamics. Taking a nuanced approach can help avoid deterministic thinking and promote a more thoughtful understanding of the role of technology in society.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank all reviewers for their time and effort invested into reviewing our manuscript.

      Please find our responses to your comments, criticisms and suggestions below in blue.

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __

      Summary:

      The manuscript by Vishwanatha et al. presents findings on the fission yeast transcription factor Cbf11, which is involved in regulating lipid synthesis. Changes in lipid metabolism often have detrimental effects on nuclear division (evidenced by the high percentage of cut phenotypes among strains with altered lipid content). Here the authors show that cbf11 deletion strains produce additional phenotypes such as changes to cohesion dynamics and altered chromatin modification within centromeric regions, in turn perhaps affecting microtubule attachment and proper chromosome distributions. This hypothesis is supported by the authors' finding of epistatic effects between cbf11 and cohesin loading and unloading.

      Major comments:

      While the evidence presented supports the hypothesis of altered cohesin loading as a major driver of observed mitotic defects, changes in the NE surface area are likely to also contribute to the phenotypes even in pre-anaphase stages.

      • This is an interesting notion. We are only aware of NE overproduction and nuclear “flares” observed upon the Lipin phosphatase dysregulation (PMID 23873576).

      • However, in our case we rather expect NE membrane shortage, not overproduction. Accordingly, we do see that the nuclear cross section area (thus likely also NE surface area) is smaller in cbf11KO compared to WT (see boxplots below). Is this what you are referring to? We are not sure how this would affect the pre-anaphase stages of mitosis.

      Did the authors test any double deletions with regulators involved in decreasing lipid content (e.g. spo7, nem1, ned1) to counteract the role of Cbf11? This could be useful in assessing the relative contribution of cohesion dynamics and histone modifications.

      • We previously published (PMID: 27687771) that cut6/ACC overexpression can indeed partially suppress the cut phenotype in the cbf11KO background. So lipid metabolism does play a role and does contribute to mitotic fidelity. In the current manuscript, we are showing that other factors contribute as well and that defects arise already prior to anaphase, which is not consistent with the simple notion of shortage of membrane building blocks during anaphase. We appreciate your suggestion on testing the relative contributions of these various factors to mitotic fidelity, but we have not tested any of the suggested double mutants.

      A possible role of physical constraints dictated by the NE was already mentioned by the authors in the context of spindle bending and decreased elongation rates and some preliminary experimental data on this would be appreciated. Generation of strains, acquisition of some timelapses, and quantification of spindle elongation rate/buckling frequency should be feasible in a reasonable time frame.

      • Assaying spindle parameters in Lipin-related mutants would indeed be interesting, but again, these are anaphase phenotypes. We are not sure how this is relevant for the pre-anaphase findings we report? Also, we unfortunately no longer have the personnel and capacity to carry out the suggested experiments.

      The authors report mRNA levels of the centromere flanking genes per1 and sdh1 to be increased by 1.5x and decreased by 2x in comparison to WT. Could the authors elaborate on whether this is an expected trend? Kaufmann et al., 2010 reported low transcription of per1 when the surrounding regions are predominantly acetylated. Fig. 4A suggests a slight increase of H3K9ac at per1 and a decrease of transcription would be conceivable.

      • We do not have any particular expectations regarding the expression levels of per1 and sdh1 in our system. We simply note that their expression changes in cbf11KO (in different directions) and this is accompanied by changes in H3K9 acetylation patterns.

      • The increased histone acetylation at the per1 locus that you mention (Kaufmann et al., 2010) was only shown for H4K12ac, while we measured H3K9ac (these marks are deposited by different enzymes). The authors actually report that “The levels of histone H3 at per1 did not change significantly between the two growth conditions and strains”, so we do not think that paper is relevant for our study.

      Fig. 3B indicates a catastrophic mitosis percentage of roughly 9.5% in cbf11∆ while in Fig. 1C 4% of all cells, or ˜31% of all mitotic events, is noted as abnormal. Could the authors clarify this discrepancy? Since Fig. 1 utilises time course data of 333 cells (please specify the number of analysed cells also in the legend), would the authors expect this data to be more trustworthy when compared to images of fixed cells? What were the criteria to assign divisions as catastrophic in fixed cells and which features were utilised to identify the 400 cells as mitotic?

      • We typically do see higher proportions of cut cells in fixed samples than in live-cell imaging. We believe this has to do with the different fluorescence readouts for live vs fixed cells. We have added the following explanations to the methods:

      “Please note that the observed frequencies of mitotic defects are not directly comparable between live and fixed cells. Following catastrophic mitosis, the dead cells rapidly lose histone-GFP fluorescence (imaging of live cells), but their DNA can still be visualized with DAPI for a much longer period (imaging of fixed cells), resulting in higher apparent defect frequencies in fixed cells.”

      • Importantly, we always compared cbf11KO to WT grown and processed under the same conditions, and that is how we determined the significance of any defects.

      • Mitotic defects were classified based on nuclear morphology both in live cells (histone signal) and in fixed cells (DAPI): Cells having the cut phenotype, or mis-segregated nucleus = 2 nuclei of different sizes, or septated cells with only one daughter cell having a nucleus, respectively.

      • We have analyzed images of at least 400 cells *in total* from asynchronous populations (interphase + mitotic >= 400). We have modified the figure legend to make this fact more clear. In our experience, this is the standard way of reporting the frequency of mitotic defects in asynchronous yeast cell populations.

      • We have specified the number of cells analyzed in Fig. 1C.

      Minor comments:

      Previous literature is, to the best of our knowledge, sufficiently referenced. The text is largely clear (some exceptions within the methods section will be elaborated on below). The figures, however, would benefit from graph titles and some minor formatting changes.

      • Figures:

      o Fig. 1: Specify the number of cells analysed in C within the legend as well. For B, please use colourblind-friendly schemes - especially since images are shown as merges only. The example of the "cut" phenotype appears small and crowded by surrounding cells. Especially the latter might affect mitotic fidelity. Under the assumption that this did not affect quantifications (WT seem fine) a less crowded cell would present a nicer example.

      • We have changed Fig. 1 as requested.

      o Fig. 3: Images shown in A add little benefit in their current form. What is the takeaway for the reader?

      • We hope that the reader gets concrete information on cellular and nuclear morphology of the investigated strains, which would be otherwise difficult to reproduce by textual description.

      Indicating that images represent DAPI staining and pointing out cells of interest with arrows/symbols would be helpful.

      • Done.

      The example shown for cbf11 appears to be dimmer in comparison and cell morphology is hard to interpret.

      • The cbf11KO cells stain fainter with DAPI than cells of other strains. We do not know why. To increase the clarity of the image, we have now adjusted the brightness and contrast of the cbf11KO panel (and indicated this adjustment in the figure legend).

      C feels misplaced in this figure and a title could improve readability.

      • We have added a title and moved the panel to Fig. 4 (4D).

      o Fig. 4: Graph titles needed, figure might work better in portrait

      • We have added the required graph titles.

      • We have recreated all ChIP-seq related figures to incorporate new data and to (hopefully) better highlight the differences between genotypes.

      • Text:

      o Mention median duration of mitosis in cbf11∆ (Fig. 2E) in text since WT is already noted;

      • Done.

      o Discussion, third paragraph: "TBZ [REF] and are prone to chromosome loss [...]". I assume this referred to minichromosome loss or have changes in ploidy/chromosome segregation been quantified?

      • Changes in ploidy were indeed not quantified. We have changed the wording to “__mini__chromosome loss”. But please note that the Ch16 minichromosome is derived from regular Chromosome III and is a real chromosome, albeit a small one.

      o Methods, Microscopy and image analysis:

      How were fixed cells imaged (glass bottom dishes, plated on lectin, mounted on slides)?

      Specify the CellR as widefield and provide details of the objective used (immersion and NA)

      • We have added the following information to the relevant Methods section:

      “Cells were applied on glass slides coated with soybean lectin, covered with a glass cover slip, and imaged using the 60X objective of the Olympus CellR widefield microscope with oil immersion (NA 1.4)”

      Elaborate on "manual evaluation of microscopic images"

      • We have extended the description of cell scoring:

      “The frequency of catastrophic mitosis occurrence was determined by manual evaluation of microscopic images using the counter function of ImageJ software, version 1.52p (Schneider et al., 2012). At least 400 cells from the asynchronous populations were analyzed per sample and mitotic defects were scored based on nuclear morphology and septum presence/position. ”

      For live cell microscopy, what was the estimated final density of cells within the 5 µl resuspension?

      • Our estimate is 4-8 x 10^6 cells in 5 ul. We have added this information into the Methods.

      What is meant by measuring the maximum section of plotted profiles? Is this the maximum distance of Hht1 signals within the entire time-lapse?

      • We have changed the description:

      “The nuclear distance was measured by using Hht2–GFP signals and converting the green channel images to binary, measuring the maximum distance between the Hht2-GFP signals using plot profile function in imageJ.”

      Was spindle length quantified the same way?

      • We have added the description:

      “Spindle length was quantified by drawing a line along the length of the spindle (using mCherry-Atb2 signals) at each timepoint and measuring the length of the line using imageJ.”

      Methods, ChIP-qPCR:

      It is not clear which strains were used, this can only be guessed by the use of a GFP antibody suggesting GFP tagged chromatin to be precipitated. For people with expertise outside of ChIP assays, this should be specified

      • We have listed the used strains in the ChIP-qPCR methods section.

      Reviewer #1 (Significance (Required)):

      Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      This manuscript presents a novel role for a transcription factor, one typically implicated in lipid metabolism, in chromatin modification and cohesin dynamics, with the possibility of this representing a more conserved process across ascomycetes. The mechanism of cbf11 regulation remains to be determined.

      Place the work in the context of the existing literature (provide references, where appropriate).

      This work helps link two bodies of work related to cell division that are usually considered in isolation, the regulation of lipid dynamics and the control of chromatin dynamics and cohesion. Some comparisons to phenotypes in closely related species would have helped provide a broader context (such as Yam et al., 2011, where the spindle morphologies in S. japonicus and response to cerulenin treatment might be of relevance to the work presented here).

      • We now briefly discuss the semi-open mitosis of Sch. japonicus and the Yam et al. 2011 paper at the beginning of the Discussion.

      State what audience might be interested in and influenced by the reported findings.

      Molecular and cellular biologists with interests in nuclear remodelling, lipid metabolism, kinetochore assembly.

      Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Fission yeast biology, nuclear remodelling, microscopy. We are not qualified to make in-depth comments on the soundness of ChIP-Seq and ChIP-qPCR experiments.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This manuscript describes detailed mechanisms by which the cbf11 deletion showed the phenotype. They found that the cbf11 deletion altered pericentromeric chromatin states such as the level of cohesin and hypermethylation.

      In general, their results are interesting and provide important insights into the relationship between lipid metabolism and chromosome segregation. The presented data are valuable for the community, but the authors should carefully re-assess their data.

      Major comments:

      1. Statistical analyses in some of the Fig.3B, 3C, 4B and S2 seem to be somewhat weird because p-values are too small for such a small number of experiments (three independent experiments) with large standard deviations. Please show all the data points in Fig. 2C-E, and provide raw values as a supplementary table for assessment of the data.

      2. We now show individual data points for all barplots and boxplots and provide all source numerical data as supplementary tables. The details of the used statistical tests are given in the respective figure legends.

      3. Pages 5-6: As for Fig. 4, the data is difficult to interpret because the trends of the ChIP-seq pattern of H3K9me2 between replicates look different: replicate 2 shows an increase of H3K9me2 signal, while replicate 1 shows almost no difference or weak if any. In such a case, the authors should repeat ChIP-seq one more time and confirm hypermethylation at these regions or confirm it by ChIP-qPCR.

      4. We do not agree with this statement. It is true that the exact histone modification patterns are not identical between the two replicates, but this is likely due to the differences in chromatin extract preparation in replicate 1 vs replicate 2 (see Methods). Importantly, both replicates show pronounced differences in H3K9me2 patterns between WT and cbf11KO. We have changed the visualization style to better highlight the differences between WT and mutant (Fig. 4A, Fig. S2B, S3)).

      5. Also, we have added one more biological replicate for the H3K9me2 ChIP-seq (Fig. S3) and performed the H3K9me2 ChIP-seq also in the Pcut6MUT strain with ~50% decreased expression of the cut6 gene (Cut6/ACC is the rate-limiting enzyme of fatty acid synthesis; cut6 is target of Cbf11) as 3 biological replicates (Fig. 4A and Fig. S3). Importantly, all replicates of both mutant strains show hypermethylated regions in the centromeres compared to WT.

      Assuming that the pericentromeric regions are hypermethylated by cbf11 deletion, it is still unclear why the transcription from only dh, but not dg, regions increased although their ChIP-seq data indicated both dh/dg regions were hypermethylated. A similar question arises to the expression of per1 and sdh1. Both K9Ac and K9me2 modifications seem to unchange at both per1 and sdh1 loci, whereas the expression levels of these loci changed in the opposite direction. These results suggest that the transcription levels of the centromeric region are independent of their histone modification states.

      • We do not know why dh expression differs from dg. But note that these are multi-copy repeats and it is very difficult to study individual copies separately. Our expression data, and partly also the ChIP-seq data represent “average” values across all the dh and dg copies present in the genome.

      • Importantly, Figure 4A (and Fig. S2B, S3) show a large piece of the fission yeast chromosome (~57 kbp) and this scale does not allow making informed judgements about the state of histone modifications at a particular promoter locus.

      • When we zoom in, we do see increased and decreased H3K9ac around the TSS of per1 and sdh1, respectively (2 replicates shown).

      • A key question of this study is to understand the relationship between lipid metabolism and chromosome structures. However, the results presented are not enough to address this question. I request to distinguish whether the defects on pericentromeric regions are mediated by lipid metabolism or direct effect by cbf11 deletion. Cbf11 is a transcription factor and can directly bind to DNA, thereby there is a possibility that Cbf11 directly modulates the pericentromeric chromatin state without regulating lipid metabolism. This question can probably be addressed. As the authors have shown in their previous study (Prevorovsky et al., 2016), overexpression of cut6, which encodes acetyl coenzyme A carboxylase and is a target of cbf11, can bypass nuclear defects. If the overexpression of cut6 restores alteration on pericentromeric regions such as cohesin enrichment and hypermethylation, it suggests the defects are a secondary effect of the decrease of phospholipid biosynthesis.

      • We agree that any rescue effects can be direct or indirect. And distinguishing between these two alternatives is unfortunately not straightforward.

      • Our Cbf11 ChIP-seq data do not show Cbf11 binding to centromeres (PMID 19101542), suggesting that any impact of Cbf11 on centromeric chromatin is most likely indirect and mediated by some other, downstream, players.

      • Instead of assaying cut6OE, we now show data that decreased cut6/ACC (a target of Cbf11) expression also leads to changes in histone methylation, similar to cbf11KO (Fig. 4A, Fig. S3). This suggests that lipid metabolism indeed can affect chromatin state (and the chromatin defects in cbf11KO are likely also lipid-related).

      • We have recently shown (Princová et al., 2023, PMID: 36626368) that decreased fatty acid synthesis leads to changes in acetylation and expression of specific stress-response genes in S. pombe, and the whole process involves the histone acetyltransferases Gcn5 and Mst1. Therefore, instead of implicating membrane phospholipids, we rather suggest that lipid metabolism can affect chromatin acetylation/methylation and structure via HATs, potentially through acetyl-CoA, the common substrate of both FA synthesis and HATs. We now mention the Princová et al., 2023 paper in the Discussion section.

      Minor comments:

      1. Figure 3C: The legend says, "Values represent means + SD from 3 independent experiments". It meant "means {plus minus} SD"?

      2. Corrected. Thank you for spotting this.

      3. The relationship between phospholipid synthesis and mitotic fidelity is now discussed in the bioRxiv paper (https://doi.org/10.1101/2022.06.01.494365). It would be nice to discuss this paper.

      4. Thank you for pointing out this reference. We now briefly mention this paper as a note that dysregulation of membrane phospholipid synthesis leads to mitotic phenotypes similar to cbf11KO.

      Reviewer #2 (Significance (Required)):

      Faithful chromosome segregation into daughter cells is crucial for cell proliferation. The authors previously reported that the deletion of cbf11, a transcription factor that regulates lipid metabolism genes, causes "cut (cell untimely torn)" phenotype (Prevorovsky et al., 2015; Prevorovsky et al., 2016). In this report, they examined detailed mechanisms by which the cbf11 deletion showed the phenotype, and found that the cbf11 deletion altered pericentromeric chromatin states such as the level of cohesin and hypermethylation. In general, their results are interesting and provide important insights into the relationship between lipid metabolism and chromosome segregation. The presented data are valuable for the community of basic science in the fields of chromosome biology and cell biology.

      We are cell biologists working on chromosomes and the cell nucleus.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The Vishwanatha et al. manuscript examined the nature of the mitotic defect in cbf11 deletion cells. cbf11+ encodes a CSL transcription factor that regulates lipid metabolism genes in S. pombe. Loss of cbf11+ was previously shown to have a "cut" phenotype presumably due in part to aberrant regulation of its target gene cut6+ which encodes-acetyl CoA/biotin carboxylase involved in fatty acid biosynthesis (Zach et al. 2018). The authors hypothesized that the mitotic defect exhibited as chromosome missegregation in cbf11 deletion cells may be caused by alterations in cohesin occupancy and H3K9 methylation in centromeres. Cohesin occupancy was slightly higher in centromeric dh and dg repeats in the cbf11 mutant and loss of the cohesin-loader gene wpl1+ appeared to suppress the mitotic defect. The authors also showed by ChIP-Seq that H3K9 methylation was higher in the centromeric regions, as well as increased minichromosomal loss in the cbf11 mutant.

      The discovery of increased cohesin occupancy and H3K9 hypermethylation in the centromeric regions of cbf11 deletion cells is novel and interesting. However, the main deficiency of the manuscript is that this discovery is underdeveloped. For example, the evidence linking the mitotic defect phenotype to these two processes was not well supported.

      • We believe that the links have already been well established in the literature. The integrity of centromeric heterochromatin (H3K9me2) is known to be required for mitotic fidelity (eg. Clr4/HMT and Clr6/HDAC mutants with H3K9me2 deficiency have high minichromosome loss and/or show lagging chromosomes during mitosis - PMID: 19556509, PMID: 8937982, PMID: 9755190). Moreover, we stress the known interconnections and provide relevant citations in the Discussion:

      “It is also important to note that heterochromatin, kinetochore function, cohesin occupancy, and gene expression are all interconnected and actually interdependent (Bernard et al., 2001; Folco et al., 2019, 5; Grewal and Jia, 2007; Gullerova and Proudfoot, 2008; Nonaka et al., 2002; Volpe et al., 2002)”

      • We show in the manuscript altered cohesin occupancy in cbf11KO and show that mutations in cohesin loading factors do affect mitotic fidelity of cbf11KO. While we do agree that this connection can be developed further, we believe this is beyond the scope of our current project.

      Moreover, there was no investigation in whether/how Cbf11 regulates cohesin occupancy or H3K9 methylation at the centromeres.

      • This is true. But again, we believe this is beyond the scope of our current project.

      Finally, the title and abstract provided an impression that lipid metabolism may influence cohesin occupancy and histone H3 hypermethylation at the centromeres, but this was not directly studied in the manuscript.

      • We now provide H3K9me2 ChIP-seq data on the Pcut6MUT mutant deficient in fatty acid synthesis to show that lipid metabolism indeed can affect histone methylation at the centromeres (Fig. 4A, Fig. S3).

      Centromeres are regions where sister chromatid cohesion is abolished last in mitosis. The observed higher levels of cohesin occupancy in the centromeric dh and dg repeats of cbf11 deletion cells could be the cause of chromosome missegregation, presumably because there is a delay or hinderance of cohesin removal from sister chromatids in mitosis. However, cohesin occupancy was carry out in asynchronous wild type and cbf11 deletion cultures, so it is unknown whether there is a delay of cohesion abolishment in mitosis. A cdc25-22 block and release experiment could better address this hypothesis.

      • We acknowledge these limitations of our findings regarding cohesin occupancy in the paper:

      “ Notably, centromeres are the regions where sister chromatin cohesion is abolished last during mitosis (Peters et al., 2008). Since cbf11Δ cells show altered cell-cycle and pre-anaphase mitotic duration compared to WT (Fig. 2), the observed difference in cohesin occupancy might merely reflect these changes in the timing of cell cycle progression. Alternatively, altered cohesin dynamics could play a role in the cbf11Δ mitotic defects.”

      • We agree the issue could be addressed better using synchronous cell populations. However, the cdc25 or cdc10 block-release does not work well in cbf11KO (PMID: 27687771), and we currently do not have the capacity to perform less disruptive forms of cell cycle synchronization.

      The observation that the spindle assembly checkpoint did not influence the mitotic catastrophe phenotype of cbf11 deletion cells suggests that the chromosome missegregation may not be mediated by defects in cohesin dynamics. How does Cbf11 influence cohesin dynamics in mitosis?

      • There are clearly multiple contributors to the mitotic defects observed in the cbf11KO strain and we state this explicitly throughout the manuscript.

      • We agree that it would be interesting in future to know more details about the link between Cbf11 and cohesin, but this is beyond the scope of our current project.

      Does Cbf11 regulate transcription of cohesin genes or indirectly through defects in the centromere or condensins?

      • Expression levels of cohesin and condensin genes are not affected by deletion of cbf11 (PMID: 26366556). We now mention these findings in the Results section.

      There was no direct evidence that H3K9 hypermethylation at the centromeres contributes to the mitotic catastrophe phenotype of cbf11 deletion cells.

      • This is true. However, the importance of H3K9me2 for mitotic fidelity has already been established in the literature (as we mention above).

      It is also not clear whether Cbf11 directly or indirectly influences histone methylation at the centromeres of affect centromere function.

      • When the Cbf11 protein is missing, centromeric histone methylation is different from normal (WT), and centromere function is not normal either - dh repeats are less expressed, minichromosome derived from ChrIII (so has a normal centromere) is 9x more frequently lost. So Cbf11 does affect these processes. The question remains, whether Cbf11 does this directly or indirectly. We favor the indirect route, as we have recently shown that H3K9 acetylation or methylation can be affected by shifting the balance between fatty acid synthesis (which is regulated by Cbf11) and histone acetyltransferase activity. We now mention these findings in the Discussion (Princová et al., 2023).

      Based on a substantial number of protein-protein interactions of Cbf11 and gene products that affect chromatin function/silencing at the centromeres from the Pancaldi et al. 2012 study (e.g. HIR complex, Hrp1-Hrp3, Cnp1, Ino80 complex), I am surprised that these candidates were not mentioned in this study or investigated.

      • Unfortunately, no DNase treatment was used during the affinity purification of Cbf11 in the study you mention. Therefore, the list of potential interactors is likely contaminated by irrelevant, DNA-mediated interactions with proteins sitting at nearby loci. This is why we have not pursued these candidates.

      Also, it would be more comprehensive to examine defects in transcriptional silencing in the centromeric regions using an ade6+ or ura4+/FOA marker system rather than measuring expression of per1+ and sdh1+.

      • We agree. We actually tried the ura4/FOA reporter system, but had problems constructing the reporter strains in the cbf11KO background. The resulting clones showed variable levels of FOA sensitivity (see figure of clones OC5-9 below), so we could not get a conclusive answer from this experiment and resorted to measuring the expression of pericentromeric genes.

      Figure 1A shows that the "cut" and nuclear displacement phenotypes are independent. However, cut mutants can also generate a nuclear displacement phenotype [Samejima et al. (1993) J. Cell Sci. 105: 135-143]. Therefore, I am not sure whether the latter phenotype can be treated as entirely independent from "cut" mutants.

      • We have made clarifications to Fig. 1A accordingly.

      Reviewer #3 (Significance (Required)):

      The discovery of increased cohesin occupancy and H3K9 hypermethylation in the

      centromeric regions of cbf11 deletion cells is novel and interesting. However, the main deficiency of the manuscript is that this discovery is underdeveloped.

      The results of this manuscript would be of considerable interest in the area of cell cycle research, transcription and chromatin structure and function.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Summary

      In this paper Vishwanatha et al. analyze the mitotic phenotypes of cells lacking a regulator of lipid metabolism Cbf11. They propose that sister chromatid cohesion abnormalities and altered chromatin marks may contribute to the increased incidence of catastrophic mitosis. Additional experiments are required to improve the study and strengthen the authors' conclusions.

      Major Comments

      Both histone and alpha-tubulin tagging are known to aggravate mitotic errors in S. pombe. Before using these markers for live imaging, the authors should quantitate mitotic phenotypes in untagged cbf11∆ cells, as compared to the wild type. Using DAPI and Calcofluor staining (and ideally, also visualizing microtubules using anti- alpha-tubulin antibodies) the authors should measure the percentage of cells in mitosis and the percentage of cells that are going, or just went, through catastrophic mitosis, in asynchronous early-mid-exponential cell populations.

      • We agree that tagging can affect protein function in numerous ways.

      • The tagged versions of tubulin (mCherry-Atb2) and H3 (Hht2-GFP) used in our paper have been obtained from Phong Tran’s lab. These tagged alleles had been published (Nature Communications, PMID: 26031557) and used successfully to monitor mitotic defects including chromosome segregation errors and the cut phenotype.

      • The analyses of mitotic and septation defects of asynchronous untagged cbf11KO cells that you suggest (except for the spindle visualization) were already done by us (PMID: 19101542, PMID: 26366556) and are in agreement with our present study. In brief, we showed that cbf11KO populations contain ~10-30% of cells with mitotic defects (eg. cut), depending on the cultivation conditions. They also show septation defects and altered cell morphology and shorter cell length.

      In analyzing the dynamic of nuclear division, the authors claim that the interval between spindle formation and anaphase onset is "longer" and "more variable" in cbf11∆ cells compared to WT cells. The authors should provide proper statistical analysis of both differences to show that these differences are significant.

      • We now show the required data and statistical testing as Fig. 2H.

      The same goes for the authors' claim that mitotic duration is "more variable" in cbf11∆ cells compared to WT cells.

      • The spread of values for both WT and cbf11KO is given in Fig. 2G.

      As mentioned above, alternative estimates of possible perturbations of mitotic dynamics could be obtained by measuring the percentage of cells in different mitotic phases in asynchronous untagged cell populations, in order to avoid possible artifacts given by tagging histones and alpha-tubulin.

      • As you mention above, to estimate their cell cycle stage, untagged cells would need to be fixed and stained to visualize the nucleus and septum. However, using fixed cbf11KO cells is not optimal for this purpose. cbf11KO have septation and cell separation defects (PMID: 19101542, PMID: 26366556). This results in increased numbers of cells having a (persistent) septum in the asynchronous population, which obscures any estimates of cell cycle stages, and this is why we observed live cells during a timecourse.

      The fact that inactivation of SAC does not change the incidence of catastrophic mitoses shows that SAC is not involved and that there are likely no problems with kinetochore-microtubule attachments. Therefore, the authors' statement "These results suggest that SAC activity only plays a minor role (if any) in the mitotic defects observed in cbf11Δ cells" should be changed.

      • We have changed the sentence to:

      “These results suggest that SAC activity only plays a minor role (if any) in the mitotic defects observed in cbf11Δ cells, or that the defects are not caused by problems with kinetochore-microtubule attachment.”

      Also, the authors' statement in the conclusion that "This indicates that proper microtubule attachment to kinetochores might be compromised and takes longer to achieve in cbf11Δ cells, possibly triggering the SAC" should be changed accordingly or further proof should be provided.

      • This is probably a misunderstanding. We do not conclude that failed microtubule attachment to kinetochores is surely the cause of mitotic defects in cbf11KO. We merely describe our reasoning about structuring the project during its execution. We have rephrased the problematic sentence to improve clarity.

      • We already state in the Discussion that the mitotic defects of cbf11KO may be caused by something completely different from microtubule attachment.

      As pointed out by the authors, cohesion occupancy is affected by the cell cycle phases duration. Therefore, the authors should correct their data (Fig.3C) for the different duration of mitosis or measure cohesion occupancy in mitotically synchronized populations. If this is not possible, I suggest removing this piece of data altogether.

      • We agree (and acknowledge in the paper) that the measurement of cohesin occupancy can be affected by duration of mitotic phases. However we do not see a straightforward way of normalizing for mitotic duration, as cohesin occupancy changes differentially at particular chromosomal loci.

      • The suggested experiment of measuring cohesin occupancy in synchronized mitotic cells would likely help. However, as mentioned in our response to Reviewer 3 above, the cdc25 or cdc10 block-release does not work well in cbf11KO (PMID: 27687771), and the heat shock or drugs (eg. spindle poisons) would introduce confounding issues themselves. Unfortunately, we currently do not have the capacity to perform less disruptive forms of cell cycle synchronization.

      • Since we show that mutations in cohesin loading factors can rescue mitotic fidelity of cbf11KO cells (Fig. 3B), we consider the data shown in Fig. 3C relevant. Therefore, we opt to keep Fig. 3C in the paper, and we do point out the potential limitations of these results in the Results section.

      In Fig. 3A it is not clear what the authors mean by "morphological" differences between WT and cbf11∆ cells or between cbf11∆ cells and cbf11∆wpl1∆ cells. The authors should provide clearer images and indicate for each image which cells show morphological defects as an example.

      • We now use arrows to highlight cells with nuclear defects in Fig. 3A.

      • We now state examples of the cbf11KO-associated morphological defects in the text, together with a reference to the paper describing these defects in detail.

      In Fig. 3A many cells in single or double cbf11∆ mutants show increased size typical of diploid cells. The authors should perform flow cytometry to test for possible diploidization in their mutants, as that would clearly affect any conclusions on mitotic defects rescue or enhancement.

      • We previously published that cbf11KO cells show increased tendency for spontaneous diploidization (PMID: 19101542). When constructing cbf11KO strains, we always take care (including flow cytometry tests of DNA content) to exclude purely diploid clones, but the process of spurious diploidization is continuous and there are always diploid cells present in the cbf11KO culture.

      • We mention diploidization as a possible mitotic outcome in cbf11KO cells in the first section of the Results.

      As correctly pointed out by the authors, it is not clear if the increase in mitotic defects in cbf11∆ cells is entirely due to the perturbed lipid metabolism or to other factors being affected by Cbf11. A possible approach to prove this point, as suggested by the authors too, would be to test if the mitotic defects identified in cbf11∆ are common to other mutants of lipid metabolism that also show an increase in catastrophic mitotic events.

      • We now show ChIP-seq data showing that centromeric H3K9 shows aberrant methylation patterns also in a hypomorphic cut6/ACC mutant (Pcut6MUT) (Fig. 4A, Fig. S3).

      • We previously showed that the Pcut6MUT mutation predisposes fission yeast cells to catastrophic mitosis, and the defects manifest when Cut6 function is further weakened by limiting the supply of biotin (cofactor of Cut6) (PMID: 27687771).

      Also, the authors' statement in the conclusion: "we have demonstrated several novel factors, not directly related to lipid metabolism, that affect mitotic fidelity in cells with perturbed lipid homeostasis" should be modified as it was not proven that these effects are not due to altered lipid metabolism.

      • We agree that “it was not proven that these effects are not due to altered lipid metabolism”. However, the emphasis here is on the word “directly”. H3K9me2 and cohesin dynamics are not directly related to the metabolism of lipids. We have changed the phrasing to improve clarity.

      Minor comments

      The initial distinction (Fig. 1A) between "cut" and "nuclear displacement" phenotypes is somewhat confusing, especially since the authors are not investigating the different outcomes of a catastrophic mitosis. The two outcomes should be grouped together under the definition of "catastrophic mitosis" as it is done in the rest of the paper.

      • We have changed Fig. 1A accordingly.

      I do not think I understand the statement that "SAC abolition might actually suppress the mitotic defects of the cbf11∆ cells". The lack of SAC might aggravate defects in kinetochore-microtubule attachment or other aspects of spindle assembly. If the authors know of specific examples where the deletion of mad2 or the genes encoding other SAC components rescued the mitotic defects, they should cite those papers. Either way, this point needs clarification.

      • We already provide an example in the Discussion:

      “Intriguingly, SAC inactivation has been shown to suppress the temperature sensitivity of the cut9-665 APC/C mutant, which is also prone to catastrophic mitosis (Elmore et al., 2014)”

      • We have now included this reference and explanation also at the point in the text that you are referring to.

      Brightfield images in Fig. 1 would be clearer without the overlap of the fluorescence channels. The authors could also change the contrast of the images to highlight the septum.

      • We have changed Fig. 1B as requested.

      The length of spindle (shown in Fig. S1) is a more informative measurement for mitotic dynamics and should be used instead of the "nuclear distance" presented in Fig. 2.

      • This might be true for a successful mitosis. But in case of defects (such as spindle detachment from the chromosomes, regressive merger of the daughter nuclei), these parameters become partially uncoupled and both are informative. We have therefore included the data from Fig. S1 in new Fig. 2C-D.

      Generally, the authors could improve the data visualization by including in all the plots the single data points distribution along with the mean/median and error bars like it was done in Fig.2 C,D,E.

      • Done.

      Reviewer #4 (Significance (Required)):

      The paper expands the knowledge on Cbf11, a still poorly characterized regulator of lipid metabolism. The idea that in addition to nuclear membrane limitation, perturbations of lipid metabolism might cause mitotic chromosome dynamics defects (for instance, through changing the protein acetylation levels), is interesting, but the authors should strengthen their conclusions by performing controls and further experiments.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Summary

      In this paper Vishwanatha et al. analyze the mitotic phenotypes of cells lacking a regulator of lipid metabolism Cbf11. They propose that sister chromatid cohesion abnormalities and altered chromatin marks may contribute to the increased incidence of catastrophic mitosis. Additional experiments are required to improve the study and strengthen the authors' conclusions.

      Major Comments

      Both histone and alpha-tubulin tagging are known to aggravate mitotic errors in S. pombe. Before using these markers for live imaging, the authors should quantitate mitotic phenotypes in untagged cbf11∆ cells, as compared to the wild type. Using DAPI and Calcofluor staining (and ideally, also visualizing microtubules using anti- alpha-tubulin antibodies) the authors should measure the percentage of cells in mitosis and the percentage of cells that are going, or just went, through catastrophic mitosis, in asynchronous early-mid-exponential cell populations.

      In analyzing the dynamic of nuclear division, the authors claim that the interval between spindle formation and anaphase onset is "longer" and "more variable" in cbf11∆ cells compared to WT cells. The authors should provide proper statistical analysis of both differences to show that these differences are significant. The same goes for the authors' claim that mitotic duration is "more variable" in cbf11∆ cells compared to WT cells. As mentioned above, alternative estimates of possible perturbations of mitotic dynamics could be obtained by measuring the percentage of cells in different mitotic phases in asynchronous untagged cell populations, in order to avoid possible artifacts given by tagging histones and alpha-tubulin.

      The fact that inactivation of SAC does not change the incidence of catastrophic mitoses shows that SAC is not involved and that there are likely no problems with kinetochore-microtubule attachments. Therefore, the authors' statement "These results suggest that SAC activity only plays a minor role (if any) in the mitotic defects observed in cbf11Δ cells" should be changed. Also, the authors' statement in the conclusion that "This indicates that proper microtubule attachment to kinetochores might be compromised and takes longer to achieve in cbf11Δ cells, possibly triggering the SAC" should be changed accordingly or further proof should be provided.

      As pointed out by the authors, cohesion occupancy is affected by the cell cycle phases duration. Therefore, the authors should correct their data (Fig.3C) for the different duration of mitosis or measure cohesion occupancy in mitotically synchronized populations. If this is not possible, I suggest removing this piece of data altogether.

      In Fig. 3A it is not clear what the authors mean by "morphological" differences between WT and cbf11∆ cells or between cbf11∆ cells and cbf11∆wpl1∆ cells. The authors should provide clearer images and indicate for each image which cells show morphological defects as an example.

      In Fig. 3A many cells in single or double cbf11∆ mutants show increased size typical of diploid cells. The authors should perform flow cytometry to test for possible diploidization in their mutants, as that would clearly affect any conclusions on mitotic defects rescue or enhancement.

      As correctly pointed out by the authors, it is not clear if the increase in mitotic defects in cbf11∆ cells is entirely due to the perturbed lipid metabolism or to other factors being affected by Cbf11. A possible approach to prove this point, as suggested by the authors too, would be to test if the mitotic defects identified in cbf11∆ are common to other mutants of lipid metabolism that also show an increase in catastrophic mitotic events. Also, the authors' statement in the conclusion: "we have demonstrated several novel factors, not directly related to lipid metabolism, that affect mitotic fidelity in cells with perturbed lipid homeostasis" should be modified as it was not proven that these effects are not due to altered lipid metabolism.

      Minor comments

      The initial distinction (Fig. 1A) between "cut" and "nuclear displacement" phenotypes is somewhat confusing, especially since the authors are not investigating the different outcomes of a catastrophic mitosis. The two outcomes should be grouped together under the definition of "catastrophic mitosis" as it is done in the rest of the paper.

      I do not think I understand the statement that "SAC abolition might actually suppress the mitotic defects of the cbf11∆ cells". The lack of SAC might aggravate defects in kinetochore-microtubule attachment or other aspects of spindle assembly. If the authors know of specific examples where the deletion of mad2 or the genes encoding other SAC components rescued the mitotic defects, they should cite those papers. Either way, this point needs clarification.

      Brightfield images in Fig. 1 would be clearer without the overlap of the fluorescence channels. The authors could also change the contrast of the images to highlight the septum.

      The length of spindle (shown in Fig. S1) is a more informative measurement for mitotic dynamics and should be used instead of the "nuclear distance" presented in Fig. 2.

      Generally, the authors could improve the data visualization by including in all the plots the single data points distribution along with the mean/median and error bars like it was done in Fig.2 C,D,E.

      Significance

      The paper expands the knowledge on Cbf11, a still poorly characterized regulator of lipid metabolism. The idea that in addition to nuclear membrane limitation, perturbations of lipid metabolism might cause mitotic chromosome dynamics defects (for instance, through changing the protein acetylation levels), is interesting, but the authors should strengthen their conclusions by performing controls and further experiments.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      In this paper, the authors present convincing experimental proof on why the BH3-only protein PUMA resists displacement by BH3-mimetics, while others such as tBID do not. Using a SMAC-mCherry based MOMP assay on isolated mitochondria, FRET in the presence of liposomes with a phospholipid composition similar to that of mitochondria as well as quantitative fast fluorescence lifetime imaging microscopy (F__�__rster resonance energy transfer - qF3) they show that the C-terminal region of PUMA (CTS), together with its BH3-domain, effectively "double-bolt" locks its interaction with BCL-XL and BCL-2 to resist displacement by the BCL-XL-specific BH3-mimetic A-1155463 or the BCL-2/BCL-XL inhibitor ABT-263 and AZD-4320. Although a similar mechanism has previously been published for BIM, the novel C-terminal binding sequence in PUMA is unrelated to that in the CTS of BIM and functions independent of PUMA binding to membranes. First, in contrast to BIM, PUMA contains multiple prolines and charged residues, and an unusually short span of hydrophobic amino acids, secondly, full length PUMA was more resistant to BH3-mimetic displacement than a PUMA mutant lacking the CTS (PUMA-d26) even in solution suggesting that the CTS of PUMA contributes to BH3-mimetic resistance even in the absence of membranes.<br /> The second, quite unexpected finding of this paper is that, in contrast to previous publications, the CTS of PUMA does not target the protein to mitochondria but to the ER. The authors show this by FLIM-FRET imaging and confocal microscopy, and they created mutants to identify the CTS residues (I175 and P180) that mediate binding to ER membranes.

      The authors did an excellent job to show the mechanism of displacement resistance of PUMA from BCL-2 survival factors from different angles (in vitro, on isolated mitochondria, liposomes and inside living cells), generating respective BH3 and CTS mutants and also domain swaps with other BH3-only proteins such as tBID. Also, the unexpected finding that PUMA primarily localizes to the ER has been extensively scrutinized and the data presented are convincing.

      Response:

      We appreciate the favourable comments and that the reviewer found the data presented convincing.

      Major comments:

      I have only three questions which I like the authors to address before this MS can be published.

      1) How can PUMA perform its pro-apoptotic action on MOMP from its site on the ER? Does PUMA eventually localize to MAMs (mitochondrial/ER contact sites)? Is it possible to co-IP PUMA with BCL-XL or BCL-2 from ER membranes or show such an interaction inside cells with PLA?

      Response:

      The reviewer raises an important point. One of the main conclusions from this paper is that the primary localization of exogenously expressed PUMA is at the ER. Our intent was to highlight the inherent specificity of the PUMA CTS sequence. However, we agree that identifying the localization of PUMA-BCL-XL complexes would add significantly to the manuscript. We carefully considered using co-IP or a proximity ligation assay (PLA) in order to investigate the localization of PUMA-BCL-XL complexes. In our experience the use of co-IP is very difficult to interpret due to the well characterized detergent-induced artifacts previously shown for BCL-2 family protein interactions (PMID: 9553144, PMID: 33794146). Moreover, PLAs are a proximity assay with a detection range of ~>20nm, and are difficult to quantify beyond enumerating frequency (ie counting spots). In contrast, the detection of FRET by fluorescence lifetime imaging microscopy (FLIM) is very sensitive to distance with a maximum that is <10nm, and the results can be interpreted quantitatively as apparent dissociation constants (manuscript Figures 2-3). Therefore we elected to use FLIM-FRET to address this question. We examined PUMA-specific interactions with BCL-XL at the ER and mitochondria by differentially segmenting the FLIM-FRET image data based on the signal from a mCherry-fused landmark expressed at the ER (mCherry-Cb5) or mitochondria (mCherry-ActA). This approach has similar spatial resolution to PLA yet retains more rigorous requirement for proximity and the quantitative interpretability of FLIM-FRET.

      For these experiments we used a recently described the method of mitochondrial image segmentation using hyperspectral image data collected during FLIM-FRET imaging (Osterlund et al., 2023). In this approach, a watershed segmentation algorithm was used to identify mitochondria areas from mCherry-ActA images collected simultaneously with the FLIM data. The ER was identified in separate samples using the same approach with mCherry-Cb5 image data. Simultaneous collection of the images ensures that the data are not affected by movement within the cells. Example images showing the segmentation results for each organelle have been added to the manuscript as Figure 4 - Figure Supplement 2A.

      The results of this FLIM-FRET experiment described in the text lines 581-598, revealed that VPUMA interacts with CBCL-XL within both ER and mitochondria-segmented ROIs (new Figure 4 - Figure Supplement 2B). These results can be explained by the fact that VPUMA is targeted to the ER, and BCL-XL is known to localize to the ER and mitochondria when bound to BH3 proteins in cells (Kale et al., 2018, PMID: 29149100). This result is similar to what we reported for BIK, another ER-localized BH3 protein that exerts its pro-apoptotic function from ER membranes (PMID: 11884414 and PMID: 15809295). Our recent data for ER localized BIK binding to mitochondria-targeted BCL-XL (Osterlund et al., 2023), suggests that, as the referee suggested, binding to occurs via a membrane-spanning interaction at MAMs (ER-mitochondia contact sites) and/or via relocalization of BIK and/or BCL-XL in response to their co-expression (Osterlund et al., 2023). Consistent with these interpretations, when expression of endogenous PUMA was upregulated in response to stress (Figure 4- figure supplement 3A-B), the amount of PUMA increased at both ER and mitochondria (Figure 4- figure supplement 3C). We have presented this data and interpretation on lines 599-621 and discussed the localization results and the similarity to BIK in the manuscript discussion, lines 1029-1035.

      2) Since PUMA seems to be "double-bolt" locked to BCL-2 or BCL-XL via its BH3-domain and CTS, how can it act as a pro-apoptotic inducer? Is its main function to act as an inhibitor of BCL-2 and BCL-XL rather than a direct BAX/BAK activator? And if it acts as a BAX/BAK activator, how can it be released from BCL-2/ BCL-XL, for example by another BH3-only protein which is induced by apoptosis stimulation? Or would in this case PUMA remain bound to BCL-2/ BCL-XL in order to activate BAX/BAK (which would be a kind of new activation mechanism)?

      Response:

      We appreciate the reviewers queries and have clarified the text to indicate that our interpretation is that by binding to BCL-XL, PUMA releases active BAX that is sequestered by BCL-XL (as shown in Figure 1A for purified proteins). Double bolt locking increases both affinity and avidity of PUMA for BCL-XL enabling competition to favor PUMA binding and displacement of sequestered BAX. To further address the reviewers point we added two additional experiments now shown in figure supplements to Figure 1. The data shown in new Figure 1 – figure supplement 1A (described on lines 182-191 of the revised manuscript) demontrates that PUMA kills HCT116 and BMK cells but not HEK293 cells. New Figure 1 – figure supplement 1B shows that inhibition of BCL-XL and MCL-1 using BH3 mimetics is sufficient to kill HCT116 and BMK cells while HEK293 cells are not killed by even high concentrations of these BH3 mimetics. To kill HEK293 cells requires activation of BAX (described on lines 191-201). Together this data indicates that the primary pro-apoptotic function of PUMA is inhbiting BCL-XL and MCL-1 rather than by activating BAX. This data fits very well with PUMA double-bolt locking resulting in very tight binding of PUMA to BCL-XL and likely MCL-1 as the primary mode of PUMA mediated induction of cell death, at least in the three cell lines investigated here. The importance and role of PUMA mediated BAX activation is an interesting area of active investigation that is beyond the purview of the current paper.

      3) Is PUMA still bound to the ER when it is transcriptionally induced by genotoxic stress. In this case, the extra amount of PUMA produced is supposed to directly activate BAX/BAK. Does it do this on the ER or on mitochondria?

      Response:

      The referee raises a very interesting point.

      Interestingly, Zheng et al., 2022 highlighted a P53-dependent death response to genotoxic stress, which results in the extension of peripheral, tubular ER and promotes the formation of ER-mitochondria contact sites (PMID: 30030520). Furthermore, PUMA is transcriptionally activated by P53 (PMID: 17360476). Therefore, we hypothesized the induction of PUMA would increase the fraction of PUMA at ER membranes and MAMs. As the latter resemble mitochondria in micrographs of cells we anticipated an increase in apparent mitochondrial localization. To address this question experimentally, we treated MCF-7 cells with genotoxic stress and ER stressors and tracked the expression of endogenous PUMA by immunofluorescence. The results are described in the manuscript (line 603-613, page 28) and shown in Figure 4 figure supplement 3.

      The immunofluorescence data confirmed that PUMA protein levels increase after genotoxic stress, as expected (Reference 39, 40 in the manuscript) and to a lessor but still significant extent after ER stress (Figure 4 figure supplements 3A and B). In response to stress the amount of PUMA increased at both ER and mitochondria, however, in unstressed cells the endogenous Puma co-localized more to the mitochondria than to the ER (Figure 4- figure supplement 3C). This suggests that similar to BIK localization of PUMA is dynamic. In particular, the abundance and localization of PUMA binding partners such as BCL-XL also affects PUMA localization (the new data are described on pages 27-28, Lines 591-621). As described above, the extra PUMA induced by genotoxic stress can indirectly activate BAX by binding BCL-XL and displacing sequestered activated BAX. Our FLIM-FRET data suggest PUMA can bind BCL-XL at both the mitochondria and the ER. Moreover, given the expansion of ER-mitochondrial contact sites that occurs during stress we cannot rule out the possibility that ER-localized PUMA can inhibit mitochondria-localized anti-apoptotic proteins (both BCL-XL and MCL-1) at the ER (for BCL-XL)and MAMs for both proteins.

      Reviewer #1 (Significance):

      Very significant contribution to the field. Quite novel

      Reviewer #2 (Evidence, reproducibility and clarity):

      This study by Pemberton and colleagues investigates interactions of pro-apoptotic PUMA with anti-apoptotic BCL-2 proteins, employing a variety of BH3-mimetics. The authors demonstrate that the PUMA/aa BCL-2 interactions are mediated not only via BH3-domain/groove interactions, but also dependent on a C-terminal sequence of PUMA. This mirrors (with distinct differences) what the authors have previously reported for BIM. They then, reveal that unexpectedly PUMA is often localising to the ER (as opposed to mitochondria), though this localisation is not important for the resistance of PUMA/BCL-2 complexes to BH3-mimetic treatment, authors speculate that ER localised PUMA may have a day job.

      In my opinion, the study is important for several reasons, not least it strongly argues that BH3-mimetics are not optimal (in themselves) to promote apoptosis dependent on PUMA, and that approaches to disrupt the "double-lock" mechanisms should be sought - this has clear clinical importance, but equally important is it adds a new layer of complexity to how BCL-2 family members "work", how the double-lock mechanism is overcome in physiological apoptosis remains an open question, for instance. The data support the authors' conclusions, I have a few points that could be addressed.

      Response:

      The positive comments from the reviewer are greatly appreciated.

      1 - The authors data in cells is consistent with a membrane recruitment effect of the PUMA CTS making a contribution to the resistance of PUMA/aa BCL-2 complexes to BH3-mimetics. What I found really intriguing, is that the CTS also influences affinity in the absence of membranes (Figure 1) - could the authors speculate why they think CTS may be affecting PUMA/aaBCL-2 binding in the absence of membranes ?

      Response:

      We agree with the reviewer that membrane binding contributes to BH3 mimetic resistant binding of PUMA to BCL-XL consistent with elegant data presented previously (Pécot et al., 2016; PMID: 28009301). However, we show in Figure 5D that mutants of VPUMA-d26 with restored membrane binding (VPUMA-d26-ER1 and VPUMA-d26-ER2) remain sensitive to BH3-mimetic displacement, indicating that membrane binding alone is not sufficient to confer resistance to BH3-mimetics. Furthermore, as the reviewer pointed out BH3 mimetic resistant binding is observed in the absence of membranes (Figure 1).

      The data using purified proteins strongly suggests that the CTS of PUMA binds to BCL-XL and is directly involved in the protein-protein interaction. The fact that PUMA with the C-terminal fusion to the fluorescent protein Venus (PUMAV) still localizes to membranes in live cells (Figure 4 D,E) suggests that the C-terminus of PUMA does not span the membrane bilayer. Instead, we hypothesize that the C-terminus of PUMA binds peripherally to the membrane making it available to physically contribute to a protein interaction with anti-apoptotic proteins. This interpretation is consistent with the low hydrophobicity and high proline content (6 of 28 residues) of the amino acid sequence of the PUMA CTS as shown in Figure 6 and compared to the transmembrane tail anchor sequences of other proteins, including the BH3-protein BIK, in Figure 5 supplement 1. Binding of Bcl-XL by both the BH3 region and CTS of PUMA would increase both the affinity and avidity of the interaction. The presentation of this data has been revised to add clarity on pages Page 8, lines 215-223 and in the discussion (Lines 988-997 and 1044-1050).

      2 - A minor point for clarification, are the mitochondria used in Fig 1A from BAX/BAK DKO cells ? - I had presumed so given exogenous BAX was added, but didn't note this in the text.

      We indeed use mitochondria from BAX/BAK DKO cells and exogenous recombinant BAX in Figure 1A. This has now been added to the text on lines 166-180.

      Reviewer #2 (Significance):

      detailed in report above

      Reviewer #3 (Evidence, reproducibility and clarity):

      In this paper, Pemberton et al show that PUMA resists BH3-mimetic mediated displacement from BCL-XL via a novel binding site within its C-terminus of PUMA termed CTS (the last 26aa). Interestingly, the CTS of PUMA directs the protein to the ER membrane and residues I175 and P180 within the CTS are required for both ER localization and BH3-mimetic resistance.

      Specific comments:<br /> 1 - BH3-mimetics kill cells by displacing sequestered pro-apoptotic proteins to initiate apoptosis. However, PUMA resists BH3-mimetic mediated displacement, and PUMA-d26 and PUMA I175A/P180A (CTS) do not. Thus, are these mutants sensitive to BH3-mimetics cell killing? In other words, do BH3-mimetics kill PUMA-/- cells that express either PUMA-d26 or PUMA I175A/P180A but not PUMA-/- cells that express wild type PUMA?

      Response:

      The reviewer raises a very interesting question that unfortunately we have been unable to address unambiguously. To answer this question requires separating the effects of PUMA on anti-apoptosis proteins and on activation of BAX and BAK as exogenous expression of express either PUMA-d26 or PUMA I175A/P180A is sufficient to kill PUMA-/- cells without the addition of a BH3 mimetic. To date we have been unable to identify mutants that inhibit anti-apoptotic proteins but that do not activate BAX and BAK as both PUMA-d26 and PUMA I175A/P180A have impaired BAX-activation function. This is additionally complicated by PUMA mediated inhibition of MCL-1, BCL-2 and BCL-W. Further, it isn’t possible to separate the function(s) using BAX/BAK knock-out cells because then PUMA induced cell death is completely abrogated. Understanding the direct activation of BAX by PUMA is an area of current investigation that is out of the scope of this paper as here we are focused on the interaction(s) of PUMA with anti-apoptotic proteins.

      2 - The authors elegantly demonstrate using microscopic analysis that over expressed PUMA mostly localizes to the ER membrane. Since this is a major conclusion in the paper which is different than previously reported, the authors should confirm these findings using sub-cellular fractions followed by Western blot analysis. They should demonstrate that endogenous and over-expressed PUMA are mainly localized to the ER membrane and that the PUMA-d26 and PUMA I175A/P180A are mainly localized to the cytoplasm.

      Response:

      We appreciate that the reviewer found the microscopic analysis convincing. We also tested the idea of sub-cellular fractionation proposed by the reviewer.However, we have found it to be very difficult to separate mitochondria and MAMs. To address the question raised we instead performed new co-localization experiments,in addition to those reported for PUMA-d26 and the point mutants in Figure 6 (images in Figure 6 - figure supplement 3). The new experiments areforendogenous PUMA at steady state and with increased expressed in response to stress. These immunofluorescence experiments are reported in Figure 4 -figure supplements 3. We also added FLIM-FRET experiments in which ROIs were derived from areas of the cell enriched in either ER or mitochondria(Figure 4 - figure supplement 2). The results of these experiments indicate that PUMA localization is dynamic and are described in detail above in response to reviewer 1 question 3 and in the manuscript from line 579 to 621 and discussed on lines 1029-1036.

      Reviewer #3 (Significance):

      The advance in this paper is significant and the paper should be published once the specific comments are adequately addressed

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      RC-2022-01805

      We thank all reviewers for their careful analysis of our manuscript, constructive suggestions and support of our work.

      Reviewer 1

      The authors show that proximity of early mouse embryo blastomere chromosomes to the cell cortex activates the Polar Body Extrusion pathway to generate cell fragments. The authors use live cell imaging in control and Myo1C and dynein knockdown embryos to document accumulation of actin and myosin near chromosomes that come in close proximity to the cell cortex, which correlates with the increased fragmentation of the mutant blastomeres. The live imaging data are nicely presented and the results are well quantified. I have two major comments, and some minor comments on clarity, for the authors consideration in revising the manuscript.

      Major comment:

      1. The authors imply that Myo1 and dynein knockdowns result in an increase in the number of cells where chromosomes come in close proximity to the cell cortex. Apparently the spindle anchoring defects are meant to indicate that such defects are responsible for the increased frequency of abnormal chromosome proximity to the cortex. But the authors never actually document whether chromosomes in fact do come into proximity to the cortex more often in the mutant than in control embryos. The authors should clarify if they think the spindle anchoring defect does result in abnormal chromosome distributions. Can the authors somehow quantify a defect in overall chromosome positioning in mutant vs control blastomeres? Presumably the movies the authors already have could be used to provide such quantification?

      We thank the reviewer for this opportunity to correct our previous assumptions. Following the reviewer’s suggestion, we tracked the distance between the cell surface and the center of the chromosomes cluster throughout mitosis. We found little difference in this distance between control and Myo1cKO embryos (Fig S3a), unlike what we had initially implied. This distance seemed more variable in Myo1cKo embryos than in control ones, suggesting that chromosome movements may be more erratic but analysis of this variation for individual cells did not show consistent differences between control and Myo1cKO embryos either (Fig S3b). Therefore, we cannot explain the increased signaling with differences in proximity of the chromosomes to the cortex during mitosis.

      Instead, as already hinted in our initial manuscript as an additional factor, we find that signaling from chromosomes to the cortex can occur for an extended time in embryos with impaired spindle anchoring.

      We had already measured that mitotic spindles persisted for a longer time in Myo1cKO embryos than in control ones (Fig 2b), as well as in ciliobrevin treated embryos as compared to DMSO treated ones (Fig S2b). To strengthen this data, we performed additional experiments in which we injected mRNA encoding fluorescent lamin-associated protein 2b (Lap2b-GFP) to track the breakdown and reassembly of the nuclear envelope. Consistent with the mitotic spindle persisting for a longer time in Myo1cKO embryos than in control ones, it generally takes more time for Myo1cKO embryos to reassemble their nuclear envelope than for control embryos (50 min vs 70 min, n = 8 control and 15 Myo1cKO embryos, p = 0.0161, Fig S3c-d, Movie 5). Taken together, the nuclear envelope and spindle data indicate that, although chromosomes are not closer to the cortex in Myo1cKO embryos than in control ones, they spend more time outside of the nucleus. This should give chromosomes extended opportunities to signal to the cortex and explains how difficulties with chromosome separation can lead to the hyper-activation of the polar body extrusion pathway.

      We have revised our manuscript accordingly.

      Near the end of the paper, the authors discuss how cell with bent/un-anchored spindles are more prone to fragmentation, referring to Figure 2. But Figure 2 does not document a correlation between blastomeres with bent spindles and increased fragmentation. Rather it shows an increase in bent spindles and in fragmentation in mutant vs control, but does show that they occur together. The authors should more accurately describe their results or provide such a correlation with additional data.

      We thank the referee for pointing out this missing information.

      To support our conclusions, we now provide additional analyses of mitosis duration in non-fragmenting and fragmenting cells from Myo1cKO embryos. When cells fragment, their mitosis is consistently longer, as measured from the persistence of the mitotic spindle, than when not fragmenting (Fig 2c). This provides a direct correlation between spindle defects and fragmentation.

      We now present these analyses in the revised manuscript.

      Finally, in describing the data in Figure 3, the authors refer to persistence of the spindle and bending of the spindle as indicating problems with anchoring. It is not clear to me how either spindle persistence or bending relate to anchoring. The authors should explain how they are related if they are, and it would be better if the authors could document spindle displacement relative to the cell center or cortex to make their point more directly that anchoring is defective.

      We apologize for not making this clearer in our initial manuscript. As others noted before (Kotak et al, 2012; Mangon et al, 2021), poorly anchored spindles show larger displacements or rotations during mitosis. Spindle persistence and bending may not be directly related to spindle anchoring defects but could reflect broader issues with spindle assembly and function caused by spindle anchoring defects. Since a previous in vitro study had identified that Myo1cKO is important for spindle anchoring (Mangon et al 2021) and that ciliobrevin, known for compromising spindle anchoring, phenocopied these aspects, we had initially focused on anchoring defects in our conclusions. We still stand by our conclusion that our data suggest spindle anchoring defects. Nevertheless, we agree that our observations report more general spindle defects and that anchoring may be only one of the defective aspects. Instead of “spindle anchoring defects”, we now simply mention “spindle defects” unless specifically discussing spindle straightness and rotation.

      Minor comment.

      The authors document in Figure 3 that Myo1C KO blastomeres have an enhanced response, with more myosin accumulating at the cortex in response to chromosomes. Why does knocking out one non-muscle myosin lead to enhanced accumulation of another? The authors note this effect but provide no discussion as to how it occurs. Some clarification might be helpful.

      In our manuscript, we report that chromosome proximity to the cortex is associated with Cdc42 activation, which leads to cortical actin recruitment (Fig 4a-d). We also observe that non-muscle myosin II (Myh9) is recruited to the cortex when chromosomes come near (Fig 3d-f). Importantly, these phenomena occur in control embryos as well and not only in Myo1cKO embryos.

      We propose that this recruitment is further increased in Myo1cKO embryos (Fig 3f) because chromosomes spend more time outside of the nuclear envelope (Fig 2). This leads to fragmentation and is not specific to Myo1cKO since the same occurs after ciliobrevin treatment (Fig S2).

      The authors provide a significant advance in our understanding of why early mammalian embryos, especially early human embryos, are so prone to fragmentation. Their data strongly support their conclusion that increased proximity of chromosomes to the cortex does lead to activation of the PBE response, which is an interesting and well documented finding. However, unless the authors can address my major comments and provide more direct evidence for increased displacement of chromosomes being responsible for increased fragmentation, they should revise their manuscript to acknowledge that they have not directly quantified chromosome positioning and thus do not conclusively document that it is responsible for increased fragmentation in the mutant oocytes.

      We thank the reviewer for their thorough analysis of our data and for giving us the opportunity to correct some of the aspects of our study.

      Reviewer 2

      The manuscript "Ectopic activation of the polar body extrusion pathway triggers cell fragmentation in preimplantation embryos" by Pelzer and colleagues is focused on mechanism of cell fragmentation in early preimplantation embryos. This is an important issue, since fragmentation, with subsequent cell loss, has significant impact on early development of human embryos in vitro.

      To study the cell fragmentation within the embryo, authors used mouse model system. However, since during the mouse preimplantation development blastomere fragmentation is less frequent than in human embryos, they used knockout of unconventional myosin-Ic to induce fragmentation of embryonic blastomeres with higher frequency and a similar morphology, known from human embryos.

      Using their Myo1c KO, authors confirmed previous observation that reduction of myosin-Ic impairs spindle anchoring and they further show that the defects in spindle anchoring are linked to cell fragmentation. And that similar defects could be induced by chemical inhibition of dynein. Importantly, the defects in anchoring, causing aberrant spindle movements, bring spindle and chromosomal DNA to the proximity of the cell cortex. This induces local changes in concentration and organization of actin and myosin IIA and leads into fragmentation. Authors show that this pathway shares similarity with mechanism of polar body extrusion (PBE) during meiosis, namely that it requires active Cdc42-mediated actin polymerization or Ect2 signaling. And also, that important role in cell fragmentation is played by cell surface tension. Based on their results, authors propose that cell fragmentation within the embryo is triggered either by hyperactivation of PBE pathway in cells with normal surface tension, or by PBE pathway activation in cells with higher contractility.

      This manuscript brings important information about mechanism, which might contribute to the high incidence of blastomere fragmentation in human embryos. I have not identified any important issues with experimental work or conclusions and therefore I recommend this paper for publication. The results from the mouse model system however need to be verified by further studies in human or similar embryos, which naturally exhibit higher fragmentation.

      We thank the reviewer for their careful examination of our manuscript and data.

      We agree that it would be important to verify the validity of our findings in other species. We have considered performing experiments with human embryos.

      Ideally, we would need embryos in their early cleavage stages (zygote to 4-cell stages) to be able study fragmentation without perturbing morphogenetic movements, which begin at the 8-cell stage. Such early embryos are particularly rare, which further requires careful experimental design.

      Ideally, such carefully designed experiment would not cause additional fragmentation (as we have mostly done in the present study) but rather reduce this deleterious process. In light of our experiments shown in Fig 4c-d, inhibiting Cdc42 would be a good way to reduce polar body extrusion signaling. Injection of DNCdc42 mRNA would be embryo-consuming to setup. We tried a Cdc42 chemical inhibitor on mouse embryos with unreliable results. Therefore, we do not yet feel confident in using precious human embryos with our currently available options.

      Another complication is administrative since this project was funded by the ERC, which does not allow experimentation with human embryos.

      As for studying the phenomenon in species other than mouse or human, we currently have limited access to other mammalian species. Generally, other mammalian embryos are less well characterized and, in particular, the species-specific fragmentation behavior would need to be characterized before initiating any attempt to reduce it.

      We hope that the reviewers will agree that the current manuscript, describing and dissecting a previously unknown mechanism, makes sufficient advances to be published without the need to assess its evolutionary conservation.

      This study revealed important mechanism, which might be responsible for inducing fragmentation of blastomeres in early preimplantation embryos. Authors use mouse knockout model system and therefore the results should be verified in other species, in which the embryos show higher fragmentation naturally. The manuscript provides evidence that pathway, leading into PBE in oocytes, remains operational also in embryos and might contribute to blastomere fragmentation in case when spindle loses anchoring to the membrane. The results of this manuscript should be of interest not only to the researchers in reproduction, but also to the general audience.

      Reviewer 3

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript discussed interesting and relevant topics in which the Authors addressed the effects of mouse Myo1C knock out on cell fragmentation and spindle anchoring defects. The authors found that fragmentation occurs in mitosis after ectopic activation of actomyosin contractility by signals emanating from DNA.

      Reviewer #3 (Significance (Required)):

      This is an excellent report dealing with significant technical methodologies. I find no fault in the methods, data analysis, or conclusions. I only have two comments. First, the authors should expand on the previous findings about the of the role of Myo1c during early preimplantation development. Second, the discussion should be expanded to compare the results of this study with those of previous/related studies (e.g., other factors involve in fragmentation and spindle anchoring). Finally, I was not able to open movie#2 and movie#8 so they may need to be re-uploaded.

      We thank the reviewer for their careful assessment of our study.

      We apologize for not discussing enough the previous research on Myo1c. To our knowledge, there is only one previous study reporting the effect of a point mutation on Myo1c on mouse ear physiology (Stauffer et al 2005). This is the first study on the role of Myo1c during mouse development. At this point, we would like to stress that our study, while partially based on the KO of Myo1c, is about cell fragmentation, which we induce experimentally in three independent ways: Myo1c KO, ciliobrevin treatment or Ect2 overexpression.

      Regarding fragmentation, to our knowledge there is simply no convincing mechanism to explain this phenomenon. One study proposed that membrane threads connecting the cell surface to the zona pellucida could pull on cells and promote fragmentation (Derick et al 2017). However, fragmentation also occurs without zona pellucida, and hence without threads pulling on cells’ surfaces (Yumoto et al 2020). Other than that, fragmentation was associated with mitosis and general cytoskeleton defects, without no clear mechanism (Alikani 1999, Fujimoto et al 2011, Daughtry et al 2019).

      We have now expanded these discussions.

    1. General comments:

      This study carefully delineates the role of magnesium in cell division versus cell elongation. The results are really important specifically for rod-shaped bacteria and also an important contribution to the broader field of understanding cell shape. Specifically, I love that they are distinguishing between labile and non-labile intracellular magnesium pools, as well as extracellular magnesium! These three pools are really challenging to separate but I commend them on engaging with this topic and using it to provide alternative explanations for their observations!

      A major contribution to prior findings on the effects of magnesium is the author’s ability to visualize the number of septa in the elongating cells in the absence of magnesium. This is novel information and I think the field will benefit from the microscopy data shown here.

      I completely agree with the authors that we need to be more careful when using rich media such as LB. It is particularly sad that we may be missing really interesting biology because of that! It’s worth moving away from such media or at least being more careful about batch to batch variability. Batch to batch variability is not as well appreciated in microbiology as it is for growing other cell types (for example, mammalian cells and insect cells).

      For me, the most exciting finding was that a large part of the cell length changes within the first 10min after adding magnesium. The authors do speculate in the discussion that this is likely happening because of biophysical or enzymatic effects, and I hope they explore this further in the future!

      I love how the paper reads like a novel! Congratulations on a very well-written paper!

      Kudos to the authors for providing many alternative explanations for their results. It demonstrates critical thinking and an open-mind to finding the truth.

      Specific comments:

      Figure 2C → please include indication of statistical significance

      Figure 3C → please include indication of statistical significance

      Figure 6A → please include indication of statistical significance

      Figure 8B → please include indication of statistical significance

      Figure S1B → please include indication of statistical significance

      Figure S3B → please include indication of statistical significance

      For your overexpression experiments, do the overexpressed proteins have a tag? It would be helpful to have Western blot data showing that the particular proteins are actually being overexpressed. I think the phenotypes that you observe are very compelling so I don’t doubt the conclusions. Western blot data would just provide some additional confirmation that you are actually achieving overexpression of UppS, MraY, and BcrC.

      Questions:

      Based on your data, there are definitely differences in gene expression when you compare cells grown in media with and without magnesium. Because the majority in cell length increase occurs in such a short time though (the first 10min), I was wondering if you think that some or most of it is not due to gene expression? Do you have any hypotheses what is most likely to be affected by magnesium? Do you think if the membrane may be affected?

      Why do you think less magnesium activates this program of less division and more elongation? Additionally why is abundant magnesium activating a program of increased cell division and less elongation? Do you think there is some evolutionary advantage, especially considering how important magnesium is for ATP production?

      Related to this previous question, I also wonder if this magnesium-dependent phenotype would extend to other unicellular organisms, may be protists or algae? That would be a really exciting direction to explore!

      Regarding the zinc and manganese experiments, why do you think they lead to additional phenotypes compared to magnesium? Do you have any hypotheses?

      Regarding your results that Lipid I availability may be a major a problem for the cell division in the absence of magnesium, do you think that is due to effects magnesium has on the enzymes directly, or do you think magnesium affects the substrate availability/conformation by coordinating the phosphate groups? Or something else, may be membrane conformation?

    1. Author Response

      Reviewer #1 (Public Review):

      This study demonstrates that Chinmo promotes larval development as part of the metamorphic gene network (MGN), in part by regulating Br-C expression in some tissues (exemplified in the wing disc) and in a Br-C independent manner in other tissues such as the salivary gland. I have included below the following comments on the submitted version of this manuscript:

      1) The authors have shown experimentally that Chinmo regulates Br-C expression in the wing disc but not the larval salivary gland. Based on this, they posit that Chinmo promotes larval development in a Br-C-dependent manner in imaginal tissues and a Br-C-independent manner in other larval tissues. This generalization of Chinmo's role in development would be more compelling if the relationship between Chinmo and Br-C were explored in other examples of imaginal/larval tissues.

      We agree with the referee that confirmation of our observations in other tissues might help to generalize Chinmo’s role. To this aim, we have analyzed the role of chinmo in an additional larval, the larval tracheal system, and imaginal tissue, the eye disc. Consistent with the results reported in the manuscript, we found that the mode of action of Chinmo is conserved, as depletion of Br-C in the eye disc is able to rescue the lack of chinmo, whereas in the tracheal system it is not. We included this new information in the main text and in new SFigures 1 and 3.

      2) Chinmo, Br-C, and E93 have all been shown to be EcR-regulated in larval tissues, including the brain and wing disc (as in Zhou et al. 2006, Dev Cell; Narbonne-Reveau and Maurange 2019, PLOS Biology; Uyeharu et al. 2017, ). It would be interesting (and I believe relevant to this study) to know whether the roles of these factors in their respective developmental stages are EcR-dependent and whether their regulation by EcR (or lack thereof) depends on whether the tissue is larval or imaginal.

      Although the relevance of EcR on the regulation of the genes that conform the metamorphic gene network has been already established, a different response of EcR-mediated signalling of these genes in larval and imaginal tissues is still not properly addressed. Finding this possible different output of the EcR signalling would be very interesting. However, we think that this is out of scope of this report as the main aim of this study was to determine the main role of the temporal genes during development and their repressive interactions.

      3) In the chinmo qPCR analysis shown in Fig1A, whether animals were sex-matched or controlled was not indicated. Since Chinmo has a published role in regulating sexual identity (Ma et al. 2014, Dev Cell; Grmai et al. 2018, PLOS Genetics), and since growth/body size is known to be a sexually dimorphic trait (Rideout et al. 2015, PLOS Genetics), it seems important to establish whether the requirement of Chinmo for larval development and/or growth. I recommend either 1) controlling for sex by repeating qPCRs in Fig 1A in either males or females, or 2) reporting male/female chinmo levels at each stage side-by-side.

      As the referee pointed out, chinmo has been related to sexual identity raising the possibility of a different effect of chinmo in growth of males and females during development. However, several observations discard this option. First of all, the role of chinmo in sexual identity has been only reported in adult testis and specifically in cyst stem cells. In fact, specific mutations of chinmo that only affects the expression of chinmo in testis, do not affect testis formation but its maturation, suggesting a role of chinmo in sex determination specifically in the testis cyst stem cells (Ma et al. 2014, Dev Cell; Grmai et al. 2018, PLOS Genetics). Second, it has been described a sex dependent growth rate during larval development (Rideout et al. 2015, PLOS Genetics; Sawala A. and Gould AP, PLoS Biol, 2017). However, the main difference in growth rate between males and females is found in L3 larvae (Sawala A. and Gould AP, PLoS Biol, 2017), when the expression of chinmo strongly declines in both males and females, indicating that chinmo impact on sex dimorphism during larval development might be at least, limited.

      Thus, considering that, based on our results, chinmo exerts its main role in larval tissue growth during L1 and L2 stages and that body growth is practically identical in male and female during these stages (Sawala A. and Gould AP, PLoS Biol, 2017), we can assume that chinmo might not contribute to sexual body size dimorphism.

      Nevertheless, we would like to clarify that we have performed the measurements of chinmo expression always in females, when sex identification was possible, namely in L3 larvae. L1 and L2 larvae qPCRs were not sex-discriminated as sex identification was not possible in our conditions.

      4) In Fig2E, the authors show that salivary gland secretion (sgs) genes are repressed in salivary glands lacking chinmo. Sgs genes are expressed during late larval stages as the animal prepares to pupate. Thus, based on the proposed model where Chinmo promotes larval development and represses the larval-to-pupal transition, one might expect that larval salivary glands lacking chinmo would express higher than normal levels of sgs genes. This expectation directly opposes the observed result - it would be helpful to speculate on this in the interpretation of results.

      This is an interesting observation. As Sgs genes are regulated by Br-C (Duan et al. Cell Reports 2020), precocious expression of this transcription factor in chinmo depleted animals might result in an early activation of those genes. Interestingly, we were not able to detect any Sgs genes expression in chinmo depleted salivary glands. We think that this is due to the fact that in absence of chinmo, this organ does not properly develop and mature, and therefore it is unable to express Sgs genes. Proof of that is that the double knockdown of Br-C and chinmo shows the same dramatically low levels of those genes. Altogether, these results strongly suggest that SGs lacking chinmo expression are unable to grow and synthesise Sgs proteins, even in the premature presence of Br-C. We discussed this point in the main text of the edited Ms. Please also see the response to referee 2.

      Reviewer #2 (Public Review):

      The evolution and control of the three-part life history of holometabolous insects have been controversial issues for over a century. While the functioning of broad as a master gene controlling the pupal stage and of E93 as a master gene for the adult stage has been known for about a decade or more, chinmo has only recently been proposed as being the master gene responsible for maintaining the larval stage (Truman & Riddiford, 2022). While the former paper focused on the embryonic and early larval function of Chinmo, this paper explores its metamorphic effects and defines the roles of Broad and E93 in the phenotypes produced by manipulations of Chinmo expression.

      Overall, the paper is well presented but in places, readers would be helped if the authors were more explicit about the logic and details of their manipulations. There are a couple of conceptual issues that the authors should address.

      The role of Broad in larval tissues:

      One intriguing issue relates to the relationship of Chinmo to Broad and E93 in larval versus imaginal tissues prior to metamorphosis. The knock-down of chinmo in imaginal discs results in severe suppression of growth and the lack of metamorphic patterning genes such as cut and wingless. Normal growth and patterning are reestablished though, if broad is also knocked-down, supporting the notion that the effects of the lack of Chinmo are mediated through the premature expression of Broad.

      In the salivary glands, by contrast, chinmo knock-down suppresses growth, and this growth suppression is not reversed by simultaneous broad knockdown. They properly conclude that the role of Chinmo in supporting the growth of larval tissues does not involve Broad, but their data on the expression of salivary gland proteins suggest that Broad still plays some role in Chinmo function in salivary glands. Fig. 5E shows the levels of various salivary glue proteins in the glands of Chinmo knock-down larvae. The levels are reduced, as expected by the lack of salivary gland growth, but a significant finding is that they are there at all! The Costantino et al. (2008) paper shows that these genes are only induced in the mid-L3. Ecdysone, acting through Broad isoforms, is necessary for their appearance and these SGS genes can be induced in the L1 and L2 stages by ectopic expression of some Broad isoforms. Their low levels in Fig 5, would be due to the small size of the gland, but the gland's premature expression of Broad likely causes their induction. In larval cells, then, Chinmo may feed into two parallel pathways, one that does not involve broad and regulates growth and the other, utilizing Broad, regulating premetamorphic changes.

      It would be useful to look at early larval salivary gland proteins such as ng-1 to -3 that are expressed in salivary glands before the critical weight. Also, it would be interesting if the appearance of the SGS proteins after chinmo knock-down (Fig 5E) is abolished by simultaneous knock-down of broad.

      This is an interesting observation. We think that the main problem has derived from the way we presented the data. Our results showed that depletion of chinmo in the SGs dramatically impairs the induction of Sgs gene expression, even with the premature presence of Br-C, which has been shown to be responsible for Sgs expression (Duan et al. Cell Reports 2020). The confusion might come from the way we presented the level of expression of those genes. In fact, the levels of Sgs in both chinmoRNAi and chinmoRNAi/Br-CRNAi SGs were virtually undetectable, suggesting that chinmo in the SG is not only required for Br-C repression but also for proper development of the gland. We believe that based on the fact that the very low levels of expression of Sgs genes in chinmo depleted SGs are still detected in the double knockdown chinmoRNAi/Br-CRNAi. Dramatically reduced expression of the early larval SGs ng1-3 genes in chinmoRNAi and double knockdown chinmoRNAi/Br-CRNAi supports this statement. Altogether these results suggest that Br-C is necessary but not sufficient for the expression of those specific SGs genes. We have changed the plots in Figure 2 and 3 to clarify this point and added the levels of expression of ng1-3.

      Role of Chinmo and Broad in Hemimetabolous insects:

      In the conclusion of their comparative studies on the cockroach (line 342), the authors state that Broad exerts no role in the development of hemimetabolous insects. However, this conclusion is not consistent with the literature. The first study of broad knockdown in a hemimetabolous insect was in the milkweed bug Oncopeltus fasciatus by Erezyilmaz et al. (2006). Surprisingly to Erezyilmaz et al., broad knock-down in early-stage nymphs did not cause premature metamorphosis. However, Broad expression was essential for tissues of the wing pads and dorsal thorax to undergo morphogenetic growth (rather than simple isomorphic growth), and for stage-specific changes in coloration through the nymphal series (but not for the nymph to adult color change). A similar function for Broad on wing growth during the later nymphal stages was later shown in Blattella (Fernandez-Nicolas et al., 2022; Huang et al., 2013). The wing- and genital pads represent "imaginal" tissues in the nymph and the need for Broad in these tissues are the same as seen in imaginal discs as the latter shift from isomorphic growth to morphogenesis at the critical weight checkpoint in the L3. This would suggest that important roles for Broad and E93 are already established in the hemimetabolous insects with E93 controlling the shift from immature (nymphal) to adult phenotypes and Broad controlling the premetamorphic growth of imaginal tissues in early-stage nymphs. Chinmo might then be needed to keep both in check.

      We are sorry for not having dealt with these observations in the submitted manuscript. We have taken them into consideration in the new version to discuss about the role of Br-C in the transition from hemimetabolous to holometabolous.

    1. Proctor, as though a secret arrow had pained his heart: Aye. Trying to grin it away – to Hale: You see, sir, between the two of us we do know them all. Hale only looks at Proctor, deep in his attempt to define this man, Proctor grows more uneasy. I think it be a small fault.

      Proctor's defensive reaction, "I think it be a small fault" has a dramatic irony of its own since, as an audience, we cannot help thinking that although forgetting the commandment may be an excusable error, committing the sin itself is a far more grievous matter; one which has brought disaster not merely to the Proctors but to Salem as a whole.

    1. Author Response

      Reviewer #1 (Public Review):

      This paper presents a thorough biochemical characterization of inferred ancestral versions of the Dicer helicase function. Probably the most significant finding is that the deepest ancestral protein reconstructed (AncD1D2) has significant double-stranded RNA-stimulated ATPase activity that was lost later, along the vertebrate lineage. These results strongly suggest that the previously known differences in ATPase activity between extant vertebrates and, for example, extant arthropods is due to loss of the ATPase activity over evolutionary time as opposed to gains in specific lineages. Based on their analysis, the authors also "restore" ATPase function in the vertebrate dicer, but they did so by making many (over 40) mutations in the vertebrate protein, and it is not clear which of these many mutations is required for the restoration of the activity. Thus, it is difficult to discern how the results of this experiment relate to the evolutionary history.

      We completely agree with this reviewer's assessment of our paper. Our Michaelis-Menten analyses raised the intriguing idea that loss of ATPase activity in the helicase domain of the vertebrate ancestor may indicate loss of the ability to couple dsRNA binding to formation of the active conformation. Our rescue experiments support this idea, albeit in future studies we hope to create an active ancestor with fewer amino acid changes. While the rescue experiments validate what these analyses told us, as the reviewer suggests, they do not themselves inform on the evolutionary history.

      A criticism of the paper is the authors' tendency (probably unconscious) to ascribe a purposefulness to evolution. For example, in the introduction, "We speculate that the unique role of the RLR's in the interferon signaling pathway in vertebrates...created an incentive to jettison an active helicase in vertebrates." Although this sentence is clearly labelled as speculation and "incentive" is clearly a metaphor, the implication is that evolution somehow has forethought. (There are other instances of this notion in the paper, for example, in the last line of the abstract). The author's statement also implies that the developing interferon system somehow caused the loss of active helicase, but it seems equally plausible that the helicase function was lost before the interferon system co-opted it.

      We agree with the stated critiques and have rephrased language that suggests that evolution is an active force. In addition to changing the last line of the abstract (page 2, line 35), and removing the quoted sentence from the Introduction, we have included a more nuanced discussion of the order of evolutionary events that may have preceded or followed the loss of helicase function in Dicer (page 18, lines 418-430)

      Reviewer #2 (Public Review):

      The manuscript by Aderounmu presents an interesting attempt to reconstruct evolution of the function of the helicase domain in ancestral Dicers, RNase III enzymes producing siRNAs from long double-stranded RNA and microRNAs from small hairpin precursors. The helicase has a role in long dsRNA recognition and processing and this function could have an antiviral role. Authors show on reconstructed ancestral Dicer variants that the helicase was losing dsRNA binding affinity and ATPase activity during evolution of the lineage leading to vertebrates while an early divergent Dicer-2 variant in Arthropods retained high activity and seemed better adapted for blunt ended long dsRNA, which would be consistent with antiviral function.

      The work is consistent with apparent adaptation of vertebrate Dicers for miRNA biogenesis and two known modes of substrate loading: "bottom up" dsRNA threading through the helicase domain where the helicase domain recognizes the end of dsRNA and feeds it into the enzyme and "top-down" where the substrate is first anchored in the PAZ domain before it locks into the enzyme. Some extant Dicer variants are known to be adapted for just one of these two modes while Dicer in C. elegans exemplifies an "ambidextrous" variant. The reconstruction of the helicase domain complex enabled authors to test how well would be ancestral helicases supporting the "bottom up" feeding of long dsRNA and whether the helicase would be distinguishing blunt-end dsRNA and 3' 2 nucleotide overhang. Although the reconstruction of an ancestral protein from highly divergent extant sequences yields just a hypothetical ancestor, which cannot be validated, the work provides remarkable data for interpreting evolutionary history of the helicase domain and RNA silencing in more general. While it is not surprising that the ancestral helicase was a functional ATPase stimulated by dsRNA, particularly new and interesting are data that the decline of the helicase function started already at the level of the common deuterostome ancestor and the helicase was essentially dead in the vertebrate ancestor. It has been reported two decades ago that human Dicer carries a helicase, which has highly conserved critical residues in the ATPase domain but it is non-functional (10.1093/emboj/cdf582). Recently published mouse mutants showed that these highly conserved residues are not important in vivo (10.1016/j.molcel.2022.10.010). Aderounmu et al. now suggest that Dicer carried this dead ATPase with conserved residues for over 500 million years of vertebrate evolution.

      I do not have any major comments to the biochemical analyses and while I think that the ancestral protein reconstruction could yield hypothetical sequences, which did not exist, I think they represent reasonable reconstructions, which yielded data worth of interpretations. My major criticism of the work concerns clarity for the readership and interpretations of some results where I wish authors would clarify/revise the text. The following three examples are particularly significant:

      1) It should be explained to which common ancestor during metazoan evolution belongs the ancestral helicase AncD1D2 or at least what that sequence might represent in terms of common ancestry during metazoan evolution.

      We thank the reviewer for bringing this issue to our attention, and we have now included a brief discussion of the complexity in identifying AncD1D2’s exact position in metazoan evolution (page 6, lines 124-134). Our maximum likelihood phylogeny is constructed from Dicer’s helicase and DUF283 subdomains which evidently do not contain enough phylogenetic signal to resolve the finer details of early metazoan evolutionary events surrounding the divergence of non-bilaterians: Porifera, Ctenophora, Cnidaria and Placozoa. In our tree, Cnidaria even diverges later than the Nematode bilaterian branch reflecting the fact that our reported phylogeny does not match consensus species relationships, especially in the invertebrate clades. This means we cannot pinpoint AncD1D2’s exact position with certainty. While we do not intend to overinterpret the evolutionary trends from these hypothetical ancestral constructs, we believe the functional differences in biochemical activity are meaningful and correspond to big-picture changes over evolutionary time. AncD1D2 thus corresponds to some early metazoan ancestor that existed before the divergence of bilaterians from non-bilaterians. In support of this interpretation, when the phylogeny is constrained such that the bilaterian branches match the consensus species tree (Figure 1-figure supplement 2A) we observe that AncD1D2 is ancestral to the bilaterian ancestor, AncD1BILAT (now labeled on the figure), but retains 95% identity to the version of AncD1D2 constructed from the maximum likelihood phylogeny (Figure 1-figure supplement 3B).

      2) This is linked to the first point - authors work with phylogenetic trees reconstructed from a single protein sequence, which are not well aligned with predicted early metazoan divergence (https://doi.org/10.1098/rstb.2015.0036). While their sequence-based trees show early branching of Dicer-2 as if the two Dicers existed in the common ancestor of almost all animals (except of Placozoa), I do not think there is sufficient support for such a statement, especially since antiviral RNAi-dedicated Dicers evolve faster and Dicer-2 is restricted to a few distant taxonomic group, which might be better explained by independent duplications of ambidextrous ancestral Dicers. I would appreciate if authors would discuss this issue in more detail and make readers more aware of the complexity of the problem.

      We agree with the reviewer that in our initial submission we did not properly address the incongruence between our maximum likelihood phylogeny and the consensus species tree of life. We have now addressed this by revisions that discuss the difficulty in using a single gene or protein to accurately date ancient evolutionary events, especially in the case of Dicer, a protein whose evolutionary history is littered with multiple duplication events (page 6, lines 124-147, beginning with “Importantly, we observed multiple instances…”; page 16, lines 365-371, sentence beginning with “Uncertainty in the single gene or protein phylogeny…”). Our assumption that an early gene duplication produced the arthropod Dicer-2 clade is consistent with previous Dicer phylogenies that have been constructed with maximum likelihood algorithms with different parameters (https://doi.org/10.1371/journal.pone.0095350, https://doi.org/10.1093/molbev/msx187, https://doi.org/10.1093/molbev/mss263) using full length Dicer sequences with different taxon sampling depths and tree construction parameters. Removing other fast evolving taxa with long branch lengths from the sequence alignment still resulted in arthropod Dicer-2 branching out early in metazoan phylogeny (https://doi.org/10.1093/molbev/mss263).

      In analyses not included in our manuscript, we also independently constructed trees using full-length metazoan Dicers, helicase and DUF-283 subdomains using both RAXML-NG and MrBayes. We tried different taxon sampling depths and tried rooting the tree using either a non-bilaterian outgroup or a fungal outgroup and also tried breaking up potential long-branch attraction with deep taxon sampling. In every iteration, the arthropod Dicer-2 clade diverged early in animal evolution at some point before or during non-bilaterian evolution. We recognize that all these efforts are still prone to long-branch attraction that may cause the rapidly evolving Dicer-2 clade to artificially cluster with distant outgroups, but so far, the only evidence to support an arthropod-specific duplication event is parsimony. This parsimony model is plausible and one might expect a recently duplicated arthropod Dicer-2 to cluster closely with nematode Dicer-1, another antiviral Dicer that would have descended from a common ecdysozoan ancestor but this is not the case. The nematode HEL-DUF clade does get attracted to non-bilaterian Cnidaria clade in our ML tree, but unlike the arthropod Dicer-2 clade, this position varied depending on the parameters of phylogenetic analysis, and so we cannot conclude that arthropod Dicer-2’s position is due to long branch attraction. More sophisticated phylogenetic and statistical tools are needed to answer this question definitively, so we decided to proceed with the highest scoring maximum-likelihood phylogeny generated by our analysis.

      While we have now included a short discussion on the nature of this uncertainty in the revised manuscript (page 6, line 124., page 16, lines 365-371), we have excluded these additional details (paragraph above) from the main text in an attempt to prioritize readability for the generalist reader, and we hope that more specialized readers will find this discussion in the public comments helpful.

      3) Authors should take more into the account existing literature and data when hypothesizing about sequences of events. Some decline of the helicase activity is apparent in AncD1DEUT suggesting that it initiated between AncD1D2 and AncD1DEUT. This implies that a) antiviral role of Dicer was becoming redundant with other cellular protein sensors by then and b) Dicer was already becoming adapted for miRNA biogenesis, which further progressed in the lineage leading to vertebrates to the unique top-down loading with the distinct pre-dicing state where the helicase forms a rigid arm. Authors even cite Qiao et al. (https://doi.org/10.1016/j.dci.2021.103997) who report primitive interferon-like system in molluscs - this places the ancestry of the interferon response upstream of AncD1DEUT and suggests that this ancestral protein-based system was taking over antiviral role of Dicer much earlier. In fact, a bit weaker performance of AncD1LOPH/DEUT combined with the aforementioned interferon-like system and massive miRNA expansion in extant molluscs (10.1126/sciadv.add9938) suggests that molluscs possibly followed a convergent path like mammals. While I am missing this kind of discussion in the manuscript, I think that the model where "interferon appears ..." in AncD1VERT (Fig. 6) is incorrect and misleading.

      This comment is similar to others, including point 3 of Essential revisions, and we have revised our model in Figure 6 accordingly. We agree with the reviewer that we did not sufficiently explore the significance of the decline in Dicer helicase function between AncD1D2 and AncD1DEUT. In addition to the changes noted in point 3 of Essential revisions, we have corrected this by adding or modifying sentences in the Results (page 9, sentence beginning on line 197 “This reduction in ATP hydrolysis efficiency prior to deuterostome divergence may have coincided with…”, and page 11, sentence beginning on line 247 “One possibility is that between AncD1D2 and the deuterostome ancestor…”).

      We did not intend to suggest that this loss of Dicer helicase function was unique to vertebrates, but we focused on the deuterostome-to-vertebrate transition for the following reasons:

      a) The mollusk clade in our analysis is incongruent with its expected species position as a protostome. In our tree it clusters with deuterostomes instead. On one hand, this is probably an artefact of incomplete lineage sorting or long branch attraction. On the other hand, it is possible that this clade’s position is an underlying signal of the convergent evolution proposed by the reviewer. In support of the latter, some extant mollusk Dicer helicases (ACCESSION: XP_014781474, ACCESSION: XP_022331683) show a loss of amino acid conservation in Dicer’s ATPase motifs implying that extant mollusks have also lost Dicer helicase function like vertebrates. However, this is in contrast to vertebrate Dicer helicase where loss of function exists, but ATPase motifs remain conserved. We do not discuss this in the paper because the evidence remains inconclusive until extant mollusk Dicers can be functionally characterized, similar to Human Dicer and Drosophila Dicer-1, to determine that they are truly specialized for miRNA processing to the detriment of helicase function.

      b) Caenorhabditis elegans Dicer is an example of an ambidextrous Dicer, that processes both miRNAs, with the top-down mechanism, and viral dsRNAs, with the bottom-up mechanism. Recently, work has been published that suggests that C. elegans also possesses a protein-based innate immune defense mechanism, but instead of competing with the RNA interference mechanism, both mechanisms seem to work in concert and even share a protein in both pathways: DRH-1, a RIG-I-Like receptor homolog (https://doi.org/10.1128/JVI.01173-19). Furthermore, a protein-based pathway has also been reported in Drosophila and in this scenario Drosophila Dicer-2 is the dsRNA sensor that is common to both pathways (https://doi.org/10.1371/journal.pntd.0002823). This collaboration observed in ecdysozoan invertebrates is different from the competition that has been well established in vertebrates. More data is needed to understand whether a model of competition or collaboration exists in lophotrochozoan invertebrates like mollusks.

    1. Author Response

      Reviewer #1 (Public Review):

      The authors provide evidence for chromatin, which in Drosophila muscle cells is peripherally localized in the nucleus, whereas the central region is depleted of chromatin, and is organised such that RNA polymerase II (RNAp) is surrounding dense regions of chromatin. The authors theoretically study the formation of these regions by describing chromatin as a multi-block copolymer, where the blocks correspond to active and inactive chromatin regions. These regions are assumed to phase separately and to have different solvability. The solvability of the active region is regulated by binding RNAp. The authors study the core-shell organization in a layered geometry by analyzing the various contributions to free energy. In this way, they in particular obtain the dependence of the shell-layer thickness, which is described as a polymer brush. From these results, they infer chromatin organization in spherical coreshell chromatin domains and compare these results to Brownian dynamics simulations.

      The work is well done and even though it uses standard methods for studying block copolymers and polymer brushes obtains interesting information about local chromatin organization. These findings should be of great interest to researchers in the field of chromatin organization and in general to everybody interested in understanding the physical principles of biological organization.

      The work has two main weaknesses: The experimental evidence for RNAp and chromatin microorganization is weak as only one example is shown. It remains unclear whether the observed organization pattern is common or not. Also, no data is shown concerning the dependence of the extensions of the active and inactive phases on parameters, for example, solvent properties or transcriptional activity. Second, some parts could prove difficult for biologists to assess. For example, the expression for the brush-free energy should be explained in more detail and notions like that of 'mushrooms' need to be introduced. As a second example, biologists might benefit from a better explanation of the concept of a theta solvent and its relevance.

      We thank Reviewer #1 for the positive review and critical feedback. Below we answer the points raised in the last paragraph of its review.

      In the original version of the manuscript we only showed a representative image of nuclei of muscle cells in an intact, live Drosophila larvae. Notably, this organization is representative of many nuclei analyzed in muscle tissue. In the revised version we show that in a distinct tissue, e.g. salivary gland epithelium of live Drosophila larvae, RNA Pol II distribution is similarly facing the nucleoplasm, although chromatin condensation differs due to higher DNA ploidy. The new images were added as Supplement information (Fig A1). Since these representative images are the main motivation behind our theoretical analysis, we think that including them will help the reader in understanding the relevance of our minimal model. The effect of different biological perturbations, such as changes in the repressive marks and how these change the core-shell structure require extensive experiments that are outside the scope of the present paper. We also note, that in live organisms (not just live cells) such as those studied here, one can only reliably use genetic perturbations; solvent quality is regulated by the organism and cannot be controlled as in synthetic polymer experiments. Our main focus in the present paper is to highlight an area that has been relatively unexplored by the chromatin organization community, which is how changes in concentrations binding-partners of chromatin may have a strong effect in nuclear architecture.

      We have also improved the explanation of the physical concepts for biologists. We added a more thorough explanation of the concept of a polymer brush and explained more clearly what the concept of theta solvent in terms of the scaling properties of a polymer in solution. We quote these revisions below.

      Reviewer #2 (Public Review):

      This work formulates a detailed theoretical polymer physics model intended to explain the observed morphology of chromatin in the Drosophila cell nucleus. The model is examined in detail by both analytical calculation and computer simulation. The central premise of the suggested theory is that it is again based on equilibrium statistical mechanics. Within this paradigm, authors explore the model that views chromatin fiber as a block copolymer and, most importantly, describes the role of RNA polymerase as it interacts with one of the copolymer blocks and regulates its effective solvent quality. Blocks are assumed to be fixed on the time scale of interest by, e.g., different levels of acetylation or methylation. RNA polymerase is supposed to interact only with one of the chromatin blocks, called active, and assumed interaction is quite peculiar. Namely, RNA polymerase complex may absorb on chromatin fiber and, the model assumes, the fiber decorated with absorbed RNA polymerase molecules is less sticky to itself, or more repulsive than the fiber itself. This peculiar assumption allows authors to make interesting predictions about how proteins can regulate the genome folding architecture.

      We thank the reviewer for the positive and critical feedback. We agree that our assumption of changes in the effective solvent stemming from protein complexes binding to chromatin is at the core of our analysis and we justify it further below.

      STRENGTH

      The work includes a rather detailed theoretical description of the model and its equilibrium statistical mechanics. As both analytical theory and accompanying simulation indicate, the assumptions put forward in formulating the model do indeed produce the desired morphology, with isolated regions ("micelles") of core inactive chromatin surrounded by the less dense shell region in which RNA polymerization may potentially take place. Having such a detailed theory is potentially beneficial for the field and opens up avenues for further exploration.

      We thank the referee for appreciating the potential benefit of our minimal theory of solvent-quality regulation by binding processes.

      WEAKNESS

      The underlying assumption about the interaction of RNA polymerase complex with the fiber, although important and organic for the model, does not seem easy to justify from a molecular standpoint, especially thinking of the charges and electrostatic interactions.

      We visualize that the binding of RNA Pol II (mediated by different transcription factors) to chromatin is also associated with larger protein complexes that may contain hydrophobic and hydrophilic components, such as pre-initiation complexes. Some regions of these complexes might associate directly with chromatin due to positive charges on the surface of the Pol II complex , whereas the hydrophilic negative regions may be directed towards the solvent. Our theory is typical of the approach used in polymer physics where coarse-grained interactions are considered. While the origin of hydrophilic interactions lies in electrostatics, such interactions are highly screened in cells (typically 200 mM concentration of salts) and can be considered as short-ranged and competitive with hydrophobic interactions. Chromatin in solution is known to condense (see Gibson, et. al., Cell 2019 and Strickfaden, et. al., Cell 2020) and even phase separate from the nucleoplasm (see Amiad-Pavlov, et. al., Science Advances, 2021); this can arise either from hydrophobic interactions of the histone tails or from opposite charge attraction of the histones and linker DNA. In our model, this competes with the binding of protein complexes which then disrupt the self-attraction of chromatin. Previous work has shown that RNA Pol II associating with chromatin (in the absence of transcription) prevents the coarsening of dense chromatin domains (see Hilbert, et. al. Nat. Comm. 2021), which agrees with our modeling of protein complexes that bind to chromatin and interfere with its condensation; in addition, the binding of Pol-II and all its binding partners that form the pre-initiation complex (see Hahn, Nat. Struct. & Mol. Biol. 2004, 11) will result in effective, steric repulsion between different active and Pol II bound chromatin domains. Another interesting observation is that most of the surface of RNA Polymerase II is negatively charged with a few positively charged patches with which it specifically interacts with DNA while others serve as exit paths of RNA (see Cramer, et. al., Science, 2001.). We agree that a more thorough analysis of the molecular interactions between what we name protein complexes and chromatin is interesting, but it is out of the scope of our paper that uses a coarsegrained, polymer physics approach. This approach also allows our model to be to be predictive as to the physical organization and growth of the domains, independent of those molecular details that are as yet unknown.

      Reviewer #3 (Public Review):

      This theoretical study provides a theoretical explanation for a puzzling question arising from recent experiments: How can chromosomes behave like polymers collapsed in a poor solvent but also contain "open" active chromatin sections? The authors propose that the binding of proteins (e.g. RNAP's) to the active sections can effectively change the solvent quality for these sections and thus open them. They suggest further that chromosomes show micellar structures with inactive blocks forming the cores of the micelles. Protein binding causes swelling of the micellar shells which affects the whole chromosome structure by changing the total number of micelles. This theory fits well to live imaging data of chromatin in Drosophila larvae, like the one shown in the striking Figure 1.

      The manuscript is written very clearly.

      My only suggestion is that the authors, in both the theory and simulation parts, are more explicit about how the interactions between the various components are modeled. From what I could see, in the theory part, one needs to look closely at Eq. 5 to understand how the influence of the binding of proteins affects the interaction between active monomers, and in the simulation part, one needs to go to the appendix to learn that interaction strengths between monomers within the active blocks and monomers within the inactive blocks have different values. The latter is crucial to understand the micellar structure shown at the top of Fig. 5A.

      We thank the reviewer for his positive response. We have explained Eq. 5 more carefully now and included other explanatory remarks throughout the text. We also explained more clearly the interactions considered in the simulations. Below we answer point by point and add quotes from the revised manuscript.

    1. We further evaluated the pipeline with a genome containing simulated HGT regions. Since our78HGT identification pipeline has two main steps, sequence composition-based filtering step and79genome comparison step. The evaluation was done for the two steps (Figure S3, Table S1). While80top 1% fragments were input to the pipeline, 20.6% correct results would be identified after81sequence composition-based filtering and 14.3% correct results identified after genome comparison.82When the percentage of fragments input was up to 50%, 83.4% and 77.7% correct results were83identified after two steps respectively. It can be seen that the precision of prediction was higher than8460% for all cases. This indicated that we may have underestimated the number of HGTs (low recall85rate) but majority of the identified HGTs were highly reliable.

      This paragraph was a bit confusing to follow but I think I got the gist of it after a few passes through! I'm curious if you thought about controlling for natural variation in 4mer frequency throughout the genome, as some other methods have found that this helps reduce off target predictions (reviewed in https://doi.org/10.1371/journal.pcbi.1004095). It may not be necessary since you do a second step after the initial screen, but I was just curious if that was something you thought about putting in place, and if so, why you decided against it

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their constructive feedback on our manuscript. They did a very comprehensive and helpful job of laying out some key areas that could be improved. We were heartened by the fact that there was a fair amount of overlap between the two reviewers, and that comments were largely addressable without further experimentation.

      Below, we provide a summary of how we have attempted to address the comments and concerns from both reviewers. We also provide the rationale and action items for our responses. Overlapping comments from both reviewers have been consolidated and responded to together.

      Comment 1 (Reviewer #1, Minor Comment 1 & Reviewer #2, Significance)

      Both reviewers raised concerns about our choice to focus on essential genes in our CRISPRi screen, which could potentially underestimate the role of non-essential factors contributing to Tae1 sensitivity or resistance.

      Rationale: We agree with the reviewers that including non-essential genes could provide additional insights into the roles of non-essential factors in Tae1 sensitivity and resistance. We believe our focus on essential genes contributes a unique perspective to the field, as there already exists a body of work that interrogates non-essential genes in this space. Here are some citations that represent this body. We will highlight these better in the manuscript.

      Lin, H.-H.; Yu, M.; Sriramoju, M. K.; Hsu, S.-T. D.; Liu, C.-T.; Lai, E.-M. A High-Throughput Interbacterial Competition Screen Identifies ClpAP in Enhancing Recipient Susceptibility to Type VI Secretion System-Mediated Attack by Agrobacterium Tumefaciens. Front Microbiol 2020, 10, 3077. https://doi.org/10.3389/fmicb.2019.03077.

      Hersch, S. J.; Sejuty, R. T.; Manera, K.; Dong, T. G. High Throughput Identification of Genes Conferring Resistance or Sensitivity to Toxic Effectors Delivered by the Type VI Secretion System; preprint; Microbiology, 2021. https://doi.org/10.1101/2021.10.06.463450.

      Additionally, our screen was experimentally optimized for essential genes using our approach. The knockdown strategy is useful specifically for essential genes because E.coli is phenotypically very sensitive to essential gene perturbations (see more here: https://doi.org/10.1128/mBio.02561-21). While it would have been ideal to include non-essential genes too, doing so would require a different additional optimization that we believe would have diluted our bandwidth for this study. We do thank the reviewers for recognizing how much effort went into this!

      We do acknowledge this is a limitation and want to make sure the readership is aware of that. Ideally, one could do more rigorous side-by-side comparisons between studies if the approaches, set-up, and assays are the same. Unfortunately, due to differences in experimental set-up, we could not directly compare with the non-essential screens. We hope others will pick up where we left off. Here are some action items we can take to increase the odds of that:

      In the Introduction, we will mention other studies and highlight the need to investigate essential genes side-by-side with non-essential. (Lines 64-7) In the Discussion, we will add a sentence that acknowledges the importance of exploring non-essential genes for a more comprehensive understanding of Tae1 sensitivity and resistance. (Lines 484-5)

      Comment 2 (Reviewer #1, Minor Comment 5 & Reviewer #2, Major Comment)

      Both reviewers mentioned that the dormancy state in msbA-KD cells is not well characterized and its relationship with Tae1 resistance is not convincingly shown.

      Rationale: We agree that our manuscript does not clearly pin down whether Tae1 resistance is linked to a true dormancy state. There are some intriguing similarities between what we observe and what is classically known as “dormancy” or “persistence”, which have specific definitions. Although we don’t yet have a concrete reason to think it’s NOT those states, we also don’t have sufficient data to point to it clearly being the same at a mechanistic or cellular level. This is merely a hypothesis that our work suggests. We would love to see others follow up on this, as we suspect there are overlaps and potentially additional cellular states that have yet to be clearly defined in this field of bacterial physiology.

      Here is how we propose to address this concern:

      We simplified our language to be more descriptive and less loaded in terms of nomenclature around dormancy or persistence. Namely, we are referring to the cells in a more descriptive way with “slowed growth.” This allows us to clearly describe what we observe without attempting to ascribe mechanism or anything beyond that. It doesn’t fundamentally change the overarching interpretation of our study. (Lines 444, 490,497-9) In the Discussion, we will add text emphasizing the need for follow-up studies to fully address whether there is indeed a connection between Tae1 resistance and slowed growth. (Lines 491-3)

      Comment 3 (Reviewer #2, Major Comment)

      The reviewer asks if the degradation of the sugar backbone is also required for lysis or if it is just the crosslinking step that is important.

      Rationale: This is an astute point. We acknowledge that the degradation of the sugar backbone may play a role in lysis, and it’s predicted that this may be why the Pae H1-T6SS delivers a second PG-degrading toxin (Tge1), a muramidase that targets the sugar backbone. The most parsimonious conclusion from past studies by us and others is that Tae1 is critical for lysis, but not sufficient in the absence of any backbone-targeting enzyme. Indeed, many T6SS-encoding bacterial species also encode >1 type of PG-degrading enzyme, which may speak precisely to the reviewer’s point. However, it should also be noted that there may be endogenous enzymes with activities that can be leveraged alongside these toxins for the same effect.

      Action items:

      In the Discussion, we will add a sentence addressing the potential role of sugar backbone degradation in the lysis process and the need for future research on this topic. (Lines 524-6)

      Comment 4 (Reviewer #1, Minor Comment 2)

      The reviewer asks why lptC-KD leads to sensitivity to Tae1, while msbA-KD leads to resistance, considering both genes are implicated in LPS export.

      Rationale: We appreciate the reviewer's careful attention to the underlying biology. They are absolutely correct in pointing this difference out. Our interpretation is that the different phenotypes may indicate that although the LPS biosynthesis superpathway intersects with PG synthesis, lptC and msbA may intersect with PG synthesis in distinct ways. We can address this concern through the following:

      We will add a sentence in the Discussion section providing our interpretation of the different phenotypes observed for lptC-KD and msbA-KD. (Lines 508-13)

      Comment 5 (Reviewer #1, Minor Comment 4)

      The reviewer notes that the contribution of msbA to Tae1 resistance appears minor based on the results in Figure 3d.

      Rationale: There are actually two aspects to this concern, which we note below. We found it difficult to fully capture it in the manuscript, but our thoughts are as follows.

      (1) Technical viewpoint:

      Bacterial competition experiments are inherently noisy. The quantitative read-out is easily impacted by a number of parameters, including cellular density, input ratio between competitor cell types, growth stage, and possibly other environmental factors that are difficult to predict. In general, our view is that we should avoid over-indexing on the degree of the phenotype, focusing more on the direction of the phenotype (loss of statistically-significant Tae1 sensitivity) and the fact that it is reproducible in our hands. Furthermore, our argument is bolstered by clear validation of the loss of Tae1 sensitivity through orthogonal lysis assays (Fig. 4a-c).

      (2) Biological viewpoint

      It is challenging to isolate the specific interaction between Tae1 and individual genetic determinants, as we think it’s a complex system with multiple factors simultaneously at play. It is crucial to acknowledge that the unique contribution of Tae1 is only a part of the T6SS. There may be other compensatory actions that influence the outcomes observed, such as upregulation of non-Tae1 toxins, regulation of system activation/firing, timing and location of T6S injections, etc. We think these are exciting possibilities and that more groups should delve into the context-dependent dynamics of the system. Although outside the scope of our manuscript, we would be open to suggestions for how we can further emphasize this point.

      Comment 6 (Reviewer #2, Minor Comment)

      The reviewer recommends that we discuss whether our findings are specific to Tae1 or if they can be extrapolated to other toxins.

      Rationale: We understand the reviewer's interest in understanding the broader implications of our findings. Although our study focuses specifically on Tae1, we believe that our findings may provide insights into the mechanisms of sensitivity and resistance to other toxins that target the cell wall. However, experimentally investigating this would fall outside the scope of our current manuscript.

      Additional Minor Revisions

      Table 1: I would label MsbA and LptC as "LPS transport" and not "LPS synthesis" (Reviewer 1) Rationale: We agree that using “LPS transport” to describe the gene functions for lptC and msbA is more specific to their functions.

      Table 1 was updated to change the “pathway/process” categorizations for lptC and msbA from “LPS synthesis” to “LPS transport”. In line with this comment, we also changed the pathway/process categorization for murJ (Lipid II flippase) to “PG transport”. Figure 3 legend: "...deformed membranes .........are demarcated in (g) and (h)" (Reviewer 1) We thank the reviewer for pointing out the missing text in this figure legend.

      We corrected the error by adding the missing text back in Figure 3. Line 339-341: Supp. Fig. 9 should be Supp. Fig. 8 (Reviewer 1) Referenced Supp. Fig. was corrected. * Second, (L422-425) the authors conclude that their data demonstrate a "reactive crosstalk between LPS and PG synthesis". I disagree. There is no information in the paper that this is the case. The authors can only suggest that cross talk may occur. (Reviewer 2) We agree. Line 421-2: replaced “demonstrate” with “suggest” to soften the argument. *

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This study reports the finding that lipopolysaccharide integrity modulates bacterial sensitivity to a Type-6-secreted bacterial toxin. The authors used the Tae1 amidase produced by the P. aeruginosa T6SS and Escherichia coli bacteria as prey cells as a model system to test the effect of knockdowns in essential gene expression of the prey. This was accomplished by constructing a library of knockdown (KD) genes based on Crispr/Cas9 and selecting for those targets where E. coli prey is not killed. The screen revealed, as expected, that KD genes encoding cell wall synthesis assembly (and bamA, involved in OM protein assembly) enhanced the sensitivity to Tae1. In contrast, KD targets in genes involved in lipid metabolism and lipopolysaccharide synthesis conferred resistant to the amidase toxin. The authors hypothesized that non-PG components of the cell envelope may shape Tae1 toxicity and undertook a more detailed analysis of the effects of knocking down one of these genes, msbA, using a various biochemical and imaging approaches. The MsbA protein is an ATPase permease that plays an essential role in flipping newly synthesized lipid A across the bacterial inner membrane. The authors show that resistance to Tae1 in msbA-KD is independent of cell wall hydrolysis (meaning that the Tae1 remains active), PG synthesis is suppressed (despite PG is still Tae1 sensitive), and that protein synthesis and growth is suppressed. This latter observation suggests that the E. coli prey enters a persistent (dormant) state that protects it from Tae1 toxicity. The authors conclude that Tae1 susceptibility in vivo is determined by cross talk between essential cell envelope pathways and the general growth state of the cell.

      Major comments:

      This is a nice study unravelling cellular off target factors that affect the killing in vivo by a T6SS toxin. In that sense the study is novel since the interplay of T6SS effectors in the context of the physiological state of the prey cell has not been directly investigated. so this study adds new information to the literature in the field.

      I have several comments concerning the interpretation of the results.

      First, it is interesting that Tae1, being an amidase, can be the sole responsible for PG degradation. The enzyme cleaved the peptide bridges but has no effect on the PG backbone. The study was not designed to pick up autolysins (since only essential genes were targeted) but one would assume that degradation of the sugar backbone must also be required for lysis.

      Second, (L422-425) the authors conclude that their data demonstrate a "reactive crosstalk between LPS and PG synthesis". I disagree. There is no information in the paper that this is the case. The authors can only suggest that cross talk may occur.

      Third, Tae1 maximal effect is present when new PG is made, which also begs the question about the location of this protein in the PG mesh. Like B-lactam and other PG-active antibiotics, the effect of Tae1 requires active cell growth. This is also consistent with the authors' finding that the msbA-KD bacterial cells enter a state of dormancy or persistence, which will make them capable of overcoming Tae1 toxicity.

      Fourth, an important outcome of protein synthesis inhibition and PG synthesis is increased oxidation and lipid peroxidation. This could also influence the results obtained in this study. It would be consistent with the other targets observed, which compromise lipid metabolism and membrane trafficking and secretion.

      Referees Cross-commenting

      Based on my own review and that of Reviewer 1, I think we both agree that there are 2 major limitations in this work: (i) the KD library only targets essential genes and this would potentially miss non-essential genes that when targeted for mutated could lead to synthetic lethal phenotypes that could be more revaling than a general defect protein synthesis, etc. and (ii) the dormancy state is not well characterized.

      Despite these points the study is very nicely done with a huge amount of work.

      Significance

      This is an important study addressing experimentally the complexities of bacteria-bacteria interactions in the context of predator-prey interplay. The T6SS effectors affecting PG appear to have the same characteristics as known antibiotics and bacteria use similar strategies to protect themselves from PG attack. This is not only to increase growth as an escape approach but also to reduce it to a point in which the target cell cannot be effectively killed despite the presence of the toxin.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors would like to thank the reviewers for their valuable comments and suggestions. We have carefully considered all of the points raised and revised our manuscript accordingly. In the rebuttal letter below, we have extensively discussed all the different concerns and adjustments we made to our work. In what follows the reviewers’ comments are in blue and the authors’ responses are in black. The additions and changes to the main and supplementary text of the manuscript are highlighted in yellow.

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)): *

      *In their paper entitled "CD38 promotes hematopoietic stem cell dormancy via c-Fos", Ibneeva et al., present a set of data predominantly from mouse HSCs where they explore the cell cycle kinetics and self-renewal capacity of LT-HSCs expressing (or not) CD38. They perform a series of sophisticated in vitro and in vivo experiments, including transplantations and single cell cultures and arrive at the conclusion that CD38 can fractionate LT-HSCs that are more deeply quiescent. Overall, it is an interesting question and would be of interest to experimental hematologists. That said, I had a number of issues that concerned me throughout the manuscript with regard to the robustness of the conclusions around CD38 and I have tried to detail these below.

      Major concerns: *

      *1) Novelty - It was unclear what the relationship of this CD38+ fraction had with other "segregators" of LT-HSCs - e.g., how does it compare with the Sca1 fractionation of Wilson et al, Cell Stem Cell 2015 or Gprc5c of Cabezas-Wallschied Cell 2017? Even if CD38 fractionated LT-HSCs, it was unclear what it would give beyond these two molecules (especially re: Sca-1 which is also a cell surface marker). *

      Response:

      We agree with the reviewer that further elaboration of this point with additional data would be helpful. We compared the expression of Sca-1 in the population of LT-HSCs (Lin- Kit+ Sca-1+ CD48- CD150+ CD34- CD201+) based on the gating strategy from the paper Wilson et al, Cell Stem Cell 2015. We found that all LT-HSCs (independent of CD38 expression) express Sca-1 at a high level and can be quantified as Sca-1hi (we have added these data in Fig. S2A). Thus, CD38 subfractionates LT-HSCs, and considering that we have shown that CD38+ are more quiescent (Fig. 3) and have higher repopulation capacity compared with CD38- LT-HSCs (Fig. 2E-G), we conclude that CD38 should be used in addition to Sca-1 to define dormant LT-HSCs.

      We found that CD38+ dormant HSCs expressed Gprc5c mRNA at higher levels than CD38- LT-HSCs (Fig. 5D). Therefore, we cannot exclude that CD38+ and Gprc5c+ identify the same population of dormant HSCs. However, Cabezas-Wallscheid Cell 2017 used the reporter Gprc5c-EGFP mouse strain, which is not widely available. In contrast, we propose to use readily available antibodies against CD38 for efficient isolation of dormant HSCs. Moreover, to define CD38+ dormant HSCs, researchers do not need to use the CD38KO mice as a negative control, it would be sufficient to use total bone marrow cells to identify the CD38+ population for gating dHSCs (we have added this information to Fig. S2C and in the text: line 119-121: “We demonstrated that total bone marrow cells can be used to define the CD38+ fraction in the absence of CD38 knock-out mice (CD38KO) (Fig. S2C), providing the possibility of an internal positive control for easy identification of CD38+ cells”.

      *Claims of CD38+ superiority in transplantation - I was surprised with the claim of CD38 negative cells being a less functional HSC when they are clearly still very strong in secondary transplantation assays. Both 38+ and 38- cells strongly repopulate secondary animals and only 5 mice were shown in the Figure. The legend suggests another experiment was undertaken, but these data are not presented. Did they substantially differ in their chimerism in primary and secondary animals? Was the magnitude of difference between the two fractions similar in both experiments? Is there a reason that the data could not be plotted on the same graph?

      *

      We have added the data from the second experiment to the graphs and changed the figure legend accordingly (Fig. 2D-H), now for primary transplantation n=8, for secondary transplantation n=6 vs 7. These data show the same trend of higher repopulation capacity of CD38+ LT-HSCs compared to CD38- LT-HSCs, although with the larger magnitude of difference in primary transplantation. We agree with the reviewer that CD38- LT-HSCs strongly repopulate secondary animals. However, the higher percentage of chimerism in peripheral blood and bone marrow for CD38+ LT-HSC progeny indicates their superior repopulation and self-renewal capacity compared to CD38- counterparts.

      Also, the typical experiment to establish a quantitative difference in HSC production would be a limiting dilution analysis with a much larger number of recipient animals - without such data it is difficult to ascertain how different the two fractions really are.

      While we appreciate the reviewer's suggestion to include additional data on the amount of repopulating HSCs, we respectfully disagree as we believe that this information is beyond the scope of the current study, which only aims to assess the functional superiority of CD38+ LT-HSCs over CD38- LT-HSCs in side-by-side comparisons. Assessment of donor-derived cells’ frequency in peripheral blood and bone marrow relative to the frequency of competitors after transplantation of the same amount of HSCs (so-called chimerism level) is a widely accepted assay in the field to demonstrate the difference in the functionality between two HSC fractions (Sanjuan-Pla et al., Nature 2013; Gekas C and Graf T, Blood 2015; Bernitz J.M et al. Cell 2016; and others, including papers cited by the reviewer: Wilson et al., Cell Stem Cell 2015 and Cabezas-Wallscheid et al., Cell 2017). A limiting dilution experiment will provide more detailed characteristics of two HSC fractions, namely the quantitative difference (how many cells from the sorted population can repopulate). However, this experiment will not significantly change our conclusion that the CD38+ LT-HSC fraction is superior in repopulation and self-renewal capacity compared to the CD38- LT-HSC fraction, as sufficiently demonstrated in Fig. 2E-G.

      Furthermore the claim that CD38- HSCs do not ever produce CD38+ cells is a bit premature with so few mice and confusingly presented data (e.g., Fig 2I is 5 pooled mice in a single histogram plot - were these concatenated flow files? If so, how were they normalised? Did the other experiment look the same? And were all CD38+ HSCs capable of giving rise to both CD38+ and CD38- cells or was it a subfraction of mice/samples?).

      The plot provided in Fig. 2I is a FACS analysis of pooled cells from mice transplanted with CD38+ or CD38- LT-HSCs (we added a detailed explanation in figure legend 2, lines 701-703). We provided data from the second experiment in Fig. S2G. All CD38+ LT-HSCs could give rise to both CD38+ and CD38- HSC; we added data in Fig. S2H.

      Cell Cycle status differences and grades of quiescence - Ki67 and DAPI are really quite tricky for discerning G0 versus G1 and no flow cytometry plots are provided for the reader to assess how this has been done. Could another technique (e.g., Hoechst/Pyronin) be used to confirm the results? Perhaps more concerning is the variability of the assay in the authors own hands. If I am interpreting things correctly, the plots in 3G, 3H and 3I in the platelet depletion, pIpC and 5FU experiments are >10% higher in the CD38- control arm than the data in 3A which make me worried about the robustness of the cell cycle assay to distinguish G0 from G1.

      Ki67 and DAPI staining is a widely accepted technique for distinguishing G0 from G1. We provide flow cytometry plots in Fig. S2F (original figures, S3B - updated figures), which the referee may have overlooked. We added a reference to the Fig. S3B to figure legend 3 to make it more transparent for the readers. We would like to clarify the reviewer’s concern regarding the slightly different frequency of CD38- cells in the G0 phase of the cell cycle at steady state in Fig. 3A (original figures). Fig. 3A compares the cell cycle stages between CD38- and CD38+ HSCs, while Fig. 3B compares the same parameters for CD38- vs CD38+ LT-HSCs, which are enriched for quiescent HSCs by using additional surface markers. Therefore, it is correct to compare the data for LT-HSCs under stress (Fig. 3G-I, original figures) with the data for LT-HSCs at steady state in figure 3B (original figures). To make it less confusing for the reader, since the entire Figure 3 is devoted to LT-HSCs, we have moved Figure 3A to the supplementary Figures (Fig. S3A).

      All experiments for Fig. S3A&3A, 3F, 3G, and 3H (updated figures), were performed separately, and we did not compare mice from different experiments to avoid differences due to technical details. However, the groups of mice for each specific treatment (ctrl vs. treatment at different time points) were analyzed on the same day, using the same amount of cells, the same master mix of antibodies, and the same FACS machine and settings to compare ctrl vs. treated mice (we added this information in the Materials and Methods section, lines 388-391). In addition, we performed a BrdU incorporation assay and label retention assay using H2B-GFP mice, which support our finding that CD38+ LT-HSCs are more quiescent than CD38- cells in the steady state.

      Minor points: Figure 3I was really confusing - it says it is the gating strategy for GFP retaining LT-HSCs, but only shows GFP versus cKit

      We reformulated the figure legend for 3D: “Representative plot defining GFP+ cells in LT-HSCs.”

      Figure 4B suggests that only 40% of CD38+ cells divide in the first 3 days - are there survival differences or are the cells sat there as single cells? It would be important to carry these further to see if cells eventually divide.

      This is a relevant and crucial point addressed by the reviewer. We did not find any significant difference in the survival of cells. We have added this data to the supplementary data - Fig. S4Q-R.

      Reviewer #1 (Significance (Required)):

      I believe the study will be of interest to specialist readers in the HSC field, especially those working on quiescence and G0 exit. At present, I think the conclusion of a true subfractionation is a bit premature, but there are pieces of data that do look exciting and warrant further investigation. It was a little unclear how this would advance beyond Sca-1 or Gprc5c fractionation for finding more primitive HSCs, but having cleaner markers is always a useful advance for the field.

      We thank the reviewer for his/her positive evaluation of our study. In our work, we compared several functional aspects of CD38+ and CD38- LT-HSCs:

      1. We used four techniques (Ki67 and DAPI staining, BrdU incorporation assay, label retention assay, single-cell division tracing assay) and showed that CD38+ LT-HSCs are more quiescent than CD38- cells.
      2. We performed a serial transplantation assay and found that although CD38- LT-HSCs have the long-term repopulation capacity, they repopulate significantly less effectively than CD38+ LT-HSCs.
      3. We used a combination of surface markers (Lin- Kit+ Sca-1+ CD48- CD150+ CD34- CD201+) to define LT-HSCs; all of which belong to the Sca-1hi population according to Wilson et al, 2015. We further separated Sca-1hi LT-HSCs into CD38+ and CD38- cells and found that they differ in the repopulation capacity and quiescence in steady state and upon hematological stress. We conclude that CD38 surface staining should be used on top of Sca-1 to sort dormant LT-HSCs.
      4. We found that CD38+ dormant LT-HSCs differ from CD38- cells in gene expression and response to CD38 and c-Fos inhibitors. CD38+ LT-HSCs are characterized by higher cytoplasmic Ca2+ and cell cycle inhibitor p57 levels than CD38- LT-HSCs. Thus, we demonstrated that CD38 is not only a marker but also has a functional role in mediating HSC dormancy. We discovered that CD38/cADPR/Ca2+/c-Fos/p57 axis regulates CD38+ HSC dormancy. Taken together, our findings demonstrate that CD38+ LT-HSCs have superior properties compared to CD38- LT-HSCs and can be classified as dHSCs, providing a simple approach for their isolation and further study. Moreover, we uncovered the CD38-mediated molecular mechanism regulating HSCs dormancy.

      Regarding my own expertise - I have spent ~20 years in the field undertaking single cell assays of normal and malignant mouse and human HSCs, including many of the core functional assays described in this paper and consider myself very familiar with the topic area.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Although the experiments were well done and supported their testing hypothesis, but the overall novelty of the whole work is not that strong and this is because:

      -the use of CD38 to identify/select and to test mouse LT-HSCs' function in vivo (although not commonly used nowadays) was demonstrated a few times more than 20 years ago (Randall, et al., 1996: PMID: 8639761 and Tajima et al., 2001; PMID: 11313250); in fact, the authors didn't even reference/acknowledge these papers which they should have done so; hence, most of the results in Fig.2 were already known (despite this current work gave a more detailed/better analysis);

      We agree with the reviewer that the previous findings using CD38 to separate HSPCs should be appreciated; however, we would like to point out that while the studies by Randall, et al., 1996: PMID: 8639761 and Tajima et al., 2001; PMID: 11313250 employ only 3 markers to discriminate HSPC (Lin- Sca-1+ Kit+), in our study, we performed for the first time a very detailed characterization of CD38+ cells using surface markers that were not available 20 years ago. We analyzed not only the HSC compartment but also different populations of multipotent progenitors. Modern surface marker combinations for the LT-HSC isolation allow us to show that both populations: CD38- and CD38+, can be classified as LT-HSCs in contrast to the data of Randall et al, where the authors did not find any long-term repopulating activity in the CD38- KLS compartment. Moreover, we showed the hierarchical relationships between these two populations. We appreciate the previous findings and recommendations of the reviewer, and have added citations (Randall, et al., 1996: PMID: 8639761 and Tajima et al., 2001; PMID: 11313250) and comment in the discussion section, lines 267-271:

      In contrast to previous studies reporting that only CD38+ HSPC compartment from adult mice contains LT-HSCs (42, 43), in our study we demonstrated using modern surface marker combinations for the isolation of LT-HSCs that while both populations: CD38- and CD38+, can be classified as LT-HSCs, only CD38+ LT-HSCs display characteristics of dormant HSCs (4).’’

      -it is known the generic roles of CD38 in producing cADPR, ADPR, etc and these can induce Ca2+ oscillation in cells; despite that, it was nicely demonstrated here that in mouse HSCs cADPR was the main signalling mediator;

      We thank the reviewer for pointing this out; indeed, it has not been shown before how Ca2+ is regulated in HSCs.

      the roles of cADPR in human CD34+ were demonstrated (Podesta et al., 2023; PMID: 12475890: when CD34+ HSPCs were primed in vitro with cADPR it resulted in enhanced short-term while maintaining long-term (secondary transplant) engraftment in NOD/SCID mice, probably (mechanisms were not determined at that time) inducing cycling/expansion of human CD34+CD38+ progenitors while inhibiting cycling (hence, better long-term maintenance) of CD34+CD38- HSPCs); on this note; the data presented in Fig.4 K and S5 should be eliminated as it adds little to their story and it can be quite confusing when comparing to mouse data unless the authors wish to explore in a more detailed way the human part.

      We appreciate the reviewer’s valuable suggestion. However, we respectfully disagree with their interpretation because we do not believe that the technical aspects of the cited paper (Podesta et al., 2003; PMID: 12475890) are robust enough to support their conclusions. Podesta et al. concluded that in vivo and in vitro treatment with a high dose of cADPR (25-fold higher than the physiological dose, according to the authors' estimation) stimulates the expansion of HSC and progenitor cells. At the same time, they did not use any surface markers to define populations and studied total mononuclear cord blood cells, so no conclusions can be drawn regarding CD34+ CD38+ and CD34+ CD38- dynamics. Unfortunately, we cannot confirm the reliability of the HSC engraftment data presented by Podesta et al. This is because they did not analyze the chimerism of human cells in peripheral blood and bone marrow for sixteen weeks post-transplantation, which is considered a standard time period for assessing long-term engraftment of human HSCs in the field (Brehm M.A. et al., Blood 2012, Cosgun K.N. et al., Cell Stem Cell 2014, Takagi S. et al. Blood 2012). Instead, they counted only some CD34+ cells at three and eleven weeks after transplantation. Therefore, the role of cADPR in the regulation of human HSC quiescence remained unknown.

      In our original study, we showed that blocking the CD38 ecto-enzymatic activity stimulated both human HSC and mouse HSCs to exit from the G0 phase of the cell cycle. The role of CD38 enzymatic activity can be conservative for mice and humans and needs to be further investigated in future studies on human HSCs. For this reason, we decided to keep Fig. 4K and S6 in the paper.

      -Ca2+ induction in cells can induce c-fos expression (as in an early response gene); in many cell types hence, it was not a surprising finding;

      We agree with the reviewer that it has been shown previously that Ca2+ induction in cells could induce c-fos expression (as an early response gene to stress). However, we have shown for the first time that Ca2+ regulates c-Fos expression in LT-HSCs under steady-state conditions.

      -c-fos was demonstrated to suppress cell cycle entry of dormant hematopoietic stem cells (Okada et al., 1999: PMID: 9920830).

      In the cited publication (Okada et al., 1999: PMID: 9920830) the authors have only analyzed the in vitro proliferation and colony formation of Lin- Sca-1+ cells in the IFNα/β inducible c-Fos overexpression model. This population mainly contains progenitor cells and only 0.004% of dormant LT-HSCs (please find below an estimation of LT-HSC frequency). Therefore, the role of c-Fos in the regulation of dormant HSC cell cycle entry remained unexplored.

      It would be useful to do ChIP-seq to determine to confirm that c-fos regulates p57 expression.

      We have shown that inhibition of c-Fos transcriptional activity inhibits p57 expression (Fig. 6G). ChIP–seq with antibody against c-Fos will answer whether c-Fos directly activates the expression of p57. However, we can only isolate 200-300 CD38+ LT-HSCs from all bones of one mouse. Unfortunately, the ChIP-seq with such an amount of cells is technically very difficult, which explains the absence of publications using ChIP-seq for studying transcription factors in LT-HSCs. We added in the Discussion section that we couldn’t exclude indirect regulation of p57 expression by c-Fos, lines 307-308:” In contrast, although we couldn’t exclude indirect regulation of p57kip2 expression by c-Fos, our data clearly reveal that inhibiting the interaction between c-Fos and DNA in dHSCs reduced protein levels of the cell cycle inhibitor p57kip2 and stimulated cell cycle entry.”

      So overall, many of the findings were already out there and the authors gathered many of the pieces of the puzzle and put them together (and demonstrated) in a nice and well-thought manner. This work does add useful information to the scientific community but unfortunately is not ground-breaking. It may contribute to other fields beyond hematopoiesis where CD38 function may play a role.

      Thank you very much for the positive review of our work. As mentioned by the reviewer, CD38 is expressed by other normal (lymphocytes, Kupffer cells (Tarrago M.G. et al., Cell Metabolism 2018)) and cancer cells, e.g. hematological malignancies, lung cancer, prostate cancer (Hogan K.A. et al. Frontiers in Immunology, 2019),) but has not been studied in the context of quiescence regulation. Currently, anti-CD38 monoclonal antibodies are used to treat malignancies (Daratumumab) by mediating cytotoxicity (Lokhorst H.M et al., N. Engl. J. Med, 2015). However, the inhibition of CD38 enzymatic activity has not been used broadly. Therefore, our study can be groundbreaking and open new directions in anti-cancer therapy.

      Reviewer #2 (Significance (Required)):

      In this manuscript, the authors investigated the potential roles of CD38 (mainly) in mouse HSCs quiescent; the authors dissected the potential molecular mechanism by which this occurred, and it was via CD38/cADPR/Ca2+/cFos/p57Kip2. The authors used a combination of transplantation assays to test the importance of CD38 in vivo, followed by a series of simple in vitro experiments (mainly using pharmacological means) to dissect the molecular mechanisms. The manuscript is well-written/explained and the data presented is solid. There are no major issues in terms of reproducibility and clarity in this work.

      We would like to thank the reviewer again for the detailed positive feedback.

    1. Reviewer #1 (Public Review):

      The authors evaluate a number of stochastic algorithms for the generation of wiring diagrams between neurons by comparing their results to tentative connectivity measured in cell cultures derived from embryonic rodent cortices. They find the best match for algorithms that include a term of homophily, i.e. preference for connections between pairs that connect to an overlapping set of neurons. The trend becomes stronger, the older the culture is (more days in vitro).

      From there, they branch off to a set of related results: First, that connectivity states reached by the optimal algorithm along the way are similar to connectivity in younger cultures (fewer days in vitro). Second, that connectivity in a more densely packed network (higher plating density) differs only in terms of shorter-range connectivity and even higher clustering, while other topological parameters are conserved. Third, blocking inhibition results in more unstructured functional connectivity. Fourth, results can be replicated to some degree in cultures of human neurons, but it depends on the type of cell.

      The culturing and recording methods are strong and impressive. The connectivity derivation methods use established algorithms but come with one important caveat, in that they are purely based on correlation, which can lead to the addition of non-structurally present edges. While this focus on "functional connectivity" is an established method, it is important to consider how this affects the main results. One main way in which functional connectivity is likely to differ from the structural one is the presence of edges between neurons sharing common innervation, as this is likely to synchronize their spiking. As they share innervation from the same set of neurons, this type of edge is placed in accordance with a homophilic principle. In other words, this is not merely an algorithmic inaccuracy, but a potential bias directly related to the main point of the manuscript. This is not invalidating the main point, which the authors clearly state to be about the correlational, functional connectivity (and using that is established in the field). But it becomes relevant when in conclusion the functional connectivity is implicitly or explicitly equated with the structural one. Specifically, considering a long-range connection to be more costly implies an actual, structural connection to be present. Speculating that the algorithm reveals developmental principles of network formation implies that it is the actual axons and synapses forming and developing. The term "wiring" also implies structural rather than functional connectivity. One should carefully consider what the distinction means for conclusions and interpretation of results.

      The main finding is that out of 13 tested algorithms to model the measured functional connectivity, one based on homophilic attachment works best, recreating with a simple principle the distributions of various topological parameters.<br /> First, I want to clear up a potential misunderstanding caused by the naming the authors chose for the four groups of generative algorithms: While the ones labelled "clustering" are based on the clustering coefficient, they do not necessarily lead to a large value of that measure nor are they really based on the idea that connectivity is clustered. Instead, the "homophilic" ones are a form of maximizing the measure (but balanced by the distance term). To be clear, their naming is not wrong, nor needs to be changed, but it can lead to misunderstandings that I wanted to clear up. Also, this means that the principle of "homophilic wiring" is a confirmation of previous findings that neuronal connectivity features increased values of the clustering coefficient. What is novel is the valuable finding that the principle also leads to matching other topological network parameters.

      The main finding is based on essentially fitting a network generation algorithm by minimizing an energy function. As such, we must consider the possibility of overfitting. Here the authors provide additional validation by using measures that were not considered in the fitting (Fig 5, to a lesser degree Fig 3e), increasing the strength of the results. Also, for a given generative algorithm, only 2 wiring parameters were optimized. However, with respect to this, I was left with the impression that a different set of them was optimized for every single in-vitro network (e.g. n=6 sets for the sparse PC networks; though this was not precisely explained, I base this on the presence of distributions of wiring parameters in Fig 6c). The results would be stronger if a single set could be found for a given type of cell culture, especially if we are supposed to consider the main finding to be a universal wiring principle. At least report and discuss their variability.

      Next, the strength of the finding depends on the strengths of the alternatives considered. Here, the authors selected a reasonably high number of twelve alternatives. The "degree" family places connections between nodes that are already highly connected, implementing a form of rich-club principle, which has been repeatedly found in brain networks. However, I do not understand the motivation for the "clustering" family. As mentioned above, they do not serve to increase the measure of the clustering coefficient, as the pair is likely not part of the same cluster. As inspiration, "Collective dynamics of 'small-world' networks" is cited, but I do not see the relation to the algorithm or results presented in that study. A clearly explained motivation for the alternatives (and maybe for the individual algorithms, not just the larger families) would strengthen the result. 

      Related to the interpretation of results, as they are presented in Fig3a, bottom left: What data points exactly go into each colored box? Specifically, into the purple box? What exactly is meant by "top performing networks across the main categories" mean? Compared with Supp Fig S4, it seems as if the authors do not select the best model out of a family and instead pool the various models that are part of the same family, albeit each with their optimized gamma and eta. Otherwise, the purple box at DIV14 in Fig3 would be identical to "degree average" at DIV14 in S4. If true, I find this problematic, as visually, the performance of one family is made to look weaker by including weak-performing models in it. I am sure one could formulate a weak-performing homophily-based rule that drives the red box up. If such pooling is done for the statistical tests in Supp Tables 3-7, this is outright misleading! (for some cases "degree average" seems not significantly worse than the homophily rules).

      The next finding is related to the development of connectivity over the days in vitro. Here, the authors compare the connectivity states the network model goes through as the algorithm builds it up, to connectivity in-vitro in younger cultures. They find comparable trajectories for two global topological parameters. <br /> Here, once again it is a strength that the authors considered additional parameters outside the ones used in fitting. However, it should be noted that the values for "global efficiency" at DIV14 (the very network that was optimized!) are clearly below the biological values plotted, weakening the generality of the previous result. This is never discussed in the text.

      The conclusion of the authors in this part derives from values of modularity decreasing over time in both model and data, and global efficiency increasing. The main impact of "time" in this context is the addition of more connections, and increasing edge density. And there is a known dependency between edge density and the bounds of global efficiency. I am not convinced the result is meaningful for the conclusion in this state. If one were to work backwards from the DIV14 model, randomly removing connections (with uniform probabilities): Would the resulting trajectory match DIV12, DIV10, and DIV7 equally well? If so, the trajectory resulting from the "matching" algorithm is not meaningful.

      Further, the conclusion of the authors implies that connections in the cultures are formed as in the algorithm: one after another over time without pruning. This could be simply tested: How stable are individual connections in vitro over time (between DIV)? 

      The next finding is that at higher densities, the connections formed by the neurons still have very comparable structures, only differing in clustering and range; and that the same generative algorithm is optimal for modelling them. I think in its current state, the correlation analysis in Fig. 4a supports this conclusion only partially: Most of these correlations are not surprising. Shortest path lengths feature heavily in the calculation of small worldness and efficiency (in one case admittedly the inverse). Also for example network density has known relations with other measures. The analysis would be stronger if that was taken into account, for example showing how correlations deviate from the ones expected in an Erdos-Renyi-type network of equal sizes.

      Yet, overall the results are supported by the depicted data and model fits in Supp. Fig S7. With the caveat that some of the numerical values depicted seem off: <br /> What are the units for efficiency? Why do they take values up to 2000? Should be < 1 as in 4b. Also, what is "strength"? I assume it's supposed to be the value of STTC, but that's not supposed to be >1. Is it the sum over the edges? But at a total degree of around 40, this would imply an average STTC almost three times higher than what's reported in Fig 1i. Also, why is the degree around 40, but between 1000 and 1500 in Fig S2? <br /> Finally, it should be mentioned that "degree average" seems (from the boxplot) to work equally well.

      Further, the conclusion of the "matching" algorithm equally fitting both cases would be stronger if we were informed about the wiring parameters (η and γ) resulting in both cases. That way we could understand: Is it the same algorithm fitting both cases or very different variants of the same? It is especially crucial here, because the η and γ parameters determine the interplay between the distance- and topology-dependent terms, and this is the one case where a very different set of pairwise distances (due to higher density) are tested. Does it really generalize to these new conditions?

      Conversely, the results relating to GABAa blocking show a case where the distances are comparable, but the topology of functional connectivity is very different. (Here again, the contrast between structural and functional connectivity could be made a bit clearer. How is correlational detection of connections affected by "bursty" activity?) The reduction in tentative inhibition following the application of the block is convincing.

      The main finding is that despite of very different connectivities, the "matching" algorithm still holds best. This is adequately supported by applying the previous analyses to this case as well. <br /> The authors then interpret the differences between blocked and control by inspection of the η and γ parameters, finding that the relative impact of the distance-based term is likely reduced, as a lower (less negative) exponent would lead to more equal values for different distances. This is a good example of inspecting the internals of a generative algorithm to understand the modeled system and is confirmed by longer edge lengths in Supp Fig. S12C.

      The authors further inspect the wiring probabilities used internally at each step of the algorithm and compare across conditions. They conclude from differences in the distribution of P_ij values that the GABAa-blocked network had a "more random" topology with "less specific" wiring. This is the opposite of the conclusion I would draw, given the depicted data. This may be partially because the authors do not clearly define their concept of "random" vs. "specific". I understand it to be the following: At each time step, one unconnected pair is randomly picked and connected, with probabilities proportional to P_ij, as in Akarca et al., 2021; "randomness" then refers to the entropy of that process. In that case, the "most random" or highest entropy case is given by uniform P_ij values, which would be depicted as a delta peak at 1 / n_pairs in the present plot. A flatter distribution would indicate more randomness if it was the distribution of P_ij over pairs of neurons (x-axis: pairs; y-axis P_ij). The conclusion should be clarified by the use of a mathematical definition and supported by data using that definition.

      Next, the methods are repeated for various cultures of human neurons. I have no specific observations there.

      In summary, while I think the most important methods are sound, and the main conclusions (reflected in the title of the paper) are supported, the analysis of more specific cases (everything from Fig 3e onwards, except for Fig 5) requires more work as in the current state their conclusions are not adequately supported.

    1. His faith in computers and quantitative data was legendary, his famous quote he said to [55:25.380 --> 55:26.380]  it. [55:26.380 --> 55:28.180]  It might have been Ellsberg that he said this to actually someone was saying that we're [55:28.180 --> 55:31.140]  losing the war in Vietnam and he said, where is your data? [55:31.140 --> 55:35.860]  Don't get me poetry, give me something I can put in the computer.

      Although computers provided nations with the ability to better themselves, negatively this is not the case for many. As the current nature of computers, are heavily used as echo chambers for peoples biases, reinforcing political polarization. This problem is highlighted by Vannevar Bush in As we may think Article.

    1. En 1945 publica un artículo llamado «As we may think» («Como podríamos pensar»[3]​) en la revista Atlantic Monthly, donde describió, principalmente, la llegada de dos dispositivos.

      Considero relevante volver a resaltar como en su niñez emergió su curiosidad y amor por la ciencia a través de sus propias experiencias y exploraciones, una especie de fascinación por la ciencia y la tecnología que se reflejó en su trabajo elaborando dispositivos.

    2. En 1945 publica un artículo llamado «As we may think» («Como podríamos pensar»[3]​) en la revista Atlantic Monthly, donde describió, principalmente, la llegada de dos dispositivos.

      Es interesante ver cómo en la historia la creación de dispositivos ha buscado facilitar las acciones humanas, lo cual ha traído velocidad, optimización de tiempo, organización, aportando a las ciencias de la información y desafortunadamente también a las armas para la guerra, pero también es complejo cómo su uso inspirado en el cerebro humano, hace que este ya no se ejercite de igual manera en acciones como hacer operaciones matemáticas, redactar escribir a mano, leer y sean reemplazadas por los dispositivos.

    1. As a rule, humans do not like to be duped. We like to know which kinds of signals to trust, and which to distrust. Being lulled into trusting a signal only to then have it revealed that the signal was untrustworthy is a shock to the system, unnerving and upsetting. People get angry when they find they have been duped. These reactions are even more heightened when we find we have been duped simply for someone else’s amusement at having done so.

      I think this has become a prevalent issue the last few years when looking at politics. The right and the left have felt so divided in recent time and it is difficult to watch politics without feeling like you may be getting "duped."

    1. Author Response

      Reviewer #1 (Public Review):

      Kazrin appears to be implicated in many diverse cellular functions, and accordingly, localizes to many subcellular sites. Exactly what it does is unclear. The authors perform a fairly detailed analysis of Kazrin in-cell function, and find that it is important for the perinuclear localization of TfN, and that it binds to members of the AP-1 complex (e.g., gamma-adaptin). The authors note that the C-terminus of Kazrin (which is predicted to be intrinsically disordered) forms punctate structures in the cytoplasm that colocalize with components of the endosomal machinery. Finally, the authors employ co-immunoprecipitation assays to show that both N and C-termini of Kazrin interacts with dynactin, and the dynein light-intermediate chain.

      Much of the data presented in the manuscript are of fairly high quality and describe a potentially novel function for Kazrin C. However, I had a few issues with some of the language used throughout, the manner of data presentation, and some of their interpretations. Most notably, I think in its current form, the manuscript does not strongly support the authors' main conclusion: that Kazrin is a dynein-dynactin adaptor, as stated in their title. Without more direct support for this function, the authors need to soften their language. Specific points are listed below.

      Major comments:

      1) I agree with the authors that the data provided in the manuscript suggest that Kazrin may indeed be an endosomal adaptor for dynein-dynactin. However, without more direct evidence to support this notion, the authors need to soften their language stating as much. For example, the title as stated would need to be changed, as would much of the language in the first paragraph of the discussion. Alternatively, the manuscript could be significantly strengthened if the authors performed a more direct assay to test this idea. For example, the authors could use methods employed previously (e.g., McKenney et al., Science 2014) to this end. In brief, the authors can simply use their recombinant Kazrin C (with a GFP) to pull out dynein-dynactin from cell extracts and perform single molecule assays as previously described.

      While this is certainly an excellent suggestion, the in vitro dynein/dynactin motility assays are really not straight forward experiments for laboratories that do not use them as a routine protocol. That is why we asked Dr. Thomas Surrey (Centre for Genomic Regulation, Barcelona), an expert in the biochemistry and biophysics of microtubule dynamics, to help us with this kind of analysis. In their setting, TIRF microscopy is used to follow EGFPdynein/dynactin motility along microtubules immobilized on cover slides (Jha et al., 2017). As shown in figure R1, more binding of EGFP-dynein to the microtubules is observed when purified kazrin is added to the assay (from 20 to 400 nM), but there is no increase in the number or processivity of the EGFP-dynein motility events. These results are hard to interpret at this point. Kazrin might still be an activating adaptor but a component is missing in the assay (i. e. an activating posttranslational modification or a particular subunit of the dynein or dynactin complexes), or it could increase the processivity of dyneindynactin in complex with another bona fide activating adaptor, as it has been demonstrated for LIS1 (Baumbach et al., 2017; Gutierrez et al., 2017). Alternatively, kazrin could transport dynactin and/or dynein to the microtubule plus ends in a kinesin 1-dependent manner, in order to load the peripheral endosomes with the minus end directed motor (Yamada et al., 2008).

      Figure R1. Kazrin C purified from E. coli increases binding of dynein to microtubules but does not increase the number or processivity of EGFP-dynein motility events. A. TIRF (Total Internal Reflexion Fluorescence) micrographs of microtubule-coated cover slides incubated in the presence of 10 nM EGFP-dynein and 20 nM dynactin in the presence or absence of 20 nM kazrin C, expressed and purified from E. coli. B. Kymographs of TIRF movies of microtubule-coated cover slides incubated in the presence of purified 10 nM EGFP-dynein, 20 nM dynactin and either 400 nM of the activating adaptor BICD2 (1:2:40 ratio) (left panel) or kazrin C (right panel). Red squares indicate processive dynein motility events induced by BICD2”.

      Investigating the molecular activity of kazrin on the dynein/dynactin motility is a whole project in itself that we feel it is out of the scope of the present manuscript. Therefore, as suggested by the BRE, we have chosen to soften the conclusions and classify kazrin as a putative “candidate” dynein/dynactin adaptor based on its interactome, domain organization and subcellular localization, as well as on the defects installed in vivo on the endosome motility upon its depletion. We also discuss other possibilities as those outlined above.

      2) I'm not sure I agree with the use of the term 'condensates' used throughout the manuscript to describe the cytoplasmic Kazrin foci. 'Condensates' is a very specific term that is used to describe membraneless organelles. Given the presumed association of Kazrin with membrane-bound compartments, I think it's more reasonable to assume these foci are quite distinct from condensates.

      We actually used condensates to avoid implying that the kazrin IDR generates membraneless compartments or induces liquid-liquid-phase separation, which is certainly not a conclusion from the manuscript. However, since all reviewers agreed that the word was misleading, we have substituted the term condensates for foci throughout the manuscript.

      3) The authors note the localization of Tfn as perinuclear. Although I agree the localization pattern in the kazKO cells is indeed distinct, it does not appear perinuclear to me. It might be useful to stain for a centrosomal marker (such as pericentrin, used in Figure 5B) to assess Tfn/EEA1 with respect to MT minus ends.

      We have now changed the term perinuclear, which implies that endosomes surround the nucleus, by the term juxtanuclear, which more accurately define what we wanted to indicate (close to). We thank the reviewer for pointing out this lack of accuracy. We also more clearly describe in the text that in fibroblast, the Golgi apparatus and the Recycling Endosomes (REs) gather around the pericentriolar region ((Granger et al., 2014) and reference therein), which is usually close to the nucleus ((Tang and Marshall, 2012) and references therein). Nevertheless, as suggested by the reviewer, we have included pictures of the TxR-Tfn and EEA1-labelled endosomes accumulating around pericentrin in wild type mouse embryonic fibroblast (MEF) (Figure 1–supplement figure 3) to illustrate these points.

      4) "Treatment with the microtubule depolymerizing drug nocodazole disrupted the perinuclear localization of GFP-kazrin C, as well as the concomitant perinuclear accumulation of EE (Fig. 5C & D), indicating that EEs and GFP-kazrin C localization at the pericentrosomal region required minus end-directed microtubule-dependent transport, mostly affected by the dynactin/dynein complex (Flores-Rodriguez et al., 2011)."

      • I don't agree that the nocodazole experiment indicates that minus end-directed motility is required for this perinuclear localization. In the absence of other experiments, it simply indicates that microtubules are required. It might, however, "suggest" the involvement of dynein. The same is true for the subsequent sentence ("Our observations indicated that kazrin C can be transported in and out of the pericentriolar region along microtubule tracks...").

      We agree with the reviewer. To reinforce the point that GFP-kazrin C localization and the pericentriolar accumularion of EEA1 rely on dynein-dependent transport, we have now added an experiment in figure 5E and F, where we use ciliobrevin to inhibit dynein in cells expressing GFP-kazrin C. In the treated cells, we see that the GFP-kazrin C staining in the pericentrin foci is lost and that EEs have a more dispersed distribution, similar to kazKO MEF. We have also completed and rearranged the in vivo fluorescence microscopy data to more clearly show that small GFP-kazrin C foci can be observed moving towards the cell centre (Figure 5-S1 and movies 6 and 7). Taken all this data together, I think we can now suggest that kazrin might travel into the pericentriolar region, possibly along microtubules and powered by dynein.

      5) Although I see a few examples of directed motion of Tfn foci in the supplemental movies, it would be more useful to see the kymographs used for quantitation (and noted by the authors on line 272). Also related to this analysis, by "centripetal trajectories", I assume the authors are referring to those moving in a retrograde manner. If so, it would be more consistent with common vernacular (and thus more clear to readers) to use 'retrograde' transport.

      We have now included some more examples of the time projections used in the analysis in figure 6-S1 and 2, where we have coloured in blue the fairly straight, longer trajectories, as opposed to the more confined movements that appeared as round dots in the time projections (coloured in red). We have also added more videos illustrating the differences observed in cells expressing endogenous or GFP-kazrin C versus kazKO cells or kazKO cells expressing GFP or GFP-kazrin C-Nt. Movies 8 and 11 show the endosome motility in representative WT and kazKO cells (movie 8) and kazKO cells expressing GFP, GFPkazrin C or GFP-kazrin C Nt (movie 11). Movies 9 and 10 show endosome motility in four magnified fields of different WT and kazKO cells, where longer and faster motility events can be observed when endogenous kazrin is expressed. Movies 12 to 14 show endosome motility in four magnified fields of different kazKO cells expressing, GFP-kazrin C (movie 12), GFP (movie 13) and GFP-kazrin C-Nt (movie 14). Longer and faster movements can be observed in the different insets of movie 12, as compared with movies 13 and 14. Finally, as suggested by the reviewer, we have re-worded centripetal movement to retrograde movement throughout the manuscript.

      6) The error bars on most of the plots appear to be extremely small, especially in light of the accompanying data used for quantitation. The authors state that they used SEM instead of SD, but their reasoning is not stated. All the former does is lead to an artificial reduction in the real deviation (by dividing SD by the square root of whatever they define as 'n', which isn't clear to me) of the data which I find to be misleading and very nonrepresentative of biological data. For example, the error bars for cell migration speed in Figure 2B suggest that the speeds for WT cells ranged from ~1.7-1.9 µm/sec, which I'm assuming is largely underrepresenting the range of values. Although I'm not a statistician, as someone that studies biochemical and biological processes, I strongly urge the authors to use plots and error bars that more accurately describe the data to your readers (e.g., scatter plots with standard deviation are the most transparent way to display data).

      We have now changed all plots to scattered plots with standard deviations, as suggested.

    1. Author Response

      Reviewer #1 (Public Review):

      Nicotine preference is highly variable between individuals. The paper by Mondoloni et al. provided some insight into the potential link between IPN nAchR heterogeneity with male nicotine preference behavior. They scored mice using the amount of nicotine consumption, as well as the rats' preference of the drug using a two-bottle choice experiment. An interesting heterogeneity in nicotine-drinking profiles was observed in adult male mice, with about half of the mice ceasing nicotine consumption at high concentrations. They observed a negative association of nicotine intake with nicotine-evoked currents in the antiparticle nucleus (IPN). They also identified beta4-containing nicotine acetylcholine receptors, which exhibit an association with nicotine aversion. The behavioral differentiation of av vs. n-avs and identification of IPN variability, both in behavioral and electrophysiological aspects, add an important candidate for analyzing individual behavior in addiction.

      The native existence of beta4-nAchR heterogeneity is an important premise that supports the molecules to be the candidate substrate of variabilities. However, only knockout and re-expression models were used, which is insufficient to mimic the physiological state that leads to variability in nicotine preference.

      We’d like to thank reviewer 1 for his/her positive remarks and for suggesting important control experiments. Regarding the reviewer’s latest comment on the link between b4 and variability, we would like to point out that the experiment in which mice were put under chronic nicotine can be seen as another way to manipulate the physiological state of the animal. Indeed, we found that chronic nicotine downregulates b4 nAChR expression levels (but has no effect on residual nAChR currents in b4-/- mice) and reduces nicotine aversion. Therefore, these results also point toward a role of IPN b4 nAChRs in nicotine aversion. We have now performed additional experiments and analyses to address these concerns and to reinforce our demonstration.

      Reviewer #2 (Public Review):

      In the current study, Mondoloni and colleagues investigate the neural correlates contributing to nicotine aversion and its alteration following chronic nicotine exposure. The question asked is important to the field of individual vulnerability to drug addiction and has translational significance. First, the authors identify individual nicotine consumption profiles across isogenic mice. Further, they employed in vivo and ex vivo physiological approaches to defining how antiparticle nuclei (IPn) neuronal response to nicotine is associated with nicotine avoidance. Additionally, the authors determine that chronic nicotine exposure impairs IPn neuronal normal response to nicotine, thus contributing to higher amounts of nicotine consumption. Finally, they used transgenic and viralmediated gene expression approaches to establish a causal link between b4 nicotine receptor function and nicotine avoidance processes.

      The manuscript and experimental strategy are well designed and executed; the current dataset requires supplemental analyses and details to exclude possible alternatives. Overall, the results are exciting and provide helpful information to the field of drug addiction research, individual vulnerability to drug addiction, and neuronal physiology. Below are some comments aiming to help the authors improve this interesting study.

      We would like to thank the reviewer for his/her positive remarks and we hope the new version of the manuscript will clarify his/her concerns.

      1) The authors used a two-bottle choice behavioral paradigm to investigate the neurophysiological substrate contributing to nicotine avoidance behaviors. While the data set supporting the author's interpretation is compelling and the experiments are well-conducted, a few supplemental control analyses will strengthen the current manuscript.

      a) The bitter taste of nicotine might generate confounds in the data interpretation: are the mice avoiding the bitterness or the nicotine-induced physiological effect? To address this question, the authors mixed nicotine with saccharine, thus covering the bitterness of nicotine. Additionally, the authors show that all the mice exposed to quinine avoid it, and in comparison, the N-Av don't avoid the bitterness of the nicotine-saccharine solution. Yet it is unclear if Av and N-Av have different taste discrimination capacities and if such taste discrimination capacities drive the N-Av to consume less nicotine. Would Av and N-Av mice avoid quinine differently after the 20-day nicotine paradigm? Would the authors observe individual nicotine drinking behaviors if nicotine/quinine vs. quinine were offered to the mice?

      As requested by all three reviewers, we have now performed a two-bottle choice experiment to verify whether different sensitivities to the bitterness of the nicotine solution could explain the different sensitivities to the aversive properties of nicotine. Indeed, even though we used saccharine to mask the bitterness of the nicotine solution, we cannot fully exclude the possibility that the taste capacity of the mice could affect their nicotine consumption. Reviewers 1 and 2 suggested to perform nicotine/quinine versus quinine preference tests, but we were afraid that forcing mice to drink an aversive, quinine-containing solution might affect the total volume of liquid consumed per day, and also might create a “generalized conditioned aversion to drinking water - detrimental to overall health and a confounding factor” as pointed out by reviewer 3. Therefore, we designed the experiment a little differently.

      In this two-bottle choice experiment, mice were first proposed a high concentration of nicotine (100 µg/ml) which has previously been shown to induce avoidance behavior in mice (Figure 3C). Then, mice were offered three increasing concentrations of quinine: 30, 100 and 300 µM. Quinine avoidance was dose dependent, as expected: it was moderate for 30 µM but almost absolute for 300 µM quinine. We then investigated whether nicotine and quinine avoidances were linked. We found no correlation between nicotine and quinine preference (new Figure: Figure 1- supplementary figure 1D). This new experiment strongly suggests that aversion to the drug is not directly tied to the sensitivity of mice to the bitter taste of nicotine.

      Other results reinforce this conclusion. First, none of the b4-/- mice (0/13) showed aversion to nicotine, whereas about half of the virally-rescued animals (8/17, b4 re-expressed in the IPN of b4-/- mice) showed nicotine aversion, a proportion similar to the one observed in WT mice. This experiment makes a clear, direct link between the expression of b4 nAChRs in the IPN and aversion to the drug.

      Furthermore, we also verified that the sensitivity of b4-/- mice to bitterness is not different from that of WT mice (new Figure 4 – figure supplement 1B). This new result indicates that the reason why b4-/- mice consume more nicotine than WT mice is not because they have a reduced sensitivity bitterness.

      Together, these new experiments strongly suggests that interindividual differences in sensitivity to the bitterness of nicotine play little role in nicotine consumption behavior in mice.

      b) Metabolic variabilities amongst isogenic mice have been observed. Thus, while the mice consume different amounts of nicotine, changes in metabolic processes, thus blood nicotine concentrations, could explain differences in nicotine consumption and neurophysiology across individuals. The authors should control if the blood concentration of nicotine metabolites between N-Av and Av are similar when consuming identical amounts of nicotine (50ug/ml), different amounts (200ug/ml), and in response to an acute injection of a fixed nicotine quantity.

      We agree with the reviewer that metabolic variabilities could explain (at least in part) the differences observed between avoiders and non-avoiders. But other factors could also play a role, such as stress level (there is a strong interaction between stress and nicotine addiction, as shown by our group (PMID: 29155800, PMID: 30361503) and others), hierarchical ranking, epigenetic factors etc… Our goal in this study is not to examine all possible sources of variability. What is striking about our results is that deletion of a single gene (encoding the nAChR b4 subunit) is sufficient to eliminate nicotine avoidance, and that re-expression of this receptor subunit in the IPN is sufficient to restore nicotine avoidance. In addition, we observe a strong correlation between the amplitude of nicotineinduced current in the IPN, and nicotine consumption. Therefore, the expression level of b4 in the IPN is sufficient to explain most of the behavioral variability we observe. We do not feel the need to explore variations in metabolic activities, which are (by the way) very expensive experiments. However, we have added a sentence in the discussion to mention metabolic variabilities as a potential source of variability in nicotine consumption.

      2) Av mice exposed to nicotine_200ug/ml display minimal nicotine_50ug/ml consumption, yet would Av mice restore a percent nicotine consumption >20 when exposed to a more extended session at 50ug/kg? Such a data set will help identify and isolate learned avoidance processes from dose-dependent avoidance behaviors.

      We have now performed an additional two-bottle choice experiment to examine an extended time at 50 µg/ml. But we also performed the experiment a little differently. We directly proposed a high nicotine concentration to mice (200 µg/ml), followed by 8 days at 50 µg/ml. We found that, overall, mice avoided the 200 µg/ml nicotine solution, and that the following increase in nicotine preference was slow and gradual throughout the eight days at 50 µg/ml (Figure 2-figure supplement 1C). This slow adjustment to a lower-dose contrasts with the rapid (within a day) change in intake observed when nicotine concentration increases (Figure 1-figure supplement 1A). About half of the mice (6/13) retained a steady, low nicotine preference (< 20%) throughout the eight days at 50 µg/ml, resembling what was observed for avoiders in Figure 2D. Together, these results suggest that some of the mice, the non-avoiders, rapidly adjust their intake to adapt to changes in nicotine concentration in the bottle. For avoiders, aversion for nicotine seems to involve a learning mechanism that, once triggered, results in prolonged cessation of nicotine consumption.

      3) The author should further investigate the basal properties of IPn neuron in vivo firing rate activity recorded and establish if their spontaneous activity determines their nicotine responses in vivo, such as firing rate, ISI, tonic, or phasic patterns. These analyses will provide helpful information to the neurophysiologist investigating the function of IPn neurons and will also inform how chronic nicotine exposure shapes the IPn neurophysiological properties.

      We have performed additional analyses of the in vivo recordings. First, we have built maps of the recorded neurons, and we show that there is no anatomical bias in our sampling between the different groups. The only condition for which we did not sample neurons similarly is when we compare the responses to nicotine in vivo in WT and b4-/- mice (Figure 4E). The two groups were not distributed similarly along the dorso-ventral axis (Figure 4-figure supplement 2B). Yet, we do not think that the difference in nicotine responses observed between WT and b4-/- mice is due to a sampling bias. Indeed, we found no link between the response to nicotine and the dorsoventral coordinates of the neurons, in any of the groups (MPNic and MP Sal in Figure 3-figure supplement 1D; WT and b4-/- mice in Figure 4-figure supplement 2C). Therefore, our different groups are directly comparable, and the conclusions drawn in our study fully justified.

      As requested, we have looked at whether the basal firing rate of IPN neurons determines the response to nicotine and indeed, neurons with higher firing rate show greater change in firing frequency upon nicotine injection (Figure 3 -figure supplement 1G and Figure 4-figure supplement 2F). We have also looked at the effect of chronic nicotine on the spontaneous firing rate of IPN neurons (Figure 3 -figure supplement 1F) but found no evidence for a change in basal firing properties. Similarly, the deletion of b4 had no effect on the spontaneous activity of the recorded neurons (Figure 4-figure supplement 2F). Finally, we found no evidence for any link between the anatomical coordinates of the neurons and their basal firing rate (Figure 3-figure supplement 1E and Figure 4figure supplement 2D).

      Reviewer #3 (Public Review):

      The manuscript by Mondoloni et al characterizes two-bottle choice oral nicotine consumption and associated neurobiological phenotypes in the antiparticle nucleus (IPN) using mice. The paper shows that mice exhibit differential oral nicotine consumption and correlate this difference with nicotine-evoked inward currents in neurons of the IPN. The beta4 nAChR subunit is likely involved in these responses. The paper suggests that prolonged exposure to nicotine results in reduced nAChR functional responses in IPN neurons. Many of these results or phenotypes are reversed or reduced in mice that are null for the beta4 subunit. These results are interesting and will add a contribution to the literature. However, there are several major concerns with the nicotine exposure model and a few other items that should be addressed.

      Strengths:

      Technical approaches are well-done. Oral nicotine, electrophysiology, and viral re-expression methods were strong and executed well. The scholarship is strong and the paper is generally well-written. The figures are high-quality.

      We would like to thank the reviewer for his/her comments and suggestions on how to improve the manuscript.

      Weaknesses:

      Two bottle choice (2BC) model. 2BC does not examine nicotine reinforcement, which is best shown as a volitional preference for the drug over the vehicle. Mice in this 2BC assay (and all such assays) only ever show indifference to nicotine at best - not preference. This is seen in the maximal 50% preference for the nicotine-containing bottle. 2BC assays using tastants such as saccharin are confounded. Taste responses can very likely differ from primary reinforcement and can be related to peripheral biology in the mouth/tongue rather than in the brain reward pathway.

      The two-bottle nicotine drinking test is a commonly used method to study addiction in mice (Matta, S. G. et al. 2006. Guidelines on nicotine dose selection for in vivo research. Psychopharmacology 190, 269–319). Like all methods, it has its limitations, but it also allows for different aspects to be addressed than those covered by selfadministration protocols. The two-bottle nicotine drinking test simply measures the animals' preference for a solution containing nicotine over a control solution without nicotine: the animals are free to choose nicotine or not, which allows to evaluate sensitivity and avoidance thresholds. What we show in this paper is precisely that despite interindividual differences in the way the drug is used (passively or actively), a significant proportion of the animals avoids the nicotine bottle at a certain concentration, suggesting that we are dealing with individual characteristics that are interesting to identify in the context of addiction and vulnerability. We agree that the twobottle choice test cannot provide as much information about the reinforcing effects of the drug as selfadministration procedures. We are aware of the limitations of the method and were careful not to interpret our data in terms of reinforcement to the drug. For instance, mice that consume nicotine were called “non-avoiders” and not “consumers”. We added a few sentences at the beginning of the discussion to highlight these limitations.

      The reviewer states that the mice in this 2BC assay (and all such assays) “only ever show indifference to nicotine at best - not preference”. This is seen in the maximal 50% preference for the nicotine-containing bottle. While this is true on average, it isn’t when we look at individual profiles, as we did here. We clearly observed that some mice have a strong preference for nicotine and, conversely, that some mice actively avoid nicotine after a certain concentration is proposed in the bottle.

      Regarding tastants, we indeed used saccharine to hide the bitter taste of nicotine and prevent taste-related side bias. This is a classical (though not perfect) paradigm in the field of nicotine research (Matta, S. G. et al. 2006. Guidelines on nicotine dose selection for in vivo research. Psychopharmacology 190, 269–319). To evaluate whether different sensitivities to the bitterness of nicotine may explain the interindividual differences in nicotine consumption we performed new experiments (as suggested by all three reviewers). In this two-bottle choice experiment, mice were first proposed a high concentration of nicotine (100 µg/ml) which has previously been shown to induce avoidance behavior in mice (Figure 3C). Then, mice were offered three increasing concentrations of quinine: 30, 100 and 300 µM. Quinine avoidance was dose dependent, as expected: it was moderate for 30 µM but almost absolute for 300 µM quinine. We then investigated whether nicotine and quinine avoidances were linked. We found no correlation between nicotine and quinine preference (new Figure: Figure 1- supplementary figure 1D). This new experiment strongly suggests that aversion to the drug is not directly tied to the sensitivity of mice to the bitter taste of nicotine. Other results reinforce this conclusion. First, none of the b4-/- mice (0/13) showed aversion to nicotine, whereas about half of the virally-rescued animals (8/17, b4 re-expressed in the IPN of b4-/- mice) showed nicotine aversion, a proportion similar to the one observed in WT mice. This experiment makes a clear, direct link between the expression of b4 nAChRs in the IPN and aversion to the drug. Furthermore, we also verified that the sensitivity of b4-/- mice to bitterness is not different from that of WT mice (new Figure 4 - figure supplement 1B). This new result indicates that the reason why b4-/- mice consume more nicotine than WT mice is not because they have a reduced sensitivity bitterness. Together, these new experiments strongly suggests that interindividual differences in sensitivity to the bitterness of nicotine play little role in nicotine consumption behavior in mice.

      Moreover, this assay does not test free choice, as nicotine is mixed with water which the mice require to survive. Since most concentrations of nicotine are aversive, this may create a generalized conditioned aversion to drinking water - detrimental to overall health and a confounding factor.

      Mice are given a choice between two bottles, only one of which contains nicotine. Hence, even though their choices are not fully free (they are being presented with a limited set of options), mice can always decide to avoid nicotine and drink from the bottle containing water only. We do not understand how this situation may create a generalized aversion to drinking. In fact, we have never observed any mouse losing weight or with deteriorated health condition in this test, so we don’t think it is a confounding factor.

      What plasma concentrations of nicotine are achieved by 2BC? When nicotine is truly reinforcing, rodents and humans titrate their plasma concentrations up to 30-50 ng/mL. The Discussion states that oral self-administration in mice mimics administration in human smokers (lines 388-389). This is unjustified and should be removed. Similarly, the paragraph in lines 409-423 is quite speculative and difficult or impossible to test. This paragraph should be removed or substantially changed to avoid speculation. Overall, the 2BC model has substantial weaknesses, and/or it is limited in the conclusions it will support.

      The reviewer must have read another version of our article, because these sentences and paragraphs are not present in our manuscript.

      Regarding the actual concentration of nicotine in the plasma, this is indeed a good question. We have actually measured the plasma concentrations of nicotine for another study (article in preparation). The results from this experiment can be found below. The half-life of nicotine is very short in the blood and brain of mice (about 6 mins, see Matta, S. G. et al. 2006. Guidelines on nicotine dose selection for in vivo research. Psychopharmacology 190, 269–319), making it very hard to assess. Therefore, we also assessed the plasma concentration of cotinine, the main metabolite of nicotine. We compared 4 different conditions: home-cage (forced drinking of 100 ug/ml nicotine solution); osmotic minipump (OP, 10 mg/kg/d, as in our current study); Souris-city (a large social environment developed by our group, see Torquet et al. Nat. Comm. 2018); and the two-bottle choice procedure (when a solution of nicotine 100 ug/ml was proposed). The concentrations of plasma nicotine found were very low for all groups that drank nicotine, but not for the group that received nicotine through the osmotic minipump group. This is most likely because mice did not drink any nicotine in the hour prior to being sampled and all nicotine was metabolized. Indeed, when we look at the plasma concentration of cotinine, we see that cotinine was present in all of the groups. The plasma concentration of cotinine was similar in the groups for which “consumption” was forced: forced drinking in the home cage (HC) or infusion through osmotic minipump. This indicates that the plasma concentration of cotinine is similar whether mice drink nicotine (100 ug/ml) or whether nicotine is infused with the minipump (10 mg/kg/d). For Souris city and the two-bottle choice procedure, the cotinine concentrations were in the same range (mostly between 0-100 ng/ml). Globally, the concentrations of nicotine and cotinine found in the plasma of mice that underwent the two-bottle choice procedure are in the range of what has been previously described (Matta, S. G. et al. 2006. Guidelines on nicotine dose selection for in vivo research. Psychopharmacology 190, 269–319).

      Regarding the limitations of the two-bottle choice test, we discuss them more extensively in the current version of the manuscript.

      Statistical testing on subgroups. Mice are run through an assay and assigned to subgroups based on being classified as avoiders or non-avoiders. The authors then perform statistical testing to show differences between the avoiders and non-avoiders. It is circular to do so. When the authors divided the mice into avoiders and non-avoiders, this implies that the mice are different or from different distributions in terms of nicotine intake. Conducting a statistical test within the null hypothesis framework, however, implies that the null hypothesis is being tested. The null hypothesis, by definition, is that the groups do NOT differ. Obviously, the authors will find a difference between the groups in a statistical test when they pre-sorted the mice into two groups, to begin with. Comparing effect sizes or some other comparison that does not invoke the null hypothesis would be appropriate.

      Our analysis, which can be summarized as follows, is fairly standard (see Krishnan, V. et al. (2007) Molecular adaptations underlying susceptibility and resistance to social defeat in brain reward regions. Cell 131, 391–404). Firstly, the mice are segregated into two groups based on their consumption profile, using the variability in their behavior. The two groups are obviously statistically different when comparing their consumption. This first analytical step allows us to highlight the variability and to establish the properties of each sub-population in terms of consumption. Our analysis could support the reviewer's comment if it ended at this point. However, our analysis doesn't end here and moves on to the second step. The separation of the mice into two groups (which is now a categorical variable) is used to compare the distribution of other variables, such as mouse choice strategy and current amplitude, based on the 2 categories. The null hypothesis tested is that the value of these other variables is not different between groups. There is no a priori obvious reason for the currents recorded in the IPN to be different in the two groups. These approaches allow us to show correlations between the variables. Finally, in the third and last step, one (or several) variable(s) are manipulated to check whether nicotine consumption is modified accordingly. Manipulation was performed by exposing mice to chronic nicotine, by using mutant mice with decreased nicotinic currents, and by re-expressing the deleted nAChR subunit only in the IPN. This procedure is fairly standard, and cannot be considered as a circular analysis with data selection problem, as explained in (Kriegeskorte, N., Simmons, W. K., Bellgowan, P. S. F. & Baker, C. I. (2009) Circular analysis in systems neuroscience: the dangers of double dipping. Nature Neuroscience 12, 535-540).

      Decreased nicotine-evoked currents following passive exposure to nicotine in minipumps are inconsistent with published results showing that similar nicotine exposure enhances nAChR function via several measures (Arvin et al, J Neurosci, 2019). The paper does acknowledge this previous paper and suggests that the discrepancy is explained by the fact that they used a higher concentration of nicotine (30 uM) that was able to recruit the beta4containing receptor (whereas Arvin et al used a caged nicotine that was unable to do so). This may be true, but the citation of 30 uM nicotine undercuts the argument a bit because 30 uM nicotine is unlikely to be achieved in the brain of a person using tobacco products; nicotine levels in smokers are 100-500 nM. It should be noted in the paper that it is unclear whether the down-regulated receptors would be active at concentrations of nicotine found in the brain of a smoker.

      We indeed find opposite results compared to Arvin et al., and we give possible explanations for this discrepancy in the discussion. To be honest we don’t fully understand why we have opposite results. However, we clearly observed a decreased response to nicotine, both in vitro (with 30 µM nicotine on brain slices) and in vivo (with a classical dose of 30 µg/kg nicotine i.v.), while Arvin et al. only tested nicotine in vitro.

      Regarding the reviewer’s comment about the nicotine concentration used (30 µM): we used that concentration in vitro to measure nicotine-induced currents (it’s a concentration close to the EC50 for heteromeric receptors, which will likely recruit low affinity a3b4 receptors) and to evaluate the changes in nAChR current following nicotine exposure. We did not use that concentration to induce nAChR desensitization, so we don’t really understand the argument regarding the levels of nicotine in smokers. For inducing desensitization, we used a minipump that delivers a daily dose of 10 mg/kg/day, which is the amount of nicotine mice drink in our assay.

      The statement in lines 440-41 ("we show that concentrations of nicotine as low as 7.5 ug/kg can engage the IPN circuitry") is misleading, as the concentration in the water is not the same as the concentration in the CSF since the latter would be expected to build up over time. The paper did not provide measurements of nicotine in plasma or CSF, so concluding that the water concentration of nicotine is related to plasma concentrations of nicotine is only speculative.

      The sentence “we show that concentrations of nicotine as low as 7.5 ug/kg can engage the IPN circuitry" is not in the manuscript so the reviewer must have read another version of the paper.

      The results in Figure 2E do not appear to be from a normal distribution. For example, results cluster at low (~100 pA) responses, and a fraction of larger responses drive the similarities or differences.

      Indeed, that is why we performed a non-parametric Mann-Whitney test for comparing the two groups, as indicated in the legend of figure 2E.

      10 mg/kg/day in mice or rats is likely a non-physiological exposure to nicotine. Most rats take in 1.0 to 1.5 mg/kg over a 23-hour self-administration period (O'Dell, 2007). Mice achieve similar levels during SA (Fowler, Neuropharmacology 2011). Forced exposure to 10 mg/kg/day is therefore 5 to 10-fold higher than rodents would ever expose themselves to if given the choice. This should be acknowledged in a limitations section of the Discussion.

      The two-bottle choice task is very different from nicotine self-administration procedures in terms of administration route: oral versus injected (in the blood or in the brain), respectively. Therefore, the quantities of drug consumed cannot be directly compared. In our manuscript, mice consume on average 10 mg/kg/day of nicotine at the highest nicotine concentration tested, which is fully consistent with what was already published in many studies (20 mg/kg/day in Frahm et al. Neuron 2013, 5-10 mg/kg/day in Bagdas et al., NP 2020, 10-20 mg/kg/day in Bagdas et al. NP2019, to cite a few...). Hence, we used that concentration of nicotine (10 mg/kg/d) for chronic administration of nicotine using minipumps. This is also a nicotine concentration that is classically used in osmotic minipumps for chronic administration of nicotine: 10 mg/kg/d in Dongelmans et al. Nat. Com 2021 (our lab), 12 mg/kg/d in Arvin et al. J. Neuro. 2019 (Drenan lab), 12 mg/kg/d in Lotfipour et al. J. Neuro. 2013 (Boulter lab) etc… Therefore, we do not see the issue here.

      Are the in vivo recordings in IPN enriched or specific for cells that have a spontaneous firing at rest? If so, this may or may not be the same set/type of cells that are recorded in patch experiments. The results could be biased toward a subset of neurons with spontaneous firing. There are MANY different types of neurons in IPN that are largely intermingled (see Ables et al, 2017 PNAS), so this is a potential problem.

      It is true that there are many types of neurons in the IPN. In-vivo electrophysiology and slice electrophysiology should be considered as two complementary methods to obtain detailed properties of IPN neurons. The populations sampled by these two methods are certainly not identical (IPR in patch -clamp versus mostly IPR and IPC in vivo), and indeed only spontaneously active neurons are recorded in in-vivo electrophysiology. The question is whether this is or not a potential problem. The results we obtained using in-vivo and brain-slice electrophysiology are consistent (i.e., a decreased response to nicotine), which indicates that our results are robust and do not depend on the selection of a particular subpopulation. In addition, we now provide the maps of the neurons recorded both in slices and in vivo (see supplementary figures, and response to the other two referees). We show that, overall, there is no bias sampling between the different groups. Together, these new analyses strongly suggest that the differences we observe between the groups are not due to sampling issues. We have added the Ables 2017 reference and are discussing neuron variability more extensively in the revised manuscript.

      Related to the above issue, which of the many different IPN neuron types did the group re-express beta4? Could that be controlled or did beta4 get re-expressed in an unknown set of neurons in IPN? There is insufficient information given in the methods for verification of stereotaxic injections.

      Re-expression of b4 was achieved with a strong, ubiquitous promoter (pGK), hence all cell types should in principle be transduced. This is now clearly stated in the result section, the figure legend and the method section. Unfortunately, we had no access to a specific mouse line to restrict expression of b4 to b4-expressing cells, since the b4-Cre line of GENSAT is no more alive. This mouse line was problematic anyways because expression levels of the a3, a5 and b4 nAChR subunits, which belong to the same gene cluster, were reported to be affected. Yet, we show in this article that deleting b4 leads to a strong reduction of nicotine-induced currents in the IPR (80%, patch-clamp), and of the response to nicotine in vivo (65%). These results indicate that b4 is strongly expressed in the IPN, likely in a large majority of IPR and IPC neurons (see also our response to reviewer 1). In addition, we show that our re-expression strategy restores nicotine-induced currents in patch-clamp experiments and also the response to nicotine in vivo (new Figure 5C). Non-native expression levels could potentially be achieved (e.g. overexpression) but this is not what we observed: responses to nicotine were restored to the WT levels (in slices and in vivo). And importantly this strategy rescued the WT phenotype in terms of nicotine consumption. Expression of b4 alone in cells that do not express any other nAChR subunit (as, presumably, in the lateral parts of the IPN, see GENSAT images above) should not produce any functional nAChR, since alpha subunits are mandatory to produce functional receptors. As specified in the manuscript, proper transduction of the IPN was verified using post-hoc immunochemistry, and mice with transduction of b4 in the VTA were excluded from the analyses.

      Data showing that alpha3 or beta4 disruption alters MHb/IPN nAChR function and nicotine 2BC intake is not novel. In fact, some of the same authors were involved in a paper in 2011 (Frahm et al., Neuron) showing that enhanced alpha3beta4 nAChR function was associated with reduced nicotine consumption. The present paper would therefore seem to somewhat contradict prior findings from members of the research group.

      Frahm et al used a transgenic mouse line (called TABAC) in which the expression of a3b4 receptor is increased, and they observed reduced nicotine consumption. We do the exact opposite: we reduce (a3)b4 receptor expression (using the b4 knock-out line, or by putting mice under chronic nicotine), and observe increased consumption. There is thus no contradiction. In fact, we discuss our findings in the light of Frahm et al. in the discussion section.

      Sex differences. All studies were conducted in male mice, therefore nothing was reported regarding female nicotine intake or physiology responses. Nicotine-related biology often shows sex differences, and there should be a justification provided regarding the lack of data in females. A limitations section in the Discussion section is a good place for this.

      We agree with the reviewer. We added a sentence in the discussion.

    1. As educators, it is important to understand that asking students to use apps or digital tools for learning activities gives companies the opportunity to collect data on them.

      This is scary to think about. I have never thought about how so many websites may have my information simply because I signed up for websites. I feel like this should be especially concerning for younger students, these websites may have access of students' address, school, name, date of birth, etc. This also made me realize that we need to teach more about internet safety and teach about this kind of stuff.

    2. Similarly, end-user license agreements (EULA) and terms of service (TOS) agreements feature opaque language that may cause you to give away your right to privacy without truly understanding what you are doing when you click “I agree.”

      While I know that companies do this on purpose to gain rights to your information and for that reason any attempt to simplify these agreements will have pushback, I really think their needs to be an extension, tool, or platform that will put these forms into simpler terms. This is a prime example of the U in Pour (understandable), and it reminds me of how error signs need to tell us what's wrong in simple language so we can fix it just as these terms of agreement should be in understandable words and syntax so we know what we're signing.

    3. can you really have privacy?

      First thing that comes to mind is no of course. I think of how much privacy I have on the media. Even though technology is such a great advantage and that many of us use everyday, it's over course not as protected as we may think.

    1. Propuso el Memex en As we may Think.

      ¿De qué manera una maquina puede cambiar la manera en que pienso? ¿Un ejemplo podría estar relacionado con la imprenta y la forma en que leemos?

    1. As we may Think.

      Aquí, describió una máquina que combinaría tecnologías de bajo nivel para lograr un mayor nivel de conocimiento organizado (como los procesos de la memoria humana)

    1. Bacon's dictum regarding the proneness of the mind, in explanation, towards unity and simplicity, at no matter what sacrifice of material, has found no more striking exemplification than that offered in the fortunes of psychology. The least developed of the sciences, for a hundred years it has borne in its presentations the air of the one most completely finished. The infinite detail and complexity of the simplest psychical life, its interweavings with the physical organism, with the life of others in the social organism,-- created no special difficulty; and in a book like James Mill's Analysis we find every mental phenomenon not only explained, but explained by reference to one principle. That rich and colored experience, never the same in two nations, in two individuals, in two moments of the same life,-- whose thoughts, desires, fears, and hopes have furnished the material for the ever-developing literature of the ages, for a Homer and a Chaucer, a Sophocles and a Shakespeare, for the unwritten tragedies and comedies of daily life,-- was neatly and carefully dissected, its parts labeled and stowed away in their proper pigeon-holes, the inventory taken, and the whole stamped with the stamp of un fait accompli. Schematism was supreme, and the air of finality was over all. We know better now. We know that that life of man whose unfolding furnishes psychology its material is the most difficult and complicated subject which man can investigate. We have some consciousness of its ramifications and of its connections. We see that man is somewhat more than a neatly dovetailed psychical machine who may be taken as an isolated individual, laid on the dissecting table of analysis and duly anatomized. We know that his life is bound up with the life of society, of the nation in the ethos and nomos; we know that he is closely connected with all the past by the lines of education, tradition, and heredity; we know that man is indeed the microcosm who has gathered into himself the riches of the world, both of space and of time, the world physical and the world psychical. We know also of the complexities of the individual life. We know that our mental life is not a syllogistic sorites, but an enthymeme most of whose members are suppressed; that large tracts never come into consciousness; that those which do get into consciousness, are vague and transitory, with a meaning hard to catch and read; are infinitely complex, involving traces of the entire life history of the individual, or are vicarious, having significance only in that for which they stand; that psychical life is a continuance, having no breaks into "distinct ideas which are separate existences"; that analysis is but a process of abstraction, leaving us with a parcel of parts from which the "geistige Band" is absent; that our distinctions, however necessary, are unreal and largely arbitrary; that mind is no compartment box nor bureau of departmental powers; in short, that we know almost nothing about the actual activities and processes of the soul. We know that the old psychology gave descriptions of that which has for the most part no existence, and which at the best it but described and did not explain. I do not say this to depreciate the work of the earlier psychologists. There is no need to cast stones at those who, having a work to do, did that work well and departed. With Sir William Hamilton and J. Stuart Mill the school passed away. It is true that many psychologists still use their language and follow their respective fashions. Their influence, no doubt, is yet everywhere felt. But changed conditions are upon us, and thought, no more than revolution, goes backward. Psychology can live no better in the past than physiology or physics; but there is no more need for us to revile Hume and Reid for not giving birth to a full and complete science, than there is for complaining that Newton did not anticipate the physical knowledge of to-day, or Harvey the physiological. The work of the earlier psychologists bore a definite and necessary relation both to the scientific conditions and the times in which it was done. If they had recognized the complexity of the subject and attempted to deal with it, the science would never have been begun. The very condition of its existence was the neglect of the largest part of the material, the seizing of a few schematic ideas and principles, and their use for universal explanation. Very mechanical and very abstract to us, no doubt, seems their division of the mind into faculties, the classification of mental phenomena into the regular, graded, clear-cut series of sensation, image, concept, etc.; but let one take a look into the actual processes of his own mind, the actual course of the mental life there revealed, and he will realize how utterly impossible were the description, much more the explanation, of what goes on there, unless the larger part of it were utterly neglected, and a few broad schematic rubrics seized by which to reduce this swimming chaos to some semblance of order. Again, the history of all science demonstrates that much of its progress consists in bringing to light problems. Lack of consciousness of problems, even more than lack of ability to solve them, is the characteristic of the non-scientific mind. Problems cannot be solved till they are seen and stated, and the work of the earlier psychologists consisted largely in this sort of work. Further, they were filled with the Zeitgeist of their age, the age of the eighteenth century and the Aufklärung, which found nothing difficult, which hated mystery and complexity, which believed with all its heart in principles, the simpler and more abstract the better, and which had the passion of completion. By this spirit, the psychologists as well as the other thinkers of the day were mastered, and under its influence they thought and wrote. Thus their work was conditioned by the nature of science itself, and by the age in which they lived. This work they did, and left to us a heritage of problems, of terminology, and of principles which we are to solve, reject, or employ as best we may. And the best we can do is to thank them, and then go about our own work; the worst is to make them the dividing lines of schools, or settle in hostile camps according to their banners. We are not called upon to defend them, for their work is in the past; we are not called upon to attack them, for our work is in the future. It will be of more use briefly to notice some of the movements and tendencies which have brought about the change of attitude, and created what may be called the "New Psychology." Not the slightest of these movements has been, of course, the reaction of the present century, from the abstract, if clear, principles of the eighteenth, towards concrete detail, even though it be confused. The general failure of the eighteenth century in all but destructive accomplishment forced the recognition of the fact that the universe is not so simple and easy a matter to deal with, after all; that there are many things in earth, to say nothing of heaven, which were not dreamed of in the philosophy of clearness and abstraction, whether that philosophy had been applied along the lines of the state, society, religion, or science. The world was sated with system and longed for fact. The age became realistic. That the movement has been accompanied with at least temporary loss in many directions, with the perishing of ideals, forgetfulness of higher purpose, decay of enthusiasm, absorption in the petty, a hard contentedness in the present, or a cynical pessimism as to both present and future, there can be no doubt. But neither may it be doubted that the movement was a necessity to bring the Antæaus of humanity back to the mother soil of experience, whence it derives its strength and very life, and to prevent it from losing itself in a substanceless vapor where its ideals and purposes become as thin and watery as the clouds towards which it aspires. Out of this movement and as one of its best aspects came that organized, systematic, tireless study into the secrets of nature, which, counting nothing common or unclean, thought no drudgery beneath it, or rather thought nothing drudgery,-- that movement which with its results had been the great revelation given to the nineteenth century to make. In this movement psychology took its place, and in the growth of physiology which accompanied it I find the first if not the greatest occasion of the development of the New Psychology. It is a matter in every one's knowledge that, with the increase of knowledge regarding the structure and functions of the nervous system, there has arisen a department of science known as physiological psychology, which has already thrown great light upon psychical matters. But unless I entirely misapprehend the popular opinion regarding the matter, there is very great confusion and error in this opinion, regarding the relations of this science to psychology. This opinion, if I rightly gather it, is, that physiological psychology is a science which does, or at least claims to, explain all psychical life by reference to the nature of the nervous system. To illustrate: very many professed popularizers of the results of scientific inquiry, as well as laymen, seem to think that the entire psychology of vision is explained when we have a complete knowledge of the anatomy of the retina, of its nervous connection with the brain, and of the centre in the latter which serves for visual functions; or that we know all about memory if we can discover that certain brain cells store up nervous impressions, and certain fibres serve to connect these cells,-- the latter producing the association of ideas, while the former occasion their reproduction. In short, the commonest view of physiological psychology seems to be that it is a science which shows that some or all of the events of our mental life are physically conditioned upon certain nerve-structures, and thereby explains these events. Nothing could be further from the truth. So far as I know, all the leading investigators clearly realize that explanations of psychical events, in order to explain, must themselves be psychical and not physiological. However important such knowledge as that of which we have just been speaking may be for physiology, it has of itself no value for psychology. It tells simply what and how physiological elements serve as a basis for psychical acts; what the latter are, or how they are to be explained, it tells us not at all. Physiology can no more, of itself, give us the what, why, and how of psychical life, than the physical geography of a country can enable us to construct or explain the history of the nation that has dwelt within that country. However important, however indispensable the land with all its qualities is as a basis for that history, that history itself can be ascertained and explained only through historical records and historic conditions. And so psychical events can be observed only through psychical means, and interpreted and explained by psychical conditions and facts. What can be meant, then, by saying that the rise of this physiological psychology has produced a revolution in psychology? This: that it has given a new instrument, introduced a new method,-- that of experiment, which has supplemented and corrected the old method of introspection. Psychical facts still remain psychical, and are to be explained through psychical conditions; but our means of ascertaining what these facts are and how they are conditioned have been indefinitely widened. Two of the chief elements of the method of experiment are variation of conditions at the will and under the control of the experimenter, and the use of quantitative measurement. Neither of these elements can be applied through any introspective process. Both may be through physiological psychology. This starts from the well-grounded facts that the psychical events known as sensations arise through bodily stimuli, and that the psychical events known as volitions result in bodily movements; and it finds in these facts the possibility of the application of the method of experimentation. The bodily stimuli and movements may be directly controlled and measured, and thereby, indirectly, the psychical states which they excite or express. There is no need at this day to dwell upon the advantages derived in any science from the application of experiment. We know well that it aids observation by indefinitely increasing the power of analysis and by permitting exact measurement, and that it equally aids explanation by enabling us so to vary the constituent elements of the case investigated as to select the indispensable. Nor is there need to call attention to the especial importance of experiment in a science where introspection is the only direct means of observation. We are sufficiently aware of the defects of introspection. We know that it is limited, defective, and often illusory as a means of observation, and can in no way directly explain. To explain is to mediate; to connect the given fact with an unseen principle; to refer the phenomenon to an antecedent condition,-- while introspection can deal only with the immediate present, with the given now. This is not the place to detail the specific results accomplished through this application of experiment to the psychological sphere; but two illustrations may perhaps be permitted: one from the realm of sensation, showing how it has enabled us to analyze states of consciousness which were otherwise indecomposable; and the other from that of perception, showing how it has revealed processes which could be reached through no introspective method. It is now well known that no sensation as it exists in consciousness is simple or ultimate. Every color sensation, for example, is made up by at least three fundamental sensory quales, probably those of red, green, and violet; while there is every reason to suppose that each of these qualities, far from being simple, is compounded of an indefinite number of homogeneous units. Thus the simplest musical sensation has also been experimentally proved to be in reality not simple, but doubly compound. First, there is the number of qualitatively like units constituting it which occasion the pitch of the note, according to the relations of time in which they stand to each other; and second, there is the relation which one order of these units bears to other secondary orders, which gives rise to the peculiar timbre or tone-color of the sound; while in a succession of notes these relations are still further complicated by those which produce melody and harmony. And all this complexity occurs, be it remembered, in a state of consciousness which, to introspection, is homogeneous and ultimate. In these respects physiology has been to psychology what the microscope is to biology, or analysis to chemistry. But the experimental method has done more than reveal hidden parts, or analyze into simpler elements. It has aided explanation, as well as observation, by showing the processes which condition a psychical event. This is nowhere better illustrated than in visual perception. It is already almost a commonplace of knowledge that, for example, the most complex landscape which we can have before our eyes, is, psychologically speaking, not a simple ultimate fact, nor an impression stamped upon us from without, but is built up from color and muscular sensations, with, perhaps, unlocalized feelings of extension, by means of the psychical laws of interest, attention, and interpretation. It is, in short, a complex judgment involving within itself emotional, volitional, and intellectual elements. The knowledge of the nature of these elements, and of the laws which govern their combination into the complex visual scene, we owe to physiological psychology, through the new means of research with which it has endowed us. The importance of such a discovery can hardly be overestimated. In fact, this doctrine that our perceptions are not immediate facts, but are mediated psychical processes, has been called by Helmholtz the most important psychological result yet reached. But besides the debt we owe Physiology for the method of experiment, is that which is due her for an indirect means of investigation which she has put within our hands; and it is this aspect of the case which has led, probably, to such misconceptions of the relations of the two sciences as exist. For while no direct conclusions regarding the nature of mental activities or their causes can be drawn from the character of nervous structure or function, it is possible to reason indirectly from one to the other, to draw analogies and seek confirmation. That is to say, if a certain nervous arrangement can be made out to exist, there is always a strong presumption that there is a psychical process corresponding to it; or if the connection between two physiological nerve processes can be shown to be of a certain nature, one may surmise that the relation between corresponding psychical activities is somewhat analogous. In this way, by purely physiological discoveries, the mind may be led to suspect the existence of some mental activity hitherto overlooked, and attention directed to its workings, or light may be thrown on points hitherto obscure. Thus it was, no doubt, the physiological discovery of the time occupied in transmission of a nervous impulse that led the German psychologists to their epoch-making investigations regarding the time occupied in various mental activities; thus, too, the present psychological theories regarding the relation of the intellectual and volitional tracts of minds were undoubtedly suggested and largely developed in analogy with Bell's discovery of the distinct nature of the sensory and motor nerves. Again, the present theory that memory is not a chamber hall for storing up ideas and their traces or relies, but is lines of activity along which the mind habitually works, was certainly suggested from the growing physiological belief that the brain cells which form the physical basis of memory do not in any way store up past impressions or their traces, but have, by these impressions, their structure so modified as to give rise to a certain functional mode of activity. Thus many important generalizations might be mentioned which were suggested and developed in  analogy with physiological discoveries. The influence of biological science in general upon psychology has been very great. Every important development in science contributes to the popular consciousness, and indeed to philosophy, some new conception which serves for a time as a most valuable category of classification and explanation. To biology is due the conception of organism. Traces of the notion are found long before the great rise of biological science, and, in particular, Kant has given a complete and careful exposition of it; but the great rôle which the "organic" conception has played of late is doubtless due in largest measure to the growth of biology. In psychology this conception has led to the recognition of mental life as an organic unitary process developing according to the laws of all life, and not a theatre for the exhibition of independent autonomous faculties, or a rendezvous in which isolated, atomic sensations and ideas may gather, hold external converse, and then forever part. Along with this recognition of the solidarity of mental life has come that of the relation in which it stands to other lives organized in society. The idea of environment is a necessity to the idea of organism, and with the conception of environment comes the impossibility of considering psychical life as an individual, isolated thing developing in a vacuum. This idea of the organic relation of the individual to that organized social life into which he is born, from which he draws his mental and spiritual sustenance, and in which he must perform his proper function or become a mental and moral wreck, forms the transition to the other great influence which I find to have been at work in developing the New Psychology. I refer to the growth of those vast and as yet undefined topics of inquiry which may be vaguely designated as the social and historical sciences,-- the sciences of the origin and development of the various spheres of man's activity. With the development of these sciences has come the general feeling that the scope of psychology has been cabined and cramped till it has lost all real vitality, and there is now the recognition of the fact that all these sciences possess their psychological sides, present psychological material, and demand treatment and explanation at the hands of psychology. Thus the material for the latter, as well as its scope, have been indefinitely extended. Take the matter of language. What a wealth of material and of problems it offers. How did it originate; was it contemporaneous with that of thought, or did it succeed it; how have they acted and reacted upon each other; what psychological laws have been at the basis of the development and differentiation of languages, of the development of their structure and syntax, of the meaning of words, of all the rhetorical devices of language. Any one at all acquainted with modern discussions of language will recognize at a glance that the psychological presentation and discussion of such problems is almost enough of itself to revolutionize the old method of treating psychology. In the languages themselves, moreover, we have a mine of resources, which, as a record of the development of intelligence, can be compared only to the importance of the paleontological record to the student of animal and vegetable life. But this is only one aspect, and not comparatively a large one, of the whole field. Folk-lore and primitive culture, ethnology and anthropology, all render their contributions of matter, and press upon us the necessity of explanation. The origin and development of myth, with all which it includes, the relation to the nationality, to language, to ethical ideas, to social customs, to government and the state, is itself a psychological field wider than any known to the previous century. Closely connected with this is the growth of ethical ideas, their relations to the consciousness and activities of the nation in which they originate, to practical morality, and to art. Thus I could go through the various spheres of human activity, and point out how thoroughly they are permeated with psychological questions and material. But it suffices to say that history in its broadest aspect is itself a psychological problem, offering the richest resources of matter. Closely connected with this, and also influential in the development of the New Psychology, is that movement which may be described as the commonest thoughts of everyday life in all its forms, whether normal or abnormal. The cradle and the asylum are becoming the laboratory of the psychologist of the latter half of the nineteenth century. The study of children's minds, the discovery of their actual thoughts and feelings from babyhood up, the order and nature of the development of their mental life and the laws governing it, promises to be a mine of greatest value. When it was recognized that insanities are neither supernatural interruptions nor utterly inexplicable "visitations," it gradually became evident that they were but exaggerations of certain of the normal workings of the mind, or lack of proper harmony and co-ordination among these workings; and thus another department of inquiries, of psychical experiments performed by nature, was opened to us, which has already yielded valuable results. Even the prison and the penitentiary have made their contributions. If there be any need of generalizing the foregoing, we may say that the development of the New Psychology has been due to the growth, on the one hand, of the science of physiology, giving us the method of experiment, and, on the other, of the sciences of humanity in general, giving us the method of objective observation, both of which indefinitely supplement and correct the old method of subjective introspection. So much for the occasioning causes and method of the New Psychology. Are its results asked for? It will be gathered, from what has already been said, that its results cannot be put down in black and white like those of a mathematical theory. It is a movement, no system. But as a movement it has certain general features. The chief characteristic distinguishing it from the old psychology is undoubtedly the rejection of a formal logic as its model and test. The old psychologists almost without exception held to a nominalistic logic. This of itself were a matter of no great importance, were it not for the inevitable tendency and attempt to make living concrete facts of experience square with the supposed norms of an abstract, lifeless thought, and to interpret them in accordance with its formal conceptions. This tendency has nowhere been stronger than in those who proclaimed that "experience" was the sole source of all knowledge. They emasculated experience till their logical conceptions could deal with it; they sheared it down till it would fit their logical boxes; they pruned it till it presented a trimmed tameness which would shock none of their laws; they preyed upon its vitality till it would go into the coffin of their abstractions. And neither so-called "school" was free from this tendency. The two legacies of fundamental principles which Hume left, were: that every distinct idea is a separate existence, and that every idea must be definitely determined in quantity and quality. By the first he destroyed all relation but accident; by the second he denied all universality. But these principles are framed after purely logical models; they are rather the abstract logical principles of difference and identity, of A is A and A is not B, put in the guise of a psychological expression. And the logic of concrete experience, of growth and development, repudiates such abstractions. The logic of life transcends the logic of nominalistic thought. The reaction against Hume fell back on certain ultimate, indecomposable, necessary first truths immediately known through some mysterious simple faculty of the mind. Here again the logical model manifests itself. Such intuitions are not psychological; they are conceptions bodily imported from the logical sphere. Their origin, tests, and character are all logical. But the New Psychology would not have necessary truths about principles; it would have the touch of reality in the life of the soul. It rejects the formalistic intuitionalism for one which has been well termed dynamic. It believes that truth, that reality, not necessary beliefs about reality, is given in the living experience of the soul's development. Experience is realistic, not abstract. Psychical life is the fullest, deepest, and richest manifestation of this experience. The New Psychology is content to get its logic from this experience, and not do violence to the sanctity and integrity of the latter by forcing it to conform to certain preconceived abstract ideas. It wants the logic of fact, of process, of life. It has within its departments of knowledge no psycho-statics, for it can nowhere find spiritual life at rest. For this reason, it abandons all legal fiction of logical and mathematical analogies and rules; and is willing to throw itself upon experience, believing that the mother which has borne it will not betray it. But it makes no attempts to dictate to this experience, and tell it what it must be in order to square with a scholastic logic. Thus the New Psychology bears the realistic stamp of contact with life. From this general characteristic result most of its features. It has already been noticed that it insists upon the unity and solidarity of psychical life against abstract theories which would break it up into atomic elements or independent powers. It lays large stress upon the will; not as an abstract power of unmotivated choice, nor as an executive power to obey the behests of the understanding, the legislative branch of the psychical government, but as a living bond connecting and conditioning all mental activity. It emphasizes the teleological element, not in any mechanical or external sense, but regarding life as an organism in which immanent ideas or purposes are realizing themselves through the development of experience. Thus modern psychology is intensely ethical in its tendencies. As it refuses to hypostatize abstractions into self-subsistent individuals, and as it insists upon the automatic spontaneous elements in man's life, it is making possible for the first time an adequate psychology of man's religious nature and experience. As it goes into the depths of man's nature it finds, as stone of its foundation, blood of its life, the instinctive tendencies of devotion, sacrifice, faith, and idealism which are the eternal substructure of all the struggles of the nations upon the altar stairs which slope up to God. It finds no insuperable problems in the relations of faith and reason, for it can discover in its investigations no reason which is not based upon faith, and no faith which is not rational in its origin and tendency. But to attempt to give any detailed account of these features of the New Psychology would be to go over much of the recent discussions of ethics and theology. We can conclude only by saying that, following the logic of life, it attempts to comprehend life.

      The thing I derived of this article is that we must understand the past to progress without developing what has been established for us we will fail as a society if there is not a evaluation of the past before doing trying to further psychological breakthrough.

    1. In Russian, by the word krasota (beauty) we mean only that which pleases the sight. And though latterly people have begun to speak of “an ugly deed,” or of “beautiful music,” it is not good Russian. A Russian of the common folk, not knowing foreign languages, will not understand you if you tell him that a man who has given his last coat to another, or done anything similar, has acted “beautifully,” that a man who has cheated another has done an “ugly” action, or that a song is “beautiful.” In Russian a deed may be kind and good, or unkind and bad. Music may be pleasant and good, or unpleasant and bad; but there can be no such thing as “beautiful” or “ugly” music.

      What do you think about this, Meliora students? How much do you think language influences our perception of "beauty" or "art?" Which came first? Find an example of a linguist's interpretation and summarize it.

    1. Diversity as a term stands for the differences that exist among all individuals in a society and then workplaces and other smaller settings. These differences are based on race, religion, ethnicity, age, nationality, political perspectives, and religious commitments. They also include distinct views, values, and ideas. A diverse workspace includes people with varying characteristics and beliefs in an equal and respective manner. Many leaders tend to think that implementing diversity in their company is a challenging task. However, they fail to understand that the benefits are well worth the effort.

      Diversity celebrates uniqueness, stories, perspectives, histories, etc...

      Equity recognizes that every person has different starting point- some of us start with advantages, others may have started with disadvantages

      Inclusiveness... is a result of Diversity + Equity. When the effort is made to celebrate our special sauce AND make room at the tables we sit at, THEN we've got an inclusive workplace

      Either end of the spectrum is either tokenization or erasure.

    1. Author Response

      Reviewer #2 (Public Review):

      I believe the authors succeeded in finding neural evidence of reactivation during REM sleep. This is their main claim, and I applaud them for that. I also applaud their efforts to explore their data beyond this claim, and I think they included appropriate controls in their experimental design. However, I found other aspects of the paper to be unclear or lacking in support. I include major and medium-level comments:

      Major comments, grouped by theme with specifics below:

      Theta.

      Overall assessment: the theta effects are either over-emphasized or unclear. Please either remove the high/low theta effects or provide a better justification for why they are insightful.

      Lines ~ 115-121: Please include the statistics for low-theta power trials. Also, without a significant difference between high- and low-theta power trials, it is unclear why this analysis is being featured. Does theta actually matter for classification accuracy?

      Lines 123-128: What ARE the important bands for classification? I understand the point about it overlapping in time with the classification window without being discriminative between the conditions, but it still is not clear why theta is being featured given the non-significant differences between high/low theta and the lack of its involvement in classification. REM sleep is high in theta, but other than that, I do not understand the focus given this lack of empirical support for its relevance.

      Line 232-233: "8). In our data, trials with higher theta power show greater evidence of memory reactivation." Please do not use this language without a difference between high and low theta trials. You can say there was significance using high theta power and not with low theta power, but without the contrast, you cannot say this.

      Thank you, we have taken this point onboard. We thought the differences observed between classification in high and low theta power trials were interesting, but we can see why the reviewer feels there is a need for a stronger hypothesis here before reporting them. We have therefore removed this approach from the manuscript, and no longer split trials into high and low theta power.

      Physiology / Figure 2.

      Overall assessment: It would be helpful to include more physiological data.

      It would be nice, either in Figure 2 or in the supplement, to see the raw EEG traces in these conditions. These would be especially instructive because, with NREM TMR, the ERPs seem to take a stereotypical pattern that begins with a clear influence of slow oscillations (e.g., in Cairney et al., 2018), and it would be helpful to show the contrast here in REM.

      We thank the reviewer for these comments. We have now performed ERP and time-frequency analyses following a similar approach to that of (Cairney et al., 2018). We have added a section in the results for these analyses as follows:

      “Elicited response pattern after TMR cues

      We looked at the TMR-elicited response in both time-frequency and ERP analyses using a method similar to the one used in (Cairney et al., 2018), see methods. As shown in Figure 2a, the EEG response showed a rapid increase in theta band followed by an increase in beta band starting about one second after TMR onset. REM sleep is dominated by theta activity, which is thought to support the consolidation process (Diekelmann & Born, 2010), and increased theta power has previously been shown to occur after successful cueing during sleep (Schreiner & Rasch, 2015). We therefore analysed the TMR-elicited theta in more detail. Focussing on the first second post-TMR-onset, we found that theta was significantly higher here than in the baseline period, prior to the cue [-300 -100] ms, for both adaptation (Wilcoxon signed rank test, n = 14, p < 0.001) and experimental nights (Wilcoxon signed rank test, n = 14, p < 0.001). The absence of any difference in theta power between experimental and adaptation conditions (Wilcoxon signed rank test, n = 14, p = 0.68), suggests that this response is related to processing of the sound cue itself, not to memory reactivation. Turning to the ERP analysis, we found a small increase in ERP amplitude immediately after TMR onset, followed by a decrease in amplitude 500ms after the cue. Comparison of ERPs from experimental and adaptation nights showed no significant difference, (n= 14, p > 0.1). Similar to the time-frequency result, this suggests that the ERPs observed here relate to the processing of the sound cues rather than any associated memory.“

      And we have updated Figure 2.

      Also, please expand the classification window beyond 1 s for wake and 1.4 s for sleep. It seems the wake axis stops at 1 s and it would be instructive to know how long that lasts beyond 1 s. The sleep signal should also go longer. I suggest plotting it for at least 5 seconds, considering prior investigations (Cairney et al., 2018; Schreiner et al., 2018; Wang et al., 2019) found evidence of reactivation lasting beyond 1.4 s.

      Regarding the classification window, this is an interesting point. TMR cues in sleep were spaced 1.5 s apart and that is why we included only this window in our classification. Extending our window beyond 1.5 s would mean that we considered the time when the next TMR cue was presented. Similarly, in wake the duration of trials was 1.1 s thus at 1.1 s the next tone was presented.

      Following the reviewer’s comment, we have extended our window as requested even though this means encroaching on the next trial. We do this because it could be possible that there is a transitional period between trials. Thus, when we extended the timing in wake and looked at reactivation in the range 0.5 s to 1.6 s we found that the effect continued to ~1.2 s vs adaptation and chance, e.g. it continued 100 ms after the trial. Results are shown in the figures below.

      Temporal compression/dilation.

      Overall assessment: This could be cut from the paper. If the authors disagree, I am curious how they think it adds novel insight.

      Line 179 section: In my opinion, this does not show evidence for compression or dilation. If anything, it argues that reactivation unfolds on a similar scale, as the numbers are clustered around 1. I suggest the authors scrap this analysis, as I do not believe it supports any main point of their paper. If they do decide to keep it, they should expand the window of dilation beyond 1.4 in Figure 3B (why cut off the graph at a data point that is still significant?). And they should later emphasize that the main conclusion, if any, is that the scales are similar.

      Line 207 section on the temporal structure of reactivation, 1st paragraph: Once again, in my opinion, this whole concept is not worth mentioning here, as there is not really any relevant data in the paper that speaks to this concept.

      We thank the reviewer for these frank comments. On consideration, we have now removed the compression/dilation analysis.

      Behavioral effects.

      Overall assessment: Please provide additional analyses and discussion.

      Lines 171-178: Nice correlation! Was there any correlation between reactivation evidence and pre-sleep performance? If so, could the authors show those data, and also test whether this relationship holds while covarying our pre-sleep performance? The logic is that intact reactivation may rely on intact pre-sleep performance; conversely, there could be an inverse relationship if sleep reactivation is greater for initially weaker traces, as some have argued (e.g., Schapiro et al., 2018). This analysis will either strengthen their conclusion or change it -- either outcome is good.

      Thanks for these interesting points. We have now performed a new analysis to check if there was a correlation between classification performance and pre-sleep performance, but we found no significant correlation (n = 14, r = -0.39, p = 0.17). We have included this in the results section as follows:

      “Finally, we wanted to know whether the extent to which participants learned the sequence during training might predict the extent to which we could identify reactivation during subsequent sleep. We therefore checked for a correlation between classification performance and pre-sleep performance to determine whether the degree of pre-sleep learning predicted the extent of reactivation, this showed no significant correlation (n = 14, r = -0.39, p = 0.17). “

      Note that we calculated the behavioural improvement while subtracting pre-sleep performance and then normalising by it for both the cued and un-cued sequences as follows:

      [(random blocks after sleep - the best 4 blocks after sleep) – (random blocks pre-sleep – the best 4 blocks pre-sleep)] / (random blocks pre-sleep – the best 4 blocks pre-sleep).

      Unlike Schönauer et al. (2017), they found a strong correspondence between REM reactivation and memory improvement across sleep; however, there was no benefit of TMR cues overall. These two results in tandem are puzzling. Could the authors discuss this more? What does it mean to have the correlation without the overall effect? Or else, is there anything else that may drive the individual differences they allude to in the Discussion?

      We have now added a discussion of this point as follows:

      “We are at a very early phase in understanding what TMR does in REM sleep, however we do know that the connection between hippocampus and neocortex is inhibited by the high levels of Acetylcholine that are present in REM (Hasselmo, 1999). This means that the reactivation which we observe in the cortex is unlikely to be linked to corresponding hippocampal reactivation, so any consolidation which occurs as a result of this is also unlikely to be linked to the hippocampus. The SRTT is a sequencing task which relies heavily on the hippocampus, and our primary behavioural measure (Sequence Specific Skill) specifically examines the sequencing element of the task. Our own neuroimaging work has shown that TMR in non-REM sleep leads to extensive plasticity in the medial temporal lobe (Cousins et al., 2016). However, if TMR in REM sleep has no impact on the hippocampus then it is quite possible that it elicits cortical reactivation and leads to cortical plasticity but provides no measurable benefit to Sequence Specific Skill. Alternatively, because we only measured behavioural improvement right after sleep it is possible that we may have missed behavioural improvements that would have emerged several days later, as we know can occur in this task (Rakowska et al., 2021).”

      Medium-level comments

      Lines 63-65: "We used two sequences and replayed only one of them in sleep. For control, we also included an adaptation night in which participants slept in the lab, and the same tones that would later be played during the experimental night were played."

      I believe the authors could make a stronger point here: their design allowed them to show that they are not simply decoding SOUNDS but actual memories. The null finding on the adaptation night is definitely helpful in ruling this possibility out.

      We agree and would like to thank the reviewer for this point. We have now included this in the text as follows: “This provided an important control, as a null finding from this adaptation night would ensure that we are decoding actual memories, not just sounds. “

      Lines 129-141: Does reactivation evidence go down (like in their prior study, Belal et al., 2018)? All they report is theta activity rather than classification evidence. Also, I am unclear why the Wilcoxon comparison was performed rather than a simple correlation in theta activity across TMR cues (though again, it makes more sense to me to investigate reactivation evidence across TMR cues instead).

      Thanks a lot for the interesting point. In our prior study (Belal et. al. 2018), the classification model was trained on wake data and then tested on sleep data, which enabled us to examine its performance at different timepoints in sleep. However in the current study the classifier was trained on sleep and tested on wake, so we can only test for differential replay at different times during the night by dividing the training data. We fear that dividing sleep trials into smaller blocks in this way will lead to weakly trained classifiers with inaccurate weight estimation due to the few training trials, and that these will not be generalisable to testing data. Nevertheless, following your comment, we tried this, by dividing our sleep trials into two blocks, e.g. the first half of stimulation during the night and the second half of stimulation during the night. When we ran the analysis on these blocks separately, no clusters were found for either the first or second halves of stimulation compared to adaptation, probably due to the reasons cited above. Hence the differences in design between the two studies mean that the current study does not lend itself to this analysis.

      Line 201: It seems unclear whether they should call this "wake-like activity" when the classifier involved training on sleep first and then showing it could decode wake rather than vice versa. I agree with the author's logic that wake signals that are specific to wake will be unhelpful during sleep, but I am not sure "wake-like" fits here. I'm not going to belabor this point, but I do encourage the authors to think deeply about whether this is truly the term that fits.

      We agree that a better terminology is needed, and have now changed this: “In this paper we demonstrated that memory reactivation after TMR cues in human REM sleep can be decoded using EEG classifiers. Such reactivation appears to be most prominent about one second after the sound cue onset. ”

      Reviewer #3 (Public Review):

      The authors investigated whether reactivation of wake EEG patterns associated with left- and right-hand motor responses occurs in response to sound cues presented during REM sleep.

      The question of whether reactivation occurs during REM is of substantial practical and theoretical importance. While some rodent studies have found reactivation during REM, it has generally been more difficult to observe reactivation during REM than during NREM sleep in humans (with a few notable exceptions, e.g., Schonauer et al., 2017), and the nature and function of memory reactivation in REM sleep is much less well understood than the nature and function of reactivation in NREM sleep. Finding a procedure that yields clear reactivation in REM in response to sound cues would give researchers a new tool to explore these crucial questions.

      The main strength of the paper is that the core reactivation finding appears to be sound. This is an important contribution to the literature, for the reasons noted above.

      The main weakness of the paper is that the ancillary claims (about the nature of reactivation) may not be supported by the data.

      The claim that reactivation was mediated by high theta activity requires a significant difference in reactivation between trials with high theta power and trials with low theta, but this is not what the authors found (rather, they have a "difference of significances", where results were significant for high theta but not low theta). So, at present, the claim that theta activity is relevant is not adequately supported by the data.

      The authors claim that sleep replay was sometimes temporally compressed and sometimes dilated compared to wakeful experience, but I am not sure that the data show compression and dilation. Part of the issue is that the methods are not clear. For the compression/dilation analysis, what are the features that are going into the analysis? Are the feature vectors patterns of power coefficients across electrodes (or within single electrodes?) at a single time point? or raw data from multiple electrodes at a single time point? If the feature vectors are patterns of activity at a single time point, then I don't think it's possible to conclude anything about compression/dilation in time (in this case, the observed results could simply reflect autocorrelation in the time-point-specific feature vectors - if you have a pattern that is relatively stationary in time, then compressing or dilating it in the time dimension won't change it much). If the feature vectors are spatiotemporal patterns (i.e., the patterns being fed into the classifier reflect samples from multiple frequencies/electrodes / AND time points) then it might in principle be possible to look at compression, but here I just could not figure out what is going on.

      Thank you. We have removed the analysis of temporal compression and dilation from the manuscript. However, we wanted to answer anyway. In this analysis, raw data were smoothed and used as time domain features. The data was then organized as trials x channels x timepoints then we segmented each trial in time based on the compression factor we are using. For instance, if we test if sleep is 2x faster than wake we look at the trial lengths in wake which was 1.1 sec. and we take half of this value which is 0.55 sec. we then take a different window in time from sleep data such that each sleep trial will have multiple smaller segments each of 0.55 sec., we then add those segments as new trials and label them with the respective trial label. Afterwards, we resize those segments temporally to match the length of wake trials. We now reshape our data from trials x channels x timepoints to trials x channels_timepoints so we aggregate channels and timepoints into one dimension. We then feed this to PCA to reduce the dimensionality of channels_timepoints into principal components. We then feed the resultant features to a LDA classifier for classification. This whole process is repeated for every scaling factor and it is done within participant in the same fashion the main classification was done and the error bars were the standard errors. We compared the results from the experimental night to those of the adaptation night.

      For the analyses relating to classification performance and behavior, the authors presently show that there is a significant correlation for the cued sequence but not for the other sequence. This is a "difference of significances" but not a significant difference. To justify the claim that the correlation is sequence-specific, the authors would have to run an analysis that directly compares the two sequences.

      Thanks a lot. We have now followed this suggestion by examining the sequence specific improvement after removing the effect of the un-cued sequence from the cued sequence. This was done by subtracting the improvement of the un-cued sequence from the improvement for the cued sequence, and then normalising the result by the improvement of the un-cued sequence. The resulting values, which we term ‘cued sequence improvement’ showed a significant correlation with classification performance (n = 14, r = 0.56, p = 0.04). We have therefore amended this section of the manuscript as follows: We have updated the text as follows: “We therefore set out to determine whether there was a relationship between the extent to which we could classify reactivation and overnight improvement on the cued sequence. This revealed a positive correlation (n = 14, r = 0.56, p = 0.04), Figure 3b.”

    2. Reviewer #2 (Public Review):

      I believe the authors succeeded in finding neural evidence of reactivation during REM sleep. This is their main claim, and I applaud them for that. I also applaud their efforts to explore their data beyond this claim, and I think they included appropriate controls in their experimental design. However, I found other aspects of the paper to be unclear or lacking in support. I include major and medium-level comments:

      Major comments, grouped by theme with specifics below:<br /> Theta.<br /> Overall assessment: the theta effects are either over-emphasized or unclear. Please either remove the high/low theta effects or provide a better justification for why they are insightful.

      Lines ~ 115-121: Please include the statistics for low-theta power trials. Also, without a significant difference between high- and low-theta power trials, it is unclear why this analysis is being featured. Does theta actually matter for classification accuracy?

      Lines 123-128: What ARE the important bands for classification? I understand the point about it overlapping in time with the classification window without being discriminative between the conditions, but it still is not clear why theta is being featured given the non-significant differences between high/low theta and the lack of its involvement in classification. REM sleep is high in theta, but other than that, I do not understand the focus given this lack of empirical support for its relevance.

      Line 232-233: "8). In our data, trials with higher theta power show greater evidence of memory reactivation." Please do not use this language without a difference between high and low theta trials. You can say there was significance using high theta power and not with low theta power, but without the contrast, you cannot say this.

      Physiology / Figure 2.<br /> Overall assessment: It would be helpful to include more physiological data.

      It would be nice, either in Figure 2 or in the supplement, to see the raw EEG traces in these conditions. These would be especially instructive because, with NREM TMR, the ERPs seem to take a stereotypical pattern that begins with a clear influence of slow oscillations (e.g., in Cairney et al., 2018), and it would be helpful to show the contrast here in REM. Also, please expand the classification window beyond 1 s for wake and 1.4 s for sleep. It seems the wake axis stops at 1 s and it would be instructive to know how long that lasts beyond 1 s. The sleep signal should also go longer. I suggest plotting it for at least 5 seconds, considering prior investigations (Cairney et al., 2018; Schreiner et al., 2018; Wang et al., 2019) found evidence of reactivation lasting beyond 1.4 s.

      Temporal compression/dilation.<br /> Overall assessment: This could be cut from the paper. If the authors disagree, I am curious how they think it adds novel insight.

      Line 179 section: In my opinion, this does not show evidence for compression or dilation. If anything, it argues that reactivation unfolds on a similar scale, as the numbers are clustered around 1. I suggest the authors scrap this analysis, as I do not believe it supports any main point of their paper. If they do decide to keep it, they should expand the window of dilation beyond 1.4 in Figure 3B (why cut off the graph at a data point that is still significant?). And they should later emphasize that the main conclusion, if any, is that the scales are similar.

      Line 207 section on the temporal structure of reactivation, 1st paragraph: Once again, in my opinion, this whole concept is not worth mentioning here, as there is not really any relevant data in the paper that speaks to this concept.

      Behavioral effects.<br /> Overall assessment: Please provide additional analyses and discussion.

      Lines 171-178: Nice correlation! Was there any correlation between reactivation evidence and pre-sleep performance? If so, could the authors show those data, and also test whether this relationship holds while covarying our pre-sleep performance? The logic is that intact reactivation may rely on intact pre-sleep performance; conversely, there could be an inverse relationship if sleep reactivation is greater for initially weaker traces, as some have argued (e.g., Schapiro et al., 2018). This analysis will either strengthen their conclusion or change it -- either outcome is good.

      Unlike Schönauer et al. (2017), they found a strong correspondence between REM reactivation and memory improvement across sleep; however, there was no benefit of TMR cues overall. These two results in tandem are puzzling. Could the authors discuss this more? What does it mean to have the correlation without the overall effect? Or else, is there anything else that may drive the individual differences they allude to in the Discussion?

      Medium-level comments<br /> Lines 63-65: "We used two sequences and replayed only one of them in sleep. For control, we also included an adaptation night in which participants slept in the lab, and the same tones that would later be played during the experimental night were played."

      I believe the authors could make a stronger point here: their design allowed them to show that they are not simply decoding SOUNDS but actual memories. The null finding on the adaptation night is definitely helpful in ruling this possibility out.

      Lines 129-141: Does reactivation evidence go down (like in their prior study, Belal et al., 2018)? All they report is theta activity rather than classification evidence. Also, I am unclear why the Wilcoxon comparison was performed rather than a simple correlation in theta activity across TMR cues (though again, it makes more sense to me to investigate reactivation evidence across TMR cues instead).

      Line 201: It seems unclear whether they should call this "wake-like activity" when the classifier involved training on sleep first and then showing it could decode wake rather than vice versa. I agree with the author's logic that wake signals that are specific to wake will be unhelpful during sleep, but I am not sure "wake-like" fits here. I'm not going to belabor this point, but I do encourage the authors to think deeply about whether this is truly the term that fits.

    1. Author Response

      Reviewer #1 (Public Review):

      The authors of this manuscript aimed to systematically evaluate the pleiotropic effects of MCR-1-mediated colistin resistance. They evaluated the effect of MCR-1 and MCR-3 carried on different plasmids on antimicrobial peptides (AMPs) and assessed their ultimate effect on virulence. The authors find that MCR-1-mediated colistin resistance correlates with increased resistance against some host AMPs, but also increased sensitivity to others. The authors also find that MCR-1 alone is associated with resistance to human serum and to elements of the complement system. This highlights a potential selective advantage for MCR-1-mediated resistance to host immune factors and a potential for enhanced virulence.

      The methods have been well established before and adequately support their main findings. While determining the role of MCR-1 in a single genetic background is important to better understand its potential pleiotropic effects against a diversity of AMPs and in a variety of scenarios, the impact and significance of the results are partially ameliorated because different genetic backgrounds, particularly those most relevant to a clinical (or agricultural) context were not considered. The results depicted here are still a necessary and important step towards a more comprehensive understanding of the pleiotropic effects of MCR-1. But, interactions between plasmids and host genomes and their co-evolution can have important effects more generally. The authors do mention this in the discussion and suggest it to be an important avenue for future work. However, given the objective of the study and the clinical and agricultural context in which the authors have framed their work, it seems more relevant to include those distinct genetic backgrounds already here.

      The conclusions stemming from the results found in Figure 3, and Figures 4c and d seem too overreaching to me. The associated resistance to AMPs from pigs seems to be only strong enough against one of the five tested AMPs and hence concluding that these impose a strong selective pressure in the pig's gut seems unsubstantiated. Similarly, the difference in survival probability within their in vivo system, though statistically significant, seems to be very ild between their MCR-1 and empty vector control.

      Thank you for the comment. We agree on the effect of MCR-MOR on AMP susceptibility and have edited the paragraph by removing the lines on strong selective pressure in the pig gut. As regards the 4c and 4d results (4e and 4f in the revised version), it is interesting and statistically convincing that MCR increases bacterial virulence despite the cost of MCR expression. And importantly, this effect is even stronger in the case of LPS treatment where the immune system is stimulated, expressing diverse host AMPs (PMID: 19897755). This shows MCR-mediated advantages to bacteria in the complex host environment.

      Reviewer #2 (Public Review):

      Jangir et al test the hypothesis that resistance to the antimicrobial peptide (AMP) colistin can simultaneously increase resistance to other AMPS with related modes of action. Because AMPS comprise part of innate immunity, their central concern is that colistin resistance may compromise host defenses and thereby increase bacterial virulence. Their results show that MCR-1, whether expressed from naturally circulating or synthetic plasmids, can increase the MIC to AMPS from humans, pigs, and chickens, and impart fitness benefits at sub-MIC concentrations. In addition, they find that MCR-1-containing strains have increased survival in human plasma and are more lethal in an insect infection model.

      The conclusions of the paper are generally well supported by the results, but some aspects could be clearer and better defended with a few small additional experiments.

      Strengths:

      Using both synthetic and natural plasmids makes it possible to cleanly separate the effects of MCR-1 from the effects of other plasmid-borne genes or plasmid copy numbers. This helps confirm the causal role of MCR-1 on altered AMP susceptibility.

      Testing the survival of transformed isolates in human serum and in insects points to relevance in the more immunologically complex host environment where cells are exposed to a suite of factors that reduce bacterial survival.

      Thank you!

      Weaknesses/suggestions:

      Although increases in MIC are evident for different AMPS, the effects are generally modest. To address this, it might be helpful to use pairwise competition assays, as in Figure 1, to establish that even small changes to MIC are associated with clear selective benefits.

      Thank you for the suggestion. We agree that in some cases the change in MIC is modest, however, we would like to highlight that small-level changes in resistance have important clinical implications. For example, resistance mutations conferring a small change in MIC can ensure the survival of pathogenic bacteria in antibiotic-treated hosts (PMID: 30131514). Additionally, a comparison between competition assays (Fig 1) and MICs (Fig 2) clearly shows that small changes in MIC are associated with substantial fitness benefits. For example, for pSEVA:MCR-1, the fold change in MIC of CATH2 (chicken), PMAP23 (pig), and LL37 (human) ranges between 1.05 and 1.5, however, the competitive fitness ranges from 10% to 17%. This issue is discussed in the revised manuscript (lines 306-317, page 13)

      ….This would be especially helpful in assays with human serum and in Galleria where the concentrations of AMPS or other immune components are unknown.

      It is clear that MCR-1 increases resistance to serum and virulence (Figure 4). However, we agree with the reviewer that the selective benefits of MCR-1 in complex host environments are not known (i.e., serum or Galleria). We have revised the final paragraph of the discussion to reflect this limitation of our study (lines 370-382, page 15).

      Assays using human serum are interesting but challenging to interpret given the diverse causes of bacterial killing, including complement. Although this was partly addressed in Supplementary Figure 6, I found the predictions of these experiments unclear. First, I think these experiments are too central to be relegated to the supplemental materials; they belong in the main text. Secondly, it is important to explicitly spell out the expectations of using heat-killed serum (which will degrade any heat-labile components) or complement-deficient serum. It should be clearer under which conditions MCR-1-containing strains are predicted to do better or worse than controls.

      We have addressed this in the revised version. We have moved Supplementary Fig 6 to the main text, and have edited the text, clarifying the model prediction (lines 245-257, page 10).

      Galleria is a useful infection model for virulence, but it is unclear what drives differences between strains. First, bacterial numbers aren't measured in this assay, so it isn't known if increased virulence is due to increased bacterial growth or decreased bacterial clearance. As above, I think these assays would be stronger using the competition-based approach in Figure 1. This would indicate bacterial numbers through time and directly show the selective benefit associated with MCR-1. Second, it would be useful to elaborate on why MCR-1 increases virulence, especially any known similarities between Galleria AMPS and those tested in Figures 1 and 2. Overall, it would help if Galleria were less of a black box.

      We agree that the mechanism underlying increased virulence remains to be explored and thus, we have already discussed this in the discussion as a limitation (lines, 370-382, page 15). However, elucidating the mechanisms by which MCR-1 increases virulence would clearly be an interesting line of research moving forward.

    1. Author Response

      Reviewer #1 (Public Review):

      In this manuscript, Sampaio et al. tackle the role of fluid flow during left-right axis symmetry breaking. The left-right axis is broken in the left-right organiser (LRO) where cilia motility generates a directional flow that permit to dictate the left from the right embryonic side. By manipulating the fluid moved by cilia in zebrafish, the authors conclude that key symmetry breaking event occurs within 1 hour through a mechanosensory process.

      Overall, while the study undeniably represents a huge amount of work, the conclusions are not sufficiently backed up by the experiments. Furthermore, the results provided present a limited advance to the field: the transient activity of the LRO is well established, and narrowing down this activity to 1 hour (even though unclear from the presented data that it is a valid conclusion) does not help to understand better the mechanism of symmetry breaking.

      We thank the reviewer1 for acknowledging the hard experimental set up. However, we must argue that knowing the exact timing that is more sensitive to fluid flow manipulations is a very important advance we provide here. The reason is because this type of experiment is giving us the physiological timing in a WT embryo. It is one thing to know the system can respond to optical tweezers earlier than 5 ss and later than 5 ss, as Yuan lab did recently, but quite another to constrain the physiological timing at which the process occurs in an unperturbed manner (as much as possible). Our aim was the latter. Our rationale is that knowing the physiological time is important to provide clues, for example we had these types of questions at the time: is the physiological time before or after cell rearrangements occur? is it falling in a directional or non-directional flow regime? Is it governed by a mild flow or stronger one? Is it before or after dand5 becomes asymmetric? Some of these questions that we think we all know the answers for, could be challenged by our experiments… so it is indeed very important to not assume we know the answer, and ask the question again in an unbiased way with every new technique available! We wanted to be unbiased, and we think that is the beauty of our time-window experiment. Indeed, it shows the physiological time-window peaks at 5 ss which is later than Yuan’s lab calcium transient recording and before dand5 asymmetric expression. In our opinion this is compatible and makes perfect sense because although the system already shows calcium transients before and can respond to lack of Pkd2 or optical tweezer cilia manipulations at 1 ss – 3 ss, it is from 4 to 6 ss, peaking at 5 ss, that it is most responsive physiologically to the fluid extraction and therefore both mechanical and chemical perturbations.

      We have made additional experiments and used smFISH on WT embryos for detecting dand5 expression with cellular resolution, and we have quantified asymmetries in dand5 number of transcripts as early as 6 ss (new Figure 7 and new author: Catarina Bota) that further support our time-window claim. Degradation of dand5 mRNA has been the mechanism suggested to be at the base of the asymmetric dand5 expression, which is usually a very fast mechanism. This new piece of evidence supports that the physiological breaking of symmetry is stronger around 5 ss. (see new discussion on this subject on page 27).

      Regarding the symmetry breaking. The fact that anterior angular velocity was the major difference between embryos that recovered without LR defects versus those that did not, reveals that angular velocity must be tightly regulated by cilia motility and CFTR activity to bring back fluid and flow directionality, which together confer the robustness of flow. This is now better explained in the manuscript. We agree that the novelty regarding angular velocity may seem incremental compared to our work from 2014, where we only analyzed speed (Sampaio et al, 2014). However, here we provided more resolution and detailed parameters of angular velocity per sections of the LRO as well as tangential and radial velocities, the components of angular velocity. The Radial component shows a trend towards left anterior that is now discussed in the text as evidence for a left difference. The present work shows that anterior angular velocity has a major role in the successful recovery of the symmetry breaking process, which was not claimed before. Here we challenged the embryo to bring to light the most important parameters.

      Importantly, the authors do not provide any convincing experiments to back up the mechanosensory hypothesis because the fluid extraction experiments affect both the chemical and physical features of the LRO, so it is impossible to disentangle the two with this approach.

      We agree the first extraction experiment (Figures 1-3 and Table 1) affects both mechanisms and does not disentangle them, and that was, in fact, our goal for the first experiment - the finding of the exact time-window for symmetry breaking. However, in the second part of the work (Figures 4-5 and Table 2) we provide a 20,000 times dilution experiment, this dilution experiment is very different than the extraction one. We apologize if this was not clear and hope to have made it clear this time.

      We must agree with the reviewer that chemosensing is not excluded, in fact we had provided a paragraph in the discussion about EV secretion rates to tone down our claim and did acknowledge that secretion could still overcome the dilution we are causing. We think we had already addressed this problem in the previous eLife manuscript but now we have discussed the possibilities and the experimental evidence that supports each of them (see page 28, last paragraph). The key experiment that does not fit with secretion is pointed out in the end, and we ask the reviewer to read it in the context of wildtype animals. We agree both scenarios must be discussed and leave space for future data on mmp21 and CIROP. However, so far, in zebrafish we cannot favor chemosensing as much as mechanosensing, we can only wait for more discoveries and be open.

    1. Author Response

      Reviewer #1 (Public Review):

      The model put forward by the authors in this manuscript is a simple and exciting one, explaining the function of AGS3 as a negative regulator of LGN, acting as a 'dominant-negative' version of LGN. Overall, the results support the model very well, and the results shown in Fig 6, which clearly reveal the functional relevance of AGS3, add strength to the paper.

      We thank the reviewer for their enthusiasm regarding our finding that AGS3 acts as an endogenous dominant-negative to inhibit LGN. We appreciate their assertion that the results support the model and that the functional relevance to epidermal stratification is a strength.

      In Figures 3A and B, the authors claim that AGS3 overexpression leads to depolarization of LGN in epidermal stem cells. However, in the example provided in Figure 3A, the LGN signal appears to be stronger than the control, with more LGN still on the apical side (many would categorize this as 'apically polarized'). In the scoring shown in Figure 3B, I am not sure if 'eyeballing' is the right way to decide whether it is polarized/depolarized/absent. The authors should come up with a bit more quantitative method to quantify the localization/amount of LGN and explain the method well in the manuscript. A similar concern regarding the determination of the LGN localization pattern applies to the rest of figure 3 as well.

      We agree with this important critique about the methodology used to assess LGN expression patterns. While we have historically included categorical analyses like those used in Fig. 3A,B in past publications (Williams et al, NCB 2014; Lough et al eLife, 2019), we have also now performed additional, unbiased, quantitative measures of LGN fluorescent intensity, as described in greater detail above. We added these new data in Fig. 4C-J, while the data previously in Fig. 3A,B have now been redistributed between Fig. 3E,F (overexpression) and Fig. 4A,B (knockdown).

      Reviewer #2 (Public Review):

      To date, only a handful of studies have addressed the importance of AGS3, a paralog of the relatively well-characterized spindle orientation factor LGN. The authors now show that AGS3 acts as a negative regulator of LGN and propose that this activity could work through competition for binding partner(s). Remarkably, regulation is temporally restricted in such a way that the conserved role played by LGN in metaphase spindle orientation is unaffected. Instead, AGS3 regulates a post-metaphase function for LGN, namely Telophase Correction. The article is well-written, the experiments are performed at a high level, and the claims are generally supported by the data. Two main points of confusion are raised in the current version. 1) The authors show that AGS3 regulates cortical localization of LGN, but would need to clarify how LGN is being affected. 2) The authors propose in the discussion that AGS3 might exert its regulatory effect through competition for NuMA, an important binding partner for LGN, but would need to clarify how and why NuMA would be involved in Telophase Correction.

      We thank the reviewer for appreciating the novelty of our findings regarding the understudied LGN/pins paralog AGS3. In regards to the first point, as described earlier, we have added additional quantitative analyses of how AGS3 affects cortical LGN fluorescent intensity in Fig. 4C-J. We now show that AGS3 loss leads to broader and higher expression levels throughout mitosis, and therefore we have amended our model to soften the claim that AGS3 primarily operates during telophase correction. This renders the second point somewhat moot, but we nonetheless have expanded our Discussion to note that NuMA can be cortically recruited to the anaphase cortex independent of LGN (lines 531-542). We also contextualize our findings with the Reviewer’s own recent study which proposes a “threshold model” of cortical Insc as a determinant of spindle orientation (Neville et al, 2023), and speculate that a similar model could apply in our system, perhaps with AGS3 binding and sequesting Insc rather than NuMA (lines 543-556).

      Reviewer #3 (Public Review):

      This paper examines the mechanisms that control division orientation in the basal layers of the epidermis. Previous work established LGN as a key promoter of divisions where one of the siblings populates the differentiated layers (perpendicular). This work addresses two important, related issues - the mechanisms that determine whether a particular division is planar vs perpendicular, and the function of AGS3, and LGN paralog that has been enigmatic. A central finding is that AGS3 is required for the normal distribution of planar and perpendicular divisions (roughly equal) such that in its absence the distribution is skewed towards the perpendicular. Interestingly, however, the authors find that AGS3 has no detectable effect on orientation if the orientation is measured at anaphase. This timing aspect builds upon previous work from this group demonstrating a phenomenon they term "telophase correction" in which the orientation changes at the latest phases of division (and possibly post division?). Thus AGS3 seems to exert its effect using these later mechanisms and this is supported by further analysis by the authors. Importantly, the authors show that AGS3 acts through LGN, based on localization data and an epistasis analysis. The function of AGS3 has been highly enigmatic so resolving this issue while providing a useful step towards understanding how the division orientation decision is made, makes for exciting progress towards an important problem. I found the overall narrative and presentation to be quite good and especially appreciated the thoughtful discussion section that did an excellent job of putting the results in context and speculating how unknown aspects of the mechanism might work based on current clues. With that said, I think there are some important issues that should be resolved.

      We thank the Reviewer for this excellent summary of our findings and appreciation of the significance of the issues that our study addresses.

      Regarding the orientation measurements, the authors should specify how the midbody marker was used to mark sibling cells, especially given the midbody can move following division. For example, how can the authors be confident that the siblings in the middle panel of 1A are correct and not an adjacent cell? Regarding quantification, it would be useful for the authors to comment on how the following would influence their measurements: 1) movements along the z-axis, and 2) movement of the nucleus within the cell

      We have used this methodology for over a decade, and while it is not flawless, we have included several safeguards to ensure that sibling cells are correctly identified. We have added additional details to the Methods section (lines 867-869, 873-879).

      A similar question is how much telophase correction really happens in telophase. How confident are the authors that the process actually occurs during division and not subsequent to it? What is drawn in their previous paper and in Figure 7A implies that post-division movements may be important. It would be useful for the authors to comment on whether they can make the distinction and whether or not it might be important.

      Our intent in coining the term “telophase correction” was to imply that this process initiates, rather than completes, during telophase. We apologize for this confusion and have clarified this in the text (lines 80-82). Since most mammalian cells complete M phase in ~1h, with the longest time spent in prophase, in the absence of direct evidence to the contrary, it may be prudent to assume that telophase, like metaphase and anaphase, is relatively short, on the order of minutes. Since we cannot directly observe reformation of the nuclear membrane in our movies, we cannot be sure when telophase ends. Likewise, we do not currently have a suitable marker of the spindle midbody for live-imaging, so cannot be sure when cytokinesis completes. That said, we feel confident that most of the reorientation is occurring prior to cytokinesis, because we have previously reported that the greatest changes in daughter cell positioning occur within the first 10-15 minutes of anaphase onset, when a gap in membrane-GFP/TdTomato is still visible (Lough et al, eLife, 2019). However, while we feel that there are many interesting questions that our work raises about the timing or reorientation relative to specific mitotic stages—e.g. is the midbody asymmetrically positioned, inherited, or ejected?—these questions are beyond the scope of the present study.

      Does the division angle in the AGS3 OE experiment (Figure 1D) correlate with AGS3 levels within the cell?

      This is an interesting question, and indeed, we our hypothesis would predict that it would. However, it is not straightforward to quantify AGS3 or mRFP1 levels, and as we explain in a new section of the Results (lines 212-237), we have some concerns that N-terminally tagged AGS3 may not be fully functional. We have added new data with C-terminally tagged AGS3-mKate2, which we feel provides even stronger evidence that mKate2+ cells show a planar shift compared to mKate2- cells (Fig. 3C,D). In the future, we could test this hypothesis at the population level by comparing division orientation profiles for AGS3-mKate2+ cells carrying either a non-targeting scramble or Gpsm11147 shRNA. We would predict that knocking down endogenous AGS3 while overexpressing AGS3-mKate2 should give an intermediate phenotype.

      I found the localization data to be the weakest part of the paper and feel that some reconsideration and reanalysis are warranted. First, the quantifications in Figures 2C, 3B, and 3F are unnecessarily vague scoring-based metrics. In 2C, "Localization pattern" should be replaced with membrane/cytoplasm ratio or an equivalent quantification. In 3B "LGN localization" should be replaced with apical/cytoplasmic and apical/basal ratios or equivalents. In 3F, "Polarized LGN frequency" should be replaced with apical/basal ratio or equivalent. It seems to me that non-AI processed data would be most appropriate for these quantifications unless such processing can be justified.

      This issue was raised by the previous two Reviewers and has been addressed by new data added to Figure 4.

      Second, it is important to note that the cytoplasmic localization of AGS3 does not allow one to conclude that AGS3 is not on the membrane. Unfortunately, high cytoplasmic signal can preclude the determination of membrane-bound signal.

      We agree with the Reviewer and have softened our language throughout the text.

      Finally, I had difficulty reconciling the images of LGN shown in Figure 3 with the conclusions made by the authors.

      We have added additional, representative images of LGN expression in control and AGS3 KD cells in Figure 4C-E.

      The challenge of the localization data is troubling because an important conclusion of the paper is that AGS3 acts via LGN. The localization data provided one leg of support for this conclusion and the other is provided by an epistasis analysis. Unfortunately, this data seems to be right on the edge because it is based on the difference between the solid and dashed blue lines in Figure 5B not being significant. However, we can see how close this is by comparing the solid and dashed red lines in the adjacent 5C, which are significantly different. Between the localization data, which doesn't seem clear cut, and the epistasis experiment, which is on the razor's edge, I'm concerned that the conclusion that AGS3 acts through LGN may be going beyond what the data allows.

      We appreciate the Reviewer’s comments about the importance of these two lines of experimentation: 1) AGS3’s effect on LGN localization, and 2) epistasis experiments between AGS3/Gpsm1 and LGN/Gpsm2. We feel we have significantly strengthened this first pillar with the additional data presented in Fig. 4C-J. Regarding the second point, we would like to emphasize that we present three lines of evidence for the existence of an epistatic relationship between LGN and AGS3: 1) the static division orientation data comparing LGN single KOs to both LGN KO + AGS3 KD and AGS3+LGN dKOs (Fig. 6B); 2) live imaging division orientation/telophase correction comparing LGN KOs to AGS3+LGN dKOs (Fig. 6C-E); 3) lineage tracing data comparing LGN KOs to AGS3+LGN dKOs (Fig. 7H,I). Further, we think the reviewer may have misconstrued the data presented in Fig. 5C (now Fig. 6C). The dashed lines indicate orientation at anaphase and solid lines 1h after anaphase, so the shift between dashed and solid lines indicates telophase correction, which occurs to similar (and statiscially significant) degrees in both LGN single mutants and AGS3+LGN dKOs. Comparisons between the single and double mutant would be between red and magenta solid lines or red and magenta dashed lines, and neither of these are statistically significant. We realize that our use of dashed lines in Fig. 5B (now Fig. 6B), which we normally only use to refer to anaphase entry in live imaging data, may have caused this confusion. Therefore, we have changed all plots to solid lines¬ in Fig. 6B, and use light and dark magenta, respectively, to differentiate between LGN KO + AGS3 KD and AGS3+LGN dKOs.

    2. Reviewer #3 (Public Review):

      This paper examines the mechanisms that control division orientation in the basal layers of the epidermis. Previous work established LGN as a key promoter of divisions where one of the siblings populates the differentiated layers (perpendicular). This work addresses two important, related issues - the mechanisms that determine whether a particular division is planar vs perpendicular, and the function of AGS3, and LGN paralog that has been enigmatic. A central finding is that AGS3 is required for the normal distribution of planar and perpendicular divisions (roughly equal) such that in its absence the distribution is skewed towards the perpendicular. Interestingly, however, the authors find that AGS3 has no detectable effect on orientation if the orientation is measured at anaphase. This timing aspect builds upon previous work from this group demonstrating a phenomenon they term "telophase correction" in which the orientation changes at the latest phases of division (and possibly post division?). Thus AGS3 seems to exert its effect using these later mechanisms and this is supported by further analysis by the authors. Importantly, the authors show that AGS3 acts through LGN, based on localization data and an epistasis analysis. The function of AGS3 has been highly enigmatic so resolving this issue while providing a useful step towards understanding how the division orientation decision is made, makes for exciting progress towards an important problem. I found the overall narrative and presentation to be quite good and especially appreciated the thoughtful discussion section that did an excellent job of putting the results in context and speculating how unknown aspects of the mechanism might work based on current clues. With that said, I think there are some important issues that should be resolved.

      Regarding the orientation measurements, the authors should specify how the midbody marker was used to mark sibling cells, especially given the midbody can move following division. For example, how can the authors be confident that the siblings in the middle panel of 1A are correct and not an adjacent cell?

      Regarding quantification, it would be useful for the authors to comment on how the following would influence their measurements: 1) movements along the z-axis, and 2) movement of the nucleus within the cell.

      A similar question is how much telophase correction really happens in telophase. How confident are the authors that the process actually occurs during division and not subsequent to it? What is drawn in their previous paper and in Figure 7A implies that post-division movements may be important. It would be useful for the authors to comment on whether they can make the distinction and whether or not it might be important.

      Does the division angle in the AGS3 OE experiment (Figure 1D) correlate with AGS3 levels within the cell?

      I found the localization data to be the weakest part of the paper and feel that some reconsideration and reanalysis are warranted.

      First, the quantifications in Figures 2C, 3B, and 3F are unnecessarily vague scoring-based metrics. In 2C, "Localization pattern" should be replaced with membrane/cytoplasm ratio or an equivalent quantification. In 3B "LGN localization" should be replaced with apical/cytoplasmic and apical/basal ratios or equivalents. In 3F, "Polarized LGN frequency" should be replaced with apical/basal ratio or equivalent. It seems to me that non-AI processed data would be most appropriate for these quantifications unless such processing can be justified.

      Second, it is important to note that the cytoplasmic localization of AGS3 does not allow one to conclude that AGS3 is not on the membrane. Unfortunately, high cytoplasmic signal can preclude the determination of membrane-bound signal.

      Finally, I had difficulty reconciling the images of LGN shown in Figure 3 with the conclusions made by the authors.

      The challenge of the localization data is troubling because an important conclusion of the paper is that AGS3 acts via LGS. The localization data provided one leg of support for this conclusion and the other is provided by an epistasis analysis. Unfortunately, this data seems to be right on the edge because it is based on the difference between the solid and dashed blue lines in Figure 5B not being significant. However, we can see how close this is by comparing the solid and dashed red lines in the adjacent 5C, which are significantly different. Between the localization data, which doesn't seem clear cut, and the epistasis experiment, which is on the razor's edge, I'm concerned that the conclusion that AGS3 acts through LGN may be going beyond what the data allows.

    1. Lateral reading is a strategy that enables people to emulate how professional fact checkers establish the credibility of online information. It involves opening up new browser tabs to search for information about the organisation or individual behind a site before diving into its contents. Only after consulting the open web do skilled searchers gauge whether expending attention is worth it. Before critical thinking can begin, the first step is to ignore the lure of the site and check out what others say about its alleged factual reports.

      I've always heard about lateral reading but never reallly used it. I think it wasn't until a couple years ago did I really start to implement lateral reading into my own learning. I would ever go further and say that this may because of the growth of the digital world. What I mean by this is that there has definitely been an influx in "fake news" the last decade or so, and so more and more people who at least try to be media literate may go out of their way to read and decipher the truth more carefully.

      I am actually a firm advocate for lateral reading because I know it can work. It has helped me evaluate credibility of articles online and it can probably help others as well.

    1. Antagonistic bots can also be used as a form of political pushback that may be ethically justifiable.

      I think this is a very interesting point to bring up. Could bots fall under free speech if they are making a political statement? Bots in general raise many questions about how they can be used and the actions of bots/ the people behind them can be morally gray. I think that points to more consideration towards regulation of some type, but how can we do that without infringing upon rights?

    2. Fake Bots

      I wonder to what extent can we find out if something is a bot vs something is not. I think this can be very interesting as there are many reasons as to why someone may want to pretend to be a bot, for example, they do not need to face the same consequences as a bot is treated usually not to the same ethical framework as humans are.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01771

      Corresponding author(s): Franck Pichaud and Rhian Walther

      1. General Statements [optional]

      We are grateful for the reviewers’ comments and suggestions. Both reviewers agree that our work addresses a poorly understood questions in biology and medicine, and that it will be of interest to the community of cell and developmental biologists.

      We note that most of the comments/suggestions, especially from Rev#2, are concerned with the text. These include suggested references to be added, a need to expand on the Method description and suggested points of discussion. We have addressed all these issues in the revised manuscript.

      Our work aims to understand which pathways control the basal geometry of epithelial cells, and how cells coordinate remodeling of their basal geometry to organize a tissue in 3D, from apical (top) to basal (bottom). This is a relatively understudied area, especially when compared to the breadth of work related to the pathways that control the apical geometry of epithelial cells.

      The apical geometry of an epithelial cell is a direct function of the number of adherens junctions the cell shares with their neighbors. Suppression or extension of adherens junctions underpins apical geometry remodeling. Basally, this same cell will be attached to the basement membrane though integrin receptors. We use the fly retina, where cells adopt stereotyped basal geometry, to investigate whether and how integrin adhesion might induce cell basal geometry remodeling in morphogenesis.

      The novel finding we report that a temporal sequence of event seems to underpin cell basal geometry remodeling in the retina, whereby i) laminin accumulates at specific location within the basement membrane, which is ii) accompanied by a concomitant accumulation of Dystroglycan (DG), and subsequently iii) integrin receptors are recruited to these sites of high Laminin-DG. This, along with our genetic experiments, suggests that a Laminin-DG-Integrin axis controls the basal geometry of retinal cells. In this axis, we envisage patterning of the basement membrane through Laminin-DG directs integrin recruitment, which in turn induces cell basal geometry remodeling. To our knowledge, this pathway in epithelial morphogenesis, spanning from ECM regulation to integrin polarization, has not been reported before. As the function of these components in basal adhesion is conserved across phyla, we anticipate our findings will be broadly relevant for our understanding of epithelial morphogenesis.

      2. Description of the planned revisions

      The main suggestion, common to both our reviewers, is that we should provide further re-assurance that the RNAi strains we use to target basement membrane components and the DG and integrin pathways are specific, and that these strains do not come with off-target effects.

      We will follow this recommendation by i) including referencing when a line that we have used has been validated elsewhere, ii) by using at least two independent RNAi strains to target a gene of interest, iii) by making use of the deGrad-FP system (Caussinus et al., 2013) to target proteins instead of genes, iv) by making use of available mutant strains. This is all relatively straightforward, and I will detail the proposed experiments as part of the following point-by-point rebuttal and revision plan.

      REVIEWER #1

      Commenting on the need to provide further controls related to some of our RNAi experiments

      1)* All the genetics experiments are based on RNAi induced knock-down approach. Although such an approach is easy to justify for genes associated with lethality when mutated, it becomes less relevant for non-lethal ones as Dystroglycan complex components (Dg, Dys, Sgc) for which null and viable mutants are published and available. The phenotype of such mutants should be provided. *

      AND

      *There is no data explaining how these RNAi lines were validated. The fact that it gives the phenotype expected by the authors is obviously not sufficient. This point is essential to exclude off-target effects and to be able to compare the different genotypes (see #2). For instance, the strong effect of sarcoglycan could be questioned. Is it really specific? If yes, is the difference with other Dystroglycan complex members only due to RNAi efficiency or does it have a specific function? *

      AND

      Line 255, "These perturbations led to a failure of bPS/Mys to accumulate at the grommet". Dg mutants are viable (PMID: 18093579); do they show consistent phenotypes?

      __RE: __Our main methodology has been to use available RNAi strains to perturb composition of the basement membrane and to inhibit the expression of components of the DG and Integrin pathways. As pointed out by the reviewer, this approach allows us to assess the function of genes that might be embryonic lethal and allows us to specifically target the basal geometry remodeling step without perturbing earlier steps of retinal morphogenesis. This is important for the basement membrane and integrins, which are required although retinal tissue development. See for example: (Fernandes et al., 2014, Thuveson, 2019 #3787).

      We are aware that mutant alleles are available for dg, dys and sgc allow for recovering adult homozygous (or trans-heterozygous) animals. However, based on our previous experience using mutants for which only very few flies make it to adulthood, we feel it is best not to examine those animals. Compensatory pathways might be at play that could mask a phenotype (Please see our recent work on the viable roughest null allele in cell intercalation (Blackie et al., 2021).

      Therefore, we propose to induce mutant clones for dg, dys and sgc using the Flp/FRT system, using the strongest alleles that are available to us. Of note, in our experience stable proteins might not show a phenotype in small clones, but will develop a phenotype in larger ones, as the protein becomes further diluted upon multiple rounds of cell division. Bearing this in mind, we will generate animals where the whole retina is mutant for these genes. This will be done using the GMR-hid system (Stowers and Schwarz, 1999).

      Specifically, we will target Dg, Dys and Sgc using:

      Dystroglycan:

      • The dg nonsense mutations, leading to expression of truncated proteins: DgO86 (stop codon at the R87 residue) and dgO43 (stop codon at the W462 residue) (Christoforou et al., 2008). While previous studies have suggested that these alleles are homozygous viable (Christoforou et al., 2008; Zhan et al., 2010), we have obtained this strain from the Bloomington Stock Centre, and note that no homozygous flies make it to adult. In preliminary work, we also note that clones mutant for the dgO86 allele generated with the flp-FRT system are very small, comprised of only one or two cells. This suggests that DG is required for cell proliferation or viability. These dg alleles are available on the G13 FRT which is not compatible with any FRT system designed to eliminate the wild type cells. To use the GMR-hid system, we will have to first recombine these dg alleles onto the appropriate FRT chromosome. Dystrophin:

      • The dys3397 allele, which is semi-lethal P-element insertion in the dys Very few adult flies homozygous for this allele flies are recovered (Christoforou et al., 2008). We will have to recombine this allele onto an FRT chromosome to generate whole mutant retinas.

      • The deficiency Df(3R)Exel6184, which removes the dys coding frame (Christoforou et al., 2008).
      • We will also use dysE17, because it has been used before (Catalani et al., 2021; Cerqueira Campos et al., 2020; Mirouse et al., 2009). This lesion is a Q2807 Stop codon in the C-terminal region common to all 6 dys The Df(3R)Exel6184 and dysE17 alleles have been recombined onto FRT82B, which will allow us to make use of the GMR-hid system to generate whole mutant retinas. Sarcoglycan:

      • Sgc (three subunits in Drosophila) using the deletion allele dscg169 (Allikian et al., 2007). We will have to recombine this mutation onto an FRT chromosome to generate whole mutant retinas. In addition, we will reproduce our RNAi phenotypes using additional available RNAi lines from stock centers and from previous studies, targeting different regions of dg, dys and scg. For dys we will use a validated RNAi line. For dg we will use a second RNAi line previously used in (Cerqueira Campos et al., 2020; Villedieu et al., 2023) For dys, we will use a second line previously used in (Cerqueira Campos et al., 2020). For Sarcoglycans, we will complement our work targeting scgd by also targeting scga.

      Moreover, since a functional endogenously GFP-tagged Dg strain is now available (Villedieu et al., 2023) along with the Dys::GFP strain we have already used, we will target these proteins using the DeGrad-FP system (Caussinus et al., 2013). The main advantage with this system is that, as with RNAi, we can target a specific time window without affecting earlier steps in retinal morphogenesis. In addition, these experiments will address the possibility that DG and Dys might be stable in cells – inhibiting genes expression in flp-FRT induced clones does not always correlate with inhibiting protein function. We think that the well-established deGrad-GFP will be useful here to address the reviewer’s comment.

      We trust these complementary approaches will more than address the reviewers’ comment by further ascertaining that the RNAi phenotypes we report here for Laminin, and the DG and integrin pathway, are specific.

      Please note that we show in Fig.3 that the basal geometry phenotype we report for the talin RNAi, using an RNAi line reported in several previous studies (Lemke et al., 2019; Perkins et al., 2010; Xie and Auld, 2011; Xie et al., 2014), is comparable the phenotype we observed using the Flp-FRT system to induce mys1 mutant clones. So, we are confident this RNAi line is specific of talin. Nevertheless, we will also show results using second RNAi line targeting *talin. *

      *- Authors claimed that laminin RNAi (or MMPs overexpression) affects cell geometry but why it is not analyzed by PCA? It is not consistent with the other figures. *

      __RE: __To address this comment, we will provide the PCA analysis for the Laminin and MMP phenotypes.

      __REVIEWER #2 __

      • Line 208, "we found that LanB2 RNAi leads to defects in bPS/Mys Integrin localization". Here, because the authors use only single RNAi, there remains the possibility that the observed phenotype was caused by an off-target effect. The authors should exclude this possibility by using another RNAi or mutants. In case of LanB2, however, showing that one RNAi against LanB1 shows the same phenotype would be enough, because LanB1 is another single subunit of fly Laminin __RE: __We have now included loss-of-function mutant clones for LanB1, using the LanB1KG003456 allele, showing defects in integrin localization resembling the LanB2 RNAi (please refer to section 3: revision already done, Section). We trust that this is good validation of the LanB2 RNAi strain. These new results have been added to Figure 6 (6E-6F).

      RE:This is the same for all the RNAi experiments”. Please refer to our response to Reviewer 1, above.

      2) *As the authors write "Laminin-rich domains", I suppose that they assume that LanA/B1 accumulates in a restricted region of the BM. However, it has been reported that the majority of Laminin in the fly embryo is soluble and floating in the haemolymph (fly's 'blood' or body fluid) (PMID: 29129537). Therefore, the LanA/B1 observed in the figures might be just floating in the intercellular space and doing nothing on the BM. The authors should exclude this possibility to support their idea that Laminin localised in a specific region of the BM recruits Integrin. For example, does secreted GFP (PMID: 12062063) not behave in the same way as LanA/B1? Can the authors show that the LanA/B1 is indeed incorporated in the BM by FRAP or any methods? *

      RE: While formally possible, our data suggest that it is unlikely that “LanA/B1 is just floating in the intercellular space and doing nothing on the BM”. For instance, our results show that the DG pathway component Scgd is required for accumulation of LanA::GFP (Fig.7E-F). The most likely explanation for this requirement is DG binding to Laminin fibers.

      Nevertheless, we will follow up on the reviewer’s comment and perform FRAP on LanA::GFP, as this is relatively straightforward. We will also try the GFP secretion experiment using the suggested GFPsecr transgene generated by the Vincent lab in 2000.

      3) Line 240. "RNAi against dSarcoglycan led to a decrease in LanA::GFP expression at the presumptive grommet at 20h APF (Figure 7F)". As to this result, the authors seem to interpret that Laminin is not recruited to the "specific BM domain" in grommet in the absence of Dg signalling. However, other possibilities exist, e.g., that the global expression level of Laminin was reduced, or that the intercellular space into which soluble Laminin (see the issue 4 above) flows was narrowed down. The authors should show the data that exclude (or at least reduce) these possibilities.

      __RE: __Addressing Rev2 point (1) will rule out that Laminin is in soluble form. To address the comment that the global expression level of Laminin might be decreased, we will quantify the amount of LanA::GFP that is not at the grommet and compare wild type animals with the scgd ones.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      __REVIEWER #1 __

      • Line 208, "we found that LanB2 RNAi leads to defects in bPS/Mys Integrin localization". Here, because the authors use only single RNAi, there remains the possibility that the observed phenotype was caused by an off-target effect. The authors should exclude this possibility by using another RNAi or mutants. In case of LanB2, however, showing that one RNAi against LanB1 shows the same phenotype would be enough, because LanB1 is another single subunit of fly Laminin __RE: __We have included new results – LanB2 loss of function – showing the role of Laminin in being required for Integrin localization in the secondary and tertiary pigment cells (revised Figure 6 – panels E-F)

      Line 237: For this, we used both RNAi against LanB2 and a loss-of-function allele of LanB1. Consistent with our model, we found that in both cases bPS/Mys Integrin localization was affected. bPS/Mys failed to accumulate at the grommet, and instead was distributed at the basal plasma membrane into punctate domains (Figure 6A-F). In addition, these perturbation experiments affected cell basal geometry remodeling (Figure 6A, 6C, 6E).

      2)* Methods section describing genetic conditions is really sketchy. The genotype corresponding to each figure is not provided and I guess that GMR-Gal4 has been used in all experiments using the Gal4 system but it is never clearly stated. *

      __RE: __We have revisited the Methods section and Figure Legends to ensure all appropriate information is readily accessible to the reader. The reviewer is correct that the retinal GMR-Gal4 driver was used to express the RNAi used in this study.

      3) PCA analysis. - In the WT situation it would be really informative to know which variable(s) is/are really discriminant between the two cell populations and then maybe to focus a bit more on these parameters. For instance, a PCA correlation circle plotting both cells and variables would be very helpful.

      __RE: __We have followed the reviewer’s advice and amended the Methods section accordingly. We now provide the PCA correlation circle plotting both cells and variables in Suppl. Fig. 3, for talin RNAi and MysDN, and Suppl. Fig. 10 for DG and Scgd RNAi

      *Methods: *

      Line 522 : Principle component analysis

      Principal component analysis (PCA) was carried out using the Scikit-learn library in Python. The Standard scaler package was used to standardize the data across all metrics before calculating the principal components. The PCA package was then used to perform the PCA. Metrics included in the PCA were as follows: extent, major axis length, minor axis length, eccentricity, roundness, circularity, area, cell shape index, perimeter.

      The cell types (secondary and tertiary pigment cells) were assigned by following the cells in 3D to the apical surface where the cell types could be identified. Cells that could not be clearly assigned as either secondary or tertiary pigment cells were excluded from the PCA.

      Extent is the area of an object divided by the area or the smallest rectangle (bounding box) that can fit around the object.

      Major axis length is the longest line that can be drawn through an object.

      Minor axis length is the line that can be drawn through an object which is perpendicular to the major axis.

      __Eccentricity __is the ratio of the length of the short (minor) axis to the length of the long

      (major) axis.

      Roundness is a comparison of an object to the best fit circle of an object. The closer the object is to a perfect circle, the more round it will be.

      Circularity is a measure of the smoothness of an object.

      Cell shape index is a dimensionless parameter to describe cell shape. When cells have smaller contacts with their neighbours the cell shape index is small.

      Correlation circle plots were generated using the mlxtend plotting package in python using the plot PCA correlation graph function.

      • Please also see the graphs we now provide in Suppl. Fig.4*. *

      We are also commenting on these results.

      Line 174: To understand which parameters explained most of the variance in the PCA analysis we generated correlation circle plots (Supplementary Figure 4). For wildtype cells, perimeter and circularity contribute most to the variance between secondary and tertiary pigment cells along the PC1 axis. Eccentricity and minor axis length contribute most to variance along the PC2 axis (Supplementary Figure 4A). For talin RNAi and MysDN cells, the correlation circle plots are remarkably similar (Supplementary Figure 4B-C), indicating that these genetic perturbations have similar effects on cell basal geometry. To confirm this result, we performed PCA comparing secondary and tertiary pigment cells for these two genotypes. In both genotypes, cells fail to form discrete clusters (Supplementary Figure 4D-E). For the secondary pigment cells, expressing talin RNAi or MysDN leads to an increase in cell roundness. For the tertiary pigment cell, these genotypes lead to an increase in circularity (Supplementary Figure 4D-E). Examining the original segmentation data confirmed that, relative to wildtype cells, either genetic perturbation has a similar effect on key cell shape parameters (Supplementary Figure 4F-G).

      *- In loss of function conditions, when the tissue is strongly affected, how do the authors recognize the two cell populations if PCA cannot? *

      __RE: __In these genotypes, each cell type is identified based on their apical position and geometry. When a cell cannot be identified it is not included in the analysis. This allowed us to track the cells from apical to basal. We now make this clear in the Methods section.

      Line 529: The cell types (secondary and tertiary pigment cells) were assigned by following the cells in 3D to the apical surface where the cell types could be identified. Cells that could not be clearly assigned as either secondary or tertiary pigment cells were excluded from the PCA.

      - On the opposite, based on the provided image, Dys RNAi seems to have a mild effect and it seems that my eyes can easily recognize those two cell populations based on their shape. So why PCA cannot?

      __RE: __We respectfully disagree with this comment. In the Dys RNAi, one cannot tell which is a secondary and which is a tertiary by visual inspection of the basal surface only. This is consistent with the PCA analysis, now described more thoroughly in Supplemental Figure 4. The Dys RNAi cells tend to remain elongated and they do not round up as much as the Scgd RNAi cells, which gives the false impression that the phenotype is closer to that of the wild type.

      - Based on the proposed images, some phenotypes look clearly different depending on the genotype, e.g. Talin and Mys (figure 3) or Dys and Sgc (Figure 8). In other words, the fact that PCA cannot separate the cell pollutions in these different genotypes does not necessarily mean that their effect is identical. Could authors perform PCA analysis between mutants? If they are different, again it might be very interesting to identify the discriminating parameters.

      RE: We did not claim the defect was identical__. __

      The basal geometries look somewhat different depending on the genotype, and we envisage this is due to differences in RNAi strength and perhaps differences in protein stability. This is the case for Dys and Scgd, as outlined in the preceding point. With respect to talin and mys, none of the authors can distinguish by eye the talin RNAi from mys1 phenotypes. We have informally asked our institutional colleagues, and they were also unable to distinguish these genotypes.

      Nevertheless, we have expanded our PCA analysis between phenotypes, considering one cell type at a time. This analysis shows that these phenotypes show partial overlap, outside of the wildtype range. While there are similarities, it does not reveal, however, any specific relationship between genes of interest (see previous).

      Line 178: For talin RNAi and MysDN cells, the correlation circle plots are remarkably similar (Supplementary Figure 4B-C), indicating that these genetic perturbations have similar effects on cell basal geometry. To confirm this result, we performed PCA comparing secondary and tertiary pigment cells for these two genotypes. In both genotypes, cells fail to form discrete clusters (Supplementary Figure 4D-E). For the secondary pigment cells, expressing talin RNAi or MysDN leads to an increase in cell roundness. For the tertiary pigment cell, these genotypes lead to an increase in circularity (Supplementary Figure 4D-E). Examining the original segmentation data confirmed that, relative to wildtype cells, either genetic perturbation has a similar effect on key cell shape parameters (Supplementary Figure 4F-G).

      *- From what I can understand, each PCA analysis has been done on a single retina. If true, more replicates should be included. If not true, the number of independent retinas should be mentioned. *

      __RE: __All PCA analyses have been done using multiple retinas from different animals. We have clarified this in the figure legends.

      4) Minor comments: - Globally, the article suffers from a lack of details, especially in the methods section and/or in figure legends.

      RE: please see what we have done to address this comment, in section (2) above.

      *- Also, several points could be advantageously discussed. For instance, why MMPs have different effects according to their specificity? Also, what could be the meaning of the nice differential pattern between integrin alpha subunits? *

      __RE: __We were concerned this would be seen as too speculative by our reviewers. Following the reviewer’s advice, we are happy to share our current working model and speculations on this.

      Results:

      Line 242: Moreover, and consistent with basement membrane regulation being important for cell basal geometry remodeling, we found that degrading the basement membrane by expressing Matrix Metalloproteases MMP1 or MMP2 in retinal cells leads to a failure in bPS/Mys localization at the grommet and prevented cell basal geometry remodeling (Figure 6G-J). While recombinant Drosophila MMP1 and 2 can degrade Col-IV, only MMP2 can degrade Laminin (Wen et al., 2020). The MMP2 phenotype we observed in basal surface organization is stronger than that of the MMP1 overexpression. Our results, therefore, suggest that both Col-IV and Laminin play a role in controlling the basal geometry of retinal cells. This suggestion is consistent with our finding that both these basement membrane proteins are enriched at the grommet once cells have acquired their basal geometry.

      Discussion:

      Line 386: Integrins can bind to Col-IV and to Laminin (Hynes, 2002). Our experiments show that MMP2 overexpression leads to a stronger phenotype than MMP1. In addition to catalyzing Collagen-IV proteolysis, MMP2 can degrade Laminin, which is something MMP1 does not seem to be able to do (Wen et al., 2020). Therefore, our results suggest that both Col-IV and Laminin are required for cell basal geometry remodeling.

      Line 408*: *

      The cone cells express two Integrin receptors, ____a____PS1/Mew-____b____PS/Mys and ____a____PS2/if-____b____PS/Mys

      We found that while the interommatidial cells express aPS1/Mew-bPS/Mys, the cone cells express both aPS1/Mew-bPS/Mys and aPS2/if-bPS/Mys. Thus, different cell types express different aPS subunits. It is not clear why the cone cells express two a-subunits. In the developing follicular epithelium of the fly oocyte, cells switch from expressing aPS1/Mew-bPS/Mys, to expressing aPS2/if-bPS/Mys (Delon and Brown, 2009). In this tissue, the developmental switch between aPS1 and aPS2 expression was shown to correlate with a change in stress fiber orientation. In addition, aPS1-bPS/Mys was also shown to be required to control F-actin levels basally. aPS1 mutant cells presented elevated levels of F-actin, a phenotype not seen in aPS2 mutant cells. Remarkably, in this tissue, aPS2-bPS/Mys, but not aPS1/Mew-bPS/Mys was able to recruit the integrin adapter Tensin. The authors envisaged that the aPS2 Tensin interaction might confer robustness in basal surface remodeling. With analogy to the follicular epithelium, we speculate that in the cone cells, aPS1-bPS/Mys and aPS2/Mew-bPS/Mys synergize in mediating robust attachment to the basement membrane, to ensure these cells do not detach as the retina lengthens along the apical-basal axis (Longley and Ready, 1995). We also note that in retinal development, the cone cells form new adherens and septate junctions at their basal feet (Banerjee et al., 2008). These cells, therefore, present two sets of adherens and Septate junctions. It is also possible that the atypical situation seen with the cone cells expressing two a subunits, is linked to the formation of these new junctions at the basal pole of these cells. It will be interesting to examine these possibilities, and to establish the role these two a-subunits play in cone cell morphogenesis. Further, the presence of two distinct integrin subunits within the cone cells may have implications when considering Integrin signaling during cone cell morphogenesis.

      *- In Methods, a list of metrics is given for the PCA analysis but some look very similar and it would be helpful to define them briefly. *

      RE: Please refer to what we have done to address this comment in section (2) above.

      *- Figures are not always color-blind adjusted (e.g. dots on PCA graphs). *

      __RE: __We have rectified this oversight.

      __REVIEWER #2 __

      1)* Line 169, "From these experiments, we conclude that Integrin adhesion is required for cell basal geometry remodeling during retinal morphogenesis". It has been long known that integrin is necessary for the gross morphogenesis of the eye (e.g., Zusman et al. 1993, PMID: 8076515). The authors need to cite these preceding researches and should clarify what new findings this new work adds to the previous knowledge. *

      __RE: __Following the reviewer’s suggestion, we have added this reference which precedes (Longley and Ready, 1995)mentioned in the paper. Both references show that integrins are required for eye integrity and attribute this function to the contraction phase of retinal development. Notably, contraction occurs after cells have remodelled their basal geometry, which we have focused on in this study.

      Line 128: The Integrin bPS subunit (Myspheroid, Mys) is required to maintain surface integrity late in retinal development, as the tissue surface undergoes basal contraction (Longley and Ready, 1995; Zusman et al., 1993).

      4) Line 180, "Using available functional GFP protein traps [49, 50]", the authors investigate the behaviour of Laminin subunits LanA and LanB1. First, ref [50] is not relevant here and should be removed. Moreover, the Laminin-GFPs the authors used are not protein traps, but transgenic strains harbouring genes and most of their regulatory information, with the ORFs tagged with GFP [49]. Furthermore, while the ref [49] reported the functionality of LanB1-GFP, this reference did not fully address the functionality of LanA-GFP. The authors need another reference on it (PMID: 29129537), which demonstrated that LanA-GFP rescues LanA mutants.

      5) Related to the issue above, in addition to LanA and LanB1, the authors examine the localisation of the following BM proteins using GFP-fusion: Perlecan/Trol, Collagen IV/Viking, Nidogen, and SPARC. The authors do not explicitly describe the nature of these GFP fusions, but I am afraid that the authors think all of them are "functional protein traps". However, in fact while Perlecan and Collagen IV are protein traps, Nidogen and SPARC are transgenics including regulatory sequences made in the ref [49]. This must be clarified. Moreover, to rely on the data obtained using these GFP fusions, their functionality must be confirmed by appropriate references or/and the authors' own data. For information, ref [62] showed the functionality of Perlecan-GFP and Collagen IV-GFP protein traps (they are both homozygous viable), and the Nidogen-GFP transgene rescues the BM deficiency of Ndg mutants (PMID: 30260959). These reports must be explained in the text, and I would like the authors collect and show more information.

      __RE: __We have deleted ref 50. We thank the reviewer for flagging the issue with our referencing. We have now amended this section.

      Line 204: To this end, we examined the localization and requirement of the Laminin A and B1 subunits (Laminina, LanA and Lamininb, LanB1), Perlecan/Trol, Collagen-IV/Viking (Col-IV), the glycoprotein Nidogen (Entactin/Ndg), and the secreted glycoprotein protein-acidic-cysteine-rich (Sparc), which are all components of the basement membrane (Walma and Yamada, 2020). For Laminin, Ndg and SPARC, we used strains generated from a fosmid library, and expressing a functional GFP-tagged transgene under the control of their own respective promoter (Dai et al., 2018; Matsubayashi et al., 2017; Sarov et al., 2016). For Col-IV and Perlecan, we used functional GFP exon-trap strains (Morin et al., 2001).

      6) Line 200, "These specific patterns of expression for LamininA/B1, Collagen IV, Perlecan, Nidogen and Sparc". I have several comments here: - 5A. These patterns are discussed only using single optical sections. To highlight the difference in their localisation patterns more objectively, multiple sections and/or 3D images should be shown.

      RE: (a) These are all projections of 3 to 5 confocal sections, and we have amended the manuscript to make this point clearer. (b) Following the reviewer’s advice, we now provide sagittal sections so the reader can better appreciate what is detected above and below the grommet. Please see new Fig. 5.

      5B. Can the authors discuss, hypothesise, or speculate the biological meaning of the difference? * AND*

      *5C. It has been reported that in the mammalian skin BM, different components show distinct localisation patterns (PMID: 33972551). It would be interesting to cite this paper and discuss the generality of the non-uniform distribution of BM components. *

      __RE: __The revised manuscript offers a short discussion in this topic.

      Line: 367 The idea that different cell types in a tissue can express different ECM components, and thus induce localized specialization of a basement membrane is well-supported by recent work in the mouse hair follicle. In this sensory organ, the architecture and composition of the basement membrane is highly specialized depending on the cell-cell and cell-tissue interface considered (Cheng et al., 2018; Fujiwara et al., 2011; Joost et al., 2016). Moreover, different cell populations – epithelial stem cells and fibroblasts, express different ECM components in the hair follicle (Tsutsui et al., 2021), supporting the notion that specific basement membrane organization contributes to cell-cell communication and overall 3D tissue architecture.

      7) Line 215, "However, inhibiting the expression of Collagen IV, Ndg, Perlecan and Sparc individually, by expressing RNAi against these genes in all retinal cells, did not lead to defects in bPS/Mys localization". To conclude so, the authors must demonstrate that the used RNAis efficiently removed its target proteins.

      __RE: __We have removed this section referring to Collagen IV, Ndg, Perlecan and Sparc.

      Instead, we now focus solely on Laminin. Because Laminin accumulation at the presumptive grommet precedes that of the other ECM factors examined in our study, we favor a model in which Laminin plays a key role in promoting integrin localization.

      8)* Line 222, "DG is required to organize the ECM in several experimental settings [42, 43, 45, 51]". Here, the authors must mention to a preceding paper that reported the eye deficiency of Dg mutant flies (PMID: 20463973), and discuss what new findings authors can add to the previous report. *

      __RE: __We have followed this recommendation.

      Line 441: We also note that a previous study showed that early in retinal development, DG localizes at the apical membrane of the photoreceptors. This study proposed that DG promotes elongation of these sensory neurons, independently to any potential role this surface receptor might play in basement membrane organization (Zhan et al., 2010). This conclusion was based on Df(2R)Dg248 mutant clones and trans-heterozygous retinas, where DG function was impaired not only in photoreceptors, but in all interommatidial cell types. Moreover, the basement membrane was not examined in this study. Our work, and the fact the bulk of retinal cell elongation occurs late in retinal development(Longley and Ready, 1995), is consistent with DG playing a role in retinal cell elongation and overall tissue thickening.

      Under “Advance”:

      *The 3D imaging of ommatidia development is beautiful and of good descriptive value. ** However, as mentioned in the major comments 1, 2, 3, and 8 above, I am afraid that the search of preceding literature seems insufficient, and it is often unclear what this manuscript add to existing knowledge. *

      __RE: __The logic of how the reviewer links points 2, and 3 they raise as part of their review, to their assessment of how our work advances the field, is unclear to me. Their Points 2 and 3 have to do with making sure we better explain how the functional ECM transgenes were generated and by whom. The importance the reviewer places on points 2, 3 when considering the Advance our work provides to the field does not appear justified to me.

      Point 1 refers to a previous study by Zusman et al., published in 1993. Using partial loss of function alleles and heat-shock inducible rescue constructs they show that bPS/Mys plays a role in eye development. They note that in adult eyes, retinal cells are not attached to their basement membrane. They show this is accompanied by a failure for the retina to elongate along the apical-basal axis. These phenotypes are consistent with a role for integrins in mediating attachment of epithelial cells to the basement membrane, and we are now referring to this work in the revised manuscript. A much more relevant reference to our work however, is (Longley and Ready, 1995), which we have used repeatedly in our manuscript to stress what was novel about our work.

      Point 8 refers to a previous report implicating DG in photoreceptor elongation, which is a developmental phase that mostly occurs after the process we are studying here (please see Fig.3 of (Longley and Ready, 1995) for quantification using sections). The photoreceptors do no contribute basal profiles at the basal surface of the retina. The DysGFP signal we detect at this tissue surface, in the presumptive and established grommet, is clearly coming from the pigment cells, not from the photoreceptor axons which are found at this basal location. We now discuss this previous report, to make what is clearer what is novel about our own work.

      .

      Minor comments: - Line 85, "This is the case in the follicular epithelium for example". Here, the text would be more reader-friendly if the authors could clarify this is the follicular epithelium of the fly ovary.

      __RE: __We have modified the text to address this comment.

      - Line 203-, regarding all the experiments involving the Gal4-UAS system. Not all the readers are familiar with the system. A brief explanation on it should be added in the main text. Moreover, in the Results section, not in the Methods, the authors should show what Gal4 they used, and where is the Gal4 expressed.

      __RE: __We have amended the manuscript accordingly.

      *- Line 239, "We found that inhibiting the expression of the DG cofactor, dSarcoglycan [53] was most effective in inhibiting this pathway in retinal cells". Here, the authors should show the data. *

      __RE: __This statement is based on the results shown in Fig.8 and Suppl. Fig.9, which make use of a PCA representation to quantify the Dg, Dys and dScg RNAi phenotypes in cell basal geometry. We have re-phrased this statement to make it clear that we are referring to the RNAi-based perturbation of these genes’ expression.

      4. Description of analyses that authors prefer not to carry out

      We will address all the reviewer comments as they will consolidate our findings.

      Our further validation of the few RNAi lines used in our study that have not been used before in publications will also be valuable to the community.

      References:

      Allikian, M.J., Bhabha, G., Dospoy, P., Heydemann, A., Ryder, P., Earley, J.U., Wolf, M.J., Rockman, H.A., and McNally, E.M. (2007). Reduced life span with heart and muscle dysfunction in Drosophila sarcoglycan mutants. Hum Mol Genet 16, 2933-2943.

      Banerjee, S., Bainton, R.J., Mayer, N., Beckstead, R., and Bhat, M.A. (2008). Septate junctions are required for ommatidial integrity and blood-eye barrier function in Drosophila. Dev Biol 317, 585-599.

      Blackie, L., Tozluoglu, M., Trylinski, M., Walther, R.F., Schweisguth, F., Mao, Y., and Pichaud, F. (2021). A combination of Notch signaling, preferential adhesion and endocytosis induces a slow mode of cell intercalation in the Drosophila retina. Development 148.

      Catalani, E., Bongiorni, S., Taddei, A.R., Mezzetti, M., Silvestri, F., Coazzoli, M., Zecchini, S., Giovarelli, M., Perrotta, C., De Palma, C., et al. (2021). Defects of full-length dystrophin trigger retinal neuron damage and synapse alterations by disrupting functional autophagy. Cell Mol Life Sci 78, 1615-1636.

      Caussinus, E., Kanca, O., and Affolter, M. (2013). Protein knockouts in living eukaryotes using deGradFP and green fluorescent protein fusion targets. Current protocols in protein science / editorial board, John E Coligan [et al] 73, Unit 30 32.

      Cerqueira Campos, F., Dennis, C., Alegot, H., Fritsch, C., Isabella, A., Pouchin, P., Bardot, O., Horne-Badovinac, S., and Mirouse, V. (2020). Oriented basement membrane fibrils provide a memory for F-actin planar polarization via the Dystrophin-Dystroglycan complex during tissue elongation. Development 147.

      Cheng, C.C., Tsutsui, K., Taguchi, T., Sanzen, N., Nakagawa, A., Kakiguchi, K., Yonemura, S., Tanegashima, C., Keeley, S.D., Kiyonari, H., et al. (2018). Hair follicle epidermal stem cells define a niche for tactile sensation. Elife 7.

      Christoforou, C.P., Greer, C.E., Challoner, B.R., Charizanos, D., and Ray, R.P. (2008). The detached locus encodes Drosophila Dystrophin, which acts with other components of the Dystrophin Associated Protein Complex to influence intercellular signalling in developing wing veins. Dev Biol 313, 519-532.

      Dai, J., Estrada, B., Jacobs, S., Sanchez-Sanchez, B.J., Tang, J., Ma, M., Magadan-Corpas, P., Pastor-Pareja, J.C., and Martin-Bermudo, M.D. (2018). Dissection of Nidogen function in Drosophila reveals tissue-specific mechanisms of basement membrane assembly. PLoS Genet 14, e1007483.

      Delon, I., and Brown, N.H. (2009). The integrin adhesion complex changes its composition and function during morphogenesis of an epithelium. J Cell Sci 122, 4363-4374.

      Fernandes, V.M., McCormack, K., Lewellyn, L., and Verheyen, E.M. (2014). Integrins regulate apical constriction via microtubule stabilization in the Drosophila eye disc epithelium. Cell reports 9, 2043-2055.

      Fujiwara, H., Ferreira, M., Donati, G., Marciano, D.K., Linton, J.M., Sato, Y., Hartner, A., Sekiguchi, K., Reichardt, L.F., and Watt, F.M. (2011). The basement membrane of hair follicle stem cells is a muscle cell niche. Cell 144, 577-589.

      Hynes, R.O. (2002). Integrins: bidirectional, allosteric signaling machines. Cell 110, 673-687.

      Joost, S., Zeisel, A., Jacob, T., Sun, X., La Manno, G., Lonnerberg, P., Linnarsson, S., and Kasper, M. (2016). Single-Cell Transcriptomics Reveals that Differentiation and Spatial Signatures Shape Epidermal and Hair Follicle Heterogeneity. Cell Syst 3, 221-237 e229.

      Lemke, S.B., Weidemann, T., Cost, A.L., Grashoff, C., and Schnorrer, F. (2019). A small proportion of Talin molecules transmit forces at developing muscle attachments in vivo. PLoS Biol 17, e3000057.

      Longley, R.L., Jr., and Ready, D.F. (1995). Integrins and the development of three-dimensional structure in the Drosophila compound eye. Dev Biol 171, 415-433.

      Matsubayashi, Y., Louani, A., Dragu, A., Sanchez-Sanchez, B.J., Serna-Morales, E., Yolland, L., Gyoergy, A., Vizcay, G., Fleck, R.A., Heddleston, J.M., et al. (2017). A Moving Source of Matrix Components Is Essential for De Novo Basement Membrane Formation. Curr Biol 27, 3526-3534 e3524.

      Mirouse, V., Christoforou, C.P., Fritsch, C., St Johnston, D., and Ray, R.P. (2009). Dystroglycan and perlecan provide a basal cue required for epithelial polarity during energetic stress. Dev Cell 16, 83-92.

      Morin, X., Daneman, R., Zavortink, M., and Chia, W. (2001). A protein trap strategy to detect GFP-tagged proteins expressed from their endogenous loci in Drosophila. Proc Natl Acad Sci U S A 98, 15050-15055.

      Perkins, A.D., Ellis, S.J., Asghari, P., Shamsian, A., Moore, E.D., and Tanentzapf, G. (2010). Integrin-mediated adhesion maintains sarcomeric integrity. Dev Biol 338, 15-27.

      Sarov, M., Barz, C., Jambor, H., Hein, M.Y., Schmied, C., Suchold, D., Stender, B., Janosch, S., K, J.V., Krishnan, R.T., et al. (2016). A genome-wide resource for the analysis of protein localisation in Drosophila. Elife 5, e12068.

      Stowers, R.S., and Schwarz, T.L. (1999). A genetic method for generating Drosophila eyes composed exclusively of mitotic clones of a single genotype. Genetics 152, 1631-1639.

      Tsutsui, K., Machida, H., Nakagawa, A., Ahn, K., Morita, R., Sekiguchi, K., Miner, J.H., and Fujiwara, H. (2021). Mapping the molecular and structural specialization of the skin basement membrane for inter-tissue interactions. Nat Commun 12, 2577.

      Villedieu, A., Alpar, L., Gaugue, I., Joudat, A., Graner, F., Bosveld, F., and Bellaiche, Y. (2023). Homeotic compartment curvature and tension control spatiotemporal folding dynamics. Nat Commun 14, 594.

      Walma, D.A.C., and Yamada, K.M. (2020). The extracellular matrix in development. Development 147.

      Wen, D., Chen, Z., Zhang, Z., and Jia, Q. (2020). The expression, purification, and substrate analysis of matrix metalloproteinases in Drosophila melanogaster. Protein Expr Purif 171, 105629.

      Xie, X., and Auld, V.J. (2011). Integrins are necessary for the development and maintenance of the glial layers in the Drosophila peripheral nerve. Development 138, 3813-3822.

      Xie, X., Gilbert, M., Petley-Ragan, L., and Auld, V.J. (2014). Loss of focal adhesions in glia disrupts both glial and photoreceptor axon migration in the Drosophila visual system. Development 141, 3072-3083.

      Zhan, Y., Melian, N.Y., Pantoja, M., Haines, N., Ruohola-Baker, H., Bourque, C.W., Rao, Y., and Carbonetto, S. (2010). Dystroglycan and mitochondrial ribosomal protein L34 regulate differentiation in the Drosophila eye. PLoS One 5, e10488.

      Zusman, S., Grinblat, Y., Yee, G., Kafatos, F.C., and Hynes, R.O. (1993). Analyses of PS integrin functions during Drosophila development. Development 118, 737-750.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Cell shape remodelling is essential for tissue morphogenesis. To model this event, the fruit fly Drosophila melanogaster has been widely used. In the pupal retina, ommatidial cells change their structure to form the photo-sensing machinery in the compound eye. Previous studies investigating this event mainly focused on the cell shape change at the apical plane. However, the cell shape at the basal side and the three-dimensional (3D) structure of the cells have been little studied.

      In this manuscript, the authors address this issue by combining state-of-art 3D imaging and fly genetics. They report that at the initial stage of eye development, a basement membrane (BM) component Laminin accumulates at the basal side of the ommatidial cells in a manner dependent on the BM-receptor molecule dystroglycan (Dg). The authors propose that this Dg-dependent Laminin accumulation induces the polarisation of integrin at the basal surface, which is essential for proper ommatidia morphogenesis.

      Major comments:

      The beautiful images presented here provide interesting descriptions of the events occurring during eye development. Also, the authors propose an attractive and simple hypothesis that the Dg-dependent recruitment of Laminin leads to integrin polarisation and tissue morphogenesis. However, I'm afraid that this hypothesis is not supported enough by the presented data. In addition, the novelty of some conclusions and the reliability of a number of reagents used are unclear. Specific concerns are described below:

      1. Line 169, "From these experiments, we conclude that Integrin adhesion is required for cell basal geometry remodeling during retinal morphogenesis". It has been long known that integrin is necessary for the gross morphogenesis of the eye (e.g., Zusman et al. 1993, PMID: 8076515). The authors need to cite these preceding researches and should clarify what new findings this new work adds to the previous knowledge.
      2. Line 180, "Using available functional GFP protein traps [49, 50]", the authors investigate the behaviour of Laminin subunits LanA and LanB1. First, ref [50] is not relevant here and should be removed. Moreover, the Laminin-GFPs the authors used are not protein traps, but transgenic strains harbouring genes and most of their regulatory information, with the ORFs tagged with GFP [49]. Furthermore, while the ref [49] reported the functionality of LanB1-GFP, this reference did not fully address the functionality of LanA-GFP. The authors need another reference on it (PMID: 29129537), which demonstrated that LanA-GFP rescues LanA mutants.
      3. Related to the issue above, in addition to LanA and LanB1, the authors examine the localisation of the following BM proteins using GFP-fusion: Perlecan/Trol, Collagen IV/Viking, Nidogen, and SPARC. The authors do not explicitly describe the nature of these GFP fusions, but I am afraid that the authors think all of them are "functional protein traps". However, in fact while Perlecan and Collagen IV are protein traps, Nidogen and SPARC are transgenics including regulatory sequences made in the ref [49]. This must be clarified. Moreover, to rely on the data obtained using these GFP fusions, their functionality must be confirmed by appropriate references or/and the authors' own data. For information, ref [62] showed the functionality of Perlecan-GFP and Collagen IV-GFP protein traps (they are both homozygous viable), and the Nidogen-GFP transgene rescues the BM deficiency of Ndg mutants (PMID: 30260959). These reports must be explained in the text, and I would like the authors collect and show more information.
      4. Line 182-, LanA and LanB1 "accumulate at the center of the ommatidium, in a pattern resembling the grommet structure (Figure 4A and Supplementary Figure 4)"... "LamininA/B1 accumulation at the presumptive grommet precedes Integrin accumulation at this location. It suggests that localized Laminin might control Integrin localization in the interommatidial cells". Based on these results, the authors discuss that "generating specific polygonal geometries at the basal surface of cells starts with organizing the ECM to establish a pattern of Laminin-rich domains, distributed across the tissue basal surface" (Line 267).

      As the authors write "Laminin-rich domains", I suppose that they assume that LanA/B1 accumulates in a restricted region of the BM. However, it has been reported that the majority of Laminin in the fly embryo is soluble and floating in the haemolymph (fly's 'blood' or body fluid) (PMID: 29129537). Therefore, the LanA/B1 observed in the figures might be just floating in the intercellular space and doing nothing on the BM. The authors should exclude this possibility to support their idea that Laminin localised in a specific region of the BM recruits Integrin. For example, does secreted GFP (PMID: 12062063) not behave in the same way as LanA/B1? Can the authors show that the LanA/B1 is indeed incorporated in the BM by FRAP or any methods? 5. Line 200, "These specific patterns of expression for LamininA/B1, Collagen IV, Perlecan, Nidogen and Sparc". I have several comments here: 5A. These patterns are discussed only using single optical sections. To highlight the difference in their localisation patterns more objectively, multiple sections and/or 3D images should be shown. 5B. Can the authors discuss, hypothesise, or speculate the biological meaning of the difference? 5C. It has been reported that in the mammalian skin BM, different components show distinct localisation patterns (PMID: 33972551). It would be interesting to cite this paper and discuss the generality of the non-uniform distribution of BM components. 6. Line 208, "we found that LanB2 RNAi leads to defects in bPS/Mys Integrin localization". Here, because the authors use only single RNAi, there remains the possibility that the observed phenotype was caused by an off-target effect. The authors should exclude this possibility by using another RNAi or mutants. This is the same for all the RNAi experiments. In case of LanB2, however, showing that one RNAi against LanB1 shows the same phenotype would be enough, because LanB1 is another single subunit of fly Laminin. 7. Line 215, "However, inhibiting the expression of Collagen IV, Ndg, Perlecan and Sparc individually, by expressing RNAi against these genes in all retinal cells, did not lead to defects in bPS/Mys localization". To conclude so, the authors must demonstrate that the used RNAis efficiently removed its target proteins. 8. Line 222, "DG is required to organize the ECM in several experimental settings [42, 43, 45, 51]". Here, the authors must mention to a preceding paper that reported the eye deficiency of Dg mutant flies (PMID: 20463973), and discuss what new findings authors can add to the previous report. 9. Line 240. "RNAi against dSarcoglycan led to a decrease in LanA::GFP expression at the presumptive grommet at 20h APF (Figure 7F)". As to this result, the authors seem to interpret that Laminin is not recruited to the "specific BM domain" in grommet in the absence of Dg signalling. However, other possibilities exist, e.g., that the global expression level of Laminin was reduced, or that the intercellular space into which soluble Laminin (see the issue 4 above) flows was narrowed down. The authors should show the data that exclude (or at least reduce) these possibilities. 10. Line 255, "These perturbations led to a failure of bPS/Mys to accumulate at the grommet". Dg mutants are viable (PMID: 18093579); do they show consistent phenotypes?

      Minor comments:

      1. Line 85, "This is the case in the follicular epithelium for example". Here, the text would be more reader-friendly if the authors could clarify this is the follicular epithelium of the fly ovary.
      2. Line 203-, regarding all the experiments involving the Gal4-UAS system. Not all the readers are familiar with the system. A brief explanation on it should be added in the main text. Moreover, in the Results section, not in the Methods, the authors should show what Gal4 they used, and where is the Gal4 expressed.
      3. Line 239, "We found that inhibiting the expression of the DG cofactor, dSarcoglycan [53] was most effective in inhibiting this pathway in retinal cells". Here, the authors should show the data.

      Referee cross-commenting

      This session includes comments from both reviewers

      Reviewer 2: I almost totally agree with Reviewer 1, who is also mainly concerned about the functional analyses part of the paper while being impressed by the authors' beautiful imaging. One issue that Reviewer 1 and I apparently disagree with is the Estimated time to Complete Revisions: while they say 1-3 months, I say 3-6. However, actually I don't think this is a serious discrepancy. Thinking of the time to obtain flies and carry out their crosses necessary for the requested experiments, I'm afraid that the revision cannot be done in 1 month. However, if the authors are fortunate, they may finish the revision in 2-3 months. As I still think that the authors may struggle, I would say the time 2-6 months. I'd be glad if the comments of Reviewer 1 and me could complement with each other to help the revision of the manuscript.

      Reviewer 1:As Reviewer #2 mentioned, there is a strong convergence of our opinions on this article, which should make the work of the authors easier. In fact, I hesitated between 1-3 or 3-6 months for the estimated revision time.

      Reviewer2: Thank you Reviewer #1 for your response. I guess we (Reviewers #1 and #2) have reached an agreement now, haven't we?

      Significance

      General assessment:

      The beautiful images presented here provide interesting descriptions of the events occurring during eye development. Also, the authors' hypothesis on the Dg-dependent recruitment of Laminin leading to integrin polarisation and tissue morphogenesis is simple and attractive. However, I'm afraid that this hypothesis is not supported enough by the presented data. In addition, the novelty of some conclusions and the reliability of a number of reagents used are unclear. Therefore, I cannot say that the conclusions of this manuscript are solid.

      Advance:

      The 3D imaging of ommatidia development is beautiful and of good descriptive value. However, as mentioned in the major comments 1, 2, 3, and 8 above, I am afraid that the search of preceding literature seems insufficient, and it is often unclear what this manuscript add to existing knowledge.

      Audience:

      If the issues mentioned above have been solved, this manuscript would be of general interest to researchers in various fields in cell and developmental biology. Would not be restricted to those using Drosophila.

    1. Men who write openly as gay men have also often been excluded from the consensus of the traditional canon and may operate more forcefully now within a specifically gay /lesbian canon

      I think this is an interesting point to bring up. Maybe the "queer media" we consume and have consumed could have been more widely accepted if homophobia wasn't so prevalent.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would truly like to thank all 3 reviewers for insightful, helpful and thus constructive comments.

      Reviewer #1

      Summary

      In this manuscript, Lockyer et al. provide novel insights into the mechanism by which Toxoplasma gondii avoids parasite restriction in IFNγ-activated human cells. To identify potentially secreted proteins supporting parasite survival in IFNγ-activated human foreskin fibroblasts (HFF), the authors designed a CRISPR screen of Toxoplasma secretome candidates based on hyperLOPIT protein localization data. By this approach, they identified novel secreted proteins supporting parasite growth in IFNγ-activated cells. Among the gene identified, they found MYR3 a known component of the putative translocon in charge of protein export through the parasitophorous vacuole membrane. Therefore, the authors focused their investigations on GRA57, a dense granule protein of unknown function, which affects parasite survival to a lesser extent than the MYR component. The resistance phenotype conferred by GRA57 was confirmed by fluorescence microscopy. Importantly, the authors provide evidence that the protective function of GRA57 is not as well conserved in murine cells of the same type (MEF) as in HFF. To further explore the mechanism by which GRA57 protect the parasites in IFNγ-activated cells, the authors searched for protein partners by biochemistry. By immunoprecipitation and tandem mass spectrometry, they identified two other putative dense granule proteins, GRA70 and GRA71, which co-purified with GRA57-HA tagged protein. Noteworthy, both proteins were also found in the CRISPR screens with significant score conferring resistance. High-content imaging analysis confirmed the protective effect conferred by GRA57, GRA70, and GRA71 individually at similar levels. After ruling out an effect of tryptophan deprivation in parasite clearance, or a role of GRA57 in protein export normally mediated by the MYR translocon, and a role on host cell gene expression by RNA-Seq, the authors investigated the ubiquitination of the parasitophorous vacuole membrane, a marker previously thought to initiate parasite clearance. A reduction in ubiquitin labeling around the vacuole of mutant parasites is observed, which is quite surprising given the correlated increase in parasite clearance. The authors concluded that ubiquitin recruitment may not be directly linked to the parasite clearance mechanism.

      Major comments

      • Figure 2C. In this figure, the restriction effect of IFNγ is about 60% (or 40% survival) for RHdeltaUPRT parasites grown in HFFs, which is quite different from the 85% mentioned earlier in the results section. How was actually done the first assay? Settings with 60% restriction sounds reasonable and indicates that a substantial fraction of the parasite population evades the restrictive effect of IFNγ, which provides a clear rationale for the main objective of this study, namely the identification of effectors supporting parasite development in human cells in the presence of IFNγ.

      This discrepancy in restriction likely arises from the differences in the parasites used in these assays and the measurements of restriction. The 85%/90% restriction initially mentioned is from the pooled CRISPR screens using the effector knockout pool. This restriction level was assessed by counting of parasites retrieved following infection of IFNg-stimulated HFFs. The 60% restriction of wildtype parasites seen in Figure 2 is a separate assay. This percentage was calculated by measuring total mCherry fluorescence area within infected HFFs. We expect the restriction of the pooled CRISPR population to be higher than in restriction assays performed with either wild type parasites or single genetic knockouts. We included the 85%/90% numbers to highlight that the HFFs were highly restrictive in the screen, but we have now removed references to these numbers in the results section to avoid confusion with later results that use more accurate measures of survival. We refer to this restriction level instead in the discussion section.

      Optional comment: GRA70 and GRA71 were both copurified with GRA57, but what about GRA71 expression and localization? Is there a reason why this protein partner has not been studied further just like GRA70?

      Tagging of GRA71 was attempted but was not successful in a first attempt. We have not re-attempted this tagging as Krishnamurthy et al 2023 (PMID: 36916910) recently tagged and localised GRA71, demonstrating it is also an intravacuolar dense granule protein with similar localisation to GRA57 and GRA70- we feel there is minimal value in us repeating this.

      *Is there any change in GRA57, GRA70, and GRA71 localization and/or amount when cells were pretreated with IFNγ? *

      Thank you for this suggestion, we have now conducted further investigation to address this. We checked the localisation of GRA57-HA and GRA70-V5 in IFNg-stimulated HFFs and found no change to their localisation. This data has been added in Supplementary Figure S4 in our revised manuscript. Alignment of our RNA-Seq data to the Toxoplasma genome, now included as Supplementary Data 4, also shows there is no significant up or downregulation in expression of any of the three proteins when HFFs are pretreated with IFNg.

      Do they still form a complex in the absence of IFNγ?

      We did not investigate this in this manuscript, however in Krishnamurthy et al 2023 (PMID: 36916910) CoIPs using GRA57 and GRA70 in the absence of IFNγ also identified these three proteins as interaction partners, so formation of the complex is likely IFNg-independent.

      • In the absence of GRA70 or GRA71 is GRA57 expression and/or localization affected?*

      We did not investigate this possibility in this manuscript, however doing so would require the generation of epitope tagged lines in knockout backgrounds. We believe this represents a significant body of work and would therefore be suitable for a future study focused on the further characterisation of this complex. The RNA-Seq data shows that GRA70 and GRA71 expression levels are not significantly different in the RH∆GRA57 strain (Supplementary Data 4) which we have now included as a statement in the results section.

      • *Page 13, result section. To determine whether GRA57 has any direct or indirect effect on host cell gene expression, the authors performed RNA-Seq analysis of HFF cells pretreated or not with IFNγ. First, as for proteomic data, were the data deposited on GEO or another repository database? *

      Second, were any effect detected on parasite gene expression? Reads alignment could be done using the T. gondii reference genome to determine whether IFNg or gra57 KO has any effect on parasite genes. Possibly, other secreted proteins not necessarily expressed at the tachyzoite stage and therefore not captured in the hyperLOPIT protein analysis are specifically expressed in these conditions.

      We will deposit the RNA-Seq data on GEO prior to final publication. We did perform read alignment using the Toxoplasma gondii reference genome, and we agree it would be useful to include this analysis. We have now provided this data in Supplementary Data 4. Comparison of parasite gene expression between RH∆Ku80 and RH∆GRA57 revealed very few major changes (L2FC 2) that were also rescued in the RH∆GRA57::GRA57 line, irrespective of IFNg stimulation. Of the few genes that were up or downregulated in the RH∆GRA57 parasites, these were all uncharacterised. Collectively this data did not provide any mechanistic insight into the function of GRA57, and we think it unlikely the GRA57 phenotype is related to major changes in host or parasite gene expression. We have amended the manuscript to highlight this.

      Optional comment: RNA-Seq analysis points to a clear induction of GBPs upon IFNγ treatment in HFF. Given the clear function of GBP in parasite clearance, have the authors ever hypothesized that GRA57 could be involved in preventing GBP binding to the PVM?

      We have not tested if GBP recruitment is influenced by GRA57, however GBPs have previously been shown to be dispensable for restriction of Toxoplasma growth in HFFs (Niedelman et al 2013, PMID: 24042117) despite being robustly induced by IFNg stimulation (Kim et al 2007, PMID: 17404298). We have modified the manuscript to highlight this.

      Minor comments

      • Page 4, introduction, 8th paragraph. Regarding the role of IST, it might be less prone to controversy to state: 'a condition that may only be met in the early stages of infection.'

      We agree and have changed this.

      • Page 4, end of introduction. Changing '... indicating that the three proteins function in a complex'. Changing to '... indicating that the three proteins function in the same pathway.' might be more appropriate for the conclusion.

      We agree and have changed this.

      • Page 4, result section, first paragraph. 'strain specific and independent effectors'. Are the authors talking about strain-specific and non-strain-specific factors?

      Yes- we have changed the text to reflect this.

      - Page 6, result section. 'GRA25, an essential virulence factor in mice'. It is not clear to the reviewer how a virulence factor is essential since both parasite and mouse survival is achieved in the GRA25 mutant. I suggest to replace 'essential' by 'major'.

      We agree and have changed this.

      - Page 7. 'showing that GRA57 resides in the intravacuolar network (IVN) (Figure 2A)'. From the image shown, GRA57 clearly localizes into the PV, but it is hard to tell whether GRA57 is associated with the intravacuolar network. Colocalization assay or electron microscopy would be necessary to draw such conclusions.

      We agree and have changed all references to this localisation as ‘intravacuolar’ instead of specifically the IVN.

      - 'uprt locus'. Lower case letters and italic are generally preferred to designate mutants, whereas upper case letters are generally used for wild type alleles. (Sibley et al., Parasitology Today, 1991. Proposal for a uniform genetic nomenclature in Toxoplasma gondii).

      We agree and have changed this.

      - The authors mentioned in the introduction that ROP1 contributes to T. gondii resistance to IFNγ in murine and human macrophages. However, they did not comment on whether ROP1 was found important in the screen performed here in human HFF cells. It may be useful to reference ROP1 in Figure 1 as GRA15, GRA25, etc.

      ROP1 was not found to be important in the HFF screens (+IFNg L2FCs in RH: -0.1, PRU: -0.46). As ROP1 was characterised as an IFNg resistance effector in macrophages, this discrepancy may therefore represent a cell type-specific difference, so we feel it is not relevant to highlight for the purposes of the screens presented here.

      - Figure 2D. The authors compared the restriction effect of IFNγ on parasites grown in HFF and MEF host cells. However, as represented - % + IFNγ/- IFNγ - it cannot be estimated whether the parasites grew similarly in the two host cell types in the absence of IFN. Please indicate whether or not the growth was similar in both cell types.

      As these restriction assays were not carried out concurrently and were designed to measure IFNg survival, we feel it would be inaccurate to compare parasite growth between the two cell types using this data. The focus of these experiments was to investigate the restrictive effect of IFNg across parasite strains, using the -IFNg condition to control for differences in growth rate or MOI. Therefore we feel it is appropriate for the focus of our manuscript to represent the data in this way.

      - pUPRT plasmid. Any reference or vector map would be appreciated.

      We have added the reference for this plasmid.

      - Page 9, figure 3A, mass spectrometry analysis. I did not find the MS data in supplementals. Were the data deposited in on PRIDE database or another data repository?

      The table was included as Supplementary Data 2, however this was not referred to in the main text. We have now amended the text to include this. The data will be deposited on PRIDE prior to final publication.

      - Figures 3E and 3F. It might be worth mentioning, at least in the figure legend, that GRA3 localizes at PV membrane and is exposed to the host cell cytoplasm (to mediate interactions with host Golgi). The signal for GRA3 following saponin treatment is here an excellent control that should be highlighted, indicating that saponin effectively permeabilized the host cell membrane.

      We agree and have updated the figure legend and the main text. We have also added a reference to Cygan et al 2021__ (__PMID: 34749525) in support of this data, which found GRA57, but not GRA70 or GRA71, enriched at the PVM.

      • Page 11, section title. I think that the authors meant 'GRA57, GRA70 and GRA71 confer resistance to vacuole clearance in IFNγ-activated HFFs.'

      We agree and have changed this.

      • Page 11, in the result section comparing the effect of GRA57 mutant with MYR component KO, the authors are referring to host pathways that are counteracted by MYR-dependent effectors released into the host cell. It is not clear which pathways the authors are referring to.

      It is not known exactly which host pathways mediate vacuole clearance or parasite growth restriction, or which MYR-dependent parasite effectors specifically resist these defences, therefore we have removed this statement from the text for clarity.

      • Page 16, discussion, end of 4th paragraph. '... to promote parasite survival in IFNγ activated cells' sounds better.

      We agree and have changed this.

      • Page 22-23, Methods section, c-Myc nuclear translocation assays and elsewhere. Please indicate how many events were actually analyzed. For example, in this assay, to determine the median nuclear c-Myc signal, how many infected cells were analyzed for each biological replicate?

      We have updated the methods section for the c-Myc nuclear translocation and ubiquitin-recruitment assays to include details on how many events were analysed.

      **Referees cross-commenting**

      Overall, I agree with most of the co-reviewers' remarks. I agree with reviewer #2 that this manuscript reports interesting data for the field of parasitology, but that the broad interest for immunologists is somewhat limited by the lack of a description of the mechanism by which these effectors oppose IFNgamma-inducible cell-autonomous defenses. I also agree with the other reviewers' comments regarding the GRA57, 70, and 71 heterotrimeric complex, which would require further description. In its present form, the manuscript undoubtedly represents an interesting starting point for further investigations and any additional data regarding the mode of interaction of the identified effectors and their function related or not to ubiquitylation would bring a significant added value.

      Reviewer #1 (Significance (Required)):

      Despite the fact that humans are accidental intermediate hosts for Toxoplasma gondii, the parasite may develop a persistent infection, demonstrating that it has effectively avoided host defenses. While Toxoplasma gondii has been extensively studied in mice, much less is known about the mechanisms by which the parasite establishes a chronic infection in humans. In this context, this article described very interesting data about the way this parasite counteracts human cell-autonomous innate immune system. This is a fascinating and important topic lying at the interface between parasitology and immunology. Indeed, the highly specialized secretory organelles characteristics of apicomplexan parasites are key to govern host-cell and parasite interactions ranging from host cell transcriptome modification to counteracting immune defense mechanisms. Overall, this article presents a significant contribution to the field of parasitology by identifying novel players involved in Toxoplasma gondii's evasion of human cell-autonomous immunity. Most conclusions are generally well supported by cutting-edge approaches and state of the art methods. Despite being a highly competitive field, this article stands out as the first screen designed specifically to identify virulence factors for human cells and extends our understanding of the secreted dense granule proteins resident of the parasitophorous vacuole. Importantly, the authors provide evidence that these players are active in different strain backgrounds and act in a way that is independent of the export machinery in charge of delivering effector proteins directly into the host cell. However, substantial further research is needed to fully understand the mechanism by which these novel players confer resistance to the parasite in IFNγ activated human cells and how their mode of action differs from that mediated by the translocation machinery (MYR complex). As a microbiologist and biochemist, I find this work of a particular interest to a broad audience, especially to parasitologists and immunologists, as it may unveil unexpected aspects of human innate immunity involved in parasite clearance with proteins unique to Apicomplexa phylum.

      Reviewer #2

      This paper reports high-quality genetic screening data identifying three novel Toxoplasma virulence factors (Gra57,70, and 71) that promote survival of two distinct Toxoplasma strains (type I RH and type II Pru) inside IFN-gamma primed human fibroblasts. Follow-up studies, exclusively focused on type I RH Toxoplasma, confirm the screening data. Gra57 IP Mass-Spec data suggest that Gra57, 70, and 71 may form a protein complex, a model supported by comparable IF staining patterns

      Major:

      - It is unclear what statistical metric was used to define screen hits as strain-dependent vs strain-independent. A standard approach would be to use a specific z-score value (often a z-score of 2) above or below best fit linear relationship between L2FCU for RH vs Pru as depicted in Fig.1D. Gra25 and Gra35 appear to be specific for Pru but it would be helpful to approach this type of categorization statistically. Also, such an analysis may reveal that only Pru-specific but not RH-specific hits were identified. Could the authors speculate why that would be?

      We did not use a specific statistical metric to define screen hits as strain-dependent vs strain-independent, but GRA57 was selected as a strain-independent hit based on having a L2FC of RH specific: TGME49_309600 (GRA71) & CST9

      PRU specific: GRA35, GRA25, ROP17, GRA23 & GRA45

      Strain-independent: MYR3, GRA57, TGME49_249990 (GRA70) & MYR1

      This agrees with our selection of strain-independent hits. However, we feel that using either L2FC or Z-score cut-offs is equally arbitrary, and we would therefore prefer to leave the data displayed without these cut-offs. It is indeed interesting that there appear to be more strain-specific hits in the PRU screen, but we cannot speculate as to why this may be as we did not explore this further here.

      *- The paper proposes that Gra57, 70, and 71 form a heterotrimeric complex. This is based on the Mass-spec data from the original Gra57 pulldown, similar IF staining patterns, and comparable phenotypic presentation of the individual KO strains. However, only the MS data provide somewhat direct evidence for the formation a trimeric complex, and these data are by no means definitive. As this is a key finding of the MS, it should be further supported by additional biochemical data. Ideally, the authors should reconstitute the trimeric complex in vitro using recombinant proteins. Admittedly, this could be quite an undertaking with various potential caveats. Alternatively, reciprocal pulldowns of the 3 components could be performed. Super-resolution microscopy of the 3 Gra proteins might present another avenue to obtain more compelling evidence in support of the central claim of this work, *

      We attempted a reciprocal pulldown using our GRA70-V5 line which unfortunately failed to verify the MS data, but we believe this is primarily due to differences in the affinity matrix that we used for this pulldown (anti-V5 vs anti-HA) and would require further optimisation or generation of a GRA70-HA line. However, while these revisions were being performed, another group published data demonstrating through pulldown of GRA57 and GRA70 that these proteins interact with each other, GRA71, and GRA32__ (__Krishnamurthy et al 2023, PMID: 36916910). We also identified GRA32 as enriched in our MS data, but to a less significant degree than GRA70 and GRA71. Together we believe that this independent data set is a robust validation of our findings, and strongly justifies the conclusion that these proteins form a complex.

      We agree with the reviewer that further biochemical characterisation of the complex will be an interesting avenue for future research, but we feel it would require a substantial amount of further work. As suggested, super-resolution microscopy of the 3 proteins would require the generation of either double or triple tagged Toxoplasma lines, or antibodies against one or more of the complex members. Again, we feel this would represent a substantial body of further work. Reconstitution of the complex in vitro would require recombinant expression and purification of multiple large proteins that are all multidomain and possibly membrane associated/integrated. Assuming a 1:1:1 stoichiometric assembly this complex would be 446kDa. Purification of such proteins and reconstitution of the complex in vitro is therefore likely to represent many challenges and we do not feel this would be trivial to accomplish.

      - The ubiquitin observations made in this paper are a bit preliminary and the authors' interpretation of their data is vague. The authors may want to re-consider that ubiquitylated delta Gra57 PVs are being destroyed with much faster kinetics than ubiquitylated WT PVs. The reduced number of ubiquitylated delta Gra57 PVs compared to ubiquitylated WT PVs across three timepoints (as shown by the authors in Fi. S8) does not disprove the 'fast kinetics model.' To test the fast kinetics ubiquitin-dependent null hypothesis, video microscopy could be used to measure the time from PV ubiquitylation onset to PV destruction

      We agree with the reviewer that the possibility remains that GRA57 knockouts are cleared within the first hour of infection, and we have amended our text to reflect this. However, we think this is unlikely given that GRA57 knockouts are also less ubiquitinated in unstimulated cells, yet do not show any growth differences in unstimulated HFFs. Also considering the new data we have provided showing reduced recognition of GRA57 knockouts by the E3 ligase RNF213 (Figure 5D), we expect that the observed reduction in ubiquitination is highly likely to be unlinked to the increased susceptibility of GRA57 knockouts to IFNg. We have amended the discussion to state this conclusion more strongly.

      The recently published manuscript that also identified GRA57/GRA70/GRA71 as effectors in HFFs showed that deletion of these effectors leads to premature egress from IFNg-activated HFFs__ (__Krishnamurthy et al 2023, PMID: 36916910). In light of this new data, we hypothesised that early egress could be causing the apparent reduction in ubiquitination. We have now provided data that disproves this hypothesis (Figure S10), as inhibition of egress did not rescue the ubiquitination phenotype. We also did not observe enhanced restriction of GRA57 knockout parasites at 3 hours post-infection (Figure S10B), suggesting clearance, or egress, happens after this time point.

      We agree with the reviewer that determining the kinetics of IFNg restriction of these knockouts in HFFs would be interesting, however we feel this is more suited to future work. Imaging ubiquitin recruitment in live cells would also require the generation of new reporter host cell lines which would require a substantial amount of further work.

      - Related to the point above. We know that different ubiquitin species are found at the PVM in IFNgamma-primed cells but to what degree each Ub species exerts an anti-parasitic effect is not well established. The paper only monitors total Ub at the PVM. Could it be that delta Gra57 PVs are enriched for a specific Ub species but depleted for another? The authors touch on this in the Discussion but these are easy experiments to perform and well within the scope of the study. At least the previously implicated ubiquitin species M1, K48, and K63 should be monitored and their colocalization with Toxo PVMs quantified

      We agree that these experiments are within the scope of this study. We have now investigated the ubiquitin phenotype further by assessing the recruitment of M1, K48 and K63 ubiquitin linkages to the vacuoles of GRA57 knockouts. We observed depletion of both M1 and K63 linked ubiquitin. This data is now included in Figure 5 and Figure S8.

      The E3 ligase RNF213 has recently been shown to facilitate recruitment of M1 and K63-linked ubiquitin to Toxoplasma vacuoles in HFFs (Hernandez et al 2022, PMID: 36154443 & Matta et al 2022, DOI: https://doi.org/10.1101/2022.10.21.513197 ). We therefore additionally assessed the recruitment of RNF213 to GRA57 knockouts, and found RNF213 recruitment was also reduced. Given that a reduction in RNF213 recruitment should correlate with a decrease in restriction, this data further supports our conclusion that the ubiquitin and restriction phenotypes are not causally linked. The observation that GRA57 knockouts are less susceptible to recognition by RNF213 also opens an exciting avenue for further research into the host recognition of Toxoplasma vacuoles by RNF213, for which currently the target is unknown.

      Minor:

      - For readers not familiar with Toxo genetics, the authors should include a sentence or two in the results section explaining the selection of HXGPRT deletion strains for the generation of Toxo libraries

      We agree and have added this in.

      - the highest scoring hits from the Pru screen (Gra35 &25) weren't investigated further. These hits appear to be specific for Pru. Some discussion as to why there are Pru-specific factors (but maybe not RH-specific factors) seems warranted

      As mentioned above, we agree that it is indeed interesting that there appear to be more strain-specific hits in the PRU screen, but we cannot speculate as to why this may be as we did not explore the reasons for this further in this manuscript. Without substantial further investigation it cannot be determined whether these represent true strain-specific differences or reflect technical variability between the independent screens. We therefore feel it is sufficient to highlight effectors with the strongest phenotypes in each screen, without drawing strong conclusions regarding strain-specificity.

      **Referees cross-commenting**

      My reading of the comments is that there's consensus that this is a high quality study revealing novel Toxo effectors that undermine human cell-autonomous immunity and an important study in the field of parasitology. I might be the outlier that doesn't see much of an advance for the field of immunology since we don't really know what these effectors are doing, and the preliminary studies addressing this point are not well developed, with some confusing results.

      My major comment #2 and rev#1's major comment #2 are, I think, essentially asking for the same thing, namely some more robust data on substantiating the formation of a trimeric complex.

      My co-reviewers made great comments all across and I don't see any real discrepancies between the reviewers' comments - just some variation in what we, the reviewers, focused on

      Reviewer #2 (Significance (Required)):

      The discovery of a novel set of secreted Gra proteins critical for enhanced Toxoplasma survival specifically in IFNgamma primed human fibroblasts (but not mouse fibroblasts) is an important discovery for the Toxoplasma field. However, the study is somewhat limited in its scope as it fails to determine which, if any, specific IFNgamma-inducible cell-autonomous immune pathway is antagonized by Gra57 &Co. Instead, the paper reports that parasitophorous vacuoles (PVs) formed by Gra57 deletion mutants acquire less host ubiquitin than PVs formed by the parental WT strain. Because host-driven PV ubiquitylation is generally considered anti-parasitic, this observation is counterintuitive, and no compelling model is presented to explain these unexpected findings. Overall, this is a well conducted Toxoplasma research study with a few technical shortcomings that need to be addressed. However, in its current form, the study provides only limited insights into possible mechanisms by which Toxoplasma undermines human immunity. This study certainly provides an exciting starting point for further explorations.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Toxoplasma gondii virulence and immune responsed upon infection in mice are well described. In contrast, little is known about human responses, particularly upon IFNγ-activation. However, host ubiquitination of the parasitophorous vacuole has been shown to be associated with parasite clearence in human cells.

      Targeted CRISPR screens were used in the type I RH and type II Pru strain of Toxoplasma gondii to identify dense granule and rhoptry proteins. Human foreskin fibroblasts (HFFs) stimulated with IFNγ were used for infection of the knock-out parasites to identify guide RNAs and thus their corresponding genes to identify genes conferring growth benefits. Beside components of the MYR translocon, gra57 was identified. This gene was then knock-out or epitope-tagged in RH. The tagged line confirmed GRA57 localisation in the intravacuolar network confirming previously published work from another lab. Knock-out of gra57 lead to a moderate decrease in survival in HFFs, but not in mouse cells. Co-immunoprecipitation experiments with GRA57 identified 2 dense granule proteins that also display IFNγ-specific phenotypes with similar localisation as GRA57, and all are resistance factors in IFNγ-activated HFFs. Knock-out of GRA57 does not impact tryptophan metabolism, effector export of gene expression of the host cells. However, deletion of GRA57 or its interaction partners reduces ubiquitination of the parasitophorous vacuole.

      Major comments:

      This is a well executed study with informative, novel data. Here a few comments and questions:

      - LFC cut-off of the CRISPR screen should be clearly stated.

      We have amended this in the text.

      - What is the rationale for using Prugniaud as the type II strain of choice and not ME49?

      Both ME49 and PRU strains are widely used in the field, but as the PRU strain was used previously by our group for in vivo screens of Toxoplasma effectors (Young et al 2019 PMID: 31481656, Butterworth et al 2022 PMID: 36476844) ,using PRU here allows for direct comparison of our screening datasets.

      - Figure 4A does not list all the significant genes that are then mentioned in the text below. This should be amended.

      It is unclear what the reviewer is referring to here (Figure 4A displays restriction assay data).

      *- RNA-Seq data is inadequately presented. Although, the actual genes regulated may be of secondary importance in this study, it would still be good to have a few key genes mentioned as a quality control statement. *

      This was also raised by reviewer 1. We have now modified the manuscript to highlight that we observed robust induction of interferon-stimulated genes in our IFNg-treated conditions, but minimal differential gene expression between HFFs infected with the different parasite strains.

      *- It is stated that "...GRA57 is not as important for survival in MEFs as in HFFS". With no significant change observed, it should be re-phrased to something like ""...indicatin that GRA57 is s important for survival in MEFs as in HFFS." *

      We have re-phrased this statement.

      *- Optional: GRA57 was described by the Bradley lab to be in the PV in tachyzoites and in the cyst wall in bradyzoites. Although it tissue cysts are not the focus of this paper and the knock-out is created also in a cyst-forming strain, it would have been useful to look for a phenotype of the knockout in cysts, in vitro at least, better both in in vitro and in vivo. In future, this could also be useful for the authors bringing in more citations. *

      We agree with the reviewer that the impact of GRA57 on cyst formation would be an interesting topic for further exploration, however the focus of our study is on the role of secreted Toxoplasma effectors during the acute stages of infection.

      Minor comments:

      - Line numbers would be useful for an efficient review process.

      We have added these to the revised manuscript.

      - Strictly speaking, we have to talk about the sexual development taking place in felid and not feline hosts (Introduction; Felidae versus Felinae).

      We have amended this in the text.

      - Please insert spaces between numbers and units.

      We have corrected this.

      - Domain structures are presented, but maybe the AlphaFold 3D predictions could be added in a supplemental figure?

      For GRA70 and GRA71 the AlphaFold 3D predictions are readily available on ToxoDB, whereas for GRA57 the prediction is not available due its size. We therefore independently analysed GRA57 using the full implementation of AlphaFold 2 (not ColabFold). We attempted submissions of putative discrete domains as well as the full-length protein, however both approaches yielded predictions with low confidence and low structural content, except for a ~100aa region of helical residues. We chose not to include the AlphaFold 3D predictions for all three proteins as the confidence for these predictions is low with pLDDT scores of commonly *- To improve the confidence of the co-immunoprecipitation, it would be necessary to use another tagged protein GRA70 or 71) and see if the same complex can be pulled down. Like this, one could also address what happens in a GRA57KO line? Do GRA70 and 71 stay together in the absence of GR57 forming a dimer? *

      Reviewer 2 raised a similar point regarding the reciprocal pulldown, please see above for our detailed response to this. As suggested, we attempted a reciprocal pulldown using our GRA70-V5 line which unfortunately did not reconstitute the complex, but we believe this was due to technical differences in the epitope tag (V5 vs HA) and affinity matrix used. Overall, we believe that more detailed study of the assembly and biochemistry of this complex will require substantially more work and the generation of further cell lines, which would be beyond the scope of this study.

      Reviewer #3 (Significance (Required)):

      Significance:

      This study endeavours to start closing an important knowledge gab of host defence in non-rodent hosts, especially humans. The data is solid using two different strains and yields novel insights into players of host cell resistance in humans against T. gondii. Using a targeted screening approach of rhoptry and dense granule proteins, they focused their interest on a subcategory of secreted proteins. The authors have not limited themselves to the screening and localisation study, but also investigated effect on host cells and host cell response. The identification of GRA57 being an important resistance factor and forming a heterodimer with GRA70 and GRA71 is novel. This study is of interest to cell biologists in the field of cyst-forming Coccidia, especially T. gondii and researchers interested in host resistance, parasite clearance by the host and parasite virulence.

      I am a cell biologist working in Toxoplasma gondii and other Coccidians.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This paper reports high-quality genetic screening data identifying three novel Toxoplasma virulence factors (Gra57,70, and 71) that promote survival of two distinct Toxoplasma strains (type I RH and type II Pru) inside IFN-gamma primed human fibroblasts. Follow-up studies, exclusively focused on type I RH Toxoplasma, confirm the screening data. Gra57 IP Mass-Spec data suggest that Gra57, 70, and 71 may form a protein complex, a model supported by comparable IF staining patterns

      Specific criticisms

      Major:

      • It is unclear what statistical metric was used to define screen hits as strain-dependent vs strain-independent. A standard approach would be to use a specific z-score value (often a z-score of 2) above or below best fit linear relationship between L2FCU for RH vs Pru as depicted in Fig.1D. Gra25 and Gra35 appear to be specific for Pru but it would be helpful to approach this type of categorization statistically. Also, such an analysis may reveal that only Pru-specific but not RH-specific hits were identified. Could the authors speculate why that would be?
      • The paper proposes that Gra57, 70, and 71 form a heterotrimeric complex. This is based on the Mass-spec data from the original Gra57 pulldown, similar IF staining patterns, and comparable phenotypic presentation of the individual KO strains. However, only the MS data provide somewhat direct evidence for the formation a trimeric complex, and these data are by no means definitive. As this is a key finding of the MS, it should be further supported by additional biochemical data. Ideally, the authors should reconstitute the trimeric complex in vitro using recombinant proteins. Admittedly, this could be quite an undertaking with various potential caveats. Alternatively, reciprocal pulldowns of the 3 components could be performed. Super-resolution microscopy of the 3 Gra proteins might present another avenue to obtain more compelling evidence in support of the central claim of this work
      • The ubiquitin observations made in this paper are a bit preliminary and the authors' interpretation of their data is vague. The authors may want to re-consider that ubiquitylated delta Gra57 PVs are being destroyed with much faster kinetics than ubiquitylated WT PVs. The reduced number of ubiquitylated delta Gra57 PVs compared to ubiquitylated WT PVs across three timepoints (as shown by the authors in Fi. S8) does not disprove the 'fast kinetics model.' To test the fast kinetics ubiquitin-dependent null hypothesis, video microscopy could be used to measure the time from PV ubiquitylation onset to PV destruction
      • Related to the point above. We know that different ubiquitin species are found at the PVM in IFNgamma-primed cells but to what degree each Ub species exerts an anti-parasitic effect is not well established. The paper only monitors total Ub at the PVM. Could it be that delta Gra57 PVs are enriched for a specific Ub species but depleted for another? The authors touch on this in the Discussion but these are easy experiments to perform and well within the scope of the study. At least the previously implicated ubiquitin species M1, K48, and K63 should be monitored and their colocalization with Toxo PVMs quantified

      Minor:

      • For readers not familiar with Toxo genetics, the authors should include a sentence or two in the results section explaining the selection of HXGPRT deletion strains for the generation of Toxo libraries
      • the highest scoring hits from the Pru screen (Gra35 &25) weren't investigated further. These hits appear to be specific for Pru. Some discussion as to why there are Pru-specific factors (but maybe not RH-specific factors) seems warranted

      Referees cross-commenting

      My reading of the comments is that there's consensus that this is a high quality study revealing novel Toxo effectors that undermine human cell-autonomous immunity and an important study in the field of parasitology. I might be the outlier that doesn't see much of an advance for the field of immunology since we don't really know what these effectors are doing, and the preliminary studies addressing this point are not well developed, with some confusing results.

      My major comment #2 and rev#1's major comment #2 are, I think, essentially asking for the same thing, namely some more robust data on substantiating the formation of a trimeric complex.

      My co-reviewers made great comments all across and I don't see any real discrepancies between the reviewers' comments - just some variation in what we, the reviewers, focused on

      Significance

      The discovery of a novel set of secreted Gra proteins critical for enhanced Toxoplasma survival specifically in IFNgamma primed human fibroblasts (but not mouse fibroblasts) is an important discovery for the Toxoplasma field. However, the study is somewhat limited in its scope as it fails to determine which, if any, specific IFNgamma-inducible cell-autonomous immune pathway is antagonized by Gra57 &Co. Instead, the paper reports that parasitophorous vacuoles (PVs) formed by Gra57 deletion mutants acquire less host ubiquitin than PVs formed by the parental WT strain. Because host-driven PV ubiquitylation is generally considered anti-parasitic, this observation is counterintuitive, and no compelling model is presented to explain these unexpected findings. Overall, this is a well conducted Toxoplasma research study with a few technical shortcomings that need to be addressed. However, in its current form, the study provides only limited insights into possible mechanisms by which Toxoplasma undermines human immunity. This study certainly provides an exciting starting point for further explorations.

    1. Be this as it may, the peculiar relations between the United States and the Indians occupying our territory are such, that we should feel much difficulty in considering them as designated by the term foreign state, were there no other part of the constitution which might shed light on the meaning of these words. But we think that in construing them, considerable aid is furnished by that clause in the eighth section of the third article; which empowers congress to ‘regulate commerce with foreign nations, and among the several states, and with the Indian tribes.’ In this clause they are as clearly contradistinguished by a name appropriate to themselves, from foreign nations, as from the several states composing the union. They are designated by a distinct appellation; and as this appellation can be applied to neither of the others, neither can the appellation distinguishing either of the others be in fair construction applied to them. The objects, to which the power of regulating commerce might be directed, are divided into three distinct classes-foreign nations, the several states, and Indian tribes.

      This is important information as this is what is going to be used as the body and closing arguments, to strip the Cherokee nation of any help from the Federal government, making it so they have it continue to listen to the laws that Georgia's people set.

    1. Author Response

      Reviewer #2 (Public Review):

      The authors use data from 3 cross-sectional age-stratified serosurveys on Enterovirus D68 from England between 2006 and 2017 to examine the transmission dynamics of this pathogen in this setting. A key public health challenge on EV-D68 has been its implication in outbreaks of acute flaccid myelitis over the past decade, and past circulation patterns and population immunity to this pathogen are not yet well-understood. Towards this end, the authors develop and compare a suite of catalytic models as fitted to this dataset and incorporate different assumptions on how the force of infection varies over time and age. They find high overall EV-D68 seroprevalence as measured by neutralizing antibodies, and detect increased transmission during this time period as measured by the annual probability of infection and basic reproduction number. Interestingly, their data indicate very high seroprevalence in the youngest children (1 year-olds), and to accommodate this observation, the authors separate the force of infection in this age class from the other groups. They then reconstruct the historical patterns of EV-D68 circulation using their models and conclude that, while the serologic data suggest that transmissibility has increased between serosurvey rounds, additional factors not accounted for here (e.g., changes in pathogenicity) are likely necessary to explain the recent emergence of AFM outbreaks, particularly given the broader age-profile of reported AFM cases. The Discussion mentions important current unknowns on the biological interpretation of EV-D68 neutralizing antibody titers for protection against infection and disease. The analysis is rigorous and the conclusions are well-supported, but a few aspects of the work need to be clarified and extended, detailed below:

      1) Due to the lack of a clear single cut-point for seropositivity on this assay, the authors sensibly present results for two cut-points in the main text (1:16 and 1:64). While some differences that stem from using different cut-points are fully expected (i.e., seroprevalence being higher using the less stringent cut-point), differences that are less expected should be further discussed. For instance, it was not clear in Figure 2 why the annual probability of infection decreased after 2010 using the 1:64 cut-point, while it continued to increase using the 1:16 cut-point. It would also be helpful to explain why overall seroprevalence and R0 continue to increase over this time period using the 1:64 cut-point. Lastly, it would be useful to see the x-axis in Figure 4 extended to the start of the time period that FOI is estimated, with accompanying credible intervals.

      For the discussion on differences between the two cut-offs, please see response to essential comment 1.

      Extending the x-axis before 2006 in Figure 4 is not possible. Estimates of the overall seroprevalence at a year y require FOI estimates up until y-40. This implies the first estimates we can provide are for 2006.

      Credible intervals have been added to Figure 4.

      2) Additional context of EV-D68 in the study setting of England would be useful. While the Introduction does mention AFM cases "in the UK and elsewhere in Europe" (line 53), a summary of reported data on EV-D68/AFM in England prior to this study would provide important context. The Methods refers to "whether transmission had increased over time (before the first reported big outbreak of EV-D68 in the US in 2014)" (lines 133-134), rather than in this setting. It would be useful to summarize the viral genomic data from the region for additional context - particularly since the emergence of a viral clade is highlighted as a co-occurrence with the increased transmissibility detected in this analysis.

      We have added a figure (new Figure 1 – figure supplement 1) showing the annual number of EV-D68 detections reported by Public Health England from 2004 to 2020.

      We have also added the following text to the introduction: “Similarly, in the UK, reported EV-D68 virus detections also show a biennial pattern between 2014 and 2018 (Figure 1 – figure supplement 1).”

      We have also amended the sentence in the Methods.

      Finally, below is a screenshot of the nexstrain tree for EV-D68 based on the VP1 region and with tips representing sequences from the UK (light blue) and European countries in colour. There is a lot of mixing between sequences from different regions, indicating widespread transmission and small regional clustering. We have added the following text to the Discussion: “Reported EV-D68 outbreaks in 2014 and 2016 were due to clade B viruses, while the 2018 outbreaks were reported to be linked to both B3 and A2 clade viruses in the UK (10), France (32) and elsewhere.”

      Reviewer #3 (Public Review):

      In the proposed manuscript, the authors use cross-sectional seroprevalence data from blood samples that were tested for evidence of antibodies against D68 for the UK. Samples were collected at 3 time points from individuals of all ages. The authors then fit a suite of serocatalytic models to explain the changing level of seropositivity by age. From each model they estimate the force of infection and assess whether there have been changes in transmissibility over the study period. D68 is an important pathogen, especially due to its links with acute flaccid myelitis, and its transmission intensity remains poorly understood.

      Serocatalytic models appear to be appropriate here. I have a few comments.

      The biggest challenge to this project is the difficulty in assigning individuals as seronegative or seropositive. There is no clear bimodal distribution in titers that would allow obvious discrimination and apparently no good validation data with controls with known serostatus. The authors tackle this problem by presenting results to four different cut-points (1:16 to 1:128) - resulting in seropositivity ranging from around 50% to around 80%. They then run the serocatalytic models with two of these (1:16 and 1:64) - leading to a range of FoI values of 0.25-0.90 for the 1 year olds and 0.05-0.25 for older age groups (depending on model and cutpoint). This represents a substantial amount of variability. While I certainly see the benefit of attacking this uncertainty head on, it does ultimately limit the inferences that can be made about the underlying risk of infection in UK communities, except that it's very uncertain and possibly quite high.

      I find the force of infection in 1 year olds very high (with a suggestion that up to 75% get infected within a year) and difficult to believe, especially as the force of infection is assumed much lower for all other ages.

      The authors exclude all <1s due to maternal antibodies, which seems sensible, however, does this mean that it is impossible for <1s to become infected in the model? We know for other pathogens (e.g., dengue virus) with protection from maternal antibodies that the protection from infection is gone after a few months. Maybe allowing for infections in the first year of life too would reduce the very large, and difficult to believe, difference in risk between 1 year olds and older age groups. I suspect you wouldn't need to rely on <1 serodata - just allow for infections in this time period.

      Relatedly, would it be possible to break the age data into months rather than years in these infants to help tease apart what happens in the critical early stages of life.

      Yes. We have added two figures (new Figures 1C and 1D) showing the prevalence of antibodies in children <1 yo. We show these data for the three serosurveys combined, because the number of individuals per month of age is very small.

      One of the major findings of the paper is that there is a steadily increasing R0. This again is difficult to understand. It would suggest there are either year on year increases in inherent transmissibility of the virus through fitness changes, or year on year increases in the mixing of the population. It would be useful for the authors to discuss potential explanations for an inferred gradual increase in R0.

      We have removed the estimates of R0 from the manuscript.

      On a similar note, I struggle to reconcile evidence of a stable or even small drop in FoI in the 1:64 models 4 and 5 from 2010/11 (Figure 3) with steadily increasing R0 in this period (Figure 4). Is this due to changes in the susceptibility proportion. It would be good to understand if there are important assumptions in the Farrington approach that may also contribute to this discrepancy.

      We have removed the estimates of R0 from the manuscript and only present the reconstruction of the annual number of new infections per age class and year (new Figure 5). We think this measure is more adapted to the discussion of the results.

      In addition, when using the classical expression R{0t}=1/(1-S(t)), with S(t) the annual proportion seropositive, the high seroprevalence estimates (new Figure 4) result in extremely high estimates of the basic reproduction number (median ranges: 11.6 – 29.7 for 1:16 and 3.3 – 7.6 for 1:64 during the period 2006 to 2017).

      We had previously used the Farrington approach as it is adapted to cases when the force of infections is different for different age classes.

      The R0 estimates (Figure 4) should also be presented with uncertainty.

      R0 no longer presented, but estimates of overall seroprevalence now presented with uncertainty.

      Finally, given the substantial uncertainty in the assay, it seems optimistic to attempt to fit annual force of infections in the 30 year period prior to the start of the sampling periods. I would be tempted to include a constant lambda prior to the dates of the first study across the models considered.

      We thank the reviewers for the suggestion.

      We implemented this change (constant FOI before 2006) in the previous models without maternal antibodies and the result for the random-walk-based models was that the variance of the random walk was estimated over a very short period, thus resulting in a rather non- smoothed FOI.

      Implementing this change with the new models with maternal antibodies and random-walk on the FOI was technically a bit complex. We therefore kept the simple random-walk over the whole period and added the following paragraph to the Discussion:

      “It is important to interpret well the results for the estimates of the FOI over time from our analysis under the assumptions of the models. First, as the best model uses a random walk on the FOI, the change in transmission that we infer happens continuously over several years. In reality, this may have occurred differently (e.g. in a shorter period of time). Our ability to recover more complex changes in transmission is limited by the data available. It would not be surprising if EV-D68 has exhibited biennial (or longer) cycles of transmission in England over the last few years, as it has been shown in the US (7) and is common for other enteroviruses (30). However, it is difficult to recover changes at this finer time scale with serology data unless sampling is very frequent (at least annual). Therefore, our study can only reveal broader long-term secular changes. Second, interpretation of the results before 2006 must be avoided for two resasons. On the one hand, as we go backwards in time, there is more uncertaintly about the time of seroconversion of the individuals informing the estimates of the FOI. On the other hand, because age and time are confounded in cross-sectional seroprevalence measurements, the random walk on time may account for possible differences in the FOI through age (possibly higher in the youngest age classes, and lowest in the oldest), which are note explicitly accounted for here. This may explain the decline in FOI when going backwards in time before the first cross-sectional study in 2006.”

    1. How do you interpret the term mental model and why do you think that it is important for learning?

      Having learned the concept of 'schema' in my high school's psychology course, I think the mental models should mean the same thing as schema: we either assimilate new knowledge to pre-existing mental structures or we accommodate new knowledge to form new structures. These mental structures play essential roles in increasing the speed of memory encoding and recalling. Even though sometimes schema may cause memory distortions since it's based on pre-existing notions, with enough rehearsal, this could be avoided.

  4. Mar 2023
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to reviewers

      We thank all reviewers for their comments and suggestions. In line, below, are our responses, marked in Bold. Textural changes in the manuscript are also marked in Bold.

      Reviewer #1__ (__Evidence, reproducibility and clarity (Required)):

      **Summary:**

      Anuculeate red blood cell (RBC) is one of the interesting biological models that indicate the presence of eukaryotic circadian system independent of transcription-translation feedback. In this manuscript, the authors set up a new method for quantifying the circadian rhythmicity in RBC. The method called "Bloody Blotting" was developed through the careful and insightful investigation of "non-specific band" observed in the western blotting of peroxiredoxin, which has been used for the circadian monitoring of RBC. The authors characterized that the "non-specific' circadian-fluctuating signals, which can be observed by ECL imaging without any antibodies(-HRP), were attributed to ferrous-haem, but not ferric-haem, cross-linked to Hb upon cell lysis. Through the Bloody Blotting, this study suggests that the circadian fluctuation of ferrous-/ferric-haem exist in human and mouse RBC, and the period of rhythmicity is not affected by the canonical clock genes.

      **Major comments:**

      1)Although the authors conducted a careful biochemical evaluation of the "Bloody Blotting" signal, it is still unclear whether the changes in the Hb* (or Hb2*) signal corresponds to the changes in the ferrous-haem level in vivo. A direct perturbation on the level of in vivo ferrous-/ferric-haem is required. For example, is the Hb* (or Hb2*) signal decreased by the administration of amyl nitrite (in mice)?

      __Thank you for the suggestion. We have addressed this and the second reviewer’s comment in a new Figures 4 & S4 and section titled “Effect of rhythms in metHb on vascular flow and body temperature”. __

      For clarity, we have relabelled the schematic in A to “rest phase” and “activity phase” to consolidate data from humans and mice which both feature in the manuscript. We performed two experiments to test the model in Fig 4A and perturb metHb in vivo. The first is a direct perturbation of metHb levels in vivo with sodium nitrite, an oxidising agent that causes methaemoglobinemia. Reflecting our results ex vivo, RBC from differentially entrained mice sampled at the same external time, but 12h apart in terms of the light:dark cycle, contained significantly different metHb levels, with more metHb in the rest phase (revised Figure 4B, C). Whereas, RBCs from mice also given nitrite in their active phase contained more metHb (and thus lower Hb2* activity) than control (revised Figure 4B and S4). The second experiment tests the effect of sodium nitrite on core body temperature. Our hypothesis predicts that nitrite should accentuate the daytime drop in core body temperature, via the increased metHb-mediated production of NO to stimulate increased vasodilation (Ignacio et al 1981 and Cosby et al 2003). Revised figures 4D and E show that the effect of nitrite on body temperature (which has a large active vs inactive difference) is indeed daytime-specific.

      Methods for these experiments have been added to Experimental Procedures.

      2)The authors speculated that the higher PRX-SO2/3 signal during the first 24 hrs in mice is due to the sapling time at the resting phase (line ~235). The effect of sampling time should be easily tested by maintaining the mice group in 12-hr shifted L/D cycles and sampling the blood in the same o'clock (i.e., now the active phase). This type of experiment is also critical for the evaluation of Bloody Blotting because the level of Hb*/Hb2* signals may be affected by not only the circadian timing of mice but also the daily environmental fluctuation of a biochemistry laboratory (this is particularly important for the Bloody Blotting because some of the critical steps including the cross-linking between haem and Hb are supposed to occur in a test tube). If the signal of Bloody Blotting reflects the in vivo circadian rhythmicity, the 12-hr shifted L/D mice RBCs should have 12-hr shifted Bloody Blotting fluctuation pattern.

      __We acknowledge this possibility. To test this, we sampled RBCs from mice kept under DL and LD conditions, as detailed in the new sections in the Experimental Procedures, harvesting blood at the same clock time. This gave us blood from mice in the “active” phase and “rest” phase – labels as per Figure 4B. Figure 4B shows that Hb2* signal significantly differs between mice in active and rest phases, even though these samples were collected and processed at the same external time. __

      Separately from Hb2* activity, upon further reading of the literature we suspect that the higher PRX-SO2/3 signal detected in mouse RBCs (Fig 2) compared with human may be due to blood acidification during animal sacrifice by CO2. Additional text has been added to Supplementary Figure S3 to remark upon this, as follows:

      "Interestingly, compared with human RBC time courses (Henslee et al., 2017; O’Neill and Reddy, 2011), we observed that murine PRX-SO2/3 immunoreactivity was extremely high during the first 24 hours of each 72-hour time course (Figure S3A). We attribute this to the different conditions under which blood was collected: blood was collected from mice culled by CO2 asphyxiation during their habitual rest phase by cardiac puncture and exposed immediately to atmospheric oxygen levels, whereas human blood was collected from subjects during their habitual active phase through venous collection into a vacuum-sealed collection vial. Thus, the initial high PRX-SO2/3 signal in mice may be related to CO2-acidification of the blood during culling, which affects PRX-SO2/3 but does not affect Hb oxidation status____."

      3)Do the casein kinase inhibitors (ref: Beale, JBR 2019) affect the period of Bloody Blotting signals?

      We have not experimentally addressed this as we consider it beyond the scope of the current study, which has instead focused on the in vivo relevance of the rhythms in metHb. Nevertheless, given the identical periodicity of PRX rhythms and Hb* rhythms (this paper), and the periodicity of PRX rhythms and rhythms in membrane conductance (Henslee et al, Nat Commun, 2018), we see no reason why the period lengthening of rhythms in membrane conductance reported in Beale et al, JBR, 2019 would not also been seen in PRX or Hb* rhythms.

      **Minor comments:**

      4)The authors quantify the dimer of Hb (Hb2*). This is important information but only explained in the supplementary figure legend. It should be explained in the main text. In addition, it is difficult to evaluate the fluctuation of Hb* (not Hb2*) because, as the authors stated, most of the Hb* signals are saturated. The saturation problem should be easily solved by reducing the sample loading volume. Quantification of Hb* is important at least experiments shown in figure 1A-G because the dimerization of Hb can be also affected by factors other than the in vivo ferrous-/ferric-haem conversion.

      Thank you for pointing this out. Indeed the data throughout the original manuscript is Hb2*. We have brought this explanation into Figure 1 legend and labelled all figures consistently with Hb2*. We include quantification of Hb* and Hb2* of the in vivo metHb perturbation experiment (Figure 4) in the uncropped membranes shown in Supplementary Figure 4. The quantification of Hb* (Supplementary Figure 4D) gives the same result as the quantification of Hb2* (Figure 4B).

      5)In the quantification of Hb2* (Figure 1A, 2E, 3C), were the signals normalized to Total Hb?

      In the quantification of Hb2* throughout, signals were normalised to total protein through coomassie stain, apart from Figure 4B which used SYPRO Ruby. Each figure presents the Hb band of coomassie or SYPRO Ruby for simplicity, but the full gels are included in Supplementary Figures 1, 3 and 4.

      6)The explanation and interpretation of the experiment shown in figure 3D should be more careful. The pulse-oximetry was conducted in normal working day conditions (real world setting) and thus should be affected by environmental and social daily signals.

      __We have changed the section to the following (edits in bold): __

      "Remarkably, in contrast to total Hb (SpHb) that displayed no significant 24h variation, the proportion of metHb (SpMet) in the blood exhibited a striking daily variation that rose during the evening and peaked during the night (Figure 3D). These subjects were in a real-world setting, and thus affected by environmental and social cues from a normal working day. However, the evening rise and night-time peak is consistent with ____the reduction in Hb2* activity at the end of the waking period in laboratory conditions (Figure 3B)____."

      7)Typos at figure indicators in supplementary figure legends. Sup figure 1A legend refers to main figure "2" (should be 1), and figure S3 legend refers to main figure 1 (should be 3).

      Thank you for pointing this out. We have corrected these legends.

      Reviewer #1 (Significance (Required)):

      The detection of circadian oscillation in RBC has been not easy because the experiment requires careful sample preparation and specific antibodies (Milev Methods Enzymol 2016) or a specific instrument for dielectrophesis (Henslee). The Bloody Blotting technic developed in this study will overcome this technical problem because Bloody Blotting does not rely on specific antibody and only requires conventional tools for western blotting. Because circadian biology of RBC is particularly important in the field of circadian research to evaluate the presence of eukaryotic circadian oscillator without transcription-translation feedback loops, this study will be interested a wide community of circadian clock researchers. This reviewer has expertise in the field of circadian genomics, biochemistry, animal experiments in mice as well as human.

      Thank you for taking the time to read and constructively comment on our work

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): **Summary**

      The study aims to provide a new tool for detecting the hemoglobin oxidative status named "Bloody blotting". It is based on redox- sensitive covalent linkage between the haem and the haemoglobin. This linkage is a consequence of an artifactual reaction provoked by the protein extraction, due to the lysis buffer's properties. In addition, using an in vitro (red blood cells) or in vivo (patients' blood) model the authors provide insight in the oscillating nature in the oxygen-carrying and nitrite reductase capacity of the blood, which is unaffected by the mutation of CK1εtau/tau and Fbxl3aafh/fh

      **Major comments:**

      In my honest opinion, the work does not provide interesting addition to what it is known in literature. The conclusions are summarized into a model (Fig.4) t, which is too speculative related to the amount and quality of results showed in the paper.

      __We are disappointed by the reviewer's response. The physiological basis for daily rhythms in body temperature cooling is not currently understood, this work provides a testable basis for understanding it. Whilst we understand that the reviewer might not find immediate value in the biochemical mechanisms that initially informed our investigation, the recent publication of our investigation of human brain temperature rhythms (Rzechorzek et al., Brain, 2022) demonstrates that daily biological temperature rhythms are of broad interest (Altmetric score >2000). Daily temperature rhythms have almost exclusively been assumed to result from daily rhythms in heat production, yet the evidence for a contribution via daily rhythms of cooling is equally strong yet has received scant attention. __

      __The speculative model that the reviewer refers to was a hypothesis that drew together multiple lines of published evidence for future experimental testing, not a conclusion, and was labelled as such in the original manuscript. To accommodate the reviewer's critique, however, we tested the model with new experiments, that are included in the revised Figure 4 and section titled “Effect of rhythms in metHb on vascular flow and body temperature”. __

      For clarity, we have relabelled the schematic in A to “rest phase” and “active phase” to consolidate data from humans and mice which both feature in the manuscript, and described it as a hypothesis to avoid confusion. We performed two experiments to test this model. The first is a direct perturbation of metHb levels in vivo with sodium nitrite, an oxidising agent that causes methaemoglobinemia. RBCs from mice given nitrite in their active phase contain more metHb (and thus lower Hb2* activity) than control (Figure 4B). Reflecting our results ex vivo, RBC from mice sampled 12h apart contain significantly different metHb levels, with more metHB in the rest phase (Figure 4B, C). The second experiment tests the effect of sodium nitrite on core body temperature. Our model predicts that nitrite should further reduce core body temperature in the daytime, via the increased production of metHb (Figure 4C) and vasodilation (Ignacio et al 1981 and Cosby et al 2003). Figure 4D and E show that body temperature (which has a large active vs inactive difference) is further lowered upon nitrite treatment, and that this effect is restricted to the daytime, consistent with our hypothesis.

      __Methods for these experiments have been added to Experimental Procedures. __

      The title is misleading. The authors did not use any mutant for clock factors, but they used a kinase (CK1εtau/tau) and a ubiquitin ligase (Fbxl3aafh/afh) mutant, which are important in the regulation of proteins belonging to the clock machinery.

      We respectfully disagree that the title was misleading. Mice and cultured cells/tissues that are mutant for CK1 and FBXL3 demonstrably show altered clock gene activity (See Godhino et al, Science, 2007, also Meng et al, Neuron, 2008, also Fig 2). Moreover, CK1 and FBXL3 are generally regarded as key components of the circadian clock due to their critical function in the regulation of clock proteins (e.g., Hirano et al, Nat. Struct. Mol. Biol., 2016). Being anucleate, RBCs lack the capacity for changes in clock gene activity and the period of oscillation is not affected by mutations that affect the activity and period of clock gene-oscillations in nucleated cells and whole mice. Since the rhythms of Hb oxidation persist in isolated RBCs, they cannot be dependent on clock gene activity and so must be considered to function independent of clock genes.

      In light of the new data on mouse body temperature presented in revised Fig 4D/E, however, we have changed the title to better communicate the revised scope of the manuscript, as follows:

      "Mechanisms and physiological function of daily haemoglobin oxidation rhythms in red blood cells"

      Speaking of the specific points described in the paper, there are aspects that are not convincing. First, the bloody blotting is a consequence of a specific reagent contained in the lysis buffer used for the protein extraction, which reacts with the haemoglobin beta and alpha (as shown by Mass Spec). The peroxidase reaction is an artifact coming from this reaction, which simply follows the rhythmicity of peroxiding accumulation in the red blood cells, whose rhythmicity is known to be circadian. I do not really understand the utility of this technique, which anyway is limited to the specific lysis buffer, but for scientific reasons, researchers need often a different kind of lysis buffer. This means that the approach shows strong limitation to the chemical environment of the lysis buffer. I do not see in it a useful tool that can replace antibodies.

      Apologies, we have not been clear enough. The bloody blotting is indeed a consequence of lysis, since that lysis condition fixes the cellular state at the time of lysis. In this case, the variation in Hb oxidation status is fixed at the time of lysis. The peroxidase activity we report is indeed revealed on membranes by the covalent interaction of the haem and Hb, which occurs at the point of lysis, and reports the oxidation state of the haem at the point of lysis. As we detail, haem exhibits peroxidase activity, so the signal we observe at molecular weights corresponding to Hb and Hb2 is peroroxidase activity due to covalently bound haem, where the peroxidase activity varies with the oxidation state of the haem. We have reorganised text associated with Figure 1, including changes to the final paragraph of the section to make explicitly clear that that the rhythm is due to a fixing of the redox state of Hb at the time of lysis – that a true underlying rhythm is revealed.

      This technique is indeed limited to the observation of haem-peroxidase activity in RBCs on membranes. But as we explain in the manuscript, this is a far quicker and simpler method of observing RBC circadian rhythms than other methods, including immunoblotting for peroxiredoxins. Furthermore, it is common to change lysis buffer according to the downstream purpose.

      Second, the oscillation in the peroxidase activity of PRX-SO2/3 is well known to be circadian (Edgar et al., 2012. doi:10.1038/nature11088.).

      Many apologies, we do not understand the point. It is indeed correct that PRX-SO2/3 abundance oscillations have been reported in RBCs and other cells and organisms. Here we report another rhythm, separate to PRX: the rhythm in Hb:metHb. The PRX-SO2/3 blots serve as a positive control for rhythmicity.

      Finally the circadian rhythms of red blood cells is already described and the corresponding author already published different papers about. The info provided in this paper do not add any new piece to the puzzle.

      Respectfully, we report a novel rhythm in RBCs and demonstrate its functional relevance in vivo in humans (Figure 3) and mice (Figure 4), i.e., it is the identity of the rhythmic species that is novel, not that there are rhythms. What we further add with this study is that rhythms are not influenced by the cellular/organismal environment during RBC development (Figure 2), occur in vivo, in freely moving people (Figure 3) and metHb has a functionally significant role in body temperature rhythms (Figure 4). Furthermore, we report a novel technique for uncovering this rhythm in RBCs.

      At this stage I do not consider the paper suitable for a publication. Other observations. Authors should describe how cells were synchronized.

      RBCs in vitro were not synchronised by external cues. As reported in the Methods section, they were maintained at constant temperature after isolation. Fibroblasts were synchronised by temperature cycles as detailed and employed previously.

      In experiments performed in vitro should be used the SD instead of the SEM.

      We respectfully disagree. The SEM quantifies how precisely you know the true mean of the population - in each case we use it, we also present replicates’ data from which the mean is calculated (e.g. Fig 1A, Fig 2E, Fig 3B and Fig 4B). This gives the real scatter of the data, as a SD would.

      **Minor comments:**

      There are many English mistakes in the article, also errors in naming figures in the figure legends.

      We have carefully re-examined the manuscript to find and fix these errors.

      Figure 1B needs an appropriate loading control.

      We have added the coomassie loading control to revised Figure 1B, with uncropped membranes shown in revised Supplementary Figure 1B

      In experiments performed in vitro should be used the SD instead of the SEM.

      SD vs SEM, see reply above.

      Reviewer #2 (Significance (Required)):

      Nature and significance of the advance At this stage I do not see any significance or advance in the field.

      Compared to existing published knowledge. The The bloody blotting seems to be an original approach although full of limitation and based on artifactual reactions. PRX-SO2/3 is well known to be circadian (Edgar et al., 2012. doi:10.1038/nature11088.), therefore the paper does not add any new insight. The clock mutation do not affect the circadian rhythm in RBC is also known (O' Neill and Reddy, 2011). Therefore the results showed in the figure 3 support already published observations but do not add any particular insight.

      It is unclear to us how the reviewer has misunderstood the scope and focus of the manuscript to such an extent. All previous work in this area by our own and other labs has been appropriately acknowledged. To reiterate the novel elements of this work:

      - A daily rhythm of Hb redox state in mouse and human red blood cells, in vitro and in vivo. This was speculated about in O'Neill & Reddy (Nature, 2011) but never directly tested until now.

      - That clock gene mutations that post-translationally regulate circadian period in nucleated mammalian cells do not affect circadian period in anucleate mammalian cells. O'Neill & Reddy (Nature, 2011) did not show this, rather we looked at (nucleated) fibroblasts that were deficient for Cry1/2 (a transcriptional repressor).

      - A novel assay for measuring mammalian RBC rhythms - nowhere is it proposed that the assay would be useful in any other context, as the reviewer seems to imply.

      - A mechanistic basis for understanding how daily rhythms in cooling of body temperature might arise, a poorly studied aspect of mammalian physiology.

      __The elements in this work that are not completely novel are included as controls, they are not the focus of the manuscript e.g. PRX-SO2/3 rhythms have not previously been shown under these conditions in mouse RBCs, only human, so these blots are included as a control for rhythmicity in Fig2. Similarly, the period of oscillation of a genetically-encoded Cry1:Luc reporter in mouse fibroblasts would be predicted to be longer and shorter in Fbxl3 and Ck1 mutants, respectively, but nowhere this been published so we have included it as a control. __

      Audience Chronobiologists, and medical science.

      Fild of expertises (reviewer) Chronobiology, molecular biology, medical science.

      **Referees cross-commenting**

      I read your comment and they were very detailed. From my point of view I am very skeptical, as I discussed about the utility of the Bloody Blotting. Also the results showed in the paper are not very innovative fro my point of view. I would like to know what do you think about.

      The rhythmicity is given by the elements present in the protein extraction. The reaction is given by the specific lysis buffer used in that experiment. Using another lysis buffer would not allow anybody to see some signal without a proper antobody. The authors claim that bloody blotting is useful because a researcher does not need to buy an antibody, but what if you don't work with a total total extract of proteins? In that case, you need to change the lysis buffer, and, therefore, the bloody blotting is not useful anymore. However, If you believe in that way, and you are two people agreeing in that, I will not oppose myself although I do not agree.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): **Summary:** Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      The manuscript "Clock gene-independent daily regulation of haemoglobin oxidation in red blood cells" describes a new assay for quantification of haemoglobin oxidation status (bloody blotting) in anucleate red blood cells". This study furthers our understanding of the role of a post-translational oscillator (PTO) in generating circadian rhythms in biology. The authors first describe how earlier work demonstrated 24h rhythms in the intensity of chemiluminescent bands on membranes blotted with protein from red blood cells (RBCs) in the absence of antibodies after exposure to ECL. They go on to address what these bands represent (through various approaches including the use of chemical inhibitors and mass spectrometry) and conclude that they are observing haemoglobin oxidation status. It is proposed that this assay represents a novel manner (complementary to earlier work) in which to report circadian rhythms in RBCs. The manuscript goes on to demonstrate the persistence of 24h rhythms in haemoglobin oxidation status in murine RBCs, including cells isolated from two clock mutant mice. Finally, the study utilises RBCs collected from human volunteers maintained under controlled conditions and demonstrate robust rhythms via "blood blotting", this data is presented alongside pulse co-oximetry data to examine physiological relevance of these rhythms.

      **Major comments:** -Are the key conclusions convincing?

      The key conclusions are well supported by the data. The discussion does become quite speculative, and this needs to be addressed.

      -Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The discussion around the physiological relevance of daily regulation of haemoglobin redox status is extensive (lines 366-403) as is the discussion on RBCs and the TTFL-less clock mechanisms (lines 405-429). Whilst interesting and well thought out, and well supported by the literature, these sections are very speculative and in my opinion should be toned down.

      Thank you to the reviewer for both the compliment and suggestion. Indeed, these discussion sections were too long. We have reorganised the physiological relevance section to reduce its length and better accommodate the new data presented in the new experiments in Figure 4.

      We have cut the TTFL-less section text by more than half.

      -Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Further experiments would be required to support the discussion about the role of daily rhythms in haemoglobin oxidation status in regulating oxygen carrying capacity of the blood, vascular tone, body temperature and sleep-wake cycle. As the authors state, these experiments are beyond the scope of this study, but are of course of major interest. It would be more appropriate to limit the discussion to what has been demonstrated directly by the data presented, with just a few sentences speculating on physiological relevance.

      __As above, we acknowledge that we were speculative in that section and we have curtailed the discussion as suggested. __

      -Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      If the focus of the discussion is shifted as suggested, there is no need to pursue any further experiments. -Are the data and the methods presented in such a way that they can be reproduced? Yes. The methods are complete, and data presented very well. -Are the experiments adequately replicated and statistical analysis adequate?

      Yes

      **Minor comments:**

      -Specific experimental issues that are easily addressable.

      1.In the murine fibroblasts/RBC experiments in Figure 2 - what genotype were the wildtype controls? The main text suggests PER2::luc (line 226) but methods suggest Cry1:luc - could the authors clarify this?

      __Thank you for pointing out this mistake, corrected text to Cry1:luciferase __

      2.In figure 2B and 2D the blots show two samples for each time point (except for 72h where there is just one) are these technical repeats? This should be clarified.

      Apologies, the labelling of this figure was not clear – for space reasons we only labelled every 2nd timepoint – the time course was 3-hourly. We have corrected the figure to label each timepoint.

      3.The controls for the bloody blots are referred to as coomassie in Figure 1. In Figure 2, the controls for PRX-SO2/3 are referred to as "loading" but are coomassie stained gels - could this be standardised? Also Figure 2D - no controls? In Figure 3B controls are referred to as 'Total Hb from coomassie staining - I wasn't clear what this was.

      Thank you. Throughout we have now labelled loading controls by their method (coomassie or SYPRO Ruby). Figure 2D is taken from the same gel as Figure 2B and so the same coomassie gel stain is used as a loading control. We have altered the figure legend to reflect this. Each figure presents the Hb band of coomassie or SYPRO Ruby for simplicity, but the full gels are included in Supplemetary Figures 1, 3 and 4. We have changed each figure legend to reflect: “coomassie stained gels were used as loading controls; the Hb band from the coomassie stained gel is shown”.

      4.Figure 3A "S1" and "S2" stated in legend but only "S" used in the schematic

      Many thanks for pointing this out. We have corrected the schematic to S1 and S2.

      -Are prior studies referenced appropriately? Yes absolutely. -Are the text and figures clear and accurate? Mostly, few comments above.

      -Do you have suggestions that would help the authors improve the presentation of their data and conclusions? No

      Reviewer #3 (Significance (Required)): -Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      The study describes a rapid and relatively simple assay for observing 24h rhythms in RBC function. On a technical basis - this will likely be of significant use to others in the field. Further work examining rhythms in haemoglobin oxidation in RBCs in clock mutant mice confirms independence from the transcriptional-translational feedback loop, which further supports earlier work in this field. Finally, studies in humans (bloody blotting in combination with pulse co-oximetry) provide a glimpse into the functional relevance of these daily oscillations

      -Place the work in the context of the existing literature (provide references, where appropriate).

      The authors have done an excellent job of reviewing the literature in the field and contextualising their data. This current data is a significant advance in the field.

      -State what audience might be interested in and influenced by the reported findings.

      This work will be of interest to circadian biologists and adds weight to the relatively new concept of a post-translational oscillator (PTO). Further work showing the relevance of this PTO on physiological function will be of great interest.

      -Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Circadian, Clock genes, mouse models,

      I do not have a background in biochemistry and do not feel overly qualified to comment constructively on approaches taken to address what is driving the observed rhythmic peroxidase activity in RBCs (e.g NiNTA affinity chromatography, use of reductants to reduce thioester bonds and use of NEM to alkylate Hb cysteine residues).

      **Referees cross-commenting**

      In terms of the utility, as my review indicated, I do feel that this manuscript advances the field, providing a rapid and relatively simple way to measure rhythms in RBCs. Reviewer 1 explained this nicely in their significance summary.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC- 2023-01819

      Corresponding author(s): Gernot Längst and Harald Wodrich

      Full revision of the manuscript

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      2. Point-by-point description of the revisions

      Dear Reviewers, thank you very much for your appreciation of our study and your input. In this point-to-point response, we amended our text marked in blue colour.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors have addressed the nucleoprotein structure of human adenovirus during the very early stages of infection, and its relationship to onset of expression of viral genes, using a combination of RNA-seq, MNase-seq, ChIP-seq and single genome imaging. They show that in the virion and the newly-infecting DNA, protein VII is precisely position at specific sites on the viral DNA, with greater accessibility at early gene promoters compared to other regions. Nucleosomes containing H3.3 replace specific protein VII at distinct positions at the transcription start sites of genes, which are then acetylated. Association with histones and nucleosomes occurs prior to transcription. These studies confirm and greatly expand on results already in the literature, and also elucidate a novel role for protein VII in orchestrating positioning of nucleosomes prior to initiation of transcription.

      The authors provide excellent data in support of their conclusions and, in many instances, use alternative experiments (i.e. two different approaches) to support their claims. The details of methods are adequate (with small exceptions outlined below) and statistical methods appropriate.

      Minor comments:

      Line 561 "Protein VII molecules were exchanged for positioned nucleosomes at the +1 site of actively transcribed genes". This statement seems to suggest that the +1 position almost acts as a nucleating site, where replacement of a single, specific protein VII molecule at +1 is an initiating event, which then spreads from that site and into the rest of the gene. Data shown in Figure 6G and 6H shows that H3.3 appears to be found equally along the full length of E1A as early as 1 hr post infection (with no real "enhancement" at the +1 position), and that the overall levels simply increase over the next 4 hrs.

      As the reviewer pointed out, the histone ChIP-seq peaks are broader than the +1 nucleosome region, extending into the transcribed regions of the gene. This is expected, as the mean length of the immunoprecipitated DNA is about 400bp long. Still, ChIP-seq peaks are in proximity to the transcription start site and overlap with the position of the +1 nucleosome. As we do not have the required resolution, we toned done our statement. The text now reads as follows: “Protein VII molecules were exchanged for nucleosomes downstream of the transcription start site, overlapping the +1 nucleosome site, of actively transcribed genes“ (line 568 ff).

      Curiously, the authors chose not to use a wildtype virus for their studies - the virus contains a deletion in the E3 region. For clarity, I suggest that the authors should preferentially use an alternative designation for their virus rather than HAd-C5. Perhaps HAd-C5delE3 to differentiate this work from studies that truly use wildtype virus.

      As requested by the reviewer we have updated the nomenclature to HAd-C5dE3 throughout the text and the figures.

      The obvious limitation of the studies using the fluorescent TAF1-beta to label Ad genomes is that as protein VII is replaced by nucleosomes, the genomes would have declining detection by this method. Genomes devoid of protein VII would be "invisible".

      Our MNase data show that within the first 4h only a fraction of pVII is removed from the viral genome e.g. at early genes, while most of the genome remains bound by protein VII. This should provide enough binding sites for TAF1-beta to label Ad genomes without a significant drop in the signal. Furthermore, our recent work (PMID:29997215, Fig. 1D) compared the TAF1-beta labelling system with a second in vivo detection system (AnchOR3) that directly labels the viral DNA independently of protein VII in the same cells. This direct comparison of two technically non-related methods to detect individual incoming adenoviral genomes in living cells showed the equivalence of both methods, at least for the first hours of infection showing that partial removal of protein VII does not affect the fluorescent TAF1-beta staining.

      Line 275 "Interestingly, a central region of the viral genome (Late3) and a region between the E3 and E4 genes exhibited almost no peaks" for protein VII. The virus utilized in this study lacked at least part of the E3 region. Did this deletion "cause" this region to be devoid of protein VII? Is the same absence of protein VII peaks observed in a fully wildtype virus? Also, can the authors provide any speculation as to why the Late3 region also lacks protein VII?

      We confirm the reviewer's observation. The region marked as Late3 and the region between E3 and E4 is present in the genome and is, as the reviewer observed, not chromatinized in our analysis. At this point, we can only speculate. We have two not mutually exclusive hypotheses. First, both regions could be involved in the proper packaging of the viral genome into the capsid. Physical constraints during packaging may preclude this region from being packaged into pVII. Second, as we observed that pVII positioning correlates with distinct DNA sequence patterns (revised Fig.4 D and E, see response to reviewer 3 for details), it might be that the sequence composition at the pVII depleted regions disfavour pVII assembly to keep those regions available for cellular factors that drive processes post genome delivery, such as transcription. Our time-resolved MNase analysis shows that indeed post genome delivery, this site in the Late3 region becomes protected (Fig. 5C), suggesting the binding of one or more cellular factors. As shown in Figure S6 we find conserved binding sites for several transcription factors at this MNase protected site.

      Whether the chromatinization devoid regions would shift in position, remain in place or be chromatinized in a wildtype virus has to be addressed in the future and cannot be answered at this point. To address the comment, we have expanded the discussion (line 620 ff)

      Line 569 "Reasons could be that the few genomes undergoing nucleosome assembly and active transcription produce the replication enzymes, whereas the bulk of genomes enters replication without activation as an elegant way to avoid repeated chromatinization." This argument may make sense in the context of a high MOI infection, but would certainly limit virus function during normal, pathogenic infection where the MOI is likely extremely low. Essentially, the authors data predicts that 80% of normal, low MOI infections don't progress to gene expression (at least during the first 4 hrs analyzed in this study).

      We follow the argument of the reviewer. The high MOI in our study was necessary to perform the combined ‘omics’ approach to arrive at meaningful data within reasonable sequencing depth. To have equivalence we also used high MOI for the imaging approach. A detailed analysis for the effect of low MOI as well as positioning effects (see reviewer comment below) on transcriptional activation is an important question and will be addressed in future studies that require different techniques in addition. To address this comment, we have updated the discussion to emphasize the importance of MOI and positioning effects (line 587 ff).

      Line 576 "This observation is in agreement with recent pVII-ChIP experiments showing transcription and replication independent pVII removal in early infection (Giberson et al., 2018; Komatsu and Nagata, 2012; Komatsu et al., 2011)." The authors can also state that histone and nucleosome deposition is also independent of transcription and replication, as has been alluded to in the same cited studies but proven more directly in this study.

      We have changed the text accordingly (line 576 and 598).

      Line 672 - the authors should be more definitive in the MOI that are used in all of their experiments. Line 672 states that an MOI of 3000 physical particles are applied per cell. There can be great variation between cell lines in how much virus binds to (and enters) a cell based on the surface levels of Ad receptors on different cell types. However, in general, 3000 is very high. Work by Wang et al. (PMID:24139403) showed that at an MOI of 200 or below most Ad will traffic correctly to the nucleus, whereas at an MOI above 200 there is a significant defect in Ad trafficking within the cell. How is this expected to affect all of the results in this study?

      We agree with this and the other reviewer that this is an important issue. The actual dose of virus that enters a given cell is dependent on the concentration of virus particles in the inoculum and the time and temperature this inoculum is in contact with the cells and the cells respective susceptibility to the virus. We applied an infection dose of 3000 physical particles per cell in a defined volume (1ml) at 37˚C for 30min followed by inoculum removal. We prefer this description because with these infection conditions, we find on average well below 100 virus particles that enter the cell (=> This is e.g., reflected in the number of accumulating genomes shown in figure 2A). In contrast, this permits to have enough viruses inside the cell to perform the different “omics” techniques applied in our study to obtain meaningful results at reasonable sequencing depths. This experimental setting was carefully chosen in full awareness of the work by Wang et al., cited by the reviewer, to avoid e.g., overloading the nuclear import rate. Thus, our experimental conditions do not exceed the “MOI of >200” that would affect nuclear import rates. The number (>200) in the Wang et al. study refers to the number of virus particles inside the cell, the infection condition used in the Wang study was an MOI of 30 bound to Hela cells in the cold for 30min and warmed for 150min which is significantly more virus than we have used in our study. We have expanded the information on the MOI used in the material and methods section to clarify this point (line 685 ff).

      Figure 5 is of low resolution and was difficult to read.

      We thank the reviewer for spotting it. It seems that the Figure quality was compromised during the PDF conversion. We updated the Figures and checked the resolution after PDF conversion.

      Figure S3 is missing a box from the top set of images indicating the region that is expanded in the detail picture.

      We updated Figure S3

      While I realize it is supplemental data, the difference in quality between the agarose gels shown in Figure S4A and S5A is shocking.

      The nature of the experiments is very different and therefore the expected MNase digestion profiles on agarose gels look different. In Figure S5 viral particles were digested with MNase, resulting in a smeary decrease in DNA size. This looks very different from the regular MNase pattern of whole cells that is dominated by the regularly spaced nucleosomes in the heterochromatic regions of the genome. As pVII protects only about 70bp of DNA and its spacing is not as homogenous as the nucleosomal spacing, the pictures shown in Figure S5A were expected as they are.

      Figure S7 is of low resolution.

      We updated the Figures and checked the resolution after PDF conversion.

      Reviewer #1 (Significance (Required)):

      At least in the field of adenovirus research, this is a very important study. There has been considerable debate in the field regarding the timing and degree of protein VII removal and histone deposition, and the necessity of active transcription for these two events. The data provided in this manuscript clearly shows that some protein VII is removed from early active genes and replaced by nucleosomes, and that these events occur prior to initiation of transcription. The authors speculate that the specific placement of protein VII, a protamine-like protein, on the Ad genome prescribes where nucleosomes are placed. This finding should be of interest to a broad general audience, as it provides novel information on chromatin assembly within mammalian cell. Key words for this reviewer: adenovirus research, HAdV nucleoprotein structure

      Reviewer #2 (Evidence, reproducibility and clarity (Required))

      The submitted manuscript presents a detailed and comprehensive analysis of the adenoviral nucleoprotein complexes as infection progresses, starting with the "adenosome" assembled with pVII which are then progressively replaced with H3.3.-containing nucleosomes as the infection progresses. The submission presents a combination of in situ and populational analyses of the viral DNA accessibility and complexes through infection. I brief, the infecting viral genomes are assembled in some 250 adenosomes with pVII, which become progressively replaced as infection progresses with nucleosomes containing H3.3 and acetylated H3K17, starting at the active promoters of the E genes. Chromatin remodeling precedes transcription, and the accessibility differs for genes of different kinetic classes at differ times after infection, although there is no correlation between accessibility and H3.3. or acetylation content. Only about 20% of the genomes become transcriptionally active, though, which somewhat complicates the analyses of the populational studies of accessibility and occupancy. Overall, the study is well conceived, performed and presented. A few issues that deserve further analyses and discussion, as described below.

      Major issues.

      As figure 2 nicely shows, only about 20% of the intranuclear genomes become transcriptionally active. However, MNase and ChIP analyses cannot differentiate these genomes from the 80% that are transcriptionally inactive. The interpretation of the positioning of pVII (figure 4) or the changes in compaction of the adenoviral chromatin at different loci (figure 5) does not appear to consider this heterogeneity other than for a brief comment about the stringent MNase digestion in page 11. The authors favor a model in which the changes in compaction shown in figure 5, at mild MNase digestions, directly correlate with transcription of the respective genes. This could well be correct, and in fact the correlation may be underestimated as 80% of the genomes may not undergo any changes, but it may also be incorrect. The analyses presented cannot differentiate whether the changes in chromatin compaction occur in only a subset of genomes or in all the genomes, regardless of whether they are transcribed or not, or even only in the non-transcribed genomes (which appears extremely unlikely). This intrinsic limitation to the methods used (and I know of no better alternative) should be acknowledged and discussed for the benefit of the reader. This limitation also impacts the analyses of the lack of correlation between H3.3 and acetylated H3K27 occupancy and compaction.

      A discussion is amended and located starting from line 571 in the text. “The heterogeneity of 80% inactive genomes and 20% activated genomes complicates the analysis of the MNase-seq data. High MNase concentrations do not differentiate between both states, and we suggest that low MNase conditions capture the dynamic viral proportion, changing and preparing its genome for gene activation. The data nicely suggest such a scenario, but there is the caveat that we catch an effect of the mixed population that we cannot differentiate.”

      The analysis of the histone ChIP is discussed below.

      Perhaps out of necessity to reach the required sensitivity, a high multiplicity of infection was used (although the actual moi is not stated, there are about 25-30 pVII foci/ per nuclei). The presentation, analyses and discussion of the results should emphasize this context. For example, one would presume that at low moi, when only one genome enters each cell, the percentage of transcriptionally active genomes in a given cell will be either 0 or 100%, but the "system" becomes saturated as more and more genomes enter the nucleus at higher moi resulting in only a subset of them being transcriptionally active. Along this line of reasoning, it is intriguing that the percentage of genomes estimated to be in nucleosomes at 4 hpi (14%) approaches the percentage of transcribed genomes.

      This issue was also raised by reviewer 1 (see detailed comment above). The reviewer is correct that we chose to use a higher MOI to reach the required sensitivity in our different “Omics” assays. The imaging approach was adapted to reach the morphological equivalence to fit this analysis. We agree that it would be interesting to also study the MOI effect on transcriptional activation (as well as positioning effects, see comment below) but this requires different approaches and will be addressed in a future study. To address this comment (and others in this review) we revised the text in the discussion to emphasize the importance of MOI and possible other effects such as positioning (line 587 ff).

      The changes in chromatin compaction presented in figure 5 are in some respect puzzling. The compaction of most of the late genes increases as infection progresses, at least for the first four hours, as the authors discuss. However, the L genes appear to be at least as accessible as the E ones at the early times, when only the E are transcribed to high levels. This appears counterintuitive, and may not be consistent with the main conclusion that increase accessibility to a given gen directly correlates to its transcriptional activity level. The data presented in Figure 5C deserves a more nuanced analysis and discussion, parsing out the changes in accessibility to each given gene at different times from the different accessibility to the different genes at any given time. The later does not appear to support the main conclusion reached by the authors that accessibility to each individual gen correlates with its transcriptional level.

      We thank the reviewer for raising this point. While the viral genomes enter the nucleus, the viral chromatin structure is tightly condensed. Therefore, it is unlikely that after nuclear entry the viral chromatin undergoes further compaction. With our analysis, we expect to detect only decompaction of genomic sites relative to 0 hpi, when the virus has not entered the nucleus yet. At some sites and particularly at the Late genes the signal is decreasing, most likely due to normalization to sequencing depth and the variation in the number of viral genomes but not due to changes in compaction. We realized that the negative accessibility scores we used in the study are misleading and give a false impression. Therefore, we changed the analysis in that way, that negative values were not permitted and converted to zeros.

      Additionally, we raised the temporal resolution of the analysis and compared the accessibility at all available timepoints against 0 hpi as suggested by the reviewer. Now, we clearly observe, that most accessibility changes are accomplished rapidly after nuclear import, already at 1 hpi and do not change much after, until 4 hpi. Regions of decompaction coincide with early expressed genes and occur before transcription, underscoring the conclusions made in the study. Nevertheless, while most genomic regions covering late genes do not show decompaction, we observed some local sites showing a high accessibility score. As transcription at those sites appears later in the life cycle of the virus, we can only speculate about the function e.g. as enhancer elements.

      The Text and Figures were changed accordingly (line 347 ff).

      New legend:

      __C) __Profile illustrates HAd-C5dE3 genome coverage by low MNase-seq fragments. The average of two replicates is shown, except at timepoint 0 hpi where only one replicate was available. The accessibility score was calculated as the log(fold-change) between the indicated timepoint and 0 hpi. The score was assessed for each pVII peak (orange bars) and negative scores were set to 0. A new accessibility peak arising during infection in the Late3 region is marked by an asterisk. __D) __Boxplot showing the accessibility score distribution in each domain at each tested timepoint after infection.

      Minor comments

      The authors may wish to highlight in the discussion that the analyses are so far limited to a single adenovirus.

      We have taken up the suggestion of the reviewer and included it in the discussion part, starting at line 607:

      “The structural analysis is still limited to a single adenovirus genotype and it will be interesting to test whether these dynamic changes are conserved among other adenoviruses. Furthermore, reproducing such organization in adenoviral vectors could result in efficient and sustained transgene expression.”

      The y-axes in the transcriptome figures (figure 1 B, S2) could be presented in Log(2) scale, such that transcript levels at all times can be appreciated in the same graph (the earlier times are just not visible in a linear scale)

      As requested by the reviewer we changed the data to log2 scale. As there is no qualitative difference to the log10 scale, presented in the original version, we would like to keep the figure as it is. To highlight changes at early time points we generated the average expression of early genes in Fig1C.

      As an information for the reviewer, we provide here the data plotted as log2 scale.

      The (lack of) phenotype of the 24xMS2 binding site recombinant adenovirus used should be shown.

      We observed no difference in phenotype between the parental and the MS2 modified virus. We updated Figure S3 and included a gel analysis and specific infectivity data to show this absence of difference.

      The kymograph analyses presented in figure 3B appear to show that there are some sites of transcript accumulation sites which do not harbor viral genomes (i.e., green only tracks). Moreover, the interpretation of the TAF1beta-mCherry signal is complicated by the (fully expected) significant "background" signal. Although these results are consistent with those obtained by RNAscope/pVII staining, there appears to be intrinsic limitations to the system, which preclude reaching strong conclusions from it. These confirmatory analyses should probably be moved to the supplementary information section and removed from the main text and figures. The longer evaluation data mentioned as not shown in page 8 is critical to the conclusions and should be shown.

      Here we disagree with the reviewer and prefer to keep the data as main figure. All (immobile) transcript accumulation sites are identified by the kymograph analysis and coincide with a genome while free transcripts show a high mobility that is not picked up in the kymograph analysis. This is independently verified in the provided supplemental movies. Depending on the positioning of the genome inside the living cell, accumulating transcripts can appear adjacent to or on top of a genome. This explains the slight shift between RNA and DNA signal for some genomes in the merged image of the kymograph. This is expected as only fully transcribed transcripts and not nascent transcripts are marked by MS2 (the MS2 loops are positioned in the 3’UTR). Also, all genomes (transcribing and non-transcribing) can be identified in the kymograph above background level. To clarify the representation, we have added labels to the kymograph to show which signal is DNA and RNA and a merge respectively. We are convinced that this data set is in strong support of our study, as it is the only technique that permits the discrimination of transcribing and non-transcribing genomes in living cells at real time.

      As requested, we have also added two additional examples for a longer observation period (10min) into the supplemental data Fig. S3C.

      Although the plot of cleavage frequency presented in figure S5 is clear, it would be beneficial to the reader if the actual peaks were also presented to compare their distribution (if any) in gDNA and virus particle.

      In Figure S5 we wanted to test whether the regions lacking pVII peaks are resulting from the absence of pVII, protecting the DNA, and therefore being fully hydrolyzed by MNase, or whether this region is tightly packed by pVII thereby protecting DNA from MNase digestion. To test both possibilities we used a very limited MNase digestion approach, where even free DNA is not fully hydrolyzed, allowing the capture of DNA fragments. Therefore, the sequenced fragments comprise a mixture of protected and un-protected fragments. In this assay, the pVII protected fragments are not fully digested to the monomeric state, but a mix of mono-, di- and other multimers are present. As reflected by the fragment size distribution with the peak between 100-200 bp (Fig S5B), pVII dimers are predominantly enriched when compared to the high MNase digestion used to map pVII positions (compare to Fig4 B). Therefore, the peaks in the S5 data set have a low resolution and do not provide exact pVII positions (see below). Therefore, we would like to keep S5 as it is. We clarified this point in the text (line 279 ff)

      Legend:

      Fragment coverage plot of MNase digestions of gDNA (black) or Ad chromatin in virus particles (purple).

      The mRNA analyses of selected transcription factors provides little information, as there is no context, there is variability between experiments, and in most cases the changes appear modest. As these results are not critical to the conclusions or analyses, perhaps the authors may wish to remove them from the manuscript. Alternatively, more in-depth analyses would be required.

      We agree with the reviewer, that more information for the reader is needed. Therefore, we performed a statistical analysis of expression changes between 0 hpi and 4 hpi of the shown transcription factors using DESeq2. We added the corresponding log2(fold-change) and p-values to the figure. And adapted the text (line 471) and figure accordingly.

      Legend:

      Gene expression changes of transcription factors over the infection time course. P-values and log2(fold-changes) from differential gene expression analysis between 4 hpi and 0 hpi using DESeq2 are indicated. ns = not significant

      It is unclear why the even distribution of H3.1-flag signal across the genome is considered indicative of no specific recruitment. The results presented are equally consistent with equal incorporation across the genome. Perhaps the authors have some additional information, such as an irrelevant antibody, input DNA, or the like, to support the conclusion. If so, that evidence should be presented and discussed. If not, the interpretation should be revisited. As an added complexity, endogenous H3.1 is normally expressed during S-phase. It is possible that Adenovirus infection may induce higher levels of expression of (untagged)endogenous H3.1, which would outcompete the tagged ectopically expressed histone. These analyses deserve a more nuanced and in-depth analysis.

      We have taken several measures in the study to address the concern of the reviewer. We consider timepoint 0 hpi as background control as the viral genome has not entered the nucleus yet. Consistently, we observe very few reads mapped to the Ad genome regardless of antibody and construct used (Fig 6B). Additionally, all samples at 0 hpi cluster together in PCA (Fig 6C) and correlation analysis (Fig S7D)

      H3.1 Flag tagged samples show at later timepoints (1 - 4hpi) slightly higher percentages of mapped reads to Ad, but plateau already at 1 hpi (Fig 6B) and cluster together in PCA (Fig 6C) and correlation analysis (Fig S7D) with 0 hpi samples. The low background signal starting at 1 hpi for H3.1 might arise due to the change of Ad genome location to the nucleus.

      Even though, the number of Ad mapped reads at later timepoints was low in H3.1 Flag tagged samples, it could still be that they accumulate at few sites on the Ad genome indicating a specific deposition. We tested this by plotting the signal across the whole Ad genome (Fig S7E) and zooming into the data (compare scale of H3.3 and H3.1 plot), but we could not detect any reproducible local enrichments. To enable the reader a better comparison between the levels of H3.1 incorporation with H3.3 we put now both on the same scale (Fig 6D and Fig S7D) clearly showing that we cannot detect H3.1 incorporation at Ad genomes in the first 4 hours of infection. The H3.1 signal corresponds to the background noise. We think for two reasons, that it is very unlikely that endogenous H3.1 outcompetes the tagged H3.1:

      • The time scale for the cells to transition into S-Phase and upregulate endogenous H3.1 would be only 1-2 hours in our timeseries experiments and therefore too short. To also show these experimentally we amended an experiment for the reviewer that is not included in the manuscript. The Western blots below show that the protein amount of H3 does not increase in the first 4hours of infection. Cells were infected and whole cell extracts were prepared 4hpi.
      • As most cells are not in S-phase in our experiments, the expression levels of H3.3 variant is higher than H3.1. With the Flag ChIPs we can clearly show that the tagged H3.3 are not outcompeted by endogenous H3.3. As there is a high sequence similarity between H3.3 and H3.1 it is very unlikely that they behave in that regard differently.

        It is highly unlikely that the somewhat higher H3K27ac signal observed in the H3.3 than in the H3.1 expressing cells may result from higher H3.3. occupancy in the viral genome as speculated in page 13. The total levels of H3.3. are unlikely to increase by the ectopically expressed one, and even if they did it is not likely that the occupancy of the viral genome would be limited by the levels of H3.3. This speculation should be removed.

      We removed the speculation.

      Materials and methods are too concise. A longer more detailed version, as supplementary information, would be highly desirable.

      We extended the materials and methods part.

      Reviewer #2 (Significance (Required)):

      The major strengths of this manuscript lie on its comprehensiveness, using several in situ and populational approaches to address biologically critical questions regarding the regulation of viral replication by chromatin and epigenetics. Experiments appears very well designed and performed and are mostly clearly presented. The interpretation analyses and discussion of the results may benefit from a more nuanced analysis of the issues posed by the existence of different populations of viral genomes in the cells infected at high moi and the accessibility across different genes at any given time versus the levels of transcription of the different genes, which appears not to be fully consistent with one of the main conclusions reached.

      This study makes a very significant contribution, describing the dynamic changes in the adenoviral nucleoprotein complexes at the early times of infection and providing a full description of both the adenosomes and the nucleosomes in more and less transcribed loci. The results are properly analyzed in context of what is known about the regulation of viral gene transcription by chromatin dynamics in other systems, including similarities and differences. This study is likely to be of high interest to a wide audience, ranging from virologist to epigeneticists, to those working in gene therapy and vectored vaccines.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript "Adenoviral chromatin organization primes for early gene activation" combines RNA-seq, MNase-seq, ChIP-seq, and single genome and transcript imaging (immunofluorescence, RNA-scope, and live cell techniques) during early Adenovirus infection in vitro to characterise the spatiotemporal dynamics of viral chromatin organisation and association with gene transcription. The manuscript is an interesting read and the authors have combined multiple complimentary techniques to make a substantial contribution to understanding the early events occurring after nuclear import of viral genomes. Adenoviruses are important causes of human and animal pathology, are a useful model of non-integrating extra-chromosomal DNA virus infection in mammalian cells, and are useful vectors for vaccination and the discoveries may influence gene therapy DNA vector design. The chromatin organisation in adenovirus infection is distinct from other DNA viruses, and is relatively poorly understood compared to, for example, SV40 or herpesviruses. The manuscript describes an early transition from purely viral chromatin with Adenovirus protein pVII packaging the virus in virions, to a viral-human hybrid chromatin pattern with apparently strategically positioned H3.3 nucleosomes and viral pVII "Adenosomes" in the early hours after nuclear import of the viral genome. The data shows that packaged Adenoviruses are in a transcriptionally accessible form and gene expression occurs rapidly after infection, the combination of the MNase-seq data with ChIP-seq data is particularly interesting demonstrating and average ~238 adenosomes positioned by specific DNA code protecting 60-70bp of DNA, and that the genome is accessible at loci that also decondense on infection, with adenosomes being replaced by cellular H3.3 containing nucleosomes at distinct sites. Particularly they show that +1 H3K27 acetylated nucleosomes are acquired at the TSS of key early genes. The authors argue that their spatiotemporal data imply that this chromatin transition "primes" for early gene transcription. The manuscript is well written, uncovers important viral chromatin biology by combining multiple experimental techniques, and the data is generally very clearly presented. A few comments follow. Major concerns: • Abstract and Title: o the abstract and title suggest that because the chromatin changes are observed coincidentally or before transcriptional changes, and that this means that these chromatin changes "prime" (title) or are "required" and play a "central role" (abstract) in early gene expression. The temporal relationship would be consistent with chromatin changes being required for transcriptional changes, but do not imply necessity. Experiments to demonstrate the necessity of these changes for early gene transcription are lacking, and I recommend amending the text or additional experiments to provide this evidence directly.

      We observe a clear timing of events, with chromatin opening, nucleosome assembly at the 5’ end of the gene followed by transcriptional activation, suggesting that these structural changes are essential for gene activation. Still, we cannot prove the direct dependency. Therefore we toned down the title of our manuscript and formulate the findings more conservatively.

      The title now reads: “Changes in adenoviral chromatin organization precede early gene activation”

      Results:o The IF data in Fig S1 is convincing, showing viral particles are accessible quickly in the nucleus. Although no statistics are provided for S1B and C, pVII foci appear at 0.5hpi and appear to mostly accumulate between 0.5hpi and 1hpi with further import between 1hpi and 4hpi. Can the authors be sure that a single pVII IF focus represents a single genome? If genomes tend to aggregate as they accumulate the number of foci per nucleus may not increase linearly with the number of genomes imported. Have the authors considered analysing the intensity of the individual pVII foci over the time points? A related question is whether the authors assume that all packaged virions contain intact complete viral genomes? Many viruses comprise some mixture of complete and incomplete packaged genomes, and the subsequent analyses determine the proportion of transcriptionally active copies with RNA-Scope to a single transcript E1A which lies at one end of the viral genome. Please comment explicitly on whether this is assumed and whether this assumption is realistic in light of known Adenovirus biology.

      We appreciate the reviewer's concern. Several studies in the adenovirus field have shown equivalence between protein VII nuclear foci and individual genomes, including our own (PMID: 26332038). Probably the most accurate study was performed by Daniel Engels lab PMID: 19406166, who used nuclear protein VII foci to titrate viral as well as vector genomes. In contrast, a different study from Patrick Hearings lab PMID: 21345950 showed that past 4hpi, the number of nuclear protein VII foci gradually declines. Based on our experience and because our study is limited to 4 hpi we are confident that protein VII foci accurately reflect individual viral genomes.

      Concerning genome packaging, adenovirus particles contain a single viral genome that is protected at each end by a covalently attached protein preventing its degradation. The packaging of adenoviruses is extremely efficient and only complete genomes are packaged into fully assembled particles. All viruses used in this study have been purified by double CsCl gradient purification. This density gradient based purification protocol removes all particles that are either empty or damaged or would contain partial genomes.

      o The RNA-Seq data in Fig 1 and Fig S2 and Table S1 demonstrates transcription of early genes is barely observable at 1hpi but is observable by 2hpi and is clearly much increased by 4hpi. Fig 2C, visualising pVII foci directly within single cells, suggests that approximately 80% of foci are observed by 1hpi and a further 20% between 1hpi and 2hpi and little thereafter. These data convincingly demonstrate that nuclear import is rapid, typically occurring in the first hour. The E1A RNA-Scope data in figure 2, visualising individual mRNA transcripts of E1A, is more sensitive than the bulk RNA-Seq, and shows transcripts at 1hpi with clearly discernible transcription by 2hpi (2A&D) which suggests that transcription occurs early, by 2hpi. Thus transcription lags nuclear genome import by approximately one hour by these methods. However, the conclusions of the subsequent analyses depend on the chromatin changes clearly preceding, rather than being approximately coincident with transcription, therefore transcription being evident by 2hpi is relevant as figure 6A and D suggest that the chromatin remodelling is subtle before 2hpi on the bulk sequencing analyses. The authors should comment on this given the importance to their argument.

      As stated by the reviewer we observe a clear lag between nuclear import and transcriptional activation. And we do also observe the largest changes in nucleosome occupancy (ChIP-seq data) between 1 and 2 hpi (Fig6A and D). Compared to 0hpi, we observe the strongest increase of nucleosome occupancy between 1hpi and 2hpi (4-8fold effect), whereas depending on the area a 2-3fold increase in occupancy can be observed from 2hpi to 4hpi (Fig6D). An effect that one would expect with chromatin structure preceding gene activation. Furthermore, the timing of nucleosome assembly perfectly matches the increase of MNase accessibility at 1 hpi, supporting our conclusions.

      o The validation of the E1A probe specificity in Fig 2B looks convincing, but there are no data presented for multiple cells to reassure that this image is representative. The equivalent figure for 2D for the Ad5-GFP control would address this.

      We include a large field overview with multiple cells for virus and vector control as new supplemental figure S2B showing that the RNAscope detection of the E1A transcript is highly specific.

      o Figure 2E is presented as a colocalization analysis but appears to be a ratio of mRNA foci to pVII foci per cell. If this is an incorrect interpretation then some clarification in the figure legend would be helpful. If this interpretation of these data is correct, then it is not truly a colocalization analysis, as a single genome may give rise to multiple transcripts and so a ratio We apologize that this figure was not clear. The data are based on real colocalizations and represent the number of pVII dots positive for E1A normalized with the total number of nuclear pVII. We have clarified the figure legend accordingly.

      o The live cell imaging experiments are elegant and convincing, but the agreement in Fig 3D of the % colocalization in MS2-BP data with the RNA-scope data is potentially misleading for the reasons outlined in the prior comment. Is the data in Fig 2E the same as the data in the right hand panel of Fig 3D. If so please comment on the n discrepancy (n=30 in 2E vs n=22 in 3D). The observation that 20% of genomes are transcriptionally active, via bursting or otherwise, is interesting, and would be consistent with the Suomalainen et al reference. The authors discuss two hypotheses to explain these findings: transcriptional bursting or a subset ~20% of genomes being transcriptionally active. This is an interesting and begs the question as to why this may occur. Assuming all imported genomes are intact (previous comment), it appears from the presented images that the foci at the radial periphery of the nucleus may be more frequently transcriptionally active, despite the nuclear periphery being enriched for heterochromatin. The authors might consider analysing the radial position of their TAF1B-mCherry genomes (active and inactive) as this might support position effect variegation rather than bursting as an explanation and they appear to already have the data to perform such analyses.

      o In the presented images (Fig 3A and Fig S3) it appears a higher proportion of genomes than 20% appear to be transcriptionally active, particularly in the low MOI experiment. The authors may wish to comment on this and quantify whether the proportion of transcribing genomes was affected by the input MOI.

      This and the previous comment concerning the influence of MOI, transcriptional bursting and the positioning effect of the genome on the transcriptional activity have also been in part raised above. As stated in our response to reviewer 1 we have used a high MOI in our experiments to have equivalence between all experimental approaches. We agree with the reviewers that all aspects (dose, bursting and positioning) merit a detailed investigation, which we plan in future studies. To be consistent and comparable in our comprehensive approach we decided to not include such studies here as they would address a different question. Nevertheless, to address this (and the above) comments we now mention positioning effects in the results (line 214) and enlarged the discussion (line 587 ff) where we especially raised awareness that such pertinent questions can be addressed with the tools presented in our study.

      We also decided to visually separate the comparison of MS2 and RNAscope data to avoid misleading the reader. Furthermore, the RNAscope data have been replaced. The RNAscope data are indeed from Fig. 2. The difference in n was due to our mistake showing two different normalized data sets. Data were either normalized using total amount of nuclear protein VII (Fig. 2E) or the total amount of nuclear E1A signals (Fig 3D), which due to the more heterogenous signal did not include all cells. In the updated version both figures display data normalized by total amount of nuclear protein VII

      o Fig 4C suggests that there is a large GC preference (or bias) in the pVII occupied regions. The authors may wish to comment on this and present a track with Adenovirus GC composition in Fig 4D.

      We thank the reviewer for raising this point. As suggested by the reviewer we analysed the GC content under pVII peaks and in the linker DNA. Indeed, pVII occupied regions have a significant higher GC content indicating that pVII preferentially positions at GC rich regions. We included this analysis as an additional Figure 4E (line 302 f).

      Legend:

      Boxplot showing GC content of pVII occupied (pVII) or free (linker) regions. Two biological replicates are shown side by side and the p-value of a students t-test of the corresponding pairs is indicated above.

      o Figure 6 presents convincing data showing H3.3. nucleosome positioning and acetylation at E1A and the data is nicely presented showing these changes occur early being observable by 2 and 4 hpi. Again, these changes are not convincingly prior to early gene activation but are certainly occurring early, and may occur prior to early gene activation at the level of individual foci, however, this is not demonstrated definitively.

      This question belongs to the same context addressed by the reviewer above. Please refer to the answer given above.

      Minor comments:

      Introduction: o Paragraph 1 - Introduction for DNA viruses in general, but the authors appear to be talking about Adenoviruses specifically, "little is known about the structural organization of the genome" and "nuclear viral genomes could undergo different parallel fates", arguably these statements are not accurate for other DNA viruses (e.g. Epstein Barr Virus) suggest amending the wording for clarity.

      The manuscript text was updated as suggested.

      o paragraph 2 - Why do the authors say that Adenoviruses are prototypic DNA viruses?

      We removed the term prototypic.

      o Paragraph 3 - A recent study is referenced but multiple references are given.

      The references were updated

      o "Protein VII stays associated with the viral genome imported to the nucleus, while pV dissociates from the viral DNA following ubiquitylation (Puntener et al., 2011). The fate of the μ-peptide is not known". - The reference suggests that pV dissociates on entry to the cytoplasm and during capsid disassembly at the nuclear pore. I find this sentence confusing as it doesn't make it clear that pV is lost before nuclear entry which is important for interpreting the data.

      We clarified this in the manuscript text

      Results:

      o Figure 5 is almost unreadable due to low resolution.

      We updated the Figures and checked the resolution after PDF conversion.

      o Reference to Fig 4C in text comes after Fig4D.

      The order of Figure panels was changed accordingly.

      Reviewer #3 (Significance (Required)):

      The manuscript is well written, uncovers important viral chromatin biology by combining multiple experimental techniques, and the data is generally very clearly presented

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      *Summary: This paper illustrates the role of GDF15 in ganglionic eminence featuring the influence on neurogenesis and progenitor proliferation. Using the conventional knockout GDF-15 mouse lines, the authors provide a series of counting data indicating that GDF15 controls neurogenesis in the late embryonic or adult neural stem cells in the ventral forebrain. *

      We would like to thank the reviewer to reading our manuscript and providing comments. As the reviewer highlights, we have used a conventional GDF15 knock-in knock-out model since the growth factor is expressed not only at tissue levels but also at a systemic level. Therefore, targeted ablation of GDF15 would be complicated and the results difficult to interpret. Moreover, GDF15 and its receptor GFRAL in normal conditions are expressed at very low levels in adult mice and mutant mice do not display any obvious developmental phenotype. However, GDF15 is characteristically expressed during development and our previous data have shown that its expression is particularly increased at later developmental ages and particularly in neural precursors, which are both the focus of our analysis. Although these previous analyses have long highlighted that GDF15 is particularly expressed in the V-SVZ and in the choroid plexus, its physiological role in this system remains to the best of our knowledge unknown. It is important to investigate this issue because, as we mention below, GDF15 expression is increased, including in NSCs, upon brain injury and aging. As the reviewer rightly mentions, using this conventional approach and straightforward quantitative analyses, instruments available and normally used for the investigation of biological phenomena, we have discovered that GDF15 directly affects the number of ependymal cells and neural stem cells thereby providing a first function for the expression of the growth factor in this region.

      Major comments:

      The entire story in this manuscript seems similar to the previous findings where the authors demonstrated the role of GDF15 in the hippocampal neural stem cells in relation to EGF and CXCR4.

      As the reviewer rightly mentions we have in a previous paper investigated the effect of GDF15 on the stem cells of the hippocampus. However, we would like respectfully to disagree with the reviewer that our current manuscript describes similar findings. Whereas we have previously shown that the effect of GDF15 in hippocampal stem cells and neurogenesis is a transient inhibition of proliferation due to reduced EGFR expression, we here found that absence of GDF15 leads instead to increased proliferation and a permanent increase of stem cell number, besides of ependymal cells. As the reviewer rightly mentions, on the basis of our previous observations, also in this study we have analyzed EGFR expression and signalling. However, although our data clearly show that EGFR expression and more importantly signalling are altered also in this niche, we could show that the effect of GDF15 is more complex than altering EGFR signalling. Since we show for the first time in this study, that besides GDF15, neural progenitors also express GFRAL, our data point at a selective effect of GDF15 in the development of neural stem cell in the GE and in the deriving adult niche, which include change in EGFR signalling.

      *The data and the conclusion presented here sound reasonable to me. The current manuscript, however, gave me the impression that the story is less impactful and rather descriptive. To improve the quality of the current draft, the authors may wish to clearly highlight the novelty of the findings, not simply apply the previous strategy to the other anatomical brain regions. Alternatively, the draft could emphasize the similarity of utilising the same biological strategies to control the number of adult stem cells in the distinct stem cell niche. *

      We would like to thank again the reviewer for the positive consideration of our quantitative analyses, although we cannot agree with the referee’s conclusion about the impact of our current study. In particular, we would like to focus here not only on the specific findings of the two studies that, as we mentioned above, reach different conclusions, but also on the general meaning of our new study in the context of understanding the role of GDF15. Besides during development and aging, GDF15 expression is promptly upregulated in several pathologies including, cancer, injury and neurodegeneration. Indeed, a growing body of evidence has highlighted the possible role of GDF15 as stress hormone and mitokine, which could be part of a conserved mechanism of the body to signal and respond to stress. Within this context, it would be very important to understand the effect of GDF15 on stem cells, as it may prompt not only understanding of the physiological role of GDF15 but also of the mechanism underlying the response and contribution of stem cell to stress and injury. Supporting this view, it was recently shown that GDF15 is upregulated in quiescent neural stem cells following brain injury (Llorens-Bobadilla et al., 2015). However, the main problem in investigating the physiological role of GDF15 is that the expression of its recently discovered receptor is very limited in the brain. Therefore, our data here are important not only because of the novel effect that they describe for GDF15 in NSC development, but also because they show that GDF15 directly affects stem cell behavior. We have now followed the suggestion of the reviewer and re-edited several parts of the manuscript including editorial changes to highlight the relevance of our findings and the figures to better illustrate them. In particular, we have added also additional analyses to support our conclusions. These include data illustrated in Fig.1B, C; Fig. 4B, G and Fig. 5G.

      *I also have two concerns which may help to improve the current draft. Firstly, I would suggest considering the data presentation as many of the counting data are not accompanied by representative images or a detailed description of the methods, which could impede the credibility of the data. For instance, it is not clear to me how the authors judged the apical vs subapical progenitor counting as neither the pictures nor the methods clearly specified how these are distinguished. Same to the Figures 3 and 8. *

      We thank the reviewer for these suggestions, which we have now implemented in our revised manuscript. These changes include addition of representative images to Figs. 1, 2, 3, 4, 5, 6, as well as an illustration of how apical vs subapical cells were counted in Fig. 2A, an illustration showing how ependymal and single-ciliated cells were determined in Fig. 8A, and more detailed descriptions in the materials and methods section.

      *I would also suggest the authors may wish to carefully review the text as many abbreviations are not properly stated (for example, what are P+ progenitors?), or not properly explained why the particular gene expression is analysed (EGF, Sox2, CXCR4). This is also applied to the anatomical jargon (like apical, subapical etc), which can be specified in the figure by introducing the cartoons or point-on images. *

      We thank the reviewer for pointing these shortcomings out. We have now done a careful proofreading of the text and implemented changes in the relative figures.

      The changes include: Explanation of Prominin-1-expressing (P+) progenitors (p. 10, line 35) and apical and subapical progenitors (p.3, line 11-23), as well as more detailed reasoning for the analysis of gene expression for EGFR, Sox2 and CXCR4 (p. 9, line 5-7; p.12, line 44 and following)

      *Last not least, I should point out that the author uses less commonly used terminology. To my knowledge, the SVZ progenitors in the GE are now called basal progenitors (Bandler et al 2017 for example) and the word intermediate progenitors is used for the Tbr2+ IP cells in the developing cortex. MASH-1 is an old gene name as it is revised as Ascl1 (please refer to any recent papers and web databases such as Mouse Genome Informatics, and NCBI, for instance). The use of prevailed wording will help the readers to understand the presented story. *

      We have now revised the terminology according to the reviewer’s suggestion.

      Minor comments:

      Abstract Line 12-15 is confusing: What does "genotype" means?

      We used the term to refer to the genetic differences between WT and GDF15 mutant mice with respect to GDF15 expression. We apologize for the lack of clarity. We have now modified the paragraph in an effort to improve its clarity.

      Introduction Despite the focus of this paper being on the proliferation in GE, the introduction mixed up the references describing the dorsal telencephalon. It's better to cite the ventral GE as some progenitor behaviours are different from the ones in the dorsal. Maybe it's better to dedicate more to describing the lineage trajectory in the ventral GE and the molecular players (such as EGF), which makes it harder to understand the rationales of the several experiments.

      We would like to thank the reviewer for making this helpful comment. In the introduction we have tried to make two points: firstly, to clarify how the different progenitor types in the VZ can be distinguished based on the localization of their site of mitosis and secondly the importance of studying GDF15 in the context of NSCs in the subependymal zone of the lateral ventricle. For the first point we have several studies referring to dividing dynamics of radial glia in the developing cortex. This reflect the fact that many papers have studied both apical and subapical radial glia within the context of the developing cortex, unlike subapical progenitors, which were first discovered in the developing ganglionic eminence. A similar problem applies to the analysis of EGFR expression in the context of VZ progenitors, although we agree with the reviewer that it should introduced for a better understanding of our analyses. Therefore, we have now introduced several changes in our introduction to eliminate the shortcomings and to offset the imbalance in terms of citations.

      Results

      Fig1: V/SVZ -> VZ? I think V means ventricles while VZ is for the ventricular zone. Single-channel images should be presented to demonstrate the positive or negative cells for each antigen. Only a subset of progenitors in the adult SVZ is GDF15 positive although this is not described in the text.

      We have now replaced V/SVZ with V-SVZ, meaning ventricular-subventricular zone, throughout the manuscript and added single channel images to all figures where it is relevant.

      *Why the GDP15 staining was performed only in the adult sections, but not in E18 while the GFRAL is shown in both stages? The text claims "GDF15 is particularly expressed in the germinal region of the GE" but I did not find the data shown in this draft. *

      The fact that GDF15 is expressed in the choroid plexus and in the subependymal region of the lateral ventricle was first observed in the neonatal rat brain and prompted us to investigate the hypothesis that GDF15 may affect NSCs. Moreover, in our previous manuscript we have confirmed that GDF15 is expressed in neural progenitors of the embryonic murine GE (Carrillo-Garcia et al., 2014). In this new manuscript, we have complemented these data by adding the missing information concerning the expression of the protein in the adult V-SVZ. Notably, we also investigate for the first time the expression of the receptor in this area. This is key issue in the field, since the expression of GFRAL has been reported only in few regions of the brain, which is in apparent contrast with the growing list of effects in which GDF15 has been involved. For completeness of information and to further strengthen our conclusions we have now added new set of images in figure 1B, C showing co-expression of GFRAL and EGFR.

      *Line 24-25: I did not understand this statement. *

      The sentence refers to the results published in our previous paper, as mentioned in our reply above, which illustrate expression of GDF15 in the GE at different ages of development and in the adult V-SVZ. In an effort to improve its clarity, we have now modified the sentence into: “Consistent with these observations, we have previously reported that in the GE, Gdf15 transcripts increase at late developmental age and remain high in the adult V-SVZ.”

      Fig. 2 Line 33: what are apical P+ progenitors?

      We apologize for this shortcoming. P+ is the abbreviation for Prominin-1 immunopositive progenitors. This information has been now added to the text.

      *Fig. 2A: The total analysed cells are not described in M&M. *

      We have now added this information in the relevant section of the manuscript (p. 7, lines 36-43).

      *Fig 2 C and D. While the counting of apical or subapical progenitors has been done respectively, the representative images of which regions are judged as apical or subapical are not shown. This comment also applies to Line 41: I did not get the logic of how this analysis will be able to distinguish apical or subapical cell division. *

      Mitotic apical and subapical progenitors have been detected on the basis of the position of their nuclei. Namely, mitosis was considered apical if the nucleus of the dividing cell was within two nuclei distance (~ 10 µm) of the apical surface, and considered subapical if the nuclei of the dividing cells was at a greater distance from the apical surface. Besides adding this information to the manuscript, see “Image analysis” in the “Materials and Methods” section, we have now illustrated our approach in the new Fig. 2B.

      *Fig. 2 E and F: I am not sure why the proliferation was assessed in vitro whole mount cultures. IP injection in vivo animals would be more convincing. *

      We have used the same whole mount preparation to determine changes in proliferation upon acute fixation of the tissue. We have then determined the effect of growth factors and pharmacological modulators in whole-mount explants preparation as this would allow us to test their effect in standardized conditions. For the sake of consistency, we have then used the same whole mount explant setting to investigate proliferation by means of IdU incorporation. We selected this mean of analysis because changes in proliferation were already detected upon tissue fixation, and direct exposure of the tissue to the pharmacological modulators allowed us to investigate the direct effect of the drugs on proliferation behavior. Using this setting, we have obtained data that are compatible and consistent with our analysis on acutely fixed preparations. We agree with the reviewer that these experiments could be also repeated by injecting the IdU in vivo, however this would be against the current animal 3R guidelines that prompt to minimize the use of animal in vivo experiments and only when they cannot be replaced by alternative approaches in vitro or ex vivo.

      *Fig. 3 I am not sure the mitotic spindle orientation analysis is very informative to stand out as one independent figure. In some contexts (Noctor et al 2008), it does not correlate to the asymmetric or symmetric division modes. *

      We would like to like to respectfully disagree on this issue. The reason why we think this data set is important is twofold. Firstly, previous papers pointing at changes in the number of NSCs in the GE, have established that this was caused by a change in the spindle orientation leading to the generation of extra SNP (Falk et al., 2017), indicating a role for the orientation of the mitotic spindle in this context. Since we observed that GDF15 promotes not only progenitor proliferation but also apical divisions, it is important to show that this effect does not reflect a change in spindle orientation. Secondly, these data set highlights an age-dependent effect on the orientation of the mitotic spindle that is fully consistent with previously published data supporting the solidity of our findings. However, since we did not see any significant differences between the WT and Gdf15-/- animals, we have decided to move this data to a supplementary figure (new supplementary figure S3).

      *Fig. 4 I am not convinced by this data since how the fluorescent intensity is measured is not described. If the internal controls to adjust the staining variation among samples are not used, the data is not convincing to me. The representative pictures are not convincing either to claim the substantial differences. Perhaps immunoblotting is better to be employed to quantify the protein expression difference. *

      We have now added additional pictures with higher magnification to show the difference in EGFR intensity, including a calibration bar (Fig. 4). Quantitative analysis showed a trend decrease at E18 which is strongly significant in the adult V-SVZ. We now also show analysis of phEGFR and modified extensively the relative result section (see also our reply to the comment on Evidence, reproducibility and clarity of reviewer 2). Furthermore, we have added the following paragraph to the methods section:

      “For fluorescence intensity measurements of EGFR, slices stained at the same time with the same antibody solutions, and imaged on the same day with constant confocal microscope settings (laser intensity, gain, pixel dwell time), were measured using Fiji/ImageJ. Raw immunofluorescence intensity was normalized by subtracting background fluorescence levels, i.e. fluorescence in cells considered negative for EGFR. To rule out any unspecific secondary antibody binding, fluorescence was compared to slices incubated with secondary, but not primary antibodies (2nd only control); no difference was found between 2nd only control and cells considered negative in EGFR-labelled samples, or between 2nd only controls of different genotypes.”

      *Fig. 5 This is very busy figure composed of mainly counting graphs of different experiments. I think at least it is better to separate the data in vivo or culture. *

      We have now rearranged the figure according to the reviewer’s suggestion. We have moved, also according to Reviewer 2’s suggestions, some less relevant data to supplementary figure S4 (that is, previous panels G-J) and added confocal images to illustrate the results of previous panels C-E. We hope that this improved the focus and clarity of this figure.

      *Fig. 6. Pictures of Day 2 and Day 7 should be presented to highlight the difference between them. *

      We have now added pictures showing the cell culture at DIV2 and DIV7, as well as the different treatments, in Fig. 6A.

      *Fig. 7 Mash-1 should be rephrased as Ascl1. *

      We have now changed the name of the gene throughout the manuscript.

      Fig. 8 A: I am not sure why these pictures are B&W even though the two antigens are stained. The main text needs more description since no explanation of FOP, b-catenin etc. The picture of GFD15 KO looks having massive numbers of FOP+ cells, which is not correlated to the counting, I guess?

      We apologize for the lack of clarity. We have now added additional panels (Fig. 8A) to illustrate the rationale behind the analysis and to demonstrate how ependymal and single-ciliated cells were counted. We have added the following sections to the manuscript text:

      Materials and methods:

      “Both the β-catenin and fibroblast growth factor receptor 1 oncogene partner (FOP) primary antibodies are mouse monoclonal antibodies of the same immunoglobulin class. Therefore, for this double immunostaining both antigens were revealed using the same secondary antibody and each was distinguished based on the localization and morphology of the labelling, which is lining the cell boundaries or at the basal body of the cilia for β-catenin and FOP, respectively.”

      Results:

      “We here used β-catenin to label cell-cell contacts, thereby visualizing cell boundaries, and fibroblast growth factor receptor 1 oncogene partner (FOP), a centrosomal protein, to visualize the basal body of the cilia. As both β-catenin and FOP-antibodies where derived from the same host species, the antigens were labelled in a single fluorescent channel and differentiated based on label localisation and intensity (Fig. 8A). Cells with a single centrosome or centrosome pair (one to two FOP+ dots) were counted as single-ciliated (SC), whereas cells with more than two centrosomes, i.e. multiciliated cells, were counted as ependymal (Epen; Fig. 8A).”

      We have also added the following paragraph to the figure legend:

      “(A) Schematic showing counting of ependymal (Epen) and single-ciliated (SC) cells using FOP and β-catenin as markers. (a’) Closeup of WT image in (B), showing β-catenin, indicating cell-cell-contacts, and FOP, indicating ciliary basal bodies/centrosomes, in a single channel. Scale bar = 10 µm. (a’’) β-catenin and FOP labels are distinguished by location, morphology and label intensity, with FOP being single dots that are more intense than β-catenin and located within the cell boundaries. (a’’’) Cells containing one or two centrosomes were considered SC cells (red), while cells with more than two centrosomes were considered multiciliated and therefore Epen (blue).”

      We would also like to point out to the reviewer that since FOP was used as a label for the ciliary base, the “massive numbers of FOP+ cells” (i.e., multiciliated cells) were indeed quantified in Fig. 8B (now 8C) as ependymal cells (Epen).

      ** Referees cross-commenting**

      I agree with Reviewer #2's comment that despite the amount of data presented, they are not presented in a coherent manner. I would suggest revising carefully before submitting to any journals. As detailed above the manuscript has been revised to improve clarity and coherence according the reviewer’s suggestions.

      Reviewer #1 (Significance (Required)):

      *The presented finding of the role of GDF15 in the ventral progenitors are evident and a new finding has not been reported. Since the same effects and signalling pathways involved in adult hippocampus neurogenesis are previously published by the same authors, the impact of the current manuscript is limited. I think heightening the role of GDF15 in the biological context of ventral progenitors, or alternatively, making a comparison to the previous finding would greatly improve the quality of the draft. In my opinion, this work would be appealing to the community of neural stem cells but maybe not to the broad audience. My expertise is neurodevelopmental biology focusing on neuronal lineages and neurogenesis. *

      We have already clarified that the effect that we report here of GDF15 on NSCs is not only novel, but is also very different from what we have previously observed in the hippocampus (see also our reply above to the comment on evidence, reproducibility and clarity of reviewer 1). Although many environmental signals and growth factors have been implicated in the regulation of NSC proliferation and self-renewal, GDF15 is, to our knowledge, one of the few factors directly regulating the number of apical NSCs. Following the suggestion of the reviewer, we have now revised our manuscript in order to highlight the difference between the two studies. Besides being important within the field of NSCs, we believe that our data are also important for understanding the physiological role of GDF15, whose expression is increased during development and in the response to a growing list of stressors. For such an understanding, it is essential to identify target cell populations which can directly respond to the growth factor. Our finding that the GDF15 receptor is expressed in NSCs provide first evidence that GDF15 can directly modulate stem cell development, providing a first function for the increase in its expression. Moreover, our observation that GFRAL continues to be expressed in adult NSCs opens up to the possibility that the increase in the expression of the growth factor in the stress response is to recruit/modulate stem cell behavior. Consistent with this scenario, it was recently observed that brain injury promoted and increase of GDF15 expression in NSCs (Llorens-Bobadilla et al., 2015).

      • Reviewer #2 (Evidence, reproducibility and clarity (Required)): *

      * Here the authors explore the role of GDF15 during development of the adult neural stem cell niche at the lateral wall of the lateral ventricle using GDF15 knock-out mice. They find increased progenitor proliferation at neonatal stages and at 8weeks, compensated by neuronal death. Further they report that EGFR+ cells are arranged differently in the GDF15 mutants (in clusters rather than columns) with also lower levels of EGFR. This is surprising to me, as the authors observe an increase in proliferation. They then report that addition of EGF leads to an increase in prominin+ progenitors in the GDF15KO, but not the WT, but there is lower levels of EGFR in the KOs. They then block CXCR4, which is allegedly required for GDF15 to modulate EGFR expression, and they find that this blocking reduces proliferation mostly in WT cells. As can be seen from this summary, to me, the model of how GDF15 loss is supposed to increase proliferation is not clear. Even less so, when in adult SVZ, EGFR+ progenitors were increased, while EGFR was reduced at postnatal stages. Beyond this, the authors show convincingly that ependymal cells are increased at adult stages, while I see no data supporting their claim of NSCs to be increased (at least not reaching levels of significance).*

      • Taken together, this manuscript contains a lot of data, but to me no coherent picture emerges. If the picture is that the higher proliferation rate of apical progenitors at E18 generates more ependymal cells, then this should be shown (by including analysis e.g. at P5 when ependymal cells emerge). How GDF15 would affect proliferation in general is also not clear to me - maybe an unbiased analysis by RNA-seq could help to separate the main effects from diving into known candidates that seem not to explain the main aspects.*

      The reviewer mentions multiple aspects of our study that we would like to clarify. Therefore, we apologize for our lengthy reply.

      Firstly, the apparent contradiction between the increase in progenitor proliferation despite the concomitant decrease in EGFR levels. It is long known activation of EGFR regulates multiple aspects of progenitor behavior including proliferation, migration and differentiation. It is also known that responsiveness of NSCs to EGF increases during development, a process that is paralleled by an increase in expression of EGFR expression increase with developmental age, peaking around the perinatal age. However, since NSCs start to slow their cell cycle and enter quiescence exactly at the same time in which the increase in responsiveness to EGF and in EGFR expression in NSCs during this same period, it is clear that there is not a linear correlation between NSC proliferation and EGFR expression. A possible explanation for this apparently counterintuitive observation is that the increase in EGFR expression is paralleled by an increase in the expression of Lrig1, a developmental negative regulator of EGFR (Jeong et al., 2020). Moreover, in our study we report that lack of GDF15 leads to a decrease in the expression of EGFR protein and not in the levels of EGFR mRNA. In light of the well-known feedback mechanism by which in the presence of high EGFR activation the receptor transits to late endosome for degradation, a decrease in the protein levels could actually represent a higher level of activation EGFR signalling in the mutant progenitors than in the wild type counterpart. This is consistent with the characteristics of punctuate EGFR immunostaining we see in the mutant tissue and our analysis of EGFR activation. Our data show that despite difference in activation kinetics, wild type and mutant progenitors similarly respond to exogenous stimulation with EGF. Moreover, there is no difference between the two genotypes in the expression of EGFR in mitotic cells, and blockade of EGFR more dramatically affects the proliferation of mutant than WT progenitors. Finally, exposure to exogenous EGF promotes the proliferation of WT but not mutant progenitors. Taken together, these observations suggest that endogenous activation of EGFR driving proliferation is higher in mutant than WT progenitors. Consistent with this hypothesis, our new data illustrated in Fig. 5A-G of the revised manuscript, show that EGFR is similarly phosphorylated in mutant and in WT progenitors and that levels of Phospho-EGFR are observed in regions with low levels of EGFR expression, especially in the postnatal V-SVZ.

      With respect to the effect of CXCR4, we have investigated its effect with respect to the ability of GDF15 to promote EGFR expression at the cell surface and secondly with respect to affect proliferation in vivo and in vitro. Both experiments reveal a permissive role of the receptor, whose activity is necessary for GDF15 to promote EGFR expression at the cell surface and for cell to undergo proliferation. While these observations confirm in part our previously published data, the molecular mechanisms underlying these effects remain unclear. However, since AMD on its own does not affect EGFR expression in either WT or mutant progenitors, the two effects are not related. Despite the absence of a clear mechanism by which CXCR4 affects proliferation, our data indicate that the permissive effect of CXCR4 is more important for the proliferation of TAPs rather than for NSCs. Therefore, the different effect of CXCR4 inhibition between WT and mutant progenitors likely reflect the fact that the latter are enriched in NSCs.

      Finally, the evidence that NSCs are significantly increased in the mutant V-SVZ is reported in Fig. 8. In this figure, it is clearly reported that compared to the WT counterpart at early postnatal stages, only the multiciliated ependymal cells are significantly increased in the mutant niche, whereas uniciliated progenitors display only a trend increase (Panel C). However, the total number of apical cells is increased in the mutant V-SVZ, indicating that the number of both cell types are likely increased. Consistent with this hypothesis, in panel G of the same figure we show that in the adult V-SVZ, when the ependymal cells are fully differentiated, also apical GFAP+ NSCs are significantly increased. Notably, in this figure we show that GFAP+ NSCs also display a primary cilium, an elongated morphology and lack multiple cilia, and therefore are not atypical ependymal cells. Finally, in supplementary table S6, we show no difference in terms of % of clone forming cells between dissociated cell preparations of the WT and mutant V-SVZ. These observations and our finding of increased Ki67 apical expression in the adult V-SVZ, illustrated in supplementary figure S2B, clearly show that apical NSCs are increased.

      We have now introduced multiple modification in text and figures to clarify the mechanisms underlying the effect of GDF15 ablation on EGFR expression and activation and the differential effect of CXCR4 on WT and mutant progenitors. The new data set illustrating phosphoEGFR are illustrated in figure 4B, G. We have also modified figure 8 in an effort to illustrate more clearly the effect of lack of GDF15 on NSC number.

      *Major comments: *

      *1) Inconsistencies start already in Figure 1: The authors show expression of the receptor at neonatal stages (much higher) and adult stages, but GDF15 is shown only in adult stages and the citations of their previous work suggests that indeed it may not be present at this early stage in the GE VZ (p.8, line 10). If it is, please show. If it isn't, could it be that it is in the CSF and signals only to apical cells? *

      As the reviewer rightly mentions, GDF15 is present in the CSF and signals mainly to apical cells, as it is known to be secreted by the choroid plexus (Böttner et al., 1999; Schober et al., 2001). However, we would like to respectfully disagree with the reviewer. In our previous work, we clearly showed that GDF15 expression increases at E16 and is highest at E18 in the GE, which is the reason we specifically chose this timepoint for analysis (compare Carrillo-Garcia et al. (2014), Fig. 1A). In this manuscript we have also shown that EGFR expressing progenitors in the GE express GDF15. As the expression of GDF15 at embryonic and neonatal ages has already been investigated by us and others for two decades (see also Schober et al. (2001)), we refrained from showing expression of GDF15 at these ages again. However, we have now modified the relevant result section to clearly highlight the existence of this previous findings.

      *2) An overview of the KO phenotype by lower power pictures would be helpful. For example an overview over the GE and PH3 immunostaining WT and KO at comparable section levels. *

      Our analysis is based on whole-mount preparation of the whole GE. To standardize our analysis the same number of pictures were taken at similar locations to obtain a quantification representative of the apical surface of the whole GE. Therefore, the areas of interest were not selected on the basis of the number of mitotic cells and the differences observed do not reflect a positional effect. Lower power pictures illustrating the whole GE, are unlikely to be helpful, because they would not show the nuclear immunostaining. However, we have now modified the relevant Material and Methods section as follows to describe the standardization of our quantitative analysis: “Whole mounts were imaged using a Leica TCS SP8 confocal microscope with a 40x or 63x oil immersion objective and LASX software (Leica). For the quantification, an average of three different regions of interest were chosen at fixed rostral, dorsal and ventral position of the GE or V-SVZ and averaged for the collection of a single data set.”

      * 3) Figure 2B- where is the apical surface, where are we in the GE? Where was quantification done? *

      We have now added images detailing the localization of apical and subapical cells in new Fig. 2A, as well as further clarifications of the imaging and quantification in the materials and methods section.

      * 4) Clarify the part with EGF signaling and/or take a more comprehensive view by a proteomic or transcriptomic approach, as EGFR and CXCR4 which were already investigated previously, may not explain the phenotype. *

      We agree with the reviewer that our data should prompt a more comprehensive approach. However, this is surely a work worthy of a separate manuscript, since we agree with the reviewer that changes in EGFR and CXCR4 do not fully explain the effect of GDF15 on proliferation. We have now clarified our conclusions about EGF signalling, modifying the relevant part in the result section. We have also modified the abstract as follows:

      “From a mechanistic point of view, we show that active EGFR is essential to maintain proliferation in the developing GE and that GDF15 affects EGFR trafficking and signal transduction. Consistent with a direct involvement of GDF15, exposure of the GE to the growth factor normalized proliferation and EGFR expression and it decreased the number of apical progenitors. A similar decrease in the number of apical progenitors was also observed upon exposure to exogenous EGF. However, this effect was not associated with reduced proliferation, illustrating the complexity of the effect of GDF15.”

      *5) Do the authors actually think that the effects on EGFR are in the cells expressing the GDF15 receptor? Then please show co-localization. *

      As both EGFR and GFRAL are widely expressed in the embryonic GE (see Figs 1B and 4B), making overlap inevitable, we did previously not assume the need to show co-localization. We have now added images showing co-localization of EGFR and GFRAL in E18 and adult brain sections in Fig. 1B.

      *6) Figure 5D shows virtually no apical mitosis in WT, but indeed there are apical mitosis in WT E18 GE as one can also see in panel 5A. *

      We apologize for the confusion. In the manuscript, we use Ki67 and analysis of nuclear morphology to determine the number of cells undergoing cell division, i.e. in meta-, ana- or telophase and immunostaining with antibody with phH3+, which stains additionally cells also at late G2 and early mitotic stages. Consistent with this, the number of mitotic cells scored with Ki67 and quantified in Fig. 5D is smaller than the number of phH3+ cells that is illustrated in Fig. 5A. Throughout the manuscript, cells labeled by phH3 immunoreactivity are named “phH3+ cells”, as quantified in Fig. 5B, whereas with “dividing cells” we refer to cells with Ki67 labeling that show nuclear morphology of meta-, ana- or telophase. We have, also according to the suggestions of reviewer 1, added images of the whole mounts analyzed for Fig. 5D, as well as the following text in the materials and methods section: “Cells were considered dividing if the nuclei were labelled with Ki67 and the nuclear morphology showed signs of division, i.e. meta-, ana- or telophase, in DAPI and Ki67-channels. For the sake of clarity, “dividing cells” only refers to this way of detection, while cells positive for phH3 are termed “phH3+ cells”, as phH3 also labels cells in interphase and prophase, as well as late G2 phase.”

      *7) For the effect on ependymal cell generation it could be good to include an intermediate age, such as P5-7, when ependymal cells differentiate, staining e.g. for Lynkeas or Mcidas, known fate determinants regulating ependymal cell differentiation at that time. *

      Most of our research was performed in either E18 or adult animals, where ependymal cells are either not yet present or already fully differentiated. Since ependymal cell differentiation starts at birth, we used P2 animals to look at ependymal cell differentiation. As shown in Fig. 8B, C this age is appropriate to study early ependymal differentiation, as a lot of multiciliated ependymal cells are already present at this age, and the difference between WT and Gdf15-/- animals is clearly visible and significant. While another age or additional markers might be interesting, we argue that it would not add to the conclusion or significance of this paper, as we can see this phenotype already at age P2 and it can still be detected it in adult animals.

      *Minor comments: *

      *-) p. 8, subapical progenitors are mentioned in line 42 without explaining how they are defined. *

      We have now added more detailed definitions of apical and subapical progenitors to the introduction.

      *-) p.8, line 44: the word increased in mentioned 2x *

      We have removed the additional word.

      *-) In the description of Figure 8 C and D seem to have been mixed up. *

      We have changed the description of Fig. 8.

      * ** Referees cross-commenting***

      *I also fully agree with the point that this manuscript is very difficult to read. I think that anyhow the results have to be reorganized to focus on the most important data, so rewriting will have to be done for clarity either way. *

      We apologize for the lack of clarity we have now extensively modified and re-edited the manuscript in an effort to improve its clarity.

      * Reviewer #2 (Significance (Required)): *

      * Exploring signmalling factors important for the stem cell niche is important, and the GDF15 indeed seems to have an effect there. The problem is, that much has been done with this factor already, but of course a mechanistic understanding of whats going on is important and could be the strength of this manuscript. However, it is really not clear, which mechanisms causes what. What is clear, is that the increased proliferation of neuronal progenitors is counterbalanced by death. Its also clear that ependymal cells are increased, which is an interesting effect. But how and why is not clear and may be the best to focus in this paper. *

      As the reviewer mentions, several publications focus on GDF15. However, there is only one publication investigating the effect of GDF15 on neural stem cells and this focuses on the hippocampus. Therefore, we would like to respectively disagree with the conclusion of the reviewer “that much has been done with this factor”. Moreover, a serious problem with previous studies investigating GDF15 is the fact that its receptor is scarcely expressed and therefore it is not clear if these studies investigate direct or indirect effects of the growth factor. Since we here for the first time show that neural stem cells in the GE and V-SVZ express GDF15-receptor GFRAL, our study for the first time show a direct involvement of GDF15 on proliferation, number of ependymal cells and, as detailed in our reply above, apical NSCs. This knowledge is not only relevant to the field of normal and cancer stem cells, but also within the context of the role of GDF15 as mitokine and as stress hormone (see also our reply to major comments 2 of reviewer 1). Therefore, although we agree with the reviewer that the molecular mechanisms underlying the effect of GDG15 need further investigation, our data are novel and of relevance to the general scientific community.

      References:

      Böttner, M., Suter-Crazzolara, C., Schober, A., Unsicker, K., 1999. Expression of a novel member of the TGF-beta superfamily, growth/differentiation factor-15/macrophage-inhibiting cytokine-1 (GDF-15/MIC-1) in adult rat tissues. Cell Tissue Res 297, 103-110.

      Carrillo-Garcia, C., Prochnow, S., Simeonova, I.K., Strelau, J., Hölzl-Wenig, G., Mandl, C., Unsicker, K., von Bohlen Und Halbach, O., Ciccolini, F., 2014. Growth/differentiation factor 15 promotes EGFR signalling, and regulates proliferation and migration in the hippocampus of neonatal and young adult mice. Development 141, 773-783.

      Falk, S., Bugeon, S., Ninkovic, J., Pilz, G.A., Postiglione, M.P., Cremer, H., Knoblich, J.A., Gotz, M., 2017. Time-Specific Effects of Spindle Positioning on Embryonic Progenitor Pool Composition and Adult Neural Stem Cell Seeding. Neuron 93, 777-791 e773.

      Jeong, D., Lozano Casasbuenas, D., Gengatharan, A., Edwards, K., Saghatelyan, A., Kaplan, D.R., Miller, F.D., Yuzwa, S.A., 2020. LRIG1-Mediated Inhibition of EGF Receptor Signaling Regulates Neural Precursor Cell Proliferation in the Neocortex. Cell Rep 33, 108257.

      Llorens-Bobadilla, E., Zhao, S., Baser, A., Saiz-Castro, G., Zwadlo, K., Martin-Villalba, A., 2015. Single-Cell Transcriptomics Reveals a Population of Dormant Neural Stem Cells that Become Activated upon Brain Injury. Cell Stem Cell 17, 329-340.

      Schober, A., Böttner, M., Strelau, J., Kinscherf, R., Bonaterra, G.A., Barth, M., Schilling, L., Fairlie, W.D., Breit, S.N., Unsicker, K., 2001. Expression of growth differentiation factor-15/ macrophage inhibitory cytokine-1 (GDF-15/MIC-1) in the perinatal, adult, and injured rat brain. J Comp Neurol 439, 32-45.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Here the authors explore the role of GDF15 during development of the adult neural stem cell niche at the lateral wall of the lateral ventricle using GDF15 knock-out mice. They find increased progenitor proliferation at neonatal stages and at 8weeks, compensated by neuronal death. Further they report that EGFR+ cells are arranged differently in the GDF15 mutants (in clusters rather than columns) with also lower levels of EGFR. This is surprising to me, as the authors observe an increase in proliferation. They then report that addition of EGF leads to an increase in prominin+ progenitors in the GDF15KO, but not the WT, but there is lower levels of EGFR in the KOs. They then block CXCR4, which is allegedly required for GDF15 to modulate EGFR expression, and they find that this blocking reduces proliferation mostly in WT cells. As can be seen from this summary, to me, the model of how GDF15 loss is supposed to increase proliferation is not clear. Even less so, when in adult SVZ, EGFR+ progenitors were increased, while EGFR was reduced at postnatal stages. Beyond this, the authors show convincingly that ependymal cells are increased at adult stages, while I see no data supporting their claim of NSCs to be increased (at least not reaching levels of significance). Taken together, this manuscript contains a lot of data, but to me no coherent picture emerges. If the picture is that the higher proliferation rate of apical progenitors at E18 generates more ependymal cells, then this should be shown (by including analysis e.g. at P5 when ependymal cells emerge). How GDF15 would affect proliferation in general is also not clear to me - maybe an unbiased analysis by RNA-seq could help to separate the main effects from diving into known candidates that seem not to explain the main aspects.

      Major comments:

      1. Inconsistencies start already in Figure 1: The authors show expression of the receptor at neonatal stages (much higher) and adult stages, but GDF15 is shown only in adult stages and the citations of their previous work suggests that indeed it may not be present at this early stage in the GE VZ (p.8, line 10). If it is, please show. If it isn't, could it be that it is in the CSF and signals only to apical cells?
      2. An overview of the KO phenotype by lower power pictures would be helpful. For example an overview over the GE and PH3 immunostaining WT and KO at comparable section levels.
      3. Figure 2B- where is the apical surface, where are we in the GE? Where was quantification done?
      4. Clarify the part with EGF signaling and/or take a more comprehensive view by a proteomic or transcriptomic approach, as EGFR and CXCR4 which were already investigated previously, may not explain the phenotype.
      5. Do the authors actually think that the effects on EGFR are in the cells expressing the GDF15 receptor? Then please show co-localization.
      6. Figure 5D shows virtually no apical mitosis in WT, but indeed there are apical mitosis in WT E18 GE as one can also see in panel 5A.
      7. For the effect on ependymal cell generation it could be good to include an intermediate age, such as P5-7, when ependymal cells differentiate, staining e.g. for Lynkeas or Mcidas, known fate determinants regulating ependymal cell differentiation at that time.

      Minor comments:

      • p. 8, subapical progenitors are mentioned in line 42 without explaining how they are defined.
      • p.8, line 44: the word increased in mentioned 2x
      • In the description of Figure 8 C and D seem to have been mixed up.

      ** Referees cross-commenting**

      I also fully agree with the point that this manuscript is very difficult to read. I think that anyhow the results have to be reorganized to focus on the most important data, so rewriting will have to be done for clarity either way.

      Significance

      Exploring signmalling factors important for the stem cell niche is important, and the GDF15 indeed seems to have an effect there. The problem is, that much has been done with this factor already, but of course a mechanistic understanding of whats going on is important and could be the strength of this manuscript. However, it is really not clear, which mechanisms causes what. What is clear, is that the increased proliferation of neuronal progenitors is counterbalanced by death. Its also clear that ependymal cells are increased, which is an interesting effect. But how and why is not clear and may be the best to focus in this paper.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank all the reviewers for their time and effort in the peer-review process. We appreciate the positive reflections on the study and the feedback comments which were well thought-out and articulated. Considering these comments has led us to deeper reflections on the conceptualization of the mechanisms at play, and we hope that our responses here and revisions of the manuscript have improved the presentation of the data and our interpretation of these complex matters. As a result, we have now incorporated a new supplementary figure 5 and present a new model figure with the corresponding comments in the text.

      1. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript Sanchez-Martinez et al investigated the function of ntc, a Drosophila homologue of FBXO7. The mechanisms by which mutations in this protein cause autosomal recessive PD are poorly understood. The protein has previously been implicated in PINK1/Parkin mitophagy however the mechanistic detail is lacking. The data presented here provide an important insight into the molecular functions of ntc as well as mitophagy in vivo in general. Ntc was found to promote ubiquitination of mitochondrial proteins which is counteracted by USP30. The basal ubiquitination regulated by these two enzymes is proposed to act as a permissive factor for the initiation of Pink1/Parkin mitophagy. The conclusions are based on strong data in vivo and there is a lot to like about this paper. The analyses are done rigorously, conclusions are balanced and well supported and there is a lot of conceptual novelty in the dataset. At the same time the paper raises some questions with regard to the role of mitophagy, at least in Drosophila. Not all of these could be answered during revisions but it would be useful to address the points outlined below.

      1. The functional measurements such as climbing, flight and lifespan are used to complement the data on mitophagy and mitochondrial health. However, it is clear that these do not correlate. Ntc KOs and Pink1/Parkin flies have reduced climbing and flight ability, however ntc KO does not affect mitochondrial function. In case of Pink1/Parkin the assumption is that impaired fly functionality is due to damaged mitochondria. This is clearly not the case with Ntc. How relevant is climbing/flight/lifespan to the role of Ntc in mitophagy?
      • The reviewer raises a very good point, and we agree that there isn’t a strict, linear connection between the cellular process of mitophagy and the presentation of organismal neuromuscular phenotypes such as motor behaviours and lifespan. Considering this point further, starts to highlight the complexity of the situation at hand: it is becoming clear that there are many different forms of mitophagy, and these perform different functions in cellular remodelling and homeostasis. And, of course, there are many ways to interfere with neuromuscular function (as well as lifespan). So, it follows that some forms of mitophagy may dramatically impact neuromuscular homeostasis when disrupted, while others may not. We and others have described that basal mitophagy is minimally by loss of Pink1/parkin in vivo, so the organismal phenotypes clearly do not relate to this biology. But it is currently unclear how the phenotypes may relate to physiologically relevant stress-induced mitophagy as the precise nature of this, as well as the methods to experimentally manipulate it, are lacking.

      Here, we are initially documenting new phenotypes for ntc, with no bias for the mechanistic cause, all of which are worthy of description to gain a holistic view of the overall contribution of this gene function to organismal integrity. It is clear from the literature that ntc/FBXO7 has multiple functions, for instance, regulating proteasome function and caspase activation, so it follows that genetic loss is likely to impinge on multiple cellular functions causing pleiotropic effects.

      We have always been careful not to consider (or claim) that the organismal phenotypes, such as motor function or lifespan, are specifically due to defective mitophagy but are an overall readout of the health and functioning of the neuromuscular system. Nevertheless, these phenotypes are useful in investigating manipulations that improve or worsen the effect of Pink1/parkin (or ntc) mutants, which, a priori, may or may not also modulate mitophagy. While we have documented these new organismal phenotypes of ntc mutants and analysed the impact on basal and stress- induced mitophagy, we have not drawn a cause-effect link between the two but a correlation at least. Nevertheless, this issue raises some important considerations for the field in terms of the different kinds of mitophagy, and when they may be needed, and the impact on cells, systems or whole-organisms when they are defective. This issue is explored more in answer to Q3 below.

      1. Fig 3 is somewhat patchy, mitophagy is shown for USP30 and ntc KO epistasis but climbing index used in OE setting. These data do not match, and it feels like experiments that are shown are the ones that worked. The relevance of climbing index to mitophagy is unclear as mentioned above. Does KO/OE of ntc and USP30 affect levels of mitochondria, e.g. Marf used as a maker for mitochondria in Fig. 1? And if not why not, considering that ntc/USP30 but not PINK/Parkin control basal mitophagy?
      • The main purpose of the data in Figure 3 was to document the impact of ntc manipulation on basal mitophagy, and by extension to link this to the known mitophagy regulator, USP30, whose loss of function has been documented to promote stress-induced mitophagy. Here, we successfully demonstrated USP30 RNAi and ntc OE cause an increase in mitophagy, and established their genetic relationship. However, it is very important to our modus operandi that we have orthogonal evidence for this relationship and understand the impact at an organismal level. As the reviewer indicates, the obvious choice that would align with the mitophagy data would be to assess whether loss of ntc prevents a USP30 RNAi phenotype. However, in our hands USP30 knockdown using the same RNAi line had no discernible impact on viability or behaviour in adult flies, precluding this experiment. Of note, we did observe a detrimental impact of USP30 knockdown on adult viability using a different RNAi line (KK) but this has 2 known off-targets so this result is unreliable. An alternative approach to genetically test the antagonistic relationship between USP30 and ntc, and equally valid in our view, is to assess whether ntc OE can counteract a USP30 OE phenotype. Here, we were fortunate that USP30 OE does indeed provoke an organismal phenotype, and this was suppressed by ntc OE consistent with the mitophagy data. It is unfortunate that the more obvious option was not workable on this occasion, but we hold that the genetic relationship was nevertheless substantiated as expected, albeit with alternative manipulations. Importantly, we established the validity of this approach by demonstrating the known genetic interaction between USP30 and parkin, whereby USP30 OE locomotor phenotype is suppressed by parkin OE (now, Fig. S3D). To substantiate this approach more clearly, we have now added to the text (lines 200-201) and figure (Fig. S3C) the lack of observable effect by USP30 knockdown as noted above.

      As to the second point, assessing whether levels of mitochondria are changed by ntc/USP30 manipulations; according to the immunoblot presented in new Supplementary Figure 5A (and replicates), the levels of ATP5a are not notably changed by ntc O/E or USP30 RNAi. Marf levels are also unchanged though this is not shown. This is in line with our expectations since, as discussed above, ntc/USP30 are only one set of regulators of one type of mitophagy and several others exist. The reviewer will likely be aware that the levels of mitochondria are tightly regulated and fine-tuned for the specific need in different tissue types, and that substantial changes to mitochondrial content can be catastrophic for cell and tissue viability. While this is relatively straightforward to achieve in cultured cells, substantial reductions in mitochondrial content are non-viable in an in vivo context. Of course, in physiological conditions, rates of degradation are kept in fine-balance with biogenesis and proliferation so non-catastrophic alterations in mitochondrial content are usually counteracted by compensatory changes in proliferation or degradation.

      1. What is the role of mitophagy in the maintenance of mitochondrial function in Drosophila in general? Pink1/Parkin KO assumed to result in dysfunctional mitochondria due to impairment of damage-induced mitophagy which is a minor contributor to mitophagy as has previously been published by the authors and confirmed in this dataset using mitophagy reporter. At the same time ntc is clearly required for mitophagy, but mitochondria remains structurally and functionally intact in ntc KO. The most straightforward interpretation of these data is that Pink1/Parkin contribute little to mitophagy in flies and their effect on mitochondria and fly function is independent of mitophagy. Instead ntc (and USP30) strongly regulate mitophagy but mitophagy is not important for the maintenance of mitochondrial function. The effect of ntc on fly function is also independent of its role in mitophagy/mitochondria. Unless there is an alternative explanation the entire dataset would need to be reinterpreted and discussed differently.
      • We agree that this is an important point raised by the study findings and needs to be clearly articulated in the text, but we don’t think it is as simple as whether ‘mitophagy’ contributes to mitochondrial and organismal integrity. First, as mentioned above, it is becoming apparent that it is crucial for the field to clearly and consciously distinguish between basal and induced forms of mitophagy. Basal mitophagy is likely, though not yet proven, to be an important component of mitochondrial quality control in metazoans and largely act in a house-keeping manner providing continual surveillance of mitochondrial quality and quantity. As such, like many other critical biological processes, it is likely to be supported in a ‘belt-and-braces’ manner by several mechanisms working in parallel with a degree of functional redundancy. In contrast, induced mitophagy is presumed to be quiescent until stimulated into action at specific times for specific purposes. For instance, it is assumed, though not yet proven, that PINK1/Parkin stress-induced mitophagy is stimulated in response to some kind of physiological stress or damage to mitochondria that may be catastrophic if left unchecked.

      We and others have shown before that PINK1/Parkin are minimally involved in basal mitophagy in vivo but they are well-established to promote stress-induced mitophagy. In contrast, we have found that ntc regulates basal mitophagy and, we posit, facilitates PINK1/Parkin mitophagy by providing the initiating ubiquitination. How does this map onto the mitochondrial/organismal phenotypes? There are clear disruptions to energy-intensive, mitochondria-rich tissues inPink1/parkin mutants which are not grossly affected in ntc mutants. On the other hand, ntc mutants show a dramatically short lifespan, much shorter than Pink1/parkin mutants, while other measures of mitochondrial integrity are fine.

      The Pink1/parkin phenotypes are consistent with a catastrophic loss of tissue integrity caused by the lack of a crucial protective measure (induced mitophagy) for a specific circumstance (we think, mitochondrial ‘damage’ arising from a huge metabolic burst). In contrast, while loss of ntc causes a partial (but not complete) loss of basal mitophagy, these same tissues appear to be able to cope with this impact on house-keeping QC but importantly are also able to mount a stress-induced response via Pink1/parkin still being present. On the other hand, it should be remembered that ntc is known to perform other important cellular functions, such as regulating proteasome function and caspase activation, and it is perhaps loss of these functions that causes the dramatic loss of vitality.

      Importantly, although Pink1/parkin do not contribute to basal (steady-state) mitophagy, we think it is not appropriate to think of Pink1/parkin mitophagy as a ‘minor’ contribution. Since, under the particular triggering conditions of damage or stress that stimulate Pink/parkin mitophagy, apparently only Pink1/parkin can perform this role in certain Drosophila tissues, and this stress- induced mitophagy is crucial to tissue integrity, as exemplify by the fact that increasing basal mitophagy via ntc O/E still is not sufficient to rescue Pink1 mutants. In this specific context, this is a major mitophagy pathway.

      In summary, the connection between mitochondrial autophagic degradation and mitochondrial/organismal health is not a simple one and we would avoid conflating different aspects of mitochondrial QC with the expectation that the consequences of their dysfunction would be the same. Nevertheless, these well-considered feedback comments have crystallised the need to elaborate these ideas in the Discussion where we have added a new section (lines 359-387).

      Reviewer #1 (Significance (Required)):

      Very strong genetic data presented; novel functions for human Park15 homologue in Drosophila; mechanistic insight into the ubiquitination of mitochondria by two opposing enzymes. Overall very interesting paper but interpretation is less clear which needs to be addressed.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this ms. Sanchez-Martinez and colleagues study the role of the ub ligase FBXO7, in regulating mitophagy - highlighting that mutations in FBXO7 associate with Parkinson's disease and defects in mitochondrial homeostasis. Using the fly as model, they carry out a series of expts. investigating ntc (Drosophila ortholog of FBX07) demonstrating that it can functionally rescue Parkin but not PINK1 deficiency. Expanding on this, they propose a model whereby ntc/FBX07 regulates basal mitophagy and also acts as a priming Ub-ligase for Parkin mediated mitophagy, finding that the dub USP30 counteracts these ntc function. Overall the data are robust and support the authors' conclusions and model, the manuscript is well written and I think can be accepted as is.

      • We thank the reviewer for their appreciation of the work and the time taken to provide supportive feedback.

      While outside the scope of this study to understand why, I find it very interesting that ntc cannot rescue the PINK1 deficient phenotype, argues that PINK1 may be having additional effects beyond regulating mitochondrial ubiquitylation.

      • We entirely agree with the reviewer, this is a very intriguing finding. Indeed, there are several examples in the literature showing that PINK1 performs additional functions than just triggering mitophagy. But in the current context we interpret these data as further support for a clear mechanistic distinction between basal mitophagy and stress-induced mitophagy as discussed at length to the other reviewers’ comments.

      Reviewer #2 (Significance (Required)):

      Importance for understanding the role of FBX07 function - relevant for Parkinson's disease, also demonstrates a role for it in priming for PINK1/Parkin dependent mitophagy.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Sanchez-Martinez et al characterise the role of nutracker (ntc), the presumed Drosophila orthologue of human FBXO7 (whose gene is mutated in autosomal recessive PD), in mitophagy and phenotypes associated with neurodegeneration in flies (climbing index, dopaminergic neuron loss, rough eye phenotype, and others). FBXO7 (human) has been previously shown to restore parkin (not Pink1) phenotypes and mitochondrial morphology in Drosophila and implicated in Pink1-parkin mitophagy, however the role of ntc in basal mitophagy and its genetic interaction with USP30 has not been previously reported. Key findings include: evidence for functional homology between ntc and FBXO7, and that Ntc/FBXO7 is required for basal mitophagy (and reverses USP30 function) in a Pink1-parkin independent manner.

      FBXO7/ntc is clearly an important regulator of mitophagy and its overexpression can suppress Parkin phenotypes, however FBX07/ntc has not been studied as intensively as Parkin and Pink1, therefore this work represents important insight into mitophagy regulators (broad interest to many overlapping fields).

      However, in addition to minor points and controls requested below, some further characterisation of the signals on the mitochondria induced by ntc/FBXO7 would improve the novelty of the study and the mechanistic insight provided. For example, the authors look at total ubiquitin and pS65- Ub, whereas if they looked at specific substrates that they mention in the discussion (e.g. OMM and translocon proteins) it would allow a less speculative discussion.

      Figure 1: The authors show that overexpression of ntc can rescue Parkin null phenotypes but not Pink1 phenotypes. In very similar experiments, overexpression of FBXO7 (human) has been shown to rescue Parkin phenotypes but not pink1 phenotypes (Burchell), appropriately mentioned by the authors.

      The western blots are not terribly clear and would benefit from quantification (particularly H).

      • We had previously performed the quantification on replicate experiments but had considered that the result was clear enough without quantification, and that including a quantification may make the figure too crowded. However, we have now added the quantification to the figure to support these results.

      Specific ubiquitylated substrates like translocon proteins would be very interesting (alternatively, this could be provided in figure 7).

      • We agree that this would be a very interesting aspect to investigate, but we feel that since the emphasis of the current study is clearly on the regulation of mitophagy, and not on specific substrates as has been published elsewhere (Phu et al., Mol Cell, 2020 (Ref. 42); Ordureau et al., Mol Cell, 2020 (Ref. 39)), investigating the impact of ntc and USP30 on the ubiquitination of the translocon would be a distraction to the focus of the study.

      If the rough eye phenotype is highly homogenous, state in text otherwise, a relative roughness quantification would be more informative.

      • The rough eye phenotype described here is indeed highly stereotyped and homogeneous. We have added this comment to the text for clarity (lines 124-126).

      Figure 2: Although mainly in agreement with Burchell et al findings (that there is no disruption of mito morphology or dopaminergic neuron loss caused by ntc loss), the loss flight ability in the ntc mutant is partially discrepant with Burchell et al (results not shown in Burchell et al). Can the authors explain the discrepancy? It is important finding that the ntc functionally orthogue of FBXO7 and differs from the Burchell et al conclusions.

      • The reviewer raises a good point regarding discrepant interpretations with earlier preliminary work that we didn’t specifically elaborate in the current manuscript. For the Burchell et al study we performed a series of non-exhaustive analyses with reagents that were most readily available at the time. The flight data described in Burchell et al (as ‘data not shown’) were done with what we now know to be a hypomorphic allele, which did not give a strong flight defect that we were expecting to see as a phenocopy of parkin mutants. Moreover, experiments aimed at testing the functional homology sought to rescue the only reported ntc phenotype at that time – male sterility – which did not work. It is worth noting that GAL4/UAS-mediated expression is known to be very inefficient in the male germline, so we originally interpreted the lack of rescue with this caveat in mind. It is also worth adding that, subsequent to the Burchell et al study, we have seen that expression of FBXO7 can rescue the caspase-3 activation in ntc mutant spermatocytes, supporting their functional homology. Importantly, during the Burchell et al study we did not have reagents to test the effects of ntc overexpression, obtained subsequently, which have provided compelling data that support a functional homology between ntc and FBXO7. At the time of writing the current manuscript we did not specifically revisit the Burchell et al text to note this strongly stated conclusion. We realise that this requires unequivocal clarification and thank the reviewer for pointing this out. We have amended the text to clarify this important point (lines 285-293).

      Figure 7: A,B. It is not clear that mitochondria have been enriched - can the authors show on mitochondria or show the fractionation quality?

      • Mitochondrial enrichment is a standard procedure in our lab, with consistently acceptable results, so we apologize for omitting a demonstration of this. We have now added these data to a new supplementary figure S5A. The corresponding information has also been added to the text (line 253). We have also extended this analysis to now show that total ubiquitination is not changed in ntc OE or USP30 RNAi, highlighting the specificity for accumulated ubiquitination on the mitochondria. This has been added to supplementary figure S5Band text lines 253-254.

      C/D. The text that accompanies these figures needs further explanation and clarification and I found this result hard to understand without referring to the discussion. I think the authors are concluding that pS65 is ubiquitylated by FBXO7? I think this should be re-written in the results section. If it is a major point that the authors want to make, a complementary approach would be advised - possibly human cells/mass spectrometry.

      • We apologise that this was confusing and have simplified the text accordingly to improve the clarity (lines 260-262). While this specific analysis is not a major point of the study, it provides a useful additional measure of how ntc/USP30 contributes to mitochondrial ubiquitination which *is* a key focus of the study so we have revised the Discussion to better highlight this point (lines359- 387).

      As for Figure 1, specific ubiquitylated substrates at the OMM such as the translocon subunits would be informative.

      • As discussed above, the role of USP30, at least, on ubiquitination of protein import in the translocon has been documented elsewhere and further specific analysis on this here would be a distraction from the main focus of the study.

      Minor points

      Figure 8 model and discussion: Nice discussion. However, unless protein import/ubiquitylation of translocon factors/localisation of FBXO7 to the translocon is shown in the manuscript, I would recommend more clarity in the figure legend to emphasise what is speculation based on other papers and what are new findings from the paper.

      • This is a fair point and we agree that it is good to be clear about which aspects of the working model are reflections of the data presented here and which are extrapolation/speculation from the literature. We have modified the figure and the figure legend accordingly.

      Reviewer #3 (Significance (Required)):

      FBXO7/ntc is clearly an important regulator of mitophagy however its mechanism of action has not been studied as intensively as Parkin and Pink1, therefore this work contains important insight into mitophagy regulators.

      It will be of broad interest to many overlapping fields, and has translational impact in that mitophagy is disrupted in many diseases and FBXO7 itself is mutated in Parkinson's disease.

    1. Author Response

      Reviewer #1 (Public Review):

      This study investigates how pathogens might shape animal societies by driving the evolution of different social movement rules. The authors find that higher disease costs induce shifts away from positive social movement (preference to move towards others) to negative social movement (avoidance from others). This then has repercussions on social structure and pathogen spread.

      Overall, the study comprises a good mixture of intuitive and less intuitive results. One major weakness of the work, however, is that the model is constructed around one pathogen that repeatedly enters a population across hundreds of generations. While the authors provide some justification for this, it does not capture any biological realism in terms of the evolution of the pathogen itself, which would be expected. The lack of co-evolution in the model substantially limits the generality of the results. For example, a number of recent studies have reported that animals might be expected to become very social when pathogens are very infectious, because if the pathogen is unavoidable they may as well gain the benefits of being social. The authors make some arguments about being focused on introduction events, but this does not really align well with their study design that carries through many generations after the introduction. Given the rapid evolutionary dynamics, perhaps the study could have a more focused period immediately after the initial introduction of the pathogen to look at rapid evolutionary responses (albeit this may need some sensitivity analyses around the parameters such as the mutation rates).

      We appreciate the reviewer’s evaluation of our work, and acknowledge that we have not currently included evolutionary dynamics for the pathogen.

      One conceptual impediment to such inclusion is knowing how pathogen traits could be modelled in a mechanistic way. For example, it is widely held that there is a trade-off between infection cost and transmissibility, with a quadratic relationship between them, but this is a pattern and not a process per se. We are unsure which mechanisms could be modelled that impinge upon both infection cost and transmissibility.

      On the practical side, we feel that a mechanistic, individual-based model that includes both pathogen and host evolution would become very challenging to interpret. It might be more tractable to begin with a mechanistic, spatial model that examines pathogen trait evolution with an unchanging host (such as an adaptation of Lion and Boots, 2010). We would be happy to take this on in future work, with a view to combining models thereafter.

      We have taken the suggestion to focus on the period immediately after the introduction, and we now focus on the following 500 generations. While 500 generations is still a long time, we would note that our model dynamics typically stabilise within 200 generations. We show the following generations primarily to check that some stability in the dynamics has indeed been reached (but see our new scenario 2).

      We also appreciate the point regarding mutation rates. Our mutation rates are relatively high to account for the small size of our population. We have found that with smaller mutation rates (0.001 rather than 0.01), evolutionary shifts in our population do not occur within the first 500 generations. This is primarily because prior to pathogen introduction, the ‘agent avoiding’ strategy that becomes common later is actually quite rare. Whether a rapid transition takes place thus depends on whether there are any agent avoiding individuals in the population at the moment of pathogen introduction, or on whether such individuals emerge rapidly thereafter through mutations on the social weights. We expect that with larger population sizes, we would be able to recover our results with smaller mutation rates as well.

      A final, and much more minor comment is whether this is really a paper about movement. The model does not really look at evolutionary changes in how animals move, but rather at where they move. How important is the actual movement process under this model? For example, would the results change if the model was constructed without explicit consideration of space and resources, but instead simply modelled individuals' decisions to form and break ties? (Similar to the recent paper by Ashby & Farine https://onlinelibrary.wiley.com/doi/full/10.1111/evo.14491 ). It might help to provide more information about how putting social decisions into a spatially explicit framework is expected to extend studies that have not done so (e.g.., because they are analytical).

      This paper is indeed about movement, as where to move is a key part of the movement ecology paradigm (Nathan et al. 2008). That said, we appreciate the advice to emphasise the importance of social decisions in a spatial context, we have added these to the Introduction (L. 79 – 81) and Discussion (L. 559 – 562). In brief, we do expect different dynamics that result from the explicit spatial context, as compared to a model in which social associations are probabilistic and could occur with any individual in the population.

      In our models, individual social tendency (whether they are prefer moving towards others) is separated from individual sociality (whether they actually associate with other individuals). This can be seen from our (new) Fig. 3D, in which individuals of each of the social strategies can sometimes have similar numbers of associations (although modulated by movement). This separation of the pattern from the underlying process is possible, we believe, due to the heterogeneity in the social landscape created by the explicit spatial context.

      Reviewer #2 (Public Review):

      This theoretical study looks at individuals' strategies to acquire information before and after the introduction of pathogens into the system. The manuscript is well-written and gives a good summary of the previous literature. I enjoyed reading it and the authors present several interesting findings about the development of social movement strategies. The authors successfully present a model to look at the costs and benefits of sociality.

      I have a couple of major comments about the work in its current form that I think are very important for the authors to address. That said, I think this is a promising start and that with some revisions, this could be a valuable contribution to the literature on behavioral ecology.

      We appreciate the reviewer’s kind words.

      Before starting, I would like to be precise that, given the scope of the models and the number of parameter choices that were necessary, I am going to avoid criticisms of the decisions made when designing the models. However, there are a few assumptions I rather find problematic and would like to give proper attention to.

      The first regards social vs. personal information. Most of the model argumentation is based on the reliance on social information (considering four, but to me overlapping, social strategies that are somehow static and heritable) but in fact, individuals may oscillate between relying on their personal information and/or on social information -- which may depend on the availability of resources, population density, stochastic factors, among others (Dall et al. 2005 Trends Ecol. Evol., Duboscq et al. 2016 Frontiers in Psychology). In my opinion, ignoring the influence of personal and social information decreases the significance of this work. I am aware that the authors consider the detection of food present in the model, but this is considered to a much smaller extent (as seen in their weight on individual decisions) than the social information cues.

      We appreciate the point that individuals can switch between relying on social and personal information. However, we would point out that in our model, the social strategies are not static. The social strategy is a convenient way of representing individuals’ position in behavioural trait-space (the ‘behavioural hypervolume’ of Bastille-Rousseau and Wittemeyer 2019). This essentially means that the importance assigned to each of the three cues available in our model varies among individuals. There are indeed individuals that are primarily guided by the density of food items, and this is the commonest ‘overall’ movement strategy before the pathogen is introduced. We represent this by showing how the importance of social information is low before pathogen introduction (Fig. 2B).

      While we primarily focus on the importance of social information, this is because the population quite understandably evolves a persistent preference for moving towards food items (i.e., using personal information if available). We have made this clearer in the text on lines 367 – 371.

      Critically, it is also unclear how, if at all, the information and pathogen traits are related to each other. If a handler gets sick, how does this affect its foraging activity (does it stop foraging, slow its activities, or does it show signs of sickness)? Perhaps this model is attempting to explore the emergence of social movement strategies only, but how they disentangle an individual's sickness status and behavioral response is unclear.

      We appreciate that infection may lead to physiological effects (e.g. altered metabolic rates, reduction in cognitive capacity) that may then influence behaviour. Our model aims to be relatively simple and general one, and does not consider the explicit mechanisms by which infection imposes a cost on fitness. Thus we do not include any behavioural modifications due to infection, as we feel that these would be much too complex to include in such a model. We would be happy to explore, in future work, phenomena such as the evolution of self-isolation and infection detection which is common among animals such as social insects (Stroeymeyt et al. 2018, Pusceddu et al. 2021).

      However, we have considered an alternative implementation of our model’s scenario 1 which could be interpreted as the infection reducing foraging efficiency by a certain percentage (other interpretations of the redirection of energy away from reproduction are also possible). We show how this implementation leads to very similar outcomes as those seen in our

      Very little is presented about the virulence of the pathogens and how they could affect the emergence of social strategies. The authors keep their main argumentation based on the introduction of novel pathogens (without distinctions on their pathogenicity), but a behavioral response is rather influenced by how fast individuals are infected and which are their chances of recovering. Besides, they consider that only one or two social interactions would be enough for pathogen transmission to occur.

      We have indeed considered a fixed transmission probability of 0.05, a relatively modest attack rate. Setting transmission probability to two other values (0.025, 0.1), we find that our general results are recovered - there is an evolutionary transition away from sociality, with the proportion of agent avoidance evolved increasing with the transmission probability. While we do not show these results in the main text, we have included figures showing the proportions of each social movement strategy here for the reviewers’ reference.

      Figures showing the proportion of social movement strategies in two simulation runs of our default implementation of scenario 1 (dE = 0.25, R = 2, pathogen introduction begins from G = 500). Top: Probability of transmission = 0.025 (half of the default). Bottom: Probability of transmission = 0.10 (double the default). Overall, the proportion of agent avoidance evolved (purple) increases with the probability of transmission. Each figure shows a single replicate of each parameter combination, for only 1,000 generations.

      Another important component is that individuals do not die, and it seems that they always have a chance (even if it is small) to reproduce. So, how the authors consider unsuccessful strategies in the model outputs or how these social strategies would be potentially "dismissed" by natural selection are not considered.

      We appreciate the point that our simulation does not include mortality effects, and that all individuals have some small chance of reproducing. There are a few practical and conceptual challenges when incorporating this level of realism in a general model. Including mortality effects could allow for the emergence of more complex density-dependent dynamics, as dead individuals would not be able to transmit the pathogen to other foragers (although for some pathogens, this could be a valid choice), nor would they be sources of social information. This would make the model much more challenging to interpret, and we have tried to keep this model as simple as possible.

      We have also sought to keep the model’s focus on the evolutionary dynamics, and to not focus on mortality. In order to balance this aim with the reviewer's suggestion, we have included a new implementation of the model’s scenario 1 which has a threshold on reproduction. That means that only individuals with a positive energy balance (intake > infection costs) are allowed to reproduce. We show a potentially counter-intuitive result, that the more social ‘handler tracking’ strategy persists at a higher frequency than in our default implementation, despite having a higher infection rate than the ‘agent avoiding’ strategy. We suggest that this is because the ‘agent avoiding’ individuals have very low or no intake. This is sufficient in our default implementation to have relatively higher fitness than the more frequently infected handler tracking individuals.

      Reviewer #3 (Public Review):

      Gupte and colleagues develop an individual-based model to examine how the introduction of a novel pathogen influences the evolution of social cue use in a population of agents for which social cues can both facilitate more efficient foraging, but also expose individuals to infection. In their simulations, individuals move across a landscape in search of food, and their movements are guided by a combination of cues related to food patches, individuals that are currently handling food items, and individuals that are not actively handling food. The latter two cues can provide indirect information about the likely presence of food due to the patchiness of food across the landscape.

      The authors find that prior to introducing the novel pathogen, selection favors strategies that home in on agents, regardless of whether those agents are currently handling food items. The overall contribution of these social cues to movement decisions, however, tends to be relatively small. After pathogen introduction, agents evolve to rely more heavily on social information and to either be more selective in their use of it (attending to other agents that are currently handling food and avoiding non-handlers) or avoiding other agents altogether. Gupte and colleagues further examine the ecological consequences of these shifts in social decision-making in terms of individuals' overall movement, food consumption, and infection risk. Relative to pre-introduction conditions, individuals move more, consume less food, and are less likely to be infected due to reduced contact with others. Epidemiological models on emergent social networks confirm that evolved behavioral changes generate networks that impede the spread of disease.

      The introduction of novel pathogens into wild populations is expected to be increasingly common due to climate change and increasing global connectedness. The approach taken here by the authors is a potentially worthwhile avenue to explore the potential eco-evolutionary consequences of such introductions. A major strength of this study is how it couples ecological and evolutionary timescales. Dominant behavioral strategies evolve over time in response to changing environmental conditions and impact social, foraging, and epidemiological dynamics within generations. I imagine there are many further questions that could be fruitfully explored using the authors' framework. There are, however, important caveats that impact the interpretation of the authors' findings.

      First, reproduction bears no cost in this model. Individuals produce offspring in proportion to their lifetime net energy intake, which is increased by consuming food and decreased by a set amount per turn once infected. However, prior to reproduction, net energy intake is normalized (0-1) according to the lowest individual value within the generation. This means that individuals need not maintain a positive energy balance nor even consume food at all to successfully reproduce, so long as they perform reasonably well relative to other members of the population. Since consuming food is not necessary to reproduce, declining per capita intake due to evolved social avoidance (Fig. 1d) likely decreases the importance of food to an individual's reproductive success relative to simply avoiding infection. This dynamic could explain the delayed emergence of the 'agent avoiding' strategy (Fig. 1a), as this strategy potentially is only viable once per capita intake reaches a sufficiently low level across the population (Fig. 1d). I am curious to know what the results would be if reproduction required some minimal positive net energy, such that individuals must risk food patches in order to reproduce. It would also be useful for the authors to provide information on how net energy intake changes across generations, as well as whether (and if so, how) attraction to the food itself may change over time.

      We thank the reviewer for their assessment of our work, and appreciate the point raised here (and in an earlier review) about individuals potentially reproducing without any intake. We have addressed this by running our default model [repeated introductions, R = 2, dE = 0.25], with a threshold on reproduction such that only individuals with a positive energy balance can reproduce. We mention these results in the text (L. 495 – 500), and show related figures in the SI Appendix. In brief, as the reviewer suggests, agent avoiding is less common for our default parameter combination, but becomes as common as the default combination when the infection cost is doubled (to dE = 0.5).

      We appreciate the reviewer’s suggestion about decreasing per-capita intake being a precondition for the proliferation of the agent avoiding strategy. With our new results, we now show that there is no overall decrease in intake, but the agent avoiding strategy still becomes a common strategy after pathogen introduction. As the reviewer suggests, this is because these individuals have an equivalent net energy as handler tracking individuals, as they are less frequently infected.

      We suggest that the delayed emergence of the agent avoiding strategy is primarily due to mutation limitations – such individuals are uncommon or non-existent in the simulation before pathogen introduction, and random mutations are required for them to emerge. As we have noted in response to an earlier comment, this becomes clear when the mutation rate is reduced from 0.01 to 0.001 – agent avoidance usually does not evolve at all.

      A second important caveat is that the evolutionary responses observed in the model only appear when novel pathogen introductions are extremely frequent. The model assumes no pathogen co-evolution, but rather that the same (or a functionally identical) pathogen is re-introduced every generation (spillover rate = 1.0). When the authors considered whether evolutionary responses were robust to less frequent introductions, however, they found that even with a per-generation spillover rate of 0.5, there was no impact on social movement strategies. The authors do discuss this caveat, but it is worth highlighting here as it bears on how general the study's conclusions may be.

      We appreciate the reviewer’s point entirely. We would point out that current knowledge about pathogen introductions across species and populations in the wild is very poor. However, the ongoing highly pathogenic avian influenza outbreak (Wille and Barr 2022), the spread of multiple strains of SARS-CoV-2 to wild deer in several different human-to-wildlife transmission events, and recent work on the potential for coronavirus spillovers from bats to humans, all suggest that at least some generalist pathogens must circulate quite widely among wildlife, often crossing into novel host species or populations. We have added these considerations to the text on lines 218 – 231.

      We have also added, in order to confront this point more squarely, a new scenario of our model in which the pathogen is introduced just once, and then transmits vertically and horizontally among individuals (lines 519 – 557). This scenario more clearly suggests when evolutionary responses to pathogen introductions are likely to occur, and what their consequences might be for a pathogen becoming endemic in a population. This scenario also serves as a potential starting point for models of host-pathogen trait co-evolution, and we have added this consideration to the text on lines 613 – 623.

      References

      ● Albery, G. F. et al. 2021. Multiple spatial behaviours govern social network positions in a wild ungulate. - Ecology Letters 24: 676–686.

      ● Bastille-Rousseau, G. and Wittemyer, G. 2019. Leveraging multidimensional heterogeneity in resource selection to define movement tactics of animals. - Ecology Letters 22: 1417–1427.

      ● Gupte, P. R. et al. 2021. The joint evolution of animal movement and competition strategies. - bioRxiv in press.

      ● Lion, S. and Boots, M. 2010. Are parasites ‘“prudent”’ in space? - Ecology Letters 13: 1245–1255.

      ● Lloyd-Smith, J. O. et al. 2005. Superspreading and the effect of individual variation on disease emergence. - Nature 438: 355–359.

      ● Nathan, R. et al. 2008. A movement ecology paradigm for unifying organismal movement research. - PNAS 105: 19052–19059.

      ● Pusceddu, M. et al. 2021. Honey bees increase social distancing when facing the ectoparasite varroa destructor. - Science Advances 7: eabj1398.

      ● Sánchez, C. A. et al. 2022. A strategy to assess spillover risk of bat SARS-related coronaviruses in Southeast Asia. - Nat Commun 13: 4380.

      ● Stroeymeyt, N. et al. 2018. Social network plasticity decreases disease transmission in a eusocial insect. - Science 362: 941–945.

      ● Wilber, M. Q. et al. 2022. A model for leveraging animal movement to understand spatio-temporal disease dynamics. - Ecology Letters in press.

      ● Wille, M. and Barr, I. G. 2022. Resurgence of avian influenza virus. - Science 376: 459–460.

    1. Author Response

      Reviewer #1 (Public Review):

      This study focuses on the role of polo like kinase 1 (PLK-1) during oocyte meiosis. In mammalian oocytes, Plk1 localizes to chromosomes and spindle poles, and there is evidence that it is required for nuclear envelope breakdown, spindle formation, chromosome segregation, and polar body extrusion. However, how Plk1 is targeted to its various locations and how it performs these functions is not well understood. This study uses C. elegans oocytes as a model to explore PLK-1 function during meiosis. They take advantage of an analogue-sensitive allele of plk-1, which enabled them to bypass nuclear envelope breakdown defects that occur following PLK-1 RNAi. This allowed them to dissect later roles of PLK-1 in oocytes, demonstrating that depletion causes defects in spindle organization, chromosome congression, segregation, and polar body extrusion. Moreover, the authors defined mechanisms by which PLK-1 is targeted to chromosomes, showing that CENP-C (HCP-4) is required for localization to chromosome arms and that BUB-1 is required for targeting to the midbivalent region. Finally, they demonstrate that upon removal of PLK-1 from both domains, there are severe meiotic defects. These findings are interesting. However, there is a need for additional analysis to better support some of their conclusions, and to aid in interpretation of particular phenotypes. Specific comments are below.

      • For many important claims of the paper, a single representative image is shown but the n is not noted. This is an issue throughout the paper for much of the localization analysis (e.g. Figure 1B, 1C, 1D, 2A, 2B, 3A, 3B, 3C, etc.); in cases like this, numbers should be included to increase the rigor of the presented data. How many images or movies were analyzed that looked like the one shown? For linescans, were they done only on one image? How many independent experiments were done, etc?

      We had initially chosen a representative image. Localisation was the same in all images that allowed ‘proper’ assessment of PLK-1 localisation. In our case, this means that we can only analyse bivalents that are perpendicular to the light path to distinguish between bivalent, chromosome arms, and kinetochore. We now report the number of oocytes (N) and bivalents (n) analysed for each condition. The line scans were done in one representative image.

      • In the abstract, it is stated that PLK-1 plays a role in spindle assembly/stability (this is also stated elsewhere, e.g. line 101). This phrasing implies that the authors have demonstrated roles in both spindle assembly and stability. However, to distinguish between these roles, they would have to show that removal of PLK-1 before spindle assembly causes defects, and also that removal of PLK-1 from pre-formed spindles causes collapse. I don't think it is necessary to do this, as the spindle roles of PLK-1 are not a focus of the paper. However, the language should be altered so that it does not imply that the paper has demonstrated roles in both. A good place to do this would be in the section from lines 144-147, where they first discuss the spindle defects. It would be straightforward to explain that their approach does not distinguish between spindle assembly and stability, and that PLK-1 could have a role in either or both.

      We fully agree with this comment. We cannot distinguish between spindle assembly and stability, and it is also not the focus of our current work. We have changed the text accordingly.

      • It is stated that there is kinetochore localization of PLK-1 (and I do see some dim cup-like localization in images after PLK-1 is removed from the chromosome arms via HCP-4 RNAi). However, this cup-like localization is not clear in most wild-type images (e.g. Figure 1B, 1D, 2A, 3A, etc.). Although I recognize that the chromatin staining might be obscuring kinetochore localization, if PLK-1 was truly a kinetochore protein I would also expect it to localize to filaments within the spindle (as many other kinetochore proteins do), especially since the authors state that BUB-1 targets PLK-1 to the kinetochore (and BUB-1 is in the filaments). In fact, the only images where it looks like PLK-1 may be localized to filaments are in Figure 4C and 6A, when HCP-4 has been depleted (though I don't know if this generally true across all HCP-4 RNAi images). For me, this calls into question the conclusion that PLK-1 truly is on the kinetochore in wild type conditions - could it be that PLK-1 only localizes to the kinetochore (and to the filaments) when HCP-4 is depleted? The authors need to resolve this issue and provide better evidence that PLK-1 normally localizes to the kinetochore, if they want to make this claim. Additionally, the observation that PLK-1 is not on the kinetochore filaments (in wild type conditions) should be addressed in the text somewhere - do the authors think that this is a special type of kinetochore protein that does not localize to the filaments?

      While our initial claim of PLK-1 kinetochore localisation was based on its cup-like localisation, we have now performed additional analysis and experiments to confirm this claim. First, we corroborated that PLK-1 cup-like pattern co-localises with the Mis12 complex component KNL-3 (New Figure 5-figure supplement 1). Second, we show that PLK-1 is present in the so called ‘linear elements’ (filaments) both within the spindle and in the cortex. Since PLK-1 presence in these filaments is seen in wild type as well as hcp-4 mutant oocytes, we conclude that PLK-1 likely localises in kinetochore in normal conditions.

      • The authors should provide a control experiment, treating wild-type worms with 10uM 3-IB-PP1. This would be important to ensure that the spindle defects seen at this concentration in the plk-1as strain are not non-specific effects of the inhibitor. There is a control in Figure 1 - figure supplement 3 using 1uM 3-IB-PP1 but didn't see a control for 10uM (the concentration at which spindle defects are observed).

      This control has now been included in Figure 1-figure supplement 3.

      • In Figure 2F, the gels for BUB-1+PLK-1 look different in the presence and absence of phosphorylation by Cdk1 - for these data, I agree with the authors that it looks as if the complex elutes at a higher volume if BUB-1 is not phosphorylated (lines 200-204). However, Figure 2G has a repeat of the condition with phosphorylated BUB-1, and in this panel, the complex appears to elute at a higher volume than it did on the gel in panel F. The gel in panel G looks much more similar to the unphosphorylated condition in panel F. The authors need to explain this discrepancy (i.e., Is there a reason why the gels cannot be compared between panels? How reproducible are these data?). Ideally, the authors would include a repeat of the unphosphorylated BUB-1 + PLK-1 condition in panel G, done at the same time as the conditions shown in that panel, to avoid the impression that their results may not be reproducible.

      The specific elution volume cannot be compared in different experiments as the column has proven to “drift” over time – with proteins eluting at a later volume than they did previously despite extensive washing. What is reproducible under the experimental conditions is that the unphosphorylated wild type proteins, or the phosphorylated T527A/T163A mutant proteins A) elute at a later volume than the phosphorylated wild type proteins and B) bind to a lower proportion of the MBP-PLK1PBD (as you can see in the relative absorbance profiles and Coomassie gels).

      • The authors would need to provide convincing evidence that co-depletion of BUB-1 and HCP-4 delocalizes PLK-1 from the chromosomes entirely, and that this co-depletion condition is more severe than either single depletion alone.

      We now provide a quantitation on the total PLK-1 levels to go along the images (New Figure 8-figure supplement 1).

      Additionally, the bub-1T527A and hcp-4T163A alleles are nice tools to, in theory, more specifically delocalize PLK-1 from the midbivalent and chromosome arms, respectively, to explore the functions of chromosome-associated PLK-1. However, I think the authors cannot rule out the possibility that other proteins are also being depleted from the midbivalent and/or chromosome arms in their conditions, and that this delocalization may contribute to the phenotypes observed. For example, hcp-4 depletion was recently shown to delocalize KLP-19 from the chromosome arms (Horton et.al. 2022), so in the experiment shown in Figure 6E (HCP-4 RNAi in the bub-1 mutant), PLK-1 was likely not the only protein missing from the chromosome arms. Therefore, understanding if other proteins are absent from these domains (in the bub-1T527A and hcp-4T16A3 mutants) would help the reader understand and interpret the presented phenotypes (and how specific they are to PLK-1 loss). Consequently, I think that to better understand the co-depletion analysis presented in Figure 6 (and Figure 6 supplement 1), the authors should analyze other midbivalent and chromosome arm proteins, to determine if any are also delocalized (e.g. SUMO, KLP-19, MCAK, etc.).

      As stated above, this paper focuses on identifying the specific meiotic events PLK-1 plays a role in and characterising its targeting mechanism. We are following on this work to understand what proteins are regulated by PLK-1 in different chromosome domains and how this relates to the observed phenotypes.

      For the current, we should emphasise that mutating a single Thr residue within an STP motif in a largely disordered region is far more specific than depleting HCP-4 or BUB-1, making it likely that the observed effects are mediated through PLK-1 targeting. It should be noted that the finding presented in Horton et.al. 2022 is in contradiction with another study in which hcp-4 depletion did not impact KLP-19 localisation (Hattersley et al 2022).

      Additionally, instead of performing a combination of mutant and RNAi analysis (i.e. HCP-4 RNAi in the bub-1 mutant (Figure 6) and BUB-1 RNAi in the hcp-4 mutant (Figure 6 figure supplement 1)), it would be more powerful to generate a double mutant - this has a higher chance of being a more specific depletion condition.

      We have performed these experiments, which are now presented in Figure 9.

    1. Author Response

      Reviewer #1 (Public Review):

      This work by Shen et al. demonstrates a single molecule imaging method that can track the motions of individual protein molecules in dilute and condensed phases of protein solutions in vitro. The authors applied the method to determine the precise locations of individual molecules in 2D condensates, which show heterogeneity inside condensates. Using the time-series data, they could obtain the displacement distributions in both phases, and by assuming a two-state model of trapped and mobile states for the condensed phase, they could extract diffusion behaviors of both states. This approach was then applied to 3D condensate systems, and it was shown that the estimates from the model (i.e., mobile fraction and diffusion coefficients) are useful to quantitatively compare the motions inside condensates. The data can also be used to reconstruct the FRAP curves, which experimentally quantify the mobility of the protein solution.

      This work introduces an experimental method to track single molecules in a protein solution and analyzes the data based on a simple model. The simplicity of the model helps a clear understanding of the situation in a test tube, and I think that the model is quite useful in analyzing the condensate behaviors and it will benefit the field greatly. However, the manuscript in its current form fails to situate the work in the right context; many previous works are omitted in this manuscript, exaggerating the novelty of the work. Also, the two- state model is simple and useful, but I am concerned about the limits of the model. They extract the parameters from the experimental data by assuming the model. It is also likely that the molecules have a continuum between fully trapped and fully mobile states, and that this continuum model can also explain the experimental data well.

      We thank the reviewer for the warm overview of our work and the insightful comments on the areas that need to be improved. We are very encouraged by the reviewer’s general positive assessment of our approach. We have addressed these comments in the revised manuscript

      Reviewer #2 (Public Review):

      In this paper, Shen and co-workers report the results of experiments using single particle tracking and FRAP combined with modeling and simulation to study the diffusion of molecules in the dense and dilute phases of various kinds of condensates, including those with strong specific interactions as well as weak specific interactions (IDR-driven). Their central finding is that molecules in the dense phase of condensates with strong specific interactions tend to switch between a confined state with low diffusivity and a mobile state with a diffusivity that is comparable to that of molecules in the dilute phase. In doing so, the study provides experimental evidence for the effect of molecular percolation in biomolecular condensates.

      Overall, the experiments are remarkably sophisticated and carefully performed, and the work will certainly be a valuable contribution to the literature. The authors' inquiry into single particle diffusivity is useful for understanding the dynamics and exchange of molecules and how they change when the specific interaction is weak or strong. However, there are several concerns regarding the analysis and interpretation of the results that need to be addressed, and some control experiments that are needed for appropriate interpretation of the results, as detailed further below.

      We thank the reviewer for the warm support of our work (assessing that our work is “remarkably sophisticated and carefully performed” and “will certainly be a valuable contribution”) and for the constructive comments/critiques, which we have now addressed in the revised manuscript (please refer to our detailed responses below).

      (1) The central finding that the molecules tend to experience transiently confined states in the condensed phase is remarkable and important. This finding is reminiscent of transient "caging"/"trapping" dynamics observed in diverse other crowded and confined systems. Given this, it is very surprising to see the authors interpret the single-molecule motion as being 'normal' diffusion (within the context of a two-state diffusion model), instead of analyzing their data within the context of continuous time random walks or anomalous diffusion, which is generally known to arise from transient trapping in crowded/confined systems. It is not clear that interpreting the results within the context of simple diffusion is appropriate, given their general finding of the two confined and mobile states. Such a process of transient trapping/confinement is known to lead to transient subdiffusion at short times and then diffusive behavior at sufficiently long times. There is a hint of this in the inset of Fig 3, but these data need to be shown on log-log axes to be clearly interpreted. I encourage the authors to think more carefully and critically about the nature of the diffusive model to be used to interpret their results.

      We thank the reviewer for the insightful comments and suggestions, which have been very helpful for us to think deeper about the experimental data and the possible underlying mechanism of our findings. Indeed, the phase separated systems studied here resemble previously studied crowed and confined systems with transient caging/trapping dynamics in the literature ((Akimoto et al., 2011; Bhattacharjee and Datta, 2019; Wong et al., 2004) for examples)(references have been added in the revised manuscript). In our PSD system in Figure 3, The caging/trapping of NR2B in the condensed phase is likely due to its binding to the percolated PSD network. Thus, NR2B molecules in the condensed phase should undergo subdiffusive motions. Indeed, from our single molecule tracking data, the motion of NR2B fits well with the continuous time random walk (CTRW) model, as surmised by this reviewer. We have now fitted the MSD curve of all tracks of NR2B in the condensed phase with an anomalous diffusion model: MSD(t)=4Dtα (see Response Figure 1 below). The fitted α is 0.74±0.03, indicating that NR2B molecules in the condensed phase indeed undergo sub- diffusive motions. The fitted diffusion coefficient D is 0.014±0.001 μm2/s. We have now replaced the Brownian motion fitting in Figure 3E in the original manuscript with this sub- diffusive model fitting in the revised manuscript to highlight the complexity of NR2B diffusion in PSD condensed phase we observed.

      Response Figure 1: Fitted the MSD curve (mean value as red dot with standard error as error bar) in condensed phase with an anomalous diffusion model (blue curve, MSD=4Dtα). The fitting gives D=0.014±0.001 μm2/s and α=0.74±0.03.

      We find it useful to interpret the apparent diffusion coefficient (D=0.014±0.001 μm2/s) derived from this particular anomalous diffusion model as containing information of NR2B motions in a broadly construed mobile state (i.e., corresponding to the network unbound form) as well as in a broadly construed confined state (i.e., corresponding to NR2B molecules bound to percolated PSD networks). The global fitting using the sub-diffusive model does not pin down motion properties of NR2B in these different motion states. This is why we used, at least as a first approximation, the two-state motion switch model (HMM model) to analyse our data (please refer also to our detailed response to the comment #7 from reviewer 1 and corresponding additional analyses made during the revision as highlighted in Response Figure 4).

      As described in our response to the comment points #4 and #7 from reviewer 1, the two- state model is most likely a simplification of NR2B motions in the condensed phase. Both the mobile state and the confined state in our simplified interpretative framework likely represent ensemble averages of their respective motion states. However, the tracking data available currently do not allow us to further distinguish the substates, but further analysis using more refined model in the future may provide more physical insight, as we now emphasize in the revised “Discussion” section: “With this in mind, the two motion states in our simple two-state model for condensed-phase dynamics should be understood to be consisting of multiple sub-states. For instance, one might envision that the percolated molecular network in the condensed phase is not uniform (e.g., existence of locally denser or looser local networks) and dynamic (i.e., local network breaking and forming). Therefore, individual proteins binding to different sub-regions of the network will have different motion properties/states. … In light of this basic understanding, the “confined state” and “mobile state” as well as the derived diffusion coefficients in this work should be understood as reflections of ensemble-averaged properties arising from such an underlying continuum of mobilities. Further development of experimental techniques in conjunction with more refined models of anomalous diffusion (Joo et al., 2020; Kuhn et al., 2021; Muñoz-Gil et al., 2021) will be necessary to characterize these more subtle dynamic properties and to ascertain their physical origins” (p.23 of the revised manuscript).

      A practical reason for using the two-state motion switch HMM model to analyse our tracking data in the condensed phase is that the lifetime of the putative mobile state (when the per-frame molecular displacements are relatively large) is very short and such relatively faster short trajectories are interspersed by long confined states (see Response Figure 4C for an example). Statistically, ascertaining a particular anomalous diffusion model by fitting to such short tracks is likely not reliable. Therefore, here we opted for a semi-quantitative interpretative framework by using fitted diffusion coefficients in a two-state HMM as well as the new correlation-based approach for demarcating a low-mobility state and a high- mobility state (see our detailed response to reviewer 1’s point #7) in the present manuscript (which is quite an extensive study already) while leaving refinements of our computational modelling to future effort.

      Even in the context of the 'normal' two-state diffusion model they present, if they wish to stick with that-although it seems inappropriate to do so-can the authors provide some physical intuition for what exactly sets the diffusivities they extract from their data. (0.17 and 0.013 microns squared per second for the mobile and confined states). Can these be understood using e.g., the Stoke-Einstein or Ogston models somehow?

      As stated above, we are in general agreement with this reviewer that the motion of NR2B in the condensed phase is more complex than the simple two-state picture we adopted as a semi-quantitative interpretation that is adequate for our present purposes. Within the multi-pronged analysis we have performed thus far, NR2B molecules clearly undergo anomalous diffusions in solution containing dense, percolated, and NR2B-binding molecular networks. As a first approximation, our simple two-state HMM analysis yielded two simple diffusion coefficients (0.17 μm2/s for the mobile state and 0.013 μm2/s for the confined state). For the diffusion coefficient in the mobile state, we regard it as providing a time scale for relatively faster diffusive motions (which may be further classified into various motion substates in the future) that are not bound or only weakly associated with the percolated network of strong interactions in the PSD condensed phase. For the confined or low-mobility state in our present formulation, these molecules are likely bound relatively tightly to the percolated networks, thus the diffusion coefficient should be much smaller than the unbounded form (i.e., the mobile state) according to the Stoke-Einstein model. However, due to the detection limitation of the supper resolution imaging method (resolution of ~20 nm), we could not definitively tell the actual diffusivity beyond the resolution limit. So the diffusion coefficient in the confined state can also be interpreted as a Gaussian distributed microscope detection error (𝑓(𝑥) =1 , which is x~N(0, σ2), where σ is the standard deviation of the Gaussian distribution viewed as the resolution of localization-based microscopy, x is the detection error between recorded localization and molecule’s actual position). The track length in the confined state is the distance between localizations in consecutive frames, which can be calculated by subtraction of two independent Gaussian distributions, and the distribution of this track length (r) will be r~N(0, 2σ2). To link the detection error with the fitted diffusion coefficient, we calculated the log likelihood function of Gaussian distributed localization error (, where σ is the standard deviation of the Gaussian distribution) for the maximum likelihood estimation process to fit the HMM model. The random walk shares a similar log likelihood term () in performing maximum likelihood estimation.

      These two log likelihood functions will produce same fitting results with 2σ2 equivalent to 4Dt according to the likelihood function. In this way, the diffusion coefficient yielded by our HMM analyses for the confined state (0.0127 μm2/s) can be interpreted as the standard deviation of localization detection error (or microscope resolution limit), which is 𝜎 =√2𝐷𝑡 = 19.5 𝑛𝑚. We have included this consideration as an alternate interpretation of the confined-state or low-mobility motions with the results now provided in the “Materials and Methods” section in the sentence, viz., “… the L-component distribution may be reasonably fitted (albeit with some deviations, see below) to a simple-diffusion functional form with a parameter s =13.6 ± 3.7 nm, where s may be interpreted as a microscope detection error due to imaging limits or alternately expressed as s = DLt with DL = 0.006149 μm2/s being the fitted confined-state diffusion coefficient and t = 0.03s is the time interval of the time step between experimental frames. (The HMM-estimated confined-state Dc = 0.0127 μm2/s corresponds to s = 19.5 nm)” (p.32 of the revised manuscript).

      (2) Equation 1 (and hence equation 2) is concerning. Consider a limit when P_m=1, that is, in the condensed phase, there are no confined particles, then the model becomes a diffusion equation with spatially dependent diffusivity, \partial c /\partial t = \nabla * (D(x) \nabla c). The molecules' diffusivity D(x) is D_d in the dilute phase and D_m in the condensed phase. No matter what values D_d and D_m are, at equilibrium the concentration should always be uniform everywhere. According to Equation 1, the concentration ratio will be D_d/D_m, so if D_d/D_m \neq 1, a concentration gradient is generated spontaneously, which violates the second law of thermodynamics. Can the authors please justify the use of this equation?

      Indeed, the derivation of Equation 1 appears to be concerning. The flux J is proportional to D * dc/dx (not kDc as in the manuscript). At equilibrium dc/dx = 0 on both sides and c is constant everywhere. Can the authors please comment?

      So then another question is, why does the Monte Carlo simulation result agree with Equation 1? I suspect this has to do with the behavior of particles crossing the boundary. Consider another limit where D_m = 0, that is, particles freeze in the condensed phase. If once a particle enters the condensed phase, it cannot escape, then eventually all particles will end up in the condensed phase and EF=infty. The authors likely used this scheme. But as mentioned above this appears to violate the second law.

      Thanks for the incisive comment. After much in-depth considerations, we are in agreement with the reviewer that Eq.1 should not be presented as a relation that is generally applicable to diffusive motions of molecules in all phase-separated systems. There are cases in which this relation can need to unphysical outcomes as correctly pointed out by the reviewer.

      Nonetheless, based on our theoretical/computational modeling, it is also clear, empirically, that Eq.1 holds approximately for the NR2B/PSD system we studied, and as such it is a useful approximate relation in our analysis. We have therefore provided a plausible physical perspective for Eq.1’s applicability as an approximate relation based upon a schematic consideration of diffusion on an underlying rugged (free) energy landscape (Zhang and Chan, 2012) of a phase-separated system (See Figure 3G in the revised manuscript), while leaving further studies of such energy landscape models to future investigations.

      This additional perspective is now included in the following added passage under a new subheading in the revised manuscript:

      "Physical picture and a two-state, two-phase diffusion model for equilibrium and dynamic properties of PSD condensates"

      (3) Despite the above two major concerns described in (1) and (2), the enrichment due to the presence of a "confined state", is reasonable. The equilibrium between "confined" and "mobile" states is determined by its interaction with the other proteins and their ratio at equilibrium corresponds to the equilibrium constant. Therefore EF=1/Pm is reasonable and comes solely from thermodynamics. In fact, the equilibrium partition between the dilute and dense phases should solely be a thermodynamic property, and therefore one may expect that it should not have anything to do with diffusivity. Can the authors please comment on this alternative interpretation?

      Thanks for this thought-provoking comment. We agree with the reviewer that the relative molecular densities in the condensed versus dilute phases are governed by thermodynamics unless there is energy input into the system. However, in our formulation, the mobile ratio should not be the only parameters for determining the enrichment fold in a phase separated system. In fact, the approximate relation (Eq.1) is EF ≈ Dd/PmDm, and thus EF ≈ 1/Pm only when Dd ≈ Dm . But the speed of mobile-state diffusion in the condensed phase is found to be appreciably smaller than that of diffusion in the dilute phase (Dd > Dm). In general, a hallmark of a phase separation system is to enrich involved molecules in the condensed phase, regardless whether the molecule is a driver (or scaffold) or a client of the system. Such enrichment is expected to be resulted from the net free energy gain due to increased molecular interactions of the condensed phase (as envisioned in Response Figure 9). For example, in the phase separation systems containing PrLD-SAMME (Figure 4 of the manuscript), Pm is close to 1, but the enrichment of PrLD-SAMME in the condensed phase is much greater than 1 (estimated to be ~77, based on the fluorescence intensity of the protein in the dilute and condensed phase; Figure 5—figure supplement 1). As far as Eq.1 is concerned, this is mathematically correct because the diffusion coefficient of PrLD-SAMME in the condensed phase (D ~0.2 μm2/s) is much smaller than the diffusion coefficient of a monomeric molecule with a similar molecular mass in dilute solution (D~ 100 μm2/s, measured by FRAP-based assay; the mobility of the molecules in the dilute solution in 3D is too fast to be tracked). Physically, it’s most likely that the slower molecular motion in the condensed phase is caused by favorable intermolecular interactions and the same favorable interactions underpinning the dynamic effects lead also to a larger equilibrium Boltzmann population.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):


      Summary:

      In this manuscript, Roberts et al. hypothesised that the 5:2 diet (a popular form of IF, a dietary strategy within the Intermittent fasting that is thought to increase adult hippocampal neurogenesis - AHN) would enhance AHN in a ghrelin-dependent manner. To do this, the Authors used immunohistochemistry to quantify new adult-born neurons and new neural stem cells in the hippocampal dentate gyrus of adolescent and adult wild-type mice and mice lacking the ghrelin receptor, following six weeks on a 5:2 diet. They report an age-related decline in neurogenic processes and identify a novel role for ghrelin-receptor in regulating the formation of new adult

      born neural stem cells in an age-dependent manner. However, the 5:2 diet did not affect new neuron or neural stem cell formation in the dentate gyrus, nor did alter performance on a spatial learning and memory task. They conclude that the 5:2 diet used in their study does not increase AHN or improve associated spatial memory function.

      Major comments:

      One criticism might be the fact that many aspects are addressed at the same time. For instance it is not fully clear the role of ghrelin with respect to testing the DR effects on AHN. Although the link between ghrelin, CR and AHN is explained by citing several previous studies, it is difficult to identify the main focus of the study. Maybe this is due to the fact that the Authors analyse and comment throughout the paper the different experimental approaches used by different

      Authors to study effect of DR to AHN. This is not bad in principle, since I think the Authors have a deep knowledge of this complex matter, but all this results in a difficulty to follow the flow of the rationale in the manuscript.

      We appreciate the reviewer’s critique regarding the rationale of the studies presented in the manuscript.

      The role of ghrelin in the regulation of AHN by dietary interventions such as CR and IF is a major interest of our lab and is the main focus of the study. We, and others, have shown that ghrelin mediates the beneficial effects of CR on AHN. It is often assumed that ghrelin will elicit similar effects in other DR paradigms. We selected the 5:2 diet since it is widely practiced by humans, but it has not been well tested experimentally.

      We sought to empirically test how the neurogenic response to 5:2 differed between mice with functional and impaired ghrelin signaling.

      Given that plasma ghrelin levels and AHN are reduced during ageing, we also wanted to determine if 5:2 diet could slow or even prevent neurogenic decline in ageing mice.

      We will re-write the manuscript to ensure that our primary aim is clearly presented. We will also reanalyze the data, with genotype and 5:2 diet as key variables. To help maintain focus, the variable of age will be analyzed separately. This amendment will, we hope, help the reader follow the narrative of our manuscript.

      Another major point: the Discussion is too long. The Authors analyse all the possible reasons why different studies obtained different results concerning the effectiveness of DR in stimulating adult neurogenesis. Thus, the Discussion seems more as a review article dealing with different methods/experimental approaches to evaluate DR effects. We know that sometimes different results are due to different experimental approaches, yet, when an effect is strong and clear, it occurs in different situations. Thus, I think that the Authors must be less shy in expressing their conclusions, also reducing the methodological considerations. It is also well known that sometimes different results can be due to a study not well performed, or to biases from the Authors.

      In our discussion, we felt that it was particularly important to be as rigorous as possible in contextualizing our findings with other published data, whilst highlighting methodological differences. Our aim was to be as precise as possible when comparing findings across studies, however, this resulted in the narrative drifting from the key objectives of our study – namely, to determine the effect of 5:2 diet on neurogenesis and whether or not ghrelin-signalling regulated the process. We will amend the text of the discussion to ensure that the key points of our study are only compared and contrasted with relevant studies in the field. We thank the reviewer for their candid comment.


      Minor comments:

      • This sentence: "There is an age-related decline in adult hippocampal neurogenesis" cannot be put in the HIGHLIGHTS, since is a well known aspect of adult hippocampal neurogenesis

      The reviewer is correct to state this. Our study replicates this interesting age-related phenomenon. However, we will remove it from the ‘Highlights’ section.

      • Images in Figure 5 are not good quality.

      We apologise for this oversight. We will review each figure and panel to ensure that high-resolution images, that are appropriately annotated, are used throughout the manuscript.

      • In general, there are not a lot of images referring to microscopic/confocal photographs across the entire manuscript.

      We structured the manuscript with a limited number of figures and associated microscope captured panels, with the aim of presenting representative images to illustrate the nature and quality of the IHC protocols. However, we will amend the figures for the revised manuscript to provide representative microscopy images, with each group included and clearly annotated.

      • The last sentence of the Discussion "These findings suggest that distinct DR regimens differentially regulate neurogenesis in the adult hippocampus and that further studies are required to identify optimal protocols to support cognition during ageing" is meaningless in the context of the study, and in contrast with the main results. Honestly, my impression is that the Authors do not want to disappoint the conclusions of the previous studies; an alternative is that other Reviewers asked for this previously.

      We do not believe that this statement is contradictory to our findings, as distinct DR paradigms do appear to regulate AHN in different ways. However, we agree that we can be more explicit with regards to our own study findings and will prioritize the conclusions of our study over those of the entire field during revision.

      Reviewer #1 (Significance (Required)):

      value the significance of publishing studies that will advance the field.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):


      In this manuscript, Roberts et al. investigate the effect of the 5:2 diet on adult hippocampal neurogenesis (AHN) in mice via the ghrelin receptor. Many studies have reported benefits of dietary restriction (DR) on the brain that include increasing neurogenesis and enhancing cognitive function. However, neither the mechanisms underlying the effects of the 5:2 diet, nor potential benefits on the brain, are well understood. The authors hypothesize that the 5:2 diet enhances AHN and cognitive function via ghrelin-receptor signaling. To test this, they placedadolescent and adult ghrelin receptor knockout or wild type mice on either the 5:2 or ad libitum (AL) diet for 6 weeks, followed by spatial memory testing using an object in place (OIP) task. The authors also assessed changes in AHN via IHC using multiple markers for cell proliferation and neural stem cells. The authors observed a decrease in AHN due to age (from adolescent to adult), but not due to diet or ghrelin-receptor signaling. While loss of the ghrelin-receptor impaired spatial memory, the 5:2 diet did not affect cognitive function. The authors conclude that the 5:2 diet does not enhance AHN or spatial memory.

      We thank the reviewer for this summary. We note that there was a significant reduction in new neurones (BrdU+/NeuN+) cells in GHS-R null animals, regardless of age or diet (3 way ANOVA of age, genotype and diet (sexes pooled): Genotype P = 0.0290). These data suggest that the loss of ghrelin receptor signalling does impair AHN. However, we will re-analyse our data in light of reviewer 1 comments to remove ‘age’ as a variable. The new analyses and associated discussion will be presented in our revised manuscript.

      The authors use a 5:2 diet but fail to provide a basic characterization of this dietary intervention. For example, was the food intake assessed? In addition to the time restriction of the feeding, does this intervention also represent an overall caloric restriction or not? According to the provided results, the 5:2 diet does not appear to regulate adult hippocampal neurogenesis contrary to the authors' original hypothesis. Did the authors measure the effects of the 5:2 diet on any other organ system? Do they have any evidence that the intervention itself resulted in any well documented benefits in other cell types? Such data would provide a critical positive control for their intervention.

      This is an important point raised by the reviewer. Currently, we carefully quantified weight change across the duration of the study. However, we do not know whether the 5:2 diet reduced overall food intake or whether it impacted the timing of feeding events. To overcome this limitation, we will now test what impact the 5:2 dietary regime has on food intake and the timing of feeding. This study will allow us to correlate any changes with 5:2 diet. In addition, we have collected tibiae to quantify skeletal growth and have collected both liver and plasma (end point) samples which will be used to assess changes in the GH-IGF-1 axis. These additional studies will allow us to characterise the effects of the 5:2 paradigm on key indicators of physiological growth. These new data will be incorporated into the revised manuscript.

      Based on the effects of ghrelin in other dietary interventions, the authors speculate that the effect of the 5:2 diet is similarly mediated through ghrelin. However, the authors do not provide any basic characterization of ghrelin signaling to warrant this strong focus on the GSH-R mice. While the GSH-R mice display changes in NSC homeostasis and neurogenesis, none of these effects appear to be modified by the 5:2 diet. Thus, the inclusion of the GSH-R mice does not seem warranted and detracts from the main 5:2 diet focus of the manuscript.

      The role of ghrelin signalling via its receptor, GHSR, is a central tenet of our hypothesis. The loxTB-GHS-R null mouse is a well validated model of impaired ghrelin signalling, in which insertion of a transcriptional blocking cassette prevents expression of the ghrelin receptor (ZIgman et al.2005 JCI). We have previously shown that this mouse model is insensitive to calorie restriction (CR) mediated stimulation of AHN, in contrast to WT mice (Hornsby et al. 2016), justifying its suitability as a model for assessing the role of ghrelin signalling in response to DR interventions, such as the 5:2 paradigm. Whilst our findings do not support a role for ghrelin signalling in the context of the 5:2 diet studied, we did follow the scientific method to empirically test the stated hypothesis. While critiques of experimental design are welcome, the removal of these data may perpetuate publication bias in favour of positive outcomes and is something we wish to avoid.

      Neurogenesis is highly sensitive to stress. The 5:2 diet may be associated with stress which could counteract any benefits on neurogenesis in this experimental paradigm. Did the authors assess any measures of stress in their cohorts? Were the mice group housed or single housed?

      We thank the reviewer for raising this point. We have open-field recordings that will now be analysed to assess general locomotor activity, anxiety and exploration behaviour. Additionally, we will assess levels of the stress hormone, ACTH, in end point plasma samples. These datasets will be incorporated into the revised manuscript.

      The authors state that the 5:2 diet led to a greater reduction in body weight (31%) in adolescent males compared to other groups. However, it appears that the cohorts were not evenly balanced and the adolescent 5:2 male mice started out with a significantly higher starting weight (Supplementary Figure 1). The difference in starting weight at such a young age is significantly confounding the conclusion that the 5:2 diet is more effective at limiting weight gain specifically in this group.

      We thank the reviewer for highlighting this limitation. In the revision we will re-focus our discussion around the Δ Body weight repeated measures data, which compares the daily body weight of each group to its baseline value - thereby normalising any intergroup differences in starting weight. Furthermore, we will restructure figures 1 and S1 so that figure 1 presents only the repeated measure Δ Body weight data, while data for body weight both at baseline and on the final day of the study will be presented in figure S1.

      The authors count NSCs as Sox2+S100b- cells. However, the representative S100b staining does not look very convincing. Instead, it would be more appropriate to count Sox2+GFAP+ cells with a single vertical GFAP+ projection. Alternatively, the authors could also count Nestin-positive cells. Additionally, the authors label BrdU+ Sox2+ S100B- cells as "new NSCs". However, it appears that the BrdU labeling was performed approximately 6 weeks before the tissue was collected (Figure 1A). Thus, these BrdU-positive NSCs most likely represent label retaining/quiescent NSCs that divided during the labeling 6 weeks prior but have not proliferated since. As such, the term "new NSC" is misleading and would suggest an NSC that was actively dividing at the time of tissue collection.

      We apologise for presenting low-resolution images – these will be replaced by high-resolution images in the revised manuscript. In this study we have quantified the actively dividing BrdU+/Sox2+/S100B- cells that represent type II NSCs (rather than GFAP+ or Nestin+ type I NSCs) that have incorporated BrdU within the time period of the 6-week intervention. We appreciate the reviewer’s comments concerning the “new NSCs” terminology. We agree that we should be more specific in clarifying that the NSCs identified are those labelled during the 1st week of the 6-week intervention. We will amend this throughout the revised manuscript by re-naming these cells as 6-week old NSCs.

      Overall, this manuscript lacks a clear focus and narrative. Due to a lack of an affect by the 5:2 diet on hippocampal neurogenesis, the authors mostly highlight already well-known effects of aging and Grehlin/GSH-R on neurogenesis. Moreover, the authors repeatedly use age-related decline and morbidities as a rational for their study. However, they assess the effects of the 5:2 diet on neurogenesis only in adolescent and young mature but not aged mice.

      To provide greater clarity, and in accordance with reviewer 1’s comments, we will amend the text throughout to provide a focus on the data obtained. The objective of the changes will be to re-enforce the original study narrative. In relation to the use of the term ‘age-related decline’ or ‘age-related changes’, we think that these are appropriate to our study. Physiological ageing doesn’t begin at a specific point of chronological time, but is a process that is continuously ongoing. Indeed, our data is in agreement with previous studies reporting an age-related reduction in AHN at 6 months of age (e.g Kuhn et al.1996).

      Minor Points

      The authors combine the data from both male and female mice for most bar graphs. While this does not appear to matter for neurogenesis or behavioral readouts, there are very significant sexually dimorphic differences with respect to body size and weight. As such, male and female mice in Figure 1D,F should not be plotted in the same bar graph.

      We agree that sexual dimorphism exists with respect to body size and weight. We used distinct male and female symbols for each individual animal on these bar graphs, but do agree with the reviewer that sexual dimorphic differences should be emphasized. To achieve this, we will include additional supplementary graphs presenting the sex differences in starting weight, final weight, and weight change versus starting weight.

      The Figure legends are very brief and should be expanded to include basic information of the experimental design, statistical analyses etc.

      We thank the reviewer for this comment. We will provide specific experimental detaisl in the revised figure legends.

      Many figures include a representative image. However, it is often unclear if that is a representative image of a WT or mutant mouse, or a 5:2 or control group (Figure 2A, 3A, 4A, 5A).

      We structured the manuscript with a limited number of figures and associated microscope captured panels, with the aim of presenting representative images to illustrate the nature and quality of the IHC protocols. However, we will amend the figures for the revised manuscript to provide representative microscopy images, with each group included and clearly annotated.

      It would be helpful to provide representative images of DCX-positive cells in Figure 3A-F. Additionally, the authors should include a more extensive description of how this quantification was performed in the method section.

      We will revise the manuscript to provide representative high-resolution Dcx+ images displaying cells of each category. The method will also be revised to include a detailed description of how the quantification and classification was performed.

      The authors state "the hippocampal rostro-caudal axis (also known as the dorsoventral[] axis". However, the rostral-caudal and dorsal-central axis are usually considered perpendicular to one another.

      We agree that the dorso-ventral and rostral-caudal axes are anatomically distinct. The terms are often used interchangeably in the literature, which can lead misinterpretations (e.g the caudal portion of dorsal hippocampus is often mislabelled as ventral hippocampus). To avoid ambiguity, mislabelling or misidentification, we will include a supplementary figure detailing our anatomical definitions of the rostral and caudal poles of the hippocampus, alongside representative images and the bregma coordinates.

      Reviewer #2 (Significance (Required)):


      Understanding the mechanisms of a popular form of intermittent fasting (5:2 diet) that is not well understood is an interesting topic. Moreover, examining the effect of this form of intermittent fasting on the brain is timely. Notwithstanding, while the authors use multiple markers to validate the effect of the 5:2 diet on adult hippocampal neurogenesis, concerns regarding experimental design, validation, and data analysis weaken the conclusions being drawn.

      We thank reviewer 2 for this significance statement. We will revise the manuscript, as mentioned above, to clarify the experimental design, improve presentation of the data, and re-focus the narrative of the primary aims of the study.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):


      Summary


      In this study, Roberts and colleagues used a specific paradigm of intermitted fasting, the 5:2 diet, meaning 5 days ad libitum food and 2 non-consecutive days of fasting. They exposed adolescent and adult wild-type mice and ghrelin receptor knockout mice (GHS-R-/-) for 6 weeks to this paradigm, followed by 1 week ad libitum food. They further used the "object in place task" (OIP) to assess spatial memory performance. At the end of the dietary regime, the authors quantified newborn neurons and neural stem cells (NSCs) by immunohistochemistry. Roberts

      et al. show that the 5:2 diet does not change the proliferation of cells in the hippocampus, but report an increased number of immature neurons (based on DCX) in all the mice exposed to the 5:2 diet. This change however did not result in an increased number of mature adult-born neurons, as assessed by a BrdU birthdating paradigm. The authors further show diet-independent effects of the ghrelin receptor knockout, leading to less adult born neurons, but more NSCs in the adolescent mice and a lower performance in the OIP task.

      Major comments:

      The main conclusion of this study is that a specific type of intermitted fasting (5:2 diet) has no effects on NSC proliferation and neurogenesis. As there are several studies showing beneficial effects of intermitted fasting on adult neurogenesis, while other studies found no effects, it is important to better understand the effects of such a dietary paradigm.

      The experimental approaches used in this manuscript are mostly well explained, but it is overall rather difficult to follow the results part, as the authors always show the 4 experimental groups together (adolescent vs adult and wt vs GHS-R-/-). They highlight the main effects comparing all the groups, which most of the time is the factor "age". Age is a well-known and thus not surprising negative influencer of adult neurogenesis. Instead of focusing on the main tested factor, namely the difference in diet, the authors show example images of the two age classes

      (adolescent vs adult), which does not underly the major point they are making. Most of the time, they do not provide a post hoc analysis, so it is difficult to judge if the results with a significant main effect would be significant in a direct 1 to 1 comparison of the corresponding groups. The authors point out themselves that previous rodent studies did not use such a 5:2 feeding pattern, so having diet, age and genotype as factors at the same time makes the assessment of the diet effect more difficult.

      The manuscript would improve if the authors restructure their data to compare first the diet groups (adolescent wt AL vs 5:2 and in a separate comparison adult wt AL vs 5:2) and only in a later part of the results check if the Ghrelin receptor plays a role or not in this paradigm.

      We thank the reviewer for these comments. In line with comments from the other reviewers we will re-formulate the presentation of our datasets. We will remove ‘age’ as a key variable as age related changes are to be expected. For the revision, we will separate the adolescent and adult mouse data sets, plotting individual graphs for both. This should provide a clearer focus on 5:2 responses in both assessed genotypes.

      This re-configuration will impact the data being analysed and, therefore, the statistical analysis presented. In our original manuscript post hoc analyses were performed, however, only significant post hoc comparisons were highlighted (e.g figure 5). Non-significant post hoc comparisons have not been presented. In the method section of the revised manuscript, we will clarify that we’ll report post hoc differences when they are observed.

      During our study design, we decided to assess diet and genotype in parallel - as part of the same analysis. This seemed to us to be the most appropriate statistical method, so that we assessed dietary responses in both WT and GHS-R null mice.

      As this 5:2 is a very specific paradigm, it is furthermore difficult to compare these results to other studies and the conclusions are only valid for this specific pattern and timing of the intervention (6 weeks). It remains unclear why the authors have not first tried to establish a study with wildtype mice and a similar duration as in previous studies observing beneficial effects of intermitted fasting on neurogenesis. Like this, it would have been possible to make a statement if the 5:2 per se does not increase neurogenesis or if the 6 weeks exposure were just too short.

      The reviewer raises this relevant point which we considered during the study design period. Given that we had previously reported significant modulation of AHN with a relatively short period of 30% CR (14 days followed by 14 days AL refeeding (Hornsby et al.2016)), we predicted that a 6 week course on the 5:2 paradigm (totalling 12 days of complete food restriction over the 6 week period) would provide a similar dietary challenge. The fact that we did not observe similar changes in AHN with this 5:2 paradigm is notable.

      The graphical representation of the data could also be improved. Below are a few

      examples listed:

      1.) Figure 1 B and C, the same symbol and colours are used for the adolescent and adult animals, which makes the graphs hard to read. One colour and symbol per group throughout the manuscript would be better.

      We thank the reviewer for this comment. We will amend the presentation of the graphs throughout the manuscript to ensure that they are easier to interpret.

      2.) The authors found no differences in the total number of Ki67 positive cells in the DG. However, Ki67 staining does not allow to conclude the type of cell which is proliferating. It would thus strengthen the findings if this analysis was combined with different markers, such as Sox2, GFAP and DCX.

      Double labelling of Ki67 positive cells would allow for further insight into the identity of distinct proliferating cell populations. However, quantifying Ki67 immunopositive cells within the sub-granular zone of the GCL, as a single marker, is commonly used in studies of AHN. Given that studies of intermittent fasting, calorie restriction and treatment with exogenous acyl-ghrelin report no effect on NPC cell division, we decided not to pursue this line of inquiry.

      3.) In Figure 3, the authors say that the diet increases the number of DCX in adolescent and adult mice, which is not clear when looking at the graph in 3B. Are there any significant differences when directly comparing the corresponding groups, for instance the WT AL vs the WT 5:2? It is further not clear how the authors distinguished the different types of DCX morphology-wise. The quantification in C and D would need to be illustrated by example images. Furthermore, the colour-code used in these graphs is not explained and remains unclear

      While the 3 way ANOVA does yield a significant overall effect for diet, we agree that it is indeed difficult to see a difference on the graph, although the mean values of the adolescent 5:2 animals are more prominent than the AL counterparts. Mean +/- SEM will be provided in the supplementary section of the revised manuscript. Furthermore, we will clarify the method used to identify distinct DCX+ morphologies, include representative high-resolution images of each DCX+ cell category, and amend the colour coding to avoid misinterpretation.

      4) In Figure 5, the authors show that the number of new NSCs is significantly increased in the adolescent GHS-R-/- mice, independent of the diet, but this increase does not persist in the adult mice. They conclude that "the removal of GHS-R has a detrimental effect on the regulation of new NSC number..." this claim is not substantiated and needs to be reformulated. As the GHS-R-/- mice have a transcriptional blockage of Ghrs since start of its expression, would such an effect on NSC regulation not result in an overall difference in brain development, as ghrelin is also important during embryonic development?

      This is an interesting point. However, we disagree that the statement "the removal of GHS-R has a detrimental effect on the regulation of new NSC number..." is unsubstantiated, since it does not exclude any developmental deficits in these mice that may account for the differences observed. Nonetheless, we will rephrase the sentence to clarify our intended point and remove any ambiguity.

      5.) In Figure 6, the authors asses spatial memory performance with a single behavioral test, the OIP. As these kind of tests are influenced by the animal's motivation to explore, it's anxiety levels, physical parameters (movement) etc., the interpretation of such a test without any additional measured parameters can be problematic. The authors claim that the loss of GHS-R expression impairs spatial memory performance. As the discrimination ratio was calculated, it is not possible to see if there is an overall difference in exploration time between genotypes. This would be a good additional information to display.

      We thank the reviewer for this insight. We have open-field recordings that will now be analysed to assess general locomotor activity, anxiety and exploration behaviour. These data, alongside exploratory time of the mice during the OIP task will be incorporated into the revised manuscript.

      Besides these points listed above, the methods are presented in such a way that they can be reproduced. The experiments contained 10-15 mice per group, which is a large enough group to perform statistical analyses. As mentioned above, the statistical analysis over all 4 groups with p-values for the main effects should be followed by post hoc multiple comparison tests to allow the direct comparison of the corresponding groups.

      Reviewer #3 (Significance (Required)):

      In the last years, growing evidences suggested that IF might have positive effect on health in general and also for neurogenesis. However, a few recent studies report no effects on neurogenesis, using different IF paradigms. This study adds another proof that not all IF paradigms influence neurogenesis and shows that more work needs to be done to better understand when and how IF can have beneficial effects. This is an important finding for the neurogenesis field, but the results are only valid for this specific paradigm used here, which limits its significance. The reporting of such negative findings is however still important, as it shows that IF is not just a universal way to increase neurogenesis. In the end, such findings might have the potential to bring the field together to come up with a more standardized dietary intervention paradigm, which would be robust enough to give similar results across laboratories and mouse strains, and would allow to test the effect of genetic mutations on dietary influences of neurogenesis.

      We thank the reviewer for their insightful and thorough feedback.

      1. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      The manuscript has not been revised at this stage.

      2. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      • *

      We have included in our replies to the reviewers a description of the amendments that we will make to our manuscript. Two requested revisions stand out as being unnecessary or cannot be provided within the scope of a revision.

      The first was the request to perform the 5:2 study in older mice. This an interesting suggestion, however, the expense and time needed to maintain mice into old age (e.g >18 months) cannot be provided within the scope of our revision. In addition, given that we report no effect of the 5:2 paradigm on AHN in adolescent (7 week old) and adult (7 month old) mice, there is less justification for such a study in older mice.

      The second request, that we disagree with, was to remove data relating to the GHS-R null mice (see reviewer 2, point 2). The role of ghrelin signalling via its receptor, GHS-R, is a central tenet of our hypothesis. Whilst our findings do not support a role for ghrelin signalling in the context of the 5:2 diet studied, we followed the scientific method to empirically test the stated hypothesis. While critiques of experimental design are welcome, the removal of such data may perpetuate publication bias in favour of positive outcomes and is something we wish to avoid.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript by Veen and colleagues assesses two transcription factors, and makes the novel conclusion that they regulate each other in a manner that is required for photoreceptor regeneration in zebrafish. The work is potentially exciting, because similar findings from zebrafish have found traction in translation to mammals, where regeneration of photoreceptors has surprising promise to treat blindness.

      The authors have been ambitious in there approach to the problem by disrupting these genes in the adult retina, which is the appropriate context required to assess photoreceptor regeneration. Because technologies for conditional gene ablation are not very available in zebrafish, these authors use electroporation of morpholinos to accomplish their goals. Where most researchers have abandoned this very challenging approach, it seems these authors have found some success.

      Together the technical feat and intriguing conclusions combine, in my opinion, to make this paper worthy of serious consideration for publication. I would hate to see it not be made available for public consumption. Its' merits are strong, but some shortcomings in communication and interpretation nevertheless should be addressed. I suggest Major and minor concerns below.

      I suspect that doing further experiments would be asking a lot of the authors at this point, but I point out some possible experiments that would improve the manuscript if my suspicion is wrong. Without further experiments, I suggest much of the writing needs to be carefully qualified and less deterministic.

      Major concerns:

      1. The authors need to quantify impacts of MO without MTZ (or with MTZ on wildtype fish without the nfsb Tg). Alternatively, the interpretation needs to be softened considerably. Observations made include increased proliferation and more PR, but these are not clearly connected by the data in a way that allows you to claim "more regeneration". A plausible alternative is that the MO was protective, and the MTZ did not kill as many PR cells when the genes were knocked down. Moreover, Figure 3 shows that only one of two proliferation markers is increased (how to explain?) and only at one timepoint, so this may be a fluke. I suggest softening the conclusions to state that gene knockdown increased proliferation and led to increased PR abundance, thus implying improved regeneration (and provide the alternative interpretation). E.g. the punchy titles of Figures 3, 5, 7 are not supported by the data; neither is text in Discussion bottom of page 15.
      2. Fig 5 quantification of proliferation is needed if the interpretation is about regeneration (see comment 1). Instead, the conclusions could be reworked to match the data.
      3. I'm unclear on why these experiments couldn't have been completed in mutant zebrafish. Are they not viable?
      4. Sequence and chemistry of the MO knockdown reagents must be provided. If they are similar to previously published MO reagents (several for both gene targets have been published) then this might be used to improve confidence of MO efficacy. Were the MOs modified to facilitate electroporation? The gene targets also must be listed with less ambiguity, e.g. when "prox1" is mentioned, do you mean prox1a? Without these details, the experiments fail to provide enough info to permit replication.
      5. A suggestion to improve the text [no need for new experiments]: The Discussion should address assumptions about MO knockdowns in regards to: a) efficacy, and b) specificity. E.g. (a) future experiments might challenge the efficacy by measuring the abundance of genes that are regulated by prox1 and her6. E.g. (b) future experiments should challenge the specificity of the MO reagents by testing to see if the same result is attained with disparate MO oligos, by phenocopy with CRISPR, performing the work in mutants (I assume rescuing the knockdown by replacing the target gene is not feasible by electroporation, but that would be ideal).
      6. The claim that Prox1 is in PR (Figure 4 title) is not convincing. Does the scRNA-Seq confirm this, and why not invoke this data to clarify more concretely? Figure 4A shows a lot of green prox1 signal, but that is very inconsistent with what is shown in Figure 4G, where no prox1 signal is observed in the PR. On page 12, which relates to this Figure, the authors instead say that Prox1 is detected in PR after injury (a big difference compared to title of Fig4!). Fig4I' shows some signal in the area of the PR, but the overlap of the signals is not convincing and it looks to mostly be adjacent to the zpr1 signal; maybe it is Muller glia or some other cone type, or rod cells. If it is Muller glia or rods, then the interpretation needs to be adjusted. Regardless, it is unclear if this is in LWS cones, which is presumably what regenerates after LWS cone ablation(?)
      7. Figures showing prox1 or Hes1 IHC (Fig 2, 4, 6, 7 & Supps) - how many replicates were evaluated (how many individual fish were assessed) to determine that these IHCs are representative.
      8. Some of the data, i.e. some photomicrographs of IHC, are used repeatedly in separate Figures. I cannot find a comment in the manuscript acknowledging this. Panel F is identical in Figures 3 and 5, and panel 7E is identical to Supp panel S5E. My opinion here is mixed: I think re-using these Figures is marginally ok if it is explicitly and repeatedly described (e.g. in Methods, Results, and Fig Legends), but I also think that if the authors have replicated the experiments sufficiently, then they will surely have some other micrographs to use. My opinion is tipped into grumpy and worried about good data integrity, because in both cases the lines that indicate retinal layers are drawn in different places between the replicated panels; that could happen out of sloppy-ness or instead could be a ploy to help hide the Figure recycling. I prefer to assume the authors are of good intent and have made an error (indeed the panels are all meant to represent the same control treatments) but I would not want the manuscript published without explicitly rectifying this issue. Minimally the replicate micrographs should be explicitly acknowledged. My search for other duplicated panels was not exhaustive.

      Minor points:

      • a) Page numbers and line numbers would make it less work to prepare a constructive critique of this paper. Similarly, the Figures need Figure numbers.
      • b) On the histograms, does each dot represent an individual fish? (i.e. an independent biological replicate).
      • c) It would be lovely to learn that left vs. right eyes were used as internal controls in each case, and then the authors could plot the difference between control & treatment within each individual. Perhaps this would allow normalizations or more powerful statistical tests, and then the PCNA data would be more aligned with the conclusions, for example.
      • d) Figure 5: expected to see quantification of PH3 here, akin to Figure 3.
      • e) P. 6 secondary antibodies probably did not come from ZIRC
      • f) More should be done to acknowledge past papers examining Her6, Hes1 and Prox1 in vertebrate retina.
      • g) I do not see how the final section of the manuscript (beginning with "Insulinoma" to the end of the Discussion is relevant to the paper. A very odd ending to this manuscript. Some sentences (especially beginning the section with a topic sentence) would be need to be added if this writing is to remain.
      • h) The final two sentences of the Abstract were interesting - these ideas are unfortunately not Discussed again later in the manuscript.
      • i) What is the source of the transgenic zebrafish line Tg(lws2:nfsb-mCherry) ? Is it maybe from Wang...Yan 2020 PLOS BIOL (PMID: 32168317)? If yes, it would be ideal to provide an allele number. If no, construction of this line should be described.
      • j) Bottom page 4 says "Two transgenic lines used were crossed" but only one line is mentioned.
      • k) Then on page 7, the text says "Zebrafish line Tg(her4.1:dRFP/gfap:GFP/lws2:nfsb-mCherry) for red cone ablation, ..." which muddies the waters even further.
      • l) When the antibody zpr1 is described, it is mentioned as a "zinc finger" (many instances throughout). This is incorrect, and the words "zinc finger" can be removed.
      • m) It would be useful to state in Methods, and at first occurrence in figure legends, that the antibody ZPR1 labels double cones (the red & green cones), and these make up about half of the cone photoreceptor population. (i.e. not all cones are evaluated in this work).
      • n) Figure 2 desperately needs a panel describing methodological timeline, similar to Fig 1D. It is really hard to figure what happened when (e.g. when did ablation occur? When was the MO delivered?). This also should be described more explicitly in the Methods, which seem quite vague on this point: Electroporated fish went straight into MTZ?
      • o) Throughout the authors refer to injury, e.g. hpi = hours post injury. I don't think this represents the methods very well at all, because they have ablated the cells, not injured them. Injured cells don't regenerate (because they are not dead). This miswording contributes to confusion interpreting the Figures, which are not decipherable as stand-alone items.
      • p) There is a really weird yellow dotted line that spans between and ACROSS adjacent panels in Figure 2. It covers the white line separating panels F' & G', and then again in F" and G".
      • q) Fig 2, it is evident that Hes1 protein is not eliminated so you cannot claim it is "not expressed". It is perhaps reduced in abundance, but signal is still obviously present.
      • r) Title of Figure 6 needs to rewritten: LLPS may be occurring, but until you manipulate both LLPS and Prox1 together, you cannot claim that they act through one another.
      • s) Figure 7 title needs to be rewritten: PR are not quantified here.
      • t) Figure 6: I am deeply incredulous that applying any chemical to zebrafish for only two minutes can alter cell differentiation, except perhaps via toxicity. Perhaps examples of similar impacts can be provided from the literature to make it seem more credible that the mechanism here is LLPS in retinal cells.

      The following minor comments are all captured under the notion that the Figure Legends all need to be re-written by a senior colleague. Figures+legends should be interpretable as stand-alone items. All these Figures fail this minimal standard. Below are some issues, but really I'd suggest starting with a blank slate.<br /> - u) Figure 1 must mention Drosophila. So very very confusing to read this believing it is about zebrafish.<br /> - v) Figure 1 what is "deadpan"?<br /> - w) Fig 2 title, how do you know these progenitors are MG-derived?<br /> - x) Fig 2, define abbreviation MG<br /> - y) Fig 2 title, Hes1 is less abundant, but that might be from alternative mechanisms other than "reduced expression" (e.g. altered PTMs, increased clearance, LLPS, etc)<br /> - z) Fig 3 legend is a jumble of oddity. At least three distinct signals are supposedly labelled in pink(?). separately, What about the mCherry - is it also in pink?<br /> - aa) Most Figures: we know they are micrographs, so you don't need to lead the Description saying "micrographs of...". Instead, describe the logic of the experiment and the overall interpretations.<br /> - bb) All Figures: it is really odd to list the data (averages & variances, including implausible significant digits on each) for every treatment - that is what the histograms are meant to convey.<br /> - cc) Figure 4 should send the reader to the Supplemental so they know that the no-primary control experiment is available.<br /> - dd) Fig 6B legend - explain what the chemicals are meant to do (e.g. "block LLPS").<br /> - ee) Fig 6 define "2m" = 2 minutes?<br /> - ff) Several more abbreviations are not defined: hpi, ONL, etc...

      Significance

      The manuscript by Veen and colleagues assesses two transcription factors, and makes the novel conclusion that they regulate each other in a manner that is required for photoreceptor regeneration in zebrafish. The work is potentially exciting, because similar findings from zebrafish have found traction in translation to mammals, where regeneration of photoreceptors has surprising promise to treat blindness.

      Together the technical feat and intriguing conclusions combine, in my opinion, to make this paper worthy of serious consideration for publication. I would hate to see it not be made available for public consumption. Its' merits are strong, but some shortcomings in communication and interpretation nevertheless should be addressed

    1. Might utilizing scientific methods to collect and analyze nutritional information — while guided by social-scientific frameworks and research practices that explain how power and inequity operate in society — result in new insights on the ways in which nutritional disparities exist within communities? What if we then drew on our knowledge of qualitative methodologies, such as interviews and focus groups, to bring in the voices and lived experiences of people working in these fields or encountering these issues?

      I think this was a great example of how the integration of different perspectives may add up to a unique and specific solution to a problem with the help of its respective expertise. I would also want to incorporate my management information systems degree with public health and a humanist course to help people in different parts of the world to create efficient health systems while also working with companies for their goodwill projects and actually implementing them for the good for equitable or even free healthcare.

    1. Since this book is also about ethics, we should mention that the first thing these women were asked to program on the ENIAC was some calculations to help build thermonuclear bombs. How do you think they might have felt about being asked to do this? The building of those bombs involved many scientists and other professionals along the way, several of whom were not on board with the idea of what their calculations were being used for. This has raised questions about moral responsibility: were the women made complicit in whatever moral wrongs may have come about using calculations they performed using the ENIAC?

      This note added to my understanding including the pictures here and the description of the history of computer language as well as the code entered by the women to generate a series of thoughts. It made me aware of the tremendous advances in computer language and technology today.

    1. Perspectivas en el uso de la ciencia

      "As We May Think" es un ensayo escrito por Vannevar Bush en 1945, en el que se presenta su visión de cómo la tecnología de la información podría mejorar la capacidad de los seres humanos para almacenar, acceder y procesar información. Por ejemplo, este autor describe una máquina de almacenamiento y recuperación de información, a la que llama "Memex", cuyo propósito sería que los usuarios pudiesen tener acceso a grandes cantidades de información de manera rápida y fácil. Me parece importante en Bush que destaca la importancia de la colaboración y el intercambio de información entre expertos en diferentes campos y también argumenta que la tecnología de la información tiene el potencial de ayudar a los expertos en distintos campos a compartir sus conocimientos y trabajar juntos para resolver problemas complejos.

    2. Durante años, los inventos han ampliado los poderes físicos de las personas en lugar de los poderes de su mente. Argumenta que están a la mano los instrumentos que, si se desarrollan adecuadamente, darán a la sociedad acceso y dominio sobre el conocimiento heredado de las épocas. La perfección de estos instrumentos pacíficos, sugiere, debería ser el primer objetivo de nuestros científicos.

      Esto es buenísimo para la innovación de nuevos inventos que pueden beneficiar la humanidad por medio de la imaginación del ser humano pero creo se debe ser limitado debido a la gran imaginación que contiene el ser humano pero dicha imaginación se puede crear ideas buenas, malas y desechables.

    1. Egoism

      I found this part controversial and attractive to me. Altruism seems morally correct. However, Egoism is naturally born in human beings so people tend to make decisions that benefit themselves. While they may defect the society. It is morally wrong to affect society while benefiting yourself. I think there's no right or wrong action when comparing Egoism and Altruism. People would have different ambitions. It is morally wise if people tend to pursue more benefits for society. Also, we are not able to judge people who takes own profits as the most important thing.

    1. Abstract

      This work has been published in GigaByte Journal under a CC-BY 4.0 license (https://doi.org/10.46471/gigabyte.79), and has published the reviews under the same license. These are as follows.

      **Reviewer 1. Xuanmin Guang **

      Han et al. had carried out genome assembly of Aplectana chamaeleonis, analysised the genome’s repeat content and annotated the genome. They descripted the geneset’s function and done a PSMC analysis. The genome is a key source for research, but there are so many mistakes in the manuscript, I suggest the author to revies the manuscript carefully and the grama and content should be re-organized. Some suggestions have been listed below:

      1. In the context part, the first two sentence lacks continuity in logic, please change them.
      2. The author didn’t mention which sequence platform they had used in the context, I think this should be added.
      3. The average sequence length in the table is 496kbp, but the author it as 496Mbp , this is a mistake.
      4. In table 1, why there aren’t any gaps in the scaffold genome?
      5. The author said that “This suggests that the significant expansion of repeating elements is an important manifestation of species differences”. Its unreasonable to get this conclusion only based your genome repeat analysis.
      6. In the text they claim that 12887 function gene had annotated, I want to know how much gene they have annotated? Please add this in the manuscript.
      7. Too many decimal places have been used in the Table2.

      Re-review: The author revised the paper as I concerned in the report and the paper could be accepted now.

      Reviewer 2. Jianbin Wang

      In this manuscript, Hou et al. present a genome assembly for Aplectana chamaeleonis, a parasitic nematode that infects amphibians. They report a genome of ~1 Gb, most of which is composed of repetitive elements. This genome draft is significant as it is the first assembled for this or any Cosmocercidae species. It may provide insights into the evolution of the nematodes – if it is thoroughly compared to other nematode genomes. It may also allow for better species identification than previous morphological methods. While the conclusions on genome size and composition described in the paper appear sound, there are many questions that go unanswered. The reasoning behind why this research was undertaken is not clear. What is the ecological or agricultural and economic impact of the species? How would the genome provide a better understanding of this species? More specific information is also needed to better understand the genome. How many chromosomes does this species have? Is there any cytology to help answer this question? Any notion of sex chromosome vs. autosome? This genome is much bigger than most of the assembled parasitic nematodes. The author did not make any efforts to explain what might contribute to this. Could the big size due to contamination in the samples used? Judging from the images, it does not look very convincing to me how clean the sample was for the genomic DNA extraction. Overall, there is a lack of in-depth data analysis and comparison between this genome and many other available nematode genomes. About the overall presentation and organization of the manuscript, the context is often lacking from results. How do these results compare to related species? How does figure 4/the demographic history fit in to this story? A round of general proofreading needs to be done for grammar, punctuation, capitalization, italics, etc – see below for some specific examples. In the Abstract, the repeat content in the Ascaris genome is 72.45%, and the total length is more than 742 Mb. The math does not add up (1.1 Gb x 72.45% = 797 Mb). Or do you mean the Aplectana genome? Should say total length of repeats. Why is this “Ascaris” genome? Ascaris is a parasite that infects pigs and human. Some sentences need addressing/clarification: Page 1. “and their diversity is also very high, many of which are above the national second-level protected animals” – what is the significance of this/how are these ideas related? Page 2. “Through the characteristics of the genome sequence, it shows that the genome is a highly continuous genome” – need to be more specific with metric and data. Page 4. “In addition, the enrichment of A. chamaeleonis genes in all metabolic pathways was found in twelve metabolic pathways.” – not sure what you are trying to say about the all or 12 pathways. Figure 1. - Images need scalebars. In A, what is the mat of material? For A, crop out area around the worm and enlarge the worm image. In B the worm is dark/shows little contrast or detail. In C, label which image is the head and which is the tail (or specify left vs. right in the legend text). The images in B and C look like they were taken using a cell phone pointed at a computer monitor – are there higher quality images? Table 1. – Why is the data in all four columns the exact same? What is the difference between each column? This appear to be a mistake when preparing the table. Very sloppy and unfortunate! Table 2 – Significant figures on the %s?. Is the “other” category needed (same for Fig2C)? Table 3 – Check text spacing (e.g. % in genome). Figure 3 – Recommend to redo the spacing of figures, increase size of text in each part of this figure. Need to refer to parts of figures in the body/text (Fig 3a vs. 3b vs. 3c). Can 3b be sorted from most number of genes to least? Figure 4 is not referenced in the body text. Consider merging Fig 4 with Fig 3. Figure 4 is lacking a description in the legend – what are the grey lines, definition of LGM? The x-axis scale and orientation are unintuitive – is the present on the left and the past on the right? Past should be on the left. Methods Genomic DNA was purification for Long-reads libraries preparation – should say purified What is the meaning of “The generation we used was 0.17” – what generation is this? and “the mutation rate was 9×10-9” needs units. The sentence “we used the pairwise sequentially Markovian coalescent (PSMC) model to estimate the effective population size of A. chamaeleonis within last million years.” should be moved to the section immediately after its current location.

      Re-review: Overall, the writing has been improved in several places and is somewhat clearer than in the previous draft. These changes are mostly related to the minor concerns raised. However, many questions related to the broader impact of this research and how the new genome compares to other nematode species remain unanswered. The following comments were largely ignored. 1. The reasoning behind why this research was undertaken is not clear. 2. What is the ecological or agricultural and economic impact of the species? How would the genome provide a better understanding of this species? 3. More specific information is also needed to better understand the genome. How many chromosomes does this species have? Is there any cytology to help answer this question? Any notion of sex chromosome vs. autosome? 4. This genome is much bigger than most of the assembled parasitic nematodes. The author did not make an effort to explain what might contribute to this. 5. Overall, there is a lack of in-depth data analysis and comparison between this genome and many other available nematode genomes. How do these results compare to related species? 6. About the overall presentation and organization of the manuscript, the context is often lacking from results. Another round of general proofreading needs to be done for grammar, punctuation, capitalization, italics, etc. – see below for additional specific examples. The authors, not the reviewers, need to make a concerted effort to read and proofread their own manuscript.

      In addition to the big picture points raised above, several other issues that were either brought up last time or are new and need to be addressed: 1. Not sure Table 1 is present the right way. The columns and rows should be reversed, I think. If so, there will be only one column - do you still need a table? 2. “Through the characteristics of the genome sequence, it shows that the genome is a highly continuous genome.” Unclear. The authors mentioned that they have fixed this in their response to the reviewers, but no change was seen in the updated manuscript. 3. “The generation we used was 0.17, and the mutation rate was 9×10-9 [8].” These numbers need units after them. Again, this was addressed in the response but not written out or clarified in the revised text. 4. “In addition, the enrichment of A. chamaeleonis genes in all metabolic pathways was found in twelve metabolic pathways.” Not sure what the authors were trying to say about the all or 12 pathways. Still confusing. 5. Photographs of the worms are still lacking scale bars. 6. Make sure that all genus and species names are italicized (in body text and in Fig.3). 7. Make section heading format is consistent (check capitalization). 8. “The results showed that 91 % of the sequences were compared to Arthropoda (1898/2088) and 7 % were compared to Arthropoda (122/2088).” Both of these say Arthropoda - is that a mistake? Also "compared to" is not the correct word, maybe "similar to"? 9. LGM acronym is defined after the second use of "last glacial period", should appear after the first use. Also, LGM stands for last glacial maximum, not period. This should be corrected.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Thank you for the rapid and favorable reviews of our manuscript entitled “Long-Read Genome Assembly and Gene Model Annotations for the Rodent Malaria Parasite Plasmodium yoelii 17XNL.” We particularly appreciated that both reviewers had substantial, detailed expertise with the sequencing and assembly of Plasmodium genomes, and valued their questions and suggestions to ensure high rigor of our work. We have addressed all of the reviewers’ comments in the revised manuscript, and have provided a point-by-point response to each below.

      Response to Reviewers

      Note: Point-by-point responses are provided in italics below each reviewer comment below. Line numbers referenced in our responses refer to their final line position in the Track Changes version of the manuscript.


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript entitled "Long-Read Genome Assembly and Gene Model annotation for the Rodent Malaria Parasite P. yoelii 17XNL" is a well-written manuscript providing updates and important observations about the genome assembly and annotation of this specific non-lethal isolate. The group overall did a great job showing how the application of newer technologies such as long-read DNA and direct RNA sequencing to generate top-quality genomes to be used as a reference for the community. Here are some comments about the work presented:

      Response: Thank you for your positive feedback and suggestions on how to clarify these findings. We have improved the revised manuscript based on your feedback and suggestions below.

      Major comments: - The authors added several result information across the methods section. Making the text repetitive, since the same is also presented in the results section. Please revise the method section to remove results from this section.

      Response: We agree and have streamlined both the Results and Methods sections to remove redundancy in these descriptions.

      • Some methods are also redundant in the Result section. For example, in line 141-142, the group describe which DNA extraction kit they used (again this is correctly mentioned in the methods section).

      Response*: We agree and have removed minutiae such as these from the Results section. These details remain in the Methods section to ensure reproducibility. *

      • Besides important, the group added several information about method comparison between base call accuracy and sequencing methods. I agree that having this information in the supplemental material is great, but I would be careful to not focus too much on those, since most of the observations are already well-known by the community and focus more in the biological relevance of what is being generated with the newly updated genome.

      Response: The advances in base calling algorithms do make substantial improvements to the Nanopore reads. We have only included a short description of this in the main manuscript and feel this is an appropriate amount of context for the typical reader. Those that love these details and want to dig further can find this content in our supplemental information.

      • The group did a great job generating two versions of the genome, and an updated gene annotation set using long-read sequencing. But the major question is, how about alternative splicing? They mention the use of it (line 350) but I don't see any result about how many alternative transcripts were observed, and if they were differentially detected in different life stages of the sets used for the RNA sequencing. This is a very important result to be added since one of the key pieces of information that long-read RNA sequencing brings for Genome annotation.

      Response: We have now expanded this description in the manuscript to note that 866 genes are predicted to have multiple transcript isoforms (Lines 240-241). Moreover, we have now generated a Supplemental Table 4 that lists these isoforms in the revised manuscript. As we have not conducted further validation of this large number of transcript isoforms, we have left the description at this level.

      • Same observation as above for potential long ncRNAs.

      Response: We agree that lncRNAs are a fascinating aspect of the biology of the parasite, but a proper analysis of this class of RNA is far outside of the scope of this current study. Automatic identification approaches with Nanopore data will likely yield high numbers of false positives, which require manual curation for rigorous annotation. We hope others can use these data to accelerate such studies as well.

      • From what I understand the Hifi run was able to generate a gapless genome assembly and the ONT run did not. What was the final coverage for each? From my experience with P. falciparum genomes, ONT even with the rapid kit was able to generate chromosomal level assemblies if the coverage was >100x (but again, this is not a rule). Add those valuable observations about the depth so the reader can check if other variables in the comparison should be made.

      Response: This is a particularly interesting aspect of not only our datasets, but of other Plasmodium genomes as well. This issue occurs at least in part due to the presence of many repeated elements in the subtelomeric regions. It is important to note that these repeated elements do not resolve into a single haplotype in an assembly due to conflicting information, not due to lack of coverage. For instance, regions may differ by only a few nucleotides that each have significant read support. We are particularly interested in a recent preprint that concludes that P. falciparum harbors extrachromosomal plasmids with these var sequences present (doi.org/10.1101/2023.02.02.526885). *If this observation is supported via peer review, this interpretation could also begin to explain our results with P. yoelii 17XNL as well. *

      • Also be sure that the structural comparisons between the genomes are not the ones used after running ragtag.py. If so, there is a high chance of structural bias in the scaffolded contigs.

      Response: We apologize for the confusion. We did not use ragtag for the PacBio assembly, and all structural and variant comparisons were done using the PacBio assembly. However, we did use ragtag for the Nanopore assembly that is included in this study as an additional resource to our community. These data were not used for variant calling though.

      • How Prokka differed from Braker2 for the Mitochondria/API annotation? This needs to be very well described since prokka is made for prokaryotic organisms and not for eukaryotic ones. And Braker2 uses a custom build dataset for training, which I believe contains known information about MIT/API for Plasmodium species.

      Response: We first applied Braker2 to the organellar genomes and identified only 6 genes in the apicoplast genome and only 2 genes in the mitochondrial genome. Due to their prokaryotic origin, we then tested if Prokka could alleviate this issue. To do so, we applied Prokka to the 17X reference genome and found that it detected all of its annotated organellar genes. Therefore, we also applied Prokka to our Py17XNL genome to annotate the genes found on the apicoplast and mitochondrial genomes. As a final validation check, the gene annotations on these two organellar genomes are effectively identical between 17X and 17XNL. This is consistent with the sequencing results and assemblies that show that the apicoplast genome is identical and the mitochondrial genome differs in a single, notable deletion in 17XNL.

      • Figure 5B, what is the peak observed in the mitochondria? What genes? Repeats?

      Response: What appears to be an inward pointed trough actually reflects the deletion of bases in 17XNL compared to the 17X assembly. We have clarified this in the manuscript on Lines 296-297 and in the legend of Figure 5.

      Minor comments: - For Oxford nanopore sequencing using the ligation kit, did the group check for potential chimeric reads generated by the protocol?

      Response*: We did. We used the adapter trimming software, Porechop, to identify and bin chimeric reads that were eliminated from the dataset. This method is described in the Makefile associated with the manuscript. *

      • Check if all species are italicized (for example, line 187 P. yoelii is not)

      Response: We have italicized this instance of P. yoelii and have reviewed the document to search for any other words that should be italicized.

      • In methods add the parameters for minimap2 for the direct RNA alignment

      Response*: We would encourage readers to view our MakeFile that has all of the commands and parameters used for the bioinformatic work reported here. *

      • For variant calling, I would use a minimum of 10x coverage to make a variant call instead of 5x. Besides looking well reproducible between all checks, I would be careful mainly with the single bp deletions with a such low threshold.

      Response: Read counts for the called variants were generally greater than 20. Moreover, we took these validations a step further and manually curated these variants using the data from multiple sequencing platforms used in this study to ensure high rigor in making these variant calls. We have further clarified this in the revised manuscript.

      • In some parts of the methods, the authors mentioned slight modifications in some protocols (for example, lines 443 and 454), besides well described in the text, could you highlight what were the modifications in the text? This will facilitate many other researchers to understand why those modifications were needed.

      Response: We have clarified these modifications in the revised manuscript. In short, these modifications consisted of: 1) For the HMW gDNA prep kit, an agitation speed of 1500 rpm was used as opposed to the recommended 2000 rpm due to limitations of our instruments. 2) A slow end over end mixing by hand was preferred over using a vertical rotating mixer as yield was consistently greater with this change. 3) For the RNeasy kit, the lysate was passed through a 20-gauge needle for homogenization of the sample. Instead of an on-column DNaseI treatment, the RNA was treated with DNaseI off of the column to promote complete DNA digestion. 4) A second elution from the RNeasy column was performed in order to improve yield.

      • As mentioned in the major, the data analysis method section needs rework to remove results from the text.

      Response: We have revised the manuscript accordingly.

      • The group mentioned that small contigs not mapping to Py17X were discarded. What are those? Repeats? Contamination?

      Response: These contigs were of mouse origin, as P. yoelii was grown in Swiss webster mice in this work. We have clarified this in the revised manuscript on Lines 183-184.

      Reviewer #1 (Significance (Required)):

      This work generated a strong method and resource for a better genome assessment of P. yoelii for the community. As I mentioned in my comments, some more details about the findings such as alternative splicing and lncRNAs may strengthen them even more the publication. I know that comparative analysis between Py17X and XNL is not in the scope here, but more information about it, such as a synteny plot would be great for the community to understand that they can rely on this new reference genome. I've been working with eukaryotic and prokaryotic genomes for more than a decade and I have a lot of experience with all the methods presented. I believe that potentially the depth generated for the ONT data may be one of the factors for not reaching the chromosomal level of this isolate, since HiFI was. The group did a great job on the method description, and I believe that the community will be very happy to incorporate this genome as one of the references for this organism.

      Response: We are thrilled that you value the data and the rigor of our approaches. We also believed that a direct comparison between 17X and 17XNL strains is critical. Because of this, we provided details of this comparison in Figures 5 and 6, as well as in supplemental files. Because our colleagues often use these strains interchangeably, it is important for our community to know what differences are present between the parental 17X and the cloned 17XNL line. While substantial identity exists between the 17X and 17XNL strains, there are many variants between them, including many that affect genes that are known to have essential functions for the parasite. For this reason and more, we believe the true 17XNL genome assembly will be a preferred reference once it is fully integrated into PlasmoDB.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The paper has three distinct parts, 1. Assembly of the P. yoelii yoelii 17XNL 2 Annotation of the genome and adding UTR regions 3. Comparing the sequence of 17XNL with 17X .

      Assembly: The authors present a novel assembly for the P. yoelii yoelii 17XNL genome. They used two different approaches, comparing Oxford Nanopore (ONT) long reads + Illumina DNA with PacBio Hifi. None of the approaches generated a telomer to telomer assembly so sequences from the 17X reference was used to fill in the mssing sequence.

      Response: Please also see the comment from Reviewer 1 and our response. The presence of many repeated elements in the subtelomeric regions leads to the challenges noted here about a telomere-to-telomere assembly, as well. The presence of these elements means that the sequences do not resolve into a single haplotype in an assembly due to conflicting information, not due to lack of coverage. Because of this, we have chosen to harmonize the selected haplotype at these subtelomeric regions with that of 17X, while still acknowledging and providing the complex data associated with the subtelomeric regions.

      Annotation Next, they generated long reads (ONT)and Illumina RNA-Seq to improve the annotation. Although, their annotation is not better than the current P. yoelii 17X reference genome in PlasmoDB, they could predict the UTR regions and alternative splice sites due to the 3' capturing approach and long reads. Having the UTR annotated and potentially having alternative splice sides is useful for the field.

      Response: We agree that the additional gene model annotations for both UTRs and alternative transcript isoforms is a valuable resource to our community. We are working with PlasmoDB currently to make these data readily accessible.

      17XNL - 17X comparison The author compared the 17XNL with the 17X reference. Both genomes were done with Pacbio, and it should be noted that P. yoelii has a GC content of probably ~23% with several homopolymer tracks. Further, the 17XNL genotype was obtained from a 17X culture, so the genomes are expected to be very similar as the author noted in the introduction. The authors found ~2000 differences; some are in genes, but many are indels, which very well could be sequencing errors. Finally, the authors claim that this genome could become relevant for the community as new reference to perform analysis. As their genome is so similar to 17X and they have to show that their annotation is at least as good as the current 17X reference genome (manual curated) and the difference are not due to sequence error in 17X or 17XNL.

      Response: As we describe below, we have taken multiple steps to inspect the quality of the 17X genome assembly (it is very robust), to call variants between strains, and to validate them using our data across multiple sequencing platforms and via manual curation. Because of this, we view these as true variants between the 17X and 17XNL genomes

      Major comments Overall I struggle to see the need for a "NEW" P. yoelii reference. It would be good to state how similar these genomes are - they are basically identical. As the 17XNL is curated manually, it would have made more sense to me to start from that one and then generate the UTR annotation and include splice sides. This could be easily loaded into an alternative Web-apollo track and then merged to the current annotation to be useful to the community.

      Response*: We chose to generate a new reference assembly for 17XNL because the current one is from 2002, remains in >5000 contigs, has gene identifiers that do not align with other current Plasmodium gene models (e.g., PY00204 vs. PY17X_0502200), and historically has had problematic gene models attributed to individual genes. This clean start ensures that users can know the provenance of the underlying data that created the genome assembly and gene models. *

      I wonder if many of the differences the authors found between 17X and the 17XNL reference are true. The authors are correct that some differences between 17X and 17XNL are true. I could not find any evidence of genome polishing with tools like Pilon or ICORN to correct sequencing errors, I wonder if these differences are sequencing errors.

      Response: The PacBio-based assembly received no error correction or polishing. It should be noted that all variants that were called automatically were also manually verified using data from multiple sequencing platforms generated in this study. Moreover, for coding sequences, we imposed a threshold that 80% of all reads at the variant’s location needed to support the variant in order to be considered true. Through these strict thresholds, we eliminated many potential variants that only had support from one sequencing platform. We highlight several variants that were confirmed through multiple datasets in Table 2.

      Did the authors look into the reads of the NCBI - GCA_900002385.2 - assembly? Maybe they could use the underlying Illumina reads if theirs don't have enough coverage. Also, the differences between 17X and 17XNL could be that the reference is wrong. How many pseudo genes did they obtain? Are there more or less than in the current reference?

      To confirm the calls, could you also map the 17XNL reads against the 17X reference and see if they are still true. As the same time, map the 17X illumina reads to see if the reference is correct at this state. When looking at the alignments, it can be seen that many different are in low complexity/repetitive regions.

      Response: We analyzed both their raw and assembled data to compare them with our results, and we determined that the 17X data and assembly were robust and that these difference likely reflect true variance between the strains. The 17X reference has 57 pseudogenes that are annotated as pir, fam-a/c, or others. Overall, there were 1057 pir genes annotated in the 17X genome, whereas we annotated 1048 for our Py17XNL genome. There were 302 fam-a/b genes annotated in the 17X genome, whereas we annotated 301 for our Py17XNL genome. As noted above, we confirmed variant calls using data from multiple sequencing platforms in this study as well as through manual curation.

      The authors sequence their genome with a HiFi Pacbio run and also ONG + DNASeq... but why did they not get 16 chomromes out? For example the current P. yoelii reference was assembled directly into far less pieces than theirs [P. chabaudi assembles into 16 pieces]. Could it be a different read depth or is it the fragment length? Could the authors please comment on that. Also, if there were contigs, why did they fill the sequence with 17X sequence, rather than keeping gaps? So in the end, their sequence is a hybrid, of 17X and 17XNL, right?

      Response: Please see our responses above to both Reviewer 1 and 2 regarding the heterogeneity of the subtelomeric regions that indicate that a single haplotype is not readily called. This is not due to insufficient read depth, but rather we believe it reflects something fascinating about Plasmodium genomes in these regions. A recent preprint (doi.org/10.1101/2023.02.02.526885) provides one possible interpretation for this observation.

      Why do you think you had less coverage of CCS read around the telomer ends? Do you think it is a systematic issue of the PacBio Hifi? Did you see any evidence of Illumina or ONT reads - or could it be that while culturing the telomer ends dropped off?

      Response: See our response above about the challenging nature of the subtelomeric regions of Plasmodium genomes. As above, this is not an issue of coverage per se, but rather of heterogeneous related sequences that are not readily resolved into a single haplotype. In order to minimize the risk of sequencing a genome of a mixture of heterogeneous parasites, we sequenced “Pass 0” parasites received directly from BEI Resources to ensure this genome reflects the established P. yoelii 17XNL clone.

      I realised that the authors used a lot of primary tools. I wonder why they chose that path, as there are several tools to do automatic finishing for long read assemblies: Assemblosis, ARAMIS, MpGAP or ILRA. Especially the last one focuses on Plasmodium genomes. Please comment.

      Response: We initially started our bioinformatic analyses using established tools such as these. Specifically, we first tried Companion and ILRA, but the results were not superior to those we achieved with the workflow we describe in this manuscript, which also provided greater parameter control.

      Also, for the annotation, could it not be better to transfer the manually curated genome annotation with LIFT off or RATT? All these tools are widely used in the generation of reference genomes in the parasitology field. I annotated their sequence with Companion, and although their gene models are good and some of the Companion calls might need improvement, overall, the Companion results look more exact to me.

      Response: Companion was the original tool we used for the generation of gene models. While we found that for a pre-package software platform it performed excellently, we found it to be insufficiently customizable and the results were not sufficiently accurate from our assessment. Additionally, lifting over information always raises the risk of imposing a different perspective on what is truly present. We believe that a high quality, de novo assembly is always preferable, and therefore chose this workflow.

      The code is very well organised, and it was easy to follow. Are you planning to put it on a GitHub repository?

      Response: We appreciate this recognition. We believe clear reporting of the bioinformatics work is critical for rigor and reproducibility. Yes, all of this will also be provided in GitHub to benefit the wider community.

      For the annotation in the attachment, there were two files. I had a look at them and they were quite different. As 17X and this genome are basically identical (Response: The two gff files represent either a Nanopore only or hybrid Nanopore+Illumina-based model. The latter produced a more comprehensive annotation of gene models, which is what we have proceeded with. However, we provided both in case end users find value in the Nanopore only annotation which has a 3’ bias due to the mechanism of how sequencing occurs via this approach.

      We have found meaningful variations in genome sequence that potentially impact biological function (see Discussion). Therefore, we maintain that these genomes are not basically identical and are useful to the malaria research community for these reasons and more.

      It is excellent that the genome is submitted to NCBI. Why are there 18k proteins? Are these the alternative spliced forms?

      Response*: We are not certain how this interpretation might have arisen, as we only have reported 7047 potential transcript isoforms to NCBI based upon our data. *

      Minor The current Py 17X genome in PlasmoDB is a Pacbio assembly (https://plasmodb.org/plasmo/app/record/dataset/TMPTX_pyoeyoelii17X), but not part of the 2014 paper. It was submitted later to NCBI than the paper the authors cite. Also, the current P. berghei Pacbio genome is from Fougère et al. PLoS Pathog 2016;12(11):e1005917.

      Response: We have now made a detailed note about the Py17X PacBio dataset in our revised manuscript on Lines 186-187. Mentions of the current P. berghei genome assembly had already cited the Foug’ere et al. publication.

      I tried to open the supplemental tables, but they were all in pdf rather than excel and split over several pages. Two had missing information, e.g. UTR per gene. From the name of the tables, I had an idea of what they should contain, but for a re-submission, it would be good to have them in the correct format.

      Response: We agree that provision of the PDFs of the supplemental files is not the ideal way to review these analyses. The complete data was also provided in the Excel files provided to Review Commons. We will ensure that the affiliate journal receives the Excel files for completion’s sake.

      To me, the beginning of the results reads a bit like an introduction (the part which sequencing technology to use)

      Response: We agree, and as noted to Reviewer 1 above, we have streamlined this section of the revised manuscript.

      Could you add to the tables: Sequence Coverage of the three technology, how many contigs you had before ordering the contigs and the number of pseudogenes in the annotation?

      Response: This information is now provided in Supplemental Table 3 in the revised manuscript.

      I struggle with the section header line 229-230 that the new sequence is more complete as it is a hybrid assembly with 17X. Alternatively, please explain how the consensus was built.

      Response: We agree and have revised this section header for accuracy.

      The authors correctly state that ONG is great, lines 333ff, but why does it not generate telomer-to-telomer chromosomes in this case? Please discusss.

      Response: Please see our response to this above for remarks made by both Reviewer 1 and 2. We have also added clarifying text in our revised manuscript discussing why this may have occurred.

      Reviewer #2 (Significance (Required)):

      General assessment As mentioned above, I struggle to see this as a strong leap for the malaria community to use this genome, as it is so similar to the current 17X genome, which is manually curated in plasmodb. Response: We agree that it is important to know how similar the genomes of 17X and the cloned 17XNL strain are. It is perhaps even more important to know what the key differences are as well. In this study, we have asked and answered these questions, and identified 2000+ variants between the strains. We have manually curated several of the variants that impact the expression of essential/important genes, and found that biologically meaningful differences exist (see Discussion). Finally, we have also provided additional information on the gene models of 17XNL, including an experimental definition of UTRs and transcript isoforms. Together, we hold that these data will not only match those currently available for 17X, but will exceed them. We are currently working with PlasmoDB to make these data readily accessible to our community.

      Advance The authors should make the comparison of ONT and PacBio HiFi clearer and discuss why the technologies still don't generate telomer-to-telomer sequences. From the biological side, none of the found differences were related to the different phenotype between 17X and 17XNL.

      Response: We have provided these comparisons and all related data to the reader in this manuscript, as well as through public depositories. Please see above for our responses as to why a true telomere-to-telomere assembly is challenging with Plasmodium parasites, and for a recent preprint that might provide an explanation for this. Finally, the phenotypic differences between 17X and 17XNL are variable, which might reflect differences in individual parasite stocks as has been historically seen in the spontaneous development of lethality in multiple laboratories. While we do not find any particular genetic difference correlates with a specific phenotype, these data using the cloned 17XNL parasite available from BEI provides a robust reference with a defined parasite stock.

      Audience: I do agree that adding the UTR sequence will be useful for those working with P. yoelii as a model, or who want to do comparative UTR analysis across species.

      Response: We agree that this additional gene model information will be valuable. We are working with PlasmoDB to make this information readily available and are already integrating it into our ongoing studies.

    1. Author Response

      Reviewer 1 (Public Review):

      Fox, Birman, and Gardner use a previously proposed convolutional neural network of the ventral visual pathway to test the behavioral and physiological impact of an attentional gain spotlight operating on the inputs to the network. They show that a gain modulation that matches the behavioral benefit of attentional cueing in a matching behavioral task, induces changes in the receptive fields (RFs) of the model units, which are consistent with previous neurophysiological reports: RF scaling, RF shift towards the attentional focus, and RF shrinkage around the focus of attention. Ingenious simulations then allow them to isolate the specific impact of these RF modulations in achieving performance improvements. The simulations show that RF scaling is primarily responsible for the improvement in performance in this computational model, whereas RF shift does not induce any significant change in decoding performance. This is significant because many previous studies have hypothesized a leading role of RF shifts in attentional selection. With their elegant approach, the authors show in this manuscript that this is questionable and argue that changes in the shape of RFs are epiphenomena of the truly relevant modulation, which is the multiplicative scaling of neural responses.

      Strengths:

      The use of a multi-layer network that accomplishes visual processing, with an approximate correspondence with the visual system, is a strength of this manuscript that allows it to address in a principled way the behavioral advantage contributed by various attentional neural modulations.

      The simulations designed to isolate the contributions of the various RF modulations are very ingenious and convincingly demonstrate a superior role of gain modulation over RF shifts in improving detection performance in the model.

      We thank the reviewer for these supportive comments.

      Weaknesses:

      There is no mention of a possible specificity of the manuscript conclusions in relation to the type of task to be performed. It is conceivable that mechanisms that are not important for detection tasks are instead crucial for a reproduction task, as in Vo et al. (2017).

      We agree that other behavioral tasks may rely on different attentional mechanisms then the ones we have studied here for detection and discrimination and now specifically point this out in the discussion [379-395].

      The manuscript puts emphasis on the biological plausibility of the model, and some quantitative agreements. But at some important points these comparisons do not appear very consistent:

      1) It is unclear what output of the model at each cortical area is to be compared with neurophysiological data. On the one hand, the manuscript argues that a 1.25 attentional factor is consistent with single-neuron results, but here this factor is applied to the inputs into V1 units. When this modulation goes through normalization in area V1, the output of V1 has a 2x gain. Intuitively, one would think that recordings in V1 neurons would correspond to layer V1 outputs in the model, but this is not the approach taken in the manuscript. This needs clarification. Also, note that the 20-40% gain reported in line 287 corresponds to high-order visual areas (V4 or MT), but not to V1, in the cited references. The quantitative correspondence between gain factors at various processing steps in the model and in the data is confusing and should be clearer.

      We agree that making a one-to-one mapping of gain effects measured in neurophysiology and different layers of the CNN is problematic. We therefore have clarified that the introduction of gain at the earliest stages of processing is meant to study how gain propagates through a complex CNN and has downstream effects [49-52 and 410-447] and we have also also clarified the various uncertainties in making one-to-one mapping from the CNN to neurophysiological measurements of gain [410-447].

      2) The model assumes a gain modulation in the inputs to V1. This would correspond to an attentional gain modulation in LGN unit outputs. There is little evidence of such strong modulation of LGN activity by attention. Also in V1 attentional modulation is small. As stated in Discussion (line 295), there is no reason to favor the current model as opposed to a model where the attentional gain is imposed later on in the visual hierarchy (for example V4). If anything, neurophysiology would be more consistent with this last scenario, given the evidence for direct V4 gain control from frontal eye fields (Moore and Armstrong, Nature 2003). The rationale for focusing on a model that incorporates the attentional spotlight on the inputs to V1 should be disclosed.

      We agree that measurements of gain changes with attention appear larger in later stages of visual processing and do not wish to explicitly link the gain changes imposed at the earliest stages of processing in our CNN observer model with changes in input from LGN as we agree this would be unrealistic. Instead, our goal was to examine how gain changes can propagate through complex neural networks and cause downstream effects on spatial tuning properties and the efficacy of readout. We have substantially re-written the manuscript, in particular the introduction [24-38, 49-52] and discussion [441-447] to better describe this rationale. We also now explicitly discuss how our propagated gain test demonstrates exactly the reviewer’s point - that gain can be injected late in the system, rather than at the earliest stages [274-276, 441-447].

      3) The model chosen is the CORnet-z model, but this model does not include recurrent dynamics within each layer. Recurrent dynamics is a prominent feature in the cortex, and there is evidence indicating that attentional modulations operate differently in feedforward and in recurrent architectures (Compte and Wang, Cerebral Cortex 2006). A specific feature of recurrent models is that the attentional spotlight need not be a multiplicative factor (which is biologically complicated) but an additive term before the ReLU non-linearity, which achieves the expected RF modulations (Compte and Wang, 2006). A model with recurrence thus represents another architecture that links gain and shift in a way that has not been explored in this manuscript, and this may limit the generalization of the conclusions (line 205).

      We appreciate the reviewer pointing us toward the Compte paper and we’ve added a discussion of recurrence as an alternate model [410-423].

      Reviewer 2 (Public Review):

      This manuscript by Fox, Birman, and Gardner combines human behavioral experiments with spatial attention manipulation and computational modeling (image-computable convolutional neural network models) to investigate the computational mechanisms that may underlie improvements in behavioral performance when deploying spatial attention.

      Strengths:

      • The manuscript is clear and the analyses, modeling, and exposition are executed well.

      • The behavioral experiments are carefully conducted and of high quality.

      • The manuscript takes a creative approach to constructing a ”neural network observer model”, that is, coupling an image-computable model to a potential readout mechanism that specifies how the representations might be used for the purposes of behavior. The focused analyses of the model innards (architecture, parameters) provide insight into how different model components lead to the final behavior of the model.

      Thank you for these supportive comments.

      Weaknesses:

      • The overall conclusions and insights gained seem heavily dependent on particular choices and design decisions made in this specific model. In particular, the readout mechanism lacks some critical descriptive details, and it is not clear whether the readout mechanism (512-dimensional representation that reflects summing over visual space) is a reasonable choice. As such, while the computational analyses and results may be correct for this model, it is not clear whether the strong general conclusions are justified. Thus, the results in their current form feel more like exploratory work showing proof of concept of how the issue of attention and underlying computational mechanisms can be studied in a rigorous and concrete computational modeling context, rather than definitive results concerning how attention operates in the visual system.

      Please see below for our response to the issue with readout and conclusions.

      Overall, the work is solidly constructed, but the overall generality and strength of the conclusions require substantial dampening.

    1. These perceptions of too much newsroom attention going towards topics like politics and Coronavirus also reflect younger audiences’ broader desire for diverse news agendas, voices, and perspectives. As we discuss throughout this report, young people – particularly 18–24s – have different attitudes toward how the news is practised: they are more likely than older groups to believe media organisations should take a stand on issues like climate change and to think journalists should be free to express their personal views on social media.

      I think the difference between YOUNG and PAST generations of consumers is that young people grew up on platforms to where you can reach millions of views, and millions of people can consume your beliefs and reporting. I think why past generations may think news is super conservative, is because they have simply grown up being told what to believe, or being relied on newspapers and programs on the television, instead of so many different perspectives we see today. I think 18-24 is the most progressive timeline of adults that this modern world has ever seen, with great perspectives, great focus on real issues, and that is simply because all of them put the effort to organize and share ideas on platforms like twitter, facebook, youtube, reddit, you name it. You see now more than ever for younger groups of activists that they are not just trying to be heard but they are trying to TEACH! And I believe that is the gap between now and then, trying to show people political beliefs and reports, and now it is to teach and help form people on their own opinion or teach people how to analyze certain struggles we face today.

    2. Three years later, we now turn our attention to how young people’s news habits and attitudes have changed amid rising concerns about news distrust and avoidance, increasing public attention to social issues such as climate change and social justice, and the growth of newer platforms such as TikTok and Telegram.

      Honestly, when looking at a perspective like this, even though it is very early in the article, I take a step back and try to remind myself what an app like TikTok is really about. This app is not about "climate change, social justice", though it may be full of videos with tons of views and videos on such topics, the app is simply made for short attention spans, made for constant clicks, its based solely around an ALGORITHM, not serious social change, even though it is a personal anecdote, I do not believe TikTok inspires real change, or real news consumption, I think the algorithm promotes topics that keep you engaged for a solid 10-90 seconds, and you move on to the next one. I am an avid social media user, since the age 12, so from a decade of experience, the wave of attention span is completely forgot about and the idea that real news is consumed solely on incoming apps like tiktok feels very lazy. Just my thoughts, but my personal belief are real political commentators on YouTube, shows on streaming platforms, or daily news cycles on television, is what real habits that are truly being formed deeply in the conscience of this young generation.

    1. Companies use the data they collect in a variety of ways, including tailoring advertisements (ads) to you, marketing, developing or improving services offered within the app, and sharing or selling the data to third-party companies.

      This has happened to me many times. At first, it was a little bit spooky and I would hear people making jokes about how their phone is "listening". But that actually was the reality. I don't even need to vocalize something that I am thinking about for all of the apps I am using to know the content I want to see, when, where, and how. I often think about this in terms of tik tok's famous "for you" page, which is a constant stream of video content specifically tailored "for you" to watch. It truly is scary, and also makes me think about how much bias and closed-mindedness that can create. Though some may think that it opens up the mind to new forms of media that they may not "otherwise watch" because they are being fed the media without having to choose what they're watching, it can actually create more bias. For example, if we think about political opinions, you are likely only being fed information in favor of your party and against your opposing party. But we are never being shown the media in support of the opposing party, or their versions of negativity about the party we associate with. It further deepens the support for our own party and the lack thereof of the opposing party. This argument can go for anything, as we fall deeper into the rabbit holes of our interests.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      The study "Mesenchymal stem cell models reveal critical role of Myc as early molecular event in osteosarcomagenesis" by Akkawi et al shows that BM-MSCs are a foundational tool for study of osteosarcoma. And authors identified Myc and its targets as early molecular events in osteosarcoma formation, using BM-MSCs knocked out of both p53 and Wwox. Although the quality and validity of the data presented in this manuscript are not in question, this reviewer has doubts about the novelty of the findings in this current version of manuscript. It does not clearly indicate what is different from previous findings.

      Response: We thank the reviewer for the kind words regarding our manuscript. Our study revealed that combined deletion of WWOX and p53, two known tumor suppressors in OS, results in early transformation of BM-MSCs that mediates upregulation of Myc. We believe that this notion is original and has not been described before. In addition, we provide evidence that this genetic manipulation has an advantage over p53 deletion alone. More details are provided below.

      1. It has already been reported that p53-/- BM-MSCs are the cells of origin in osteosarcoma development (ref. 21).

      Response: We agree with the reviewer about this notion/fact and we indeed cited this reference. It should be noted that in this study the cell of origin of osteosarcoma was proposed to be BM-MSCs-derived osteogenic progenitors when p53 and Rb are co-deleted. Interestingly, this study did not show whether p53-/- BM-MSCs-derived osteogenic progenitors are able to form osteosarcoma or any other type of sarcomas. In our model BM-MSCs p53-KO alone were not able to develop osteosarcoma tumors as well, even when injected intratibially. Moreover, we showed that co-deletion of Wwox and p53 in BM-MSCs-derived osteogenic progenitors have the ability to form OS at earlier stages at the time point when deletion of p53 alone is still not enough to induce osteosarcomagenesis. Based on our findings and those in the literature, we believe that this combined genetic perturbation is critical for initiation of OS.

      1. Indispensability of Myc in osteosarcomagenesis was revealed (PMID: 12098700).

      Response: It is well known that Myc is a potent oncogene and overexpressed in osteosarcoma (ref. 12, 13, 28 and ref. 29) as well as in many other human malignancies. The referred study (PMID: 12098700; ref 32) which expressed Myc in lymphocytes based their analysis of Myc contribution at the tumor stage (at the time of tumor detection). Our study, on the contrary, shows that upregulation of Myc is an early event in osteogenic-committed BM-MSCs deficient for Wwox and p53 in tumor-free mice. This notion indicates that combined deletion of two important tumor suppressors, WWOX and p53, promotes osteosarcoma through early changes in Myc levels. Our data further show that p53 deletion alone is dispensable for this change to happen at early stages further highlighting the significance of our findings. The referred paper was cited and discussed in the discussion section (ref 32).

      Our data further shed light on WWOX as a key determinant of Myc function placing it as an upstream regulator. To further proof this, we propose in our revision plan to knockout WWOX in yBM SKO cells and/or restore WWOX in yBM DKO cells and determine consequences on Myc levels and activity as well as on tumorigenesis. In addition, we shall perform a ChIP-seq assay for Myc in yBM DKO and compare it to yBM SKO cells to show whether WWOX deletion is indeed results in enhanced chromatin accessibility of Myc.

      1. Osteosarcoma formation of p53flox-OsxCre mice was rescued by Myc-depletion and BM-MSCs from p53flox-OsxCre was shown to upregulate Myc (PMID: 34803166).

      Response: This is a very relevant new study that we regrettably missed to cite so we are grateful for the reviewer to bring this up. In our revised version, we will be citing this important reference and discussing it (ref 33). Although this article shows that p53 deficiency promotes osteosarcomagenesis mediated through oncogenic Myc, which we also showed in our study (figure 6C, D), but the authors did not validate the tumorigenic ability of these cells at this earlier stage of osteosarcomagenesis. The later was showed in our study by injecting yBM-MSCs deficient for p53 intratibially into immunocompromised mice and showed that they lack tumorigenic ability although they display a mild upregulation of Myc (figure S5). On the other hand, co-deletion of Wwox and p53 at this earlier stage resulted in even higher levels of Myc and inducing osteosarcomagenesis at this earlier stage. Therefore, our study provides unforeseen effect of genetic perturbation that promotes OS initiation.

      First of all, the authors need to cite the above papers and compare their findings with them to better clarify the value of their findings.

      Response: Thank you for this comment, we will cite and discuss these valuable studies in our revised version.

      The only one clear finding would be that Wwox functions as a tumor suppressor by repressing Myc function in the absence of p53, but unfortunately, no data have been presented on the mechanism Some additional analysis would be needed to mention it

      Response: As addressed above, our revision plan will include:

      1. Depleting WWOX in yBM SKO and/or restoring WWOX in yBM DKO cells to further prove the tumor suppressor function of WWOX.
      2. Performing ChIP-seq assay for Myc in yBM DKO and compare it to yBM SKO cells to show whether WWOX deletion is indeed results in enhanced chromatin accessibility of Myc.

      In the first place, Myc is upregulated in the absence of both p53 and Wwox, compared in only p53-null situation? Western blotting would be better to show it

      Response: In our revised version we will add a western blotting showing the upregulation of Myc in DKO (WWOX, p53) compared to SKO (p53) yBM cells; This is already added in new Figure 5S C.

      1. In Figure 1D, should separate each panel so that it is clearly visible. What is the blue-colored fluorescence, DAPI? If so, why don't tdTomato positive cells overlap with blue (Figs, 1D, 2C, 4C, 4E?

      Response: In our revised version we added more precise images in all Figures indicating the overlap between DAPI (blue) and tdTomato (Red).

      1. Why was MCM7 chosen among the Myc targets (S Figure 3)? What is the rationale for this?

      Response: Thank you for this important notion. MCM7 is part of the MCMs protein family, that plays and essential role as a helicase and organizing center in DNA replication initiation. Moreover, several studies show the upregulation of MCM7 in several types of cancers among which is osteosarcoma, as cited in our manuscript (Ref. 14, 15, 16). MCM7 is also a direct target of Myc and has been shown to be a druggable target, by SVA as has been presented in Fig 7. Altogether, these facts and observations made us exploring its significance in our mode. In our revision plan, we will also explore other Myc targets through performing ChIP-seq on DKO cells.

      In Figure 5 legend, what does "yBM cells (1.5, 4-months) (n=6)" mean? yBM cells (1.5-months) (n=3) and yBM cells (4-months) (n=3)?

      Response: Thank you for this notion. yBM, at age of 1.5 months or 4-months were collected from tumor free mice and analyzed. In our revised paper, we updated and clarified this in the Figure 5 legend.

      1. In Figure 7B, is there a correlation between MCM7 and Myc protein expression levels?

      Response: Thank you for this comment. In our revised version we added a western blot analysis showing the upregulation of both Myc and MCM7 in yBM-DKO compared to SKO cells; new Figure S5 C. (did the mean 7B upper panne, if so, we have to add this in the updated version).

      Also, do MCM7 and Myc immunopositivites overlap in Figure 7G?

      Response: In our revised version we will perform Myc IHC on same tumor sections. In the meanwhile, we added a western blot analysis showing the inhibitory effect of Simvastatin on both MCM7 and Myc in vitro (new Fig 7B, lower panel, re-blotted for Myc).

      In S Figure 4C, what is 'PC'? What sample was loaded?

      Response: PC refers to positive control for p53 that was used which was in this case HEPG2 cells treated with Nutlin to stabilize p53. p53 antibody used in this plot (IC12-Cell signaling) detects both human and mouse p53. A note was added to Figure legend.

      1. In S Figure 2A, what does 'US' (BM-US) mean? In S Figure 4F, what does 'US' and 'S' (Direct US and Direct S) mean?

      Response: Thank you for this notion. We apologize for not clearly defined these symbols. In our revised version we added clarifications in the legends of these figures. The symbols are as follow: US: unstained BM-control, S: stained BM, Direct: directly collected BM and checked with FACS before culturing.

      1. Overall, this manuscript, there are too many symbols and it is cumbersome. Ex, in S Figure 3, yBM_DKO, Tum_DKO, DKOT, DKO-BMT, etc. All figures should be consistent with the same notation.

      Response: We apologize for this in consistency in using too many symbols. In our revised version we will provide a table with all the symbols that should be consistent all over the manuscript

      Reviewer #1 (Significance):

      Although the quality and validity of the data presented in this manuscript are not in question, this reviewer has doubts about the novelty of the findings in this current version of manuscript. It does not clearly indicate what is different from previous findings, such as;

      1. It has already been reported that p53-/- BM-MSCs are the cells of origin in osteosarcoma development (ref.21).
      2. Indispensability of Myc in osteosarcomagenesis was revealed (PMID: 12098700).
      3. Osteosarcoma formation of p53flox-OsxCre mice was rescued by Myc-depletion and BM-MSCs from p53flox-OsxCre was shown to upregulate Myc (PMID: 34803166).

      First of all, the authors need to cite the above papers and compare their findings with them to better clarify the value of their findings.

      Response: Thank you for your valuable comments, and as we mentioned these important studies were and will be cited and discussed properly in our revised version.

      The only one clear finding would be that Wwox functions as a tumor suppressor by repressing Myc function in the absence of p53, but unfortunately, no data have been presented on the mechanism.

      Response: As stated in our response above, we argue that our observations showing very early transformation of BM-MSCs in combined genetic perturbation of WWOX and p53 is novel. In our revision plan we propose to perform additional ex vivo experiments to prove this notion by performing WWOX deletion in SKO-yBM cells and WWOX restoration in DKO-yBM cells and test consequences on Myc levels/activity and tumorigenicity. To further shed light on the mechanistic outcome of WWOX action in this context, we shall perform Myc ChIP and ChIP seq assays in yBM DKO and compare it to yBM SKO cells to show whether WWOX deletion is indeed results in enhanced chromatin accessibility of Myc [follow up of Fig-I shown above]. These experiments should further strengthen our findings.

      Reviewer #2 (Evidence, reproducibility and clarity):

      In current study, the authors established a mouse model with tdTomato expression under the OSX-controlled double deletions of Wwox and Trp53. Such mouse strain gives a great platform to study the OS development and therapeutic potential. Experiments are clear and convincing. Results are well presented.

      Response: We thank the reviewer for the kind words regarding our manuscript and acknowledging its clarity and validity.

      To better improve this study, few minor suggestions regarding the data are as following:

      1. Some of the legends on figures are too small to read, or in low quality. please change these labels.

      Response: We thank the reviewer for this notion. We apologize for the low quality and inconvenience. In our revised version, we shall provide an improved resolution of legends.

      1. For Figure 7C and D, from 7C, the control WT BM showed clear resistance to the SVA treatment, but in 7D, there is almost no cells in the WT BM group. Data of this group might be missed?

      Response: Thank you for this comment. Cre+WT BM cells (shown in 7D) were unable to form colonies as shown previously in figures 2B and 4B. Fig 7C refers to sensitivity to SVA using MTT assay.

      1. For figure 7G, the difference among MCM7 IHC staining of two groups didn't show as much as the statistical analysis in the right panel. Authors may consider using MCM7 western blot to check its levels after SVA treat.

      Response: Thank you for this notion. We updated the Figure showing a more representative image (new Figure 7G).

      Reviewer #2 (Significance):

      This study uses a transgenic mouse model with tdTomato expressed in combination of loss of p53 and Wwox under the OSX lineage to study early initiation of OS. They found only DKO bone marrow cells can form OS in a subsequent orthotopic mouse model, but not the p53 single KO cells. After compare the RNA-seq from these different cell population, they identified the Myc pathway is the key player to promote OS development, especially the MCM7. Moreover, they tested SVA in treating these BM cells and reveal a therapeutic potential. This animal model is a good platform to study OS, especially at the early stage. Most results are clear and convincing. With the identification of Myc pathway, they further tested the SVA effects on treating these DKO BM. This is an important study and provided meaningful information to the OS, even broad cancer research community.

      Response: We thank the reviewer for his/her supporting comments, and acknowledging the importance of our study.

      However, the significance, or novelty of this work is not sufficient. For instance, SKO BM won't form tumor in the IT injection assays compare to the DKO BM groups, therefore, the involvement of Wwox during the OS tumorigenesis is clear. However, authors didn't explore any potential mechanisms of Wwox function or related signaling behind this observation

      Response: We thank the reviewer for this very important comment. As mentioned previously, response to reviewer 1, our results indicate that combined deletion of two important known tumor suppressors, WWOX and p53, promotes osteosarcoma through early changes in Myc levels. Our data further show that p53 deletion alone is dispensable for this change to happen at early stages further highlighting the significance of our findings. In our revision plan we will do knockout of WWOX in yBM SKO and restore WWOX in yBM DKO cells. In addition, we are currently working to perform ChIP assay for Myc in yBM DKO and compare it to yBM SKO cells to show whether WWOX deletion is indeed results in enhanced chromatin accessibility of Myc.

      And the RNA-seq analysis mostly focus on c-Myc pathway and its downstream targets. Given the well-known relationship of p53, c-myc even RB in the OS, it will be more interesting and attractive to see a clear mechanism of Wwox in this context.

      Response: We thank the reviewer for this suggestion. Indeed, combined deletion of WWOX and p53 resulted in alteration of key cellular pathways involved in OS development. Due to the capacity of the work, we focused here on this important notion showing very early upregulation of Myc in BM-MSCs isolated from DKO cells, but not from SKO cells. Future work can expand use of this model to address relationship with other key pathways and genes.

      Second, since authors took effort to generate this Tomato-DKO mice, it could be clearer if they isolate tdTomato positive cells instead of a mixture of BM, culture them, differentiate them, and perform more assays using these cells. In this way, it will give better clean background for all assays, and may be able to find novel effectors during this OS progression process.

      Response: Thank you for this important suggestion. BM-MSCs cells collected directly from the mouse (Tom+, Sca1+, CD45-) represents a very small and minute population and was used to be cultured for enrichment as was done in previous studies (ref. 33 and 34). So direct collecting this small population and injecting directly to immunocompromised mice is not feasible. Moreover, further validation of the cultured cells used in our study confirmed their mesenchymal identity. We however, propose to try performing in vitro tumorigenic assays on these sorted cells. In our revision plan we suggest performing colony formation assays and soft agar assays to address tumorigenicity of these cells.

      Third, within the text, authors tried to use OB differentiation and some other assays to discuss the OS origin cells, MSC or OB; but didn't get a preferred conclusion. It could be possible to better understand this process with the single cell RNA-seq using these BM from different mice or at different ages

      Response: Thank you for this important point. According to our results we can conclude that BM-MSCs committed to the osteoblast lineage are supposed to be the cell of origin for OS and will be clearly emphasized in our revised version. Preforming a single cell RNA-seq is beyond the scope of this study and can be explored in future studies.

      In general, this is a clean, straightforward study, and they established a very useful model to study OS. But the mechanisms merit is somewhere short

      Response: Thank you for the kind words, as proposed previously our plan to further investigate the mechanism of WWOX regulatory effect on Myc will be addressed using the in vitro assays of WWOX deletion/restoration to SKO/DKO-yBM cells respectively. Moreover, ChIP-seq assay for Myc in yBM DKO and compare it to yBM SKO cells to show whether WWOX deletion is indeed results in enhanced chromatin accessibility of Myc.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary: The work describes analysis of an Osx1-Cre p53fl/fl Wwoxfl/fl mouse model compared to an Osx1-Cre p53fl/fl model. The authors have assessed the osteosarcoma inducing potential of cells within the bone marrow by including a tdTom reporter of Cre expression. They conclude that bone marrow mesenchymal cells can give rise to osteosarcoma when transplanted. Based on transcriptomics they contend that upregulation of Myc is important and its target MCM7 can be targeted with simvastatin.

      Major comments: -

      Are the key conclusions convincing?

      The authors can generate tumors in immunocompromised mice upon injecting cells derived from bone marrow flushed. This is not surprising given the data available now about expression of Osx1-Cre

      Response: Thank you for this notion. In our study we showed that yBM cells harboring the deletion of two known important tumor suppressor genes WWOX and p53 collected from tumor free mice are tumorigenic compared to SKO p53 at this earlier stage. This is a novel notion that has not been presented previously. In our revised version, we propose to further study the mechanism of WWOX action on Myc accessibility.

      The analysis of MSCs lacks detail and needs significant more improvement and assessment by FACS using the well-defined criteria for mesenchymal cells that have been developed

      Response: Thank you for this comment. In our revised version we will add another marker (CD11b), in addition to the used markers (CD45/Sca-1 and CD29) which are all well known to define MSCs as mentioned in Ref. 21, 33, 34.

      It is not clear to me from the available information if the authors have used bone marrow cells that were flushed and immediately transplanted or if all cells transplanted have been placed in culture first, adherent cells expanded in culture media favoring survival and proliferation of non-hematopoietic cells and then transplanted- this is important to clarify explicitly as it is important to the significance of the study. If these cells are all used after culture, then the novelty of these studies are questionable as it was demonstrated previously that similar types of cells give rise to OS when transplanted (PMID: 18697945).

      Response: Thank you for this important point. In the referred reference (PMID: 18697945, ref 34) authors used p53/Rb Cre- stromal cells that were cultured in vitro then infected with Ad-Cre and then injected subcutaneously in immunocompromised mice. In our study, we provide evidence that genetically manipulated young BM-MSCs (for Wwox/p53) are tumorigenic when injected intratibially, a more relevant niche for these cells, and this involved upregulation of Myc. In our revised version, we shall provide more mechanistic insights on the functional relationship of WWOX and Myc. Using BM-MSCs cells that are directly collected from the mouse (Tom+, Sca1+, CD45-) is not feasible due to very low percentage of cells which has been also previously reported by many groups (Ref 34). In our revision plan, we also propose to try performing in vitro tumorigenic assays on sorted cells.

      Moreover, in our study we validated that the deletion of p53 alone at this earlier stage is not enough to induce osteosarcomagenesis (which was not shown previously) suggesting additional hits are required for OS formation. Importantly, co-deletion of Wwox and p53 using the same Cre line resulted in the upregulation/higher levels of Myc that promotes osteosarcomagenesis at this earlier stage.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      I think overall the authors are appropriately cautious in interpretation. The points raised above regarding the nature of the cells would need clarification and then the claims reassessed however.

      Response: Thank you for this acknowledgment. According to our results we can conclude that BM-MSCs committed to the osteoblast lineage could be the cell of origin for OS and this will be clearly emphasized in the discussion of our revised version.

      Claims re "metastatic potential" should be significantly reconsidered - the authors present (motility assays) which should be referred to as motility assays. The injection of cells intravenously is a lodgment assay of cells in the venous circulation and does not equate to the process a cancer cell must undergo and survive to metastasis from a primary tumor in an immune competent environment. The claims around these assays should be significantly reconsidered.

      Response: Thank you for this important comment. In our revision plan we will check the lungs of IT injected mice for the presence of lung metastatic nodules (tdTomato positive cells in the lungs).

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      It is not explained or justified what the control cohort in Fig 7F is significantly smaller than the treated cohort. This will affect the statistical analysis and interpretation. There is no statement regarding blinding/randomization (or not) in the in vivo simvastatin experiment - this needs to be added.

      Response: In our revised version we clarified in the materials and methods section clearly the randomization of the mice selection for each group and number of mice in each group.

      The authors should include discussion that these are relatively long latency OS models compared to p53/pRb compound mutants and contrast with previous data from these models where in vitro cultured cells did give rise to OS in vivo after Cre treatment.

      Response: Thank you for this suggestion. In our revised paper, we will include the latency of OS in our model and compare them to p53/Rb, and emphasize that our model tested the tumorigenic ability of SKO yBM cells and showed that they are unable to form osteosarcoma tumors at this early stage compared to DKO yBM cells.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      If a patient could to be treated based on these data, then the extra experiments to provide a robust preclinical dataset should be provided otherwise significant caution should be stated.

      Response: Our study provides a proof-of-concept showing that our model can be used to screen for drugs that could inhibit OS development. The inhibitory potential of SVA affecting the progression of OS for clinical assessment would certainly need further investigation that goes beyond the scope of this paper.

      • Are the data and the methods presented in such a way that they can be reproduced?

      See comments regarding cells used for injection and blinding. Need to more clearly describe the cells being injected and what was done to them from isolation to injection.

      Response: Thank you for this comment. In our revised version we will add clearly in the materials and methods section the protocol of cell isolation and injection, randomization of the mice selection for each group and number of mice in each group.

      • Are the experiments adequately replicated and statistical analysis adequate? Unclear if appropriate experiment completed in Fig 7F as no justification of different sample sizes is provided nor a statement reblinding/randomization.

      Response: Thank you for this point. In our revised version we will add clearly in the materials and methods section the randomization of the mice selection for each group and number of mice in each group.

      Minor comments:

      • Specific experimental issues that are easily addressable.

      The Osx1-Cre expresses a Cre:GFP fusion - the authors should correlate the GFP and tomato signal.

      Response: We thank the reviewer for this point. We shall perform validation analysis in our revised plan.

      Fig 2A- if 50% of the bone marrow is tdTom positive this is not evident in the image in Fig 1D. Quantify the images in Fig 1D. The tumor images in Fig 2C appear to have only sporadic tdTom positive cells - the authors should explain this further.

      Response: Thank you for this notion, Figure 2A represents BM collected from a tumor bearing mouse which has a higher percentage of tomato positive BM cells compared to tumor free mouse (Fig 1D). Fig 2C is updated

      Need statistics added to all GSEA plots.

      Response: Thank you for this comment, statistics will be added in our revised version.

      4C is very different than 2C - 4C is more consistent with the levels of tomato stated

      Response: Thank you for this notion, the images were updated in our revised version

      • Are prior studies referenced appropriately?

      This seems appropriate

      Response: Thank you for this comment.

      • Are the text and figures clear and accurate?

      Figures are not ideal and could be improved in terms of >clarity and text size

      Response: We apologize for the low quality. In our revised version, we shall provide an improved resolution and accuracy.

      Several sections of text are either contradictory or questionably accurate:

      page 3: Molecular studies of OS are significantly hindered by its genetic complexity and chromosomal instability, which precludes the identification of a single recurrent event associated with OS. Contrasts with the following text: Page 18 - p53 has been extensively shown to play a central role in OS development in both human and mouse models. The data from sequencing of human OS and mouse models and canine data all point to p53 loss as being a central event in OS.

      Response: We apologize for this inaccurate statement. In our revised paper, we revised this statement to reflect the common event of p53 deletion in OS and its significance in osteosarcomagenesis.

      Page 18: High genetic heterogeneity and chromosomal instability limit the early diagnosis of OS and lead to lung metastases and a worse prognosis. I don't understand this statement given that intratumor characteristics are not a determinant of early diagnosis - the patient being aware they have an issue and the clinical follow up determine the rate of diagnosis (and access to healthcare).

      Response: We shall revise this statement to reflect the genetic heterogeneity of OS tumors.

      Page 21: Consistent with their roles as tumor suppressor genes, the combined deletion of WWOX and p53 in Osx1-positive progenitor cells resulted in their transformation and growth advantage. Didn't the authors reach this conclusion from their previous work published in 2016? –

      Response: Thank you for this notion. In our previous paper, we provided evidence that WWOX and p53 loss contributes to more aggressive OS tumor formation. In the current study, we provide evidence about early events contributes to osteosarcomagenesis using our Wwox/p53 model.

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      improve figure clarity and text sizes

      Response: We thank the reviewer for this notion; in our revised version, we shall provide an improved resolution, accuracy and clarity.

      Reviewer #3 (Significance):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      I think this is largely an incremental study which largely confirms previous studies.

      Response: Although our results are consistent with some previously noted observations, we clearly provide unforeseen evidence that links Wwox/p53 in early osteosarcomagenesis and suggest that p53 deletion alone is not enough at this stage; other hits are required as in Wwox/p53 or Rb/p53. The mechanism of how DKO cells (Wwox/p53) results in Myc upregulation is also novel and will be further tested in our revised submission.

      • Place the work in the context of the existing literature (provide references, where appropriate).

      The study is a modest advance over the existing literature.

      Response: We believe that with our revision plan, our paper will provide a significant advancement in the research in osteosarcomagenesis.

      • State what audience might be interested in and influenced by the reported findings.

      This would be relevant to basic sarcoma researchers.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Generated multiple murine models of OS and characterized and applied them for biological understanding and preclinical studies.

    1. Author Response

      Reviewer #1 (Public Review):

      Rosas et al studied the mechanism/s that enabled carbapenems resistance of a Klebsiella isolate, FK688, which was isolated from an infected patient. To identify and characterize this mechanism, they used a combination of multiple methods. They started by sequencing the genome of this strain by a combination of short and long read sequencing. They show that Klebsiella FK688 does not encode a carbapenemase, and thus looked for other mechanisms that can explain this resistance. They discover that both DHA-1 (located on the mega-plasmid) and an inactivation of the porin OmpK36, are required for carbapenem resistance in this strain. By using experimental evolution, it was shown that resistance is lost rapidly in the absence of antibiotics selection, by a deletion in pNAR1 that removed blaDHA-1. Moreover, their results suggested that it is likely that exposure to other antibiotics selected for the acquisition of the mega-plasmid that carries DHA-1, which then enabled this strain to gain resistance to carbapenemase by a single deletion.

      The major strength of this study is the use of various approaches, to tackle an important and interesting problem.

      The conclusions of this paper are mostly well supported by data, but one aspect is not clear enough. The description of the evolutionary experiment is not clear. I could not find a clear description of the names of the evolved populations. However, the authors describe strains B3 and A2, but their source is not clear. The legends of the relevant figure (Figure 5) are confusing. For example, the text describing panel B is not related to the image shown in this panel. Moreover, it is shown in panel C (and written in the main text) that the OmpK36+ evolved populations had only translucent colonies, so what is the source of B3(o)?

      We appreciate the point and in response have added a panel to Figure 5 (in the revised paper this is now Fig. 5A) to illustrate the evolutionary experiment and specify that there are two lineages (A and B) with 20 replicates each that, after 200 generations of evolution, give rise to populations of which A2 and B3 are the exemplars characterized.

      We have corrected the legends in Figure 5.

      We now explain (sentence starting on Line 197) that the B3 (o) is the single isolate of an opaque colony from lineage B3, it is the only colony that we identified from out of 595 colonies observed in the B3 population. B3(o) was sequenced and analysed as a comparator and has some value in that regard, despite being an anomaly.

      Reviewer #2 (Public Review):

      The authors sequenced a clinical pathogen, Klebsiella FK688, and definitively establish the genetic basis of the carbapenem-resistance phenotype of this strain. They also show that the causal mutations confer reduced fitness under laboratory conditions, and that carbapenem sensitivity readily re-evolves in the lab due to the fitness costs associated with the resistance mutations in the clinical isolate. They also establish that subinhibitory concentrations of ceftazidime select for the otherwise deleterious blaDHA-1 gene. Based on this finding the authors speculate that prior beta-lactam selection faced by the ancestors of Klebsiella FK688 potentiated the evolution of the carbapenem-resistance phenotype of this strain. If this hypothesis is true, then prior history of beta-lactam exposure may generally potentiate the evolution of carbapenem resistance.

      Strengths:

      From a technical perspective, the findings in this paper are solid. In addition, the authors establish a simple genetic basis for carbapenem resistance in a clinical strain, which is a valuable and non-trivial finding (i.e. they show that the CRE phenotype in this strain is not an omnigenic trait distributed over hundreds of loci).

      Weaknesses:

      The main weakness of this paper is that the authors draw overly broad conclusions of a conceptual nature from narrow experimental findings. This could be addressed by drawing more modest and narrow implications from the findings.

      1) The title of this paper is "Treatment history shapes the evolution of complex carbapenem-resistant phenotypes in Klebsiella spp." But they provide no data on the treatment history of the patient from whom this strain was isolated from. Therefore, the authors have no evidence to support their central claim. Indeed, it is completely possible that this strain never faced beta-lactam selection in the past, or that the patient's hypothetical history of betalactamase was irrelevant for the evolution of FK688. First, it is completely possible that this is a hospital-acquired infection, such that the history of this strain is due to selection in other contexts in the hospital that have little to do with the patient's treatment history. Second, it is completely possible that this strain (the chromosome anyway) has no prior history of beta-lactamase selection, and that it acquired the megaplasmid containing blaDHA-1 via conjugation from some other strain. In this second hypothetical scenario, it is possible that the fitness cost of the blaDHA-1 gene is not particularly high in a different source strain, but that it has some cost in the FK688 strain that it was isolated from. And of course, fitness costs in the human host could be very different than fitness costs in the laboratory, where strains are evolving under strong selection for fast growth. And given the benefit of resistance, it's clear that this strain clearly has a strong fitness advantage over faster-growing sensitive strains in the context of the source patient under antibiotic treatment.

      My general point here is that the broad claims made about patient history or prior history shaping the evolution of this strain are largely indefensible because there is no data here to make solid inferences about how prior history shaped the evolution of this strain.

      We appreciate the point and have changed our title and scaled back the strength of our conclusions regarding patient treatment history.

      2) Historical contingency. The authors claim that their work shows how historical contingency shapes the evolution of resistance. One problem with this claim is that it is trivial- this is only a significant claim if the reader believes that prior history is not important in the evolution of antibiotic resistance, which is a straw-man null hypothesis, to mix a couple metaphors. To be more concrete, clearly strain background (prior history) matters-eliminating the plasmid with the resistance gene eliminates resistance. But that is not particularly surprising, given the past 50 years of evolutionary microbiology literature on plasmids and resistance. By contrast to this work, the major contribution of papers that examine the role of historical contingency in evolution (i.e. various Lenski papers) is that those works quantitatively measure the role of history in comparison to other factors (chance, adaptation). Since this work is a deep dive into a single clinical isolate, the data presented here do not and cannot shed light on the role of historical contingency in the emergence of this strain. The authors' claims about the prior history that led to the CRE phenotype are reasonable- but are fundamentally speculative. I have nothing against speculation, as long as it is clear what claims are speculative, and what are concrete implications. But the authors frame these speculative claims as concrete implications of their findings.

      This is a fair point. We have reframed the study to not focus on historical contingency.

      As the reviewer points out, any discussion about historical contingency in the context of evolution is trivial in one sense. One of the reasons that the studies of Lenski and Blount provide new insights into the role of historical evolution because they knew the history of their populations (at, least for the number of generations since the LTEE began), and had a high degree of control and understanding of the growth conditions where the trait evolved. As such, they could go back to time points before the trait evolved, and then repeat the evolution experiment many times, in the exact same environment where the trait originally evolved, and then count how often they observed the evolution of that trait.

      Here we study a clinical isolate, and have less understanding of the evolutionary history of our strain. While we cannot re-evolve carbapenem resistant in the exact same environment experienced by the FK688 strain, we did test the capacity for the wild type, and two possible intermediate genotypes genotypes, to evolve carbapenem resistance in growth media with carbapenem.

      Altogether- we have comprehensive evidence for the genetic cause of carbapenem resistance: the BLA1 plasmid + OmpK36. We showed, by experiment, that it is much more likely for carbapenem resistance to evolve in a FK688 strain that carries the BLA1 plasmid, than in an FK688 strain that did not carry the plasmid even if it had acquired the OmpK36 mutation. We think this not trivial because a significant proportion of all of the carbapenem resistant Klebsiella that have been isolated are non-carbapenemase CRE. Our reconstruction provides a plausible explanation for why non-carbapenemase CRE evolve – because they are evolving from strains that have already been treated with a non-carbapenem beta-lactam drug and have thereby selected for the presence of a beta-lactamase (that is not a carbapenemase).

      So, while we have scaled back the strength of our claims, we do think that our results can provide some insight into how the evolutionary history of a pathogen can shape the molecular path to antibiotic resistance.

      3) The authors claim that "[This work] suggests that the strategic combinations of antibiotics could direct the evolution of low-fitness, drug-resistant genotypes". I suppose this is true, but I also think this is a stretch of an implication given these findings. To be blunt, while I suppose it's better to have costly resistance variants that re-evolve sensitivity than to have low-cost high-resistance strains circulating, I think the patient's family would probably disagree that the evolution of a low-fitness drug-resistant genotype was good or strategic in the clinical context, even if better from a public health perspective. Low-fitness drug-resistant strains are just as lethal under clinical antibiotic concentrations!

      Thank you for the comment, we see how this sentence could be seen as too strong a conclusion and have rewritten the last sentence of the DISCUSSION (line 351):

      “These results show how an individual’s treatment history might shape the evolution of AMR, and should be taken into consideration in order to explain the evolution of non-carbapenemase CRE”

      The authors do show the plausibility of their hypothesis/model that prior beta-lactam selection is sufficient to potentiate the evolution of carbapenem-resistance (by the additional ompK loss-of-function mutation). I think those findings are very nice. But the authors undermine their results by extrapolating too far from their data. Hence, I think narrowing the scope of the implications would improve this paper.

      In addition to narrowing the scope of the implications as written, I also would like to add that there may be other ways of framing this paper (other than historical contingency) that may make the significance of this work more apparent to a broader audience. This may be worth considering during the revision process.

      We have taken these suggestions on board and have re-framed the final sentences of the ABSTRACT, INTRODUCTION and DISCUSSION accordingly. Specifically, we have removed reference to historical contingency and instead have reframed our experiments as providing a genetic and evolutionary explanation for an interesting and concerning cause of antibiotic resistance – non-carbapenemase CRE.

    1. Author Response

      Reviewer #1 (Public Review):

      Weaknesses:

      Gene expression level as a confounding factor was not well controlled throughout the study. Higher gene expression often makes genes less dispensable after gene duplication. Gene expression level is also a major determining factor of evolutionary rates (reviewed in http://www.ncbi.nlm.nih.gov/pubmed/26055156). Some proposed theories explain why gene expression level can serve as a proxy for gene importance (http://www.ncbi.nlm.nih.gov/pubmed/20884723, http://www.ncbi.nlm.nih.gov/pubmed/20485561). In that sense, many genomic/epigenomic features (such as replication timing and repressed transcriptional regulation) that were assumed "neutral" or intrinsic by the authors (or more accurately, independent of gene dispensability) cannot be easily distinguishable from the effect of gene dispersibility.

      We thank the reviewer for this important comment. We totally agree that transcriptomic and epigenomic features cannot be easily distinguished from gene dispensability and do not think that these features of the elusive genes can be explained solely by intrinsic properties of the genomes. Our motivation for investigating the expression profiles of the elusive gene is to understand how they lost their functional indispensability (original manuscript L285-286 in Results). We also discussed the possibility that sequence composition and genomic location of elusive genes may be associated with epigenetic features for expression depression, which may result in a decrease of functional constraints (original manuscript L470-474 in Discussion). Nevertheless, we think that the original manuscript may have contained misleading wordings, and thus we have edited them to better convey our view that gene expression and epigenomic features are related to gene function.

      (P.2, Introduction) This evolutionary fate of a gene can also be affected by factors independent of gene dispensability, including the mutability of genomic positions, but such features have not been examined well.

      (P6, Introduction) These data assisted us to understand how intrinsic genomic features may affect gene fate, leading to gene loss by decreasing the expression level and eventually relaxing the functional importance of ʻelusiveʼ genes.

      (P33, Discussion) Another factor is the spatiotemporal suppression of gene expression via epigenetic constraints. Previous studies showed that lowly expressed genes reduce their functional dispensability (Cherry, 2010; Gout et al., 2010), and so do the elusive genes.

      Additionally, responding to the advices from Reviewers 1 and 2 [Rev1minor7 and Rev2-Major4], we have added a new section Elusive gene orthologs in the chicken microchromosomes in which we describe the relationship between the elusive genes and chicken microchromosomes. In this section, we also argue for the relationship between the genomic feature of the elusive genes and their transcriptomic and epigenomic characteristics. In the chicken genome, elusive genes did not show reduced pleiotropy of gene expression nor the epigenetic features relevant with the reduction, consistently with the moderation of nucleotide substitution rates. This also suggests that the relaxation of the ‘elusiveness’ is associated with the increase of functional indispensability.

      (P27, Elusive gene orthologs in the chicken microchromosomes in Results) Our analyses indicates that the genomic features of the elusive genes such as high GC and high nucleotide substitutions do not always correlate with a reduction in pleiotropy of gene expression that potentially leads to an increase in functional dispensability, although these features have been well conserved across vertebrates. In addition, the avian orthologs of the elusive genes did not show higher KA and KS values than those of the non-elusive genes (Figure 3; Figure 3–figure supplement 1), likely consistent with similar expression levels between them (Figure 5–figure supplement 1) (Cherry, 2010; Zhang and Yang, 2015). With respect to the chicken genome, the sequence features of the elusive genes themselves might have been relaxed during evolution.

      Ks was used by the authors to indicate mutation rates. However, synonymous mutations substantially affect gene expression levels (https://pubmed.ncbi.nlm.nih.gov/25768907/, https://pubmed.ncbi.nlm.nih.gov/35676473/). Thus, synonymous mutations cannot be simply assumed as neutral ones and may not be suitable for estimating local mutation rates. If introns can be aligned, they are better sequences for estimating the mutability of a genomic region.

      We appreciate the reviewer for this meaningful suggestion. As a response, we have computed the differences in intron sequences between the human and chimpanzee genomes and compared them between the elusive and non-elusive genes. As expected, we found larger sequence differences in introns for the elusive genes than for the non-elusive genes. In Figure 2c of the revised manuscript, we have included the distribution of KI, sequence differences in introns between the human and chimpanzee genomes for the elusive and non-elusive genes. Additionally, we have added the corresponding texts to Results and the procedure to Methods as shown below.

      (P11, Identification of human ‘elusive’ genes in Results) In addition, we computed nucleotide substitution rates for introns (KI) between human and chimpanzee (Pan troglodytes) orthologs and compared them between the elusive and non-elusive genes.

      (P11, Identification of human ‘elusive’ genes in Results) Our analysis further illuminated larger KS and KI values for the elusive genes than in the non-elusive genes (Figure 2b, c; Figure 2–figure supplement 1). Importantly, the higher rate of synonymous and intronic nucleotide substitutions, which may not affect changes in amino acid residues, indicates that the elusive genes are also susceptible to genomic characteristics independent of selective constraints on gene functions.

      (P39, Methods) To compute nucleotide sequence differences of the individual introns, we extracted 473 elusive and 4,626 non-elusive genes that harbored introns aligned with the chimpanzee genome assembly. The nucleotide differences were calculated via the whole genome alignments of hg38 and panTro6 retrieved from the UCSC genome browser.

      The term "elusive gene" is not necessarily intuitive to readers.

      We previously published a paper reporting the group of genes that we refer to as ‘elusive genes,’ lost in mammals and aves independently but retained by reptiles, in the gecko genome assembly (Hara et al., 2018, BMC Biology). We initially termed them with a more intuitive name (‘loss-prone genes’) but changed it because one of our peer-reviewers did not agree to use this name. Later on, we have continuously used this term in another paper (Hara et al., 2018, Nat. Ecol. Evol.). In addition, some other groups have used the word ‘elusive’ with a similar intention to ours (Prokop et al, 2014, PLOS ONE, doi: 10.1371/journal.pone.0092751; Ribas et al., 2011, BMC Genomics, doi: 10.1186/1471-2164-12-240). We would appreciate the reviewer’s understanding of this naming to ensure the consistency of our researches on gene loss. In the revised manuscript, we have added sentences to provide a more intuitive guide to ‘elusive genes’,

      (P6, Introduction) We previously referred to the nature of genes prone to loss as ‘elusive’(Hara et al., 2018a, 2018b). In the present study, we define the elusive genes as those that are retained by modern humans but have been lost independently in multiple mammalian lineages. As a comparison of the elusive genes, we retrieved the genes that were retained by almost all of the mammalian species examined and defined them as ‘non-elusive’, representing those persistent in the genomes.

      Reviewer #3 (Public Review):

      Overall, the study is descriptive and adds incremental evidence to an existing body of extensive gene loss literature. The topic is specialised and will be of interest to a niche audience. The text is highly redundant, repeating the same false positive issue in the introduction, methods, and discussion sections, while no clear conclusion or interpretation of their main findings are presented.

      Major comments

      While some of the false discovery rate issues of gene loss detection were addressed in the presented pipeline, the authors fail to test one of the most severe cases of mis-annotating gene loss events: frameshift mutations which cause gene annotation pipelines to fail reporting these genes in the first place. Running a blastx or diamond blastx search of their elusive and non-elusive gene sets against all other genomes, should further enlighten the robustness of their gene loss detection approach

      For the revised manuscript, we have refined the elusive gene set as the reviewer suggested. In the genome assemblies, we have searched for the orthologs of the elusive genes for the species in which they were missing. The search has been conducted by querying amino acid sequences of the elusive genes with tblastn as well as MMSeqs2 that performed superior to tblastn in sensitivity and computational speed. In addition, regarding another comment by Reviewer 3. we have searched for the orthologs by referring to existing ortholog annotations. We used the ortholog annotations implemented in RefSeq instead of those from the TOGA pipeline: both employ synteny conservation. We have coordinated the identified orthologs with our gene loss criteria–absence from all the species used in a particular taxon–and excluded 268 genes from the original elusive gene set. These genes contain those missing in the previous gene annotations used in the original manuscript but present in the latest ones, as well as those falsely missing due to incorrect inference of gene trees. Finally, the refined set of 813 elusive genes were subject to comparisons with the non-elusive genes. Importantly, these comparisons retained the significantly different trends of the particular genomic, transcriptomic, and epigenomic features between them except for very few cases (Table R1 included below). This indicates that both initial and revised sets of the elusive genes reflect the nature of the ‘elusiveness,’ though the initial set contained some noises. We have modified the numbers of elusive genes in the corresponding parts of the manuscript including figures and tables. Additionally, we have added the validation procedures in Methods.

      Table R1. Difference in statistical significances across different elusive gene sets *The other features showed significantly different trends between the elusive and non-elusive genes for all of the elusive gene sets and thus are not included in this table.

      (P38 in Methods) The gene loss events inferred by molecular phylogeny were further assessed by synteny-based ortholog annotations implemented in RefSeq, as well as a homolog search in the genome assemblies (Table S2) with TBLASTN v2.11.0+ (Altschul et al., 1997) and MMSeqs2 (Steinegger and Söding, 2017) referring to the latest RefSeq gene annotations (last accessed on 2 Dec, 2022). This procedure resulted in the identification of 813 elusive genes that harbored three or fewer duplicates. Similarly, we extracted 8,050 human genes whose orthologs were found in all the mammalian species examined and defined them as non-elusive genes.

      The reviewer also suggested us investigating falsely-missing genes due to frameshift mutations (in this case we guess that the reviewer assumed the genome assembly that falsely included frameshift mutations). This requires us to search for the orthologs by revisiting the sequencing reads because the frameshift is sometimes caused by indels of erroneous basecalling. We have selected five elusive genes and searched for the fragments of orthologs in sequencing reads for the species in which they are missing. We have retrieved sequencing reads corresponding to the genome assemblies from NCBI SRA and performed sequence similarity search using the program Diamond against the amino acid sequences of the elusive genes and could not find the frameshift that potentially causes the mis-annotation of the elusive genes.

      Along this line, we noticed that when annotation files were pooled together via CD-Hit clustering, a 100% identity threshold was chosen (Methods). Since some of the pooled annotations were drawn from less high quality assemblies which yield higher likelihoods of mismatches between annotations, enforcing a 100% identity threshold will artificially remove genes due to this strict constraint. It will be paramount for this study to test the robustness of their findings when 90% and 95% identity thresholds were selected.

      cd-hit clustering with 100% sequence identity only clusters those with identical (and sometimes truncated) sequences, and, in the cluster, the sequences other than the representative are discarded. This means that the sequences remain if they are not identical to the other ones. If the similarity threshold is lowered, both identical and highly similar sequences are clustered with each other, and more sequences are discarded. Therefore, our approach that employs clustering with 100% similarity may minimize false positive gene loss.

      While some statistical tests were applied (although we do recommend consulting a professional statistician, since some identical distributions tend to show significantly low p-values), the authors fail to discuss the fact that their elusive gene set comprises of ~5% of all human genes (assuming 21,000 genes), while their non-elusive set represents ~40% of all genes. In other words, the authors compare their sequence and genomic features against the genomic background rather than a biological signal (nonelusiveness). An analysis whereby 1,081 genes (same number as elusive set) are randomly sampled from the 21,000 gene pool is compared against the elusive and non-elusive distributions for all presented results will reveal whether the non-elusive set follows a background distribution (noise) or not.

      Our study aims to elucidate the characteristics of genes that differentiate their fates, retention or loss. To achieve this, we put this characterization into the comparison between the elusive and non-elusive genes. This comparison highlighted clearly different phylogenetic signals for gene loss between elusive and non-elusive genes, allowing us to extract the features associated with the loss-prone nature. The random sampling set suggested by Reviewer may largely consists of the remainders that were not classified by the elusive and non-elusive genes. However, these remainders may contain a considerable number of genes with distinctive phylogenetic signatures rather than the intermediates between the elusive and nonelusive genes: the genes with multiple loss events in more restricted taxa than our criterion, the ones with frequent duplication, etc. Therefore, we think that a comparison of the elusive genes with the random-sampling set does not achieve our objective: the comparison of the clearly different phylogenetic signals.

      We also wondered whether the authors considered testing the links between recombination rate / LD and the genomic locations of their elusive genes (again compared against randomly sampled genes)?

      We have retrieved fine-scale recombination rate data of males and females from https://www.decode.com/addendum/ (Suppl. Data of Kong, A et al., Nature, 467:1099–1103, 2010) and have compared them between the gene regions of the elusive and non-elusive genes. Both comparisons show no significant differences: average 0.829 and 0.900 recombinations/kb for the elusive and non-elusive genes, respectively, p=0.898, for males; average 0.836 and 0.846 recombinations/kb for the elusive and non-elusive genes, respectively, p=0.256, for females).

      Given the evidence presented in Figure 6b, we do not agree with the statement (l.334-336): "These observations suggest that the elusive genes are unlikely to be regulated by distant regulatory elements". Here, a data population of ~1k genes is compared against a data population of ~8k genes and the presented difference between distributions could be a sample size artefact. We strongly recommend retesting this result with the ~1k randomly sampled genes from the total ~21,000 gene pool and then compare the distributions.

      Analogous random sampling analysis should be performed for Fig 6a,d

      As described above, our study does not intend to extract signals from background. To make the comparison objectives clear, we have revised the corresponding sentence as below.

      (P22, Transcriptomic natures of elusive genes in Results) These observations suggest that the elusive genes are unlikely to be regulated by distant regulatory elements compared with the non-elusive genes (Figure 6b).

      We didn't see a clear pattern in Figure 7. Please quantify enrichments with statistical tests. Even if there are enriched regions, why did the authors choose a Shannon entropy cutoff configuration of <1 (low) and >1 (high)? What was the overall entropy value range? If the maximum entropy value was 10 or 100 or even more, then denoting <1 as low and >1 as high seems rather biased.

      To use Figure 7 in a new section in Results, we have added an ideogram showing the distribution of the genes that retain the chicken orthologs in microchromosomes. In response to the comment by Reviewer 2, we have performed statistical tests and found that the elusive genes were significantly more abundant in orthologs in microchromosomes than the non-elusive genes. Furthermore, the observation that the elusive genes prefer to be located in gene-rich regions was already statistically supported (Figure 2f).

      As shown in Figure 5, Shannon’s H' ranged from zero to approximately 4 (exact maximum value is 3.97) and 5 (5.11) for the GTEx and Descartes gene expression datasets, respectively. Although the threshold H'=1 was an arbitrarily set, we think that it is reasonable to classify the genes with high pleiotropy from those with low pleiotropy.

    1. Author Response

      Reviewer #1 (Public Review):

      1) It would be helpful to include some sort of comparison in Fig. 4, e.g. the regressions shown in Fig 3, to indicate to what extent the ICCl data corresponds to the "control range" of frequency tuning.

      Figure 4 was modified to show the frequency range typically found in the ICCls. This range is based on results from Wagner et al., 2007, which extensively surveyed ICCls responses. This modification shows that our ICCls recordings in the ruff-removed owls cover the normal frequency hearing range of the owl.

      2) A central hypothesis of the study is that the frequency preference of the high-frequency neurons is lower in ruff-removed owls because of the lowered reliability caused by a lack of the ruff. Yet, while lower, the frequency range of many neurons in juvenile and ruff-removed owls seems sufficiently high to be still responsive at 7-8 kHz. I think it would be important to know to what extent neurons are still ITD sensitive at the "unreliable high frequencies" even if the CFs are lower since the "optimization" according to reliability depends not on the best frequency of each neuron per se, but whether neurons are less ITD sensitive at the higher, less reliable frequencies.

      The concern regarding the frequency range that elicits responsivity was largely addressed above. Specifically, Figure L1 showing frequency tuning of frontally tuned ICx neurons in ruff-removed owls indicates that while there is some variability of tuning across neurons, there is little responsivity above 6 kHz. In contrast, equivalent analysis in juvenile owls (Figure L3), shows there is much more responsiveness and variability across neurons to high and low frequencies. This evidence supports our hypothesis that the juvenile owl brain is still highly plastic, which facilitates learning during development. Although the underlying data was already reported in Figure 7 of our previously submitted manuscript, we can include Figures L1 and L2, potentially as supplemental figures, if considered useful by editors and reviewers. Nevertheless, this argumentation was further expanded in the revised text (Line 229).

      Figure L1. Frequency tuning of frontally-tuned ICx neurons in ruff-removed owls. Tuning curves are normalized by the max response. Thick black line indicates the average tuning curve. Dashed black line indicates basal response.

      Figure L2. ITD sensitivity across frequencies in ruff-removed owl. Two example neurons shown in a and b. ITD tuning for tones (colored) and broadband (black) plotted by firing rate (non-normalized). Solid colored lines indicate responses to frequencies that are within the neuron’s preferred frequency range (i.e. above the half-height, see Methods), dashed lines indicate frequencies outside of the neuron’s frequency range.

      Figure L3. Frequency tuning of frontally-tuned ICx neurons in juvenile owls. Tuning curves are normalized by the max response. Thick black line indicates the average tuning curve. Dashed black line indicates basal response.

      3) It would be interesting to have an estimate of the time scale of experience dependency that induces tuning changes. Do the authors have any data on this question? I appreciate the authors' notion that the quantifications in Fig 7 might indicate that juvenile owls are already "beginning to be shaped by ITD reliability" (line 323 in Discussion). How many days after hearing onset would this correspond to? Does this mean that a few days will already induce changes?

      While tracking changes induced by ruff-removal over development were outside of the scope of this study, many other studies have assessed experience-dependent plasticity in the barn owl. The recordings in this study were performed approximately 20 days after hearing onset, suggesting that the juveniles had ample time to begin learning. These points were expanded upon in the discussion (Lines 254, 280-283).

      Reviewer #2 (Public Review):

      1) Why is IPD variability plotted instead of ITD variability (or indeed spatial reliability)? The relationship between these measures is likely to vary across frequency, which makes it difficult to compare ITD variability across frequency when IPDs are plotted. Normalizing data across frequencies also makes it difficult to compare different locations and acoustical conditions. For example, in Fig.1a and Fig.1b, the data shown for 3 kHz at ~160 degrees seems quantitatively and visually quite different, but the difference (in Fig.1c) appears to be negligible.

      Justification of why IPD variability is used as an estimate of ITD variability was added to introduction (Lines 55-60), results (Line 100) and methods (Lines 371-374) sections of the manuscript, explaining the fact that because ITD detection is based on phase locking by auditory nerve and ITD detector neurons tuned to narrow frequency bands, responses of ITD detector neurons forwarded to downstream midbrain regions are therefore determined by IPD variability. Additionally, ITD is calculated by dividing IPD by frequency, which makes comparisons of ITD reliability across frequency mathematically uninformative.

      2) How well do the measures of ITD reliability used reflect real-world listening? For example, the model used to calculate ITD reliability appears to assume the same (flat) spectral profile for targets and distractors, which are presented simultaneously with the same temporal envelope, and a uniform spatial distribution of sounds across space. It is therefore unclear how robust the study's results are to violations of these assumptions.

      While we agree that our analysis cannot completely capture real-world listening for the barn owl, a general analysis using similar flat spectral profiles for targets and concurrent sounds provides a broad assessment of reliability of ITD cues. While a full recapitulation of real-world listening is beyond the scope of this study (i.e. recording natural scenes from the ear canals of wild barn owls), we included additional analyses of ITD reliability in Figure 1-figure supplement 1, described above.

      3) Does facial ruff removal produce an isolated effect on ITD variability or does it also produce changes in directional gain, and the relationship between spatial cues and sound location? Although the study considers this issue in some places (e.g. Fig.2, Fig.5), a clearer presentation of the acoustical effects of facial ruff removal and their implications (for all locations, not just those to the front), as well as an attempt to understand how these acoustical changes lead to the observed changes in ITD reliability, would greatly strengthen the study. In addition, Fig.1 shows average ITD reliability across owls, but it would be helpful to know how consistent these measures are across owls, given individual variability in Head-Related Transfer Functions (HRTFs). This potentially has implications for the electrophysiological experiments, if the HRTFs of those animals were not measured. One specific question that is potentially very relevant is whether the facial ruff attenuates sounds presented behind the animal and whether it does so in a frequency-dependent way. In addition, if facial ruff removal enables ILDs to be used for azimuth, then ITDs may also become less necessary at higher frequencies, even if their reliability remains unchanged.

      Additional analysis was conducted to generate representation of changes in directional gain induced by ruff removal, added to new figure (Fig 5). This analysis shows that changes in gain following ruff-removal are largely frequency-independent: there is a de-attenuation of peripherally and rearwardly located sounds, but the highest gain remains for high frequencies in frontal space. There is an additional increase in gain for high frequencies from rearward space, these changes would not explain the changes in frequency tuning we report. As mentioned in new additions to the manuscript, the changes at the most rearward-located auditory spatial locations are unlikely to have an effect on the auditory midbrain. No studies in the barn owl have found neurons in the ICx or optic tectum tuned to >120° (Knudsen, 1982; Knudsen, 1984; Cazettes et al., 2014). In addition, variability of IPD reliability across owls was analyzed and reported in the amended Figure 1, which notes very little changes across owls. In this analysis, we did realize that the file of one of the HRTFs obtained from von Campenhausen et al. 2006 was mislabeled, which explains slight differences in revised Fig 1b. Nevertheless, added analysis of IPD reliability across owls indicates that the pattern in ITD reliability is stable across owls (Fig. 1d,e), which supports our decision to not record HRTFs from owls used in this study. Finally, we added to the discussion that clarifies that the use of ILD for azimuth would not provide the same resolution as ITD would (Lines 295-303). We also do not believe that the use of ILD for azimuth would make “ITDs… less necessary at higher frequencies”, given that the ICCls is still computing ITD at these high frequencies (Fig 4), and that ILDs also have higher resolution at higher frequencies, with and without the facial ruff (Olsen et al, 1989; Keller et al., 1998; von Campenhausen et al., 2006).

      1) It is unclear why some analyses (Fig.5, Fig.7) are focused on frontal locations and frontally-tuned neurons. It is also unclear why neurons with a best ITDs of 0 are described as frontally tuned since locations behind the animal produce an ITD of 0 also. Related to this, in Fig.1, facial ruff removal appears to reduce IPD variability at low frequencies for locations to the rear (~160 degrees), where the ITD is likely to be close to 0. Neurons with a best ITD of 0 might therefore be expected to adjust their frequency tuning in opposite directions depending on whether they are tuned to frontal or rearward locations.

      An extensive explanation was added to the methods detailing why we do not believe the neurons recorded in this study are tuned to the rear. Namely, studies mapping the barn owl’s ICx and optic tectum have not reported neurons tuned to locations >120°, with the number of neurons representing a given spatial location decreasing with eccentricity (Knudsen, 1982; Knudsen, 1984; Cazettes et al., 2014). While we agree that there does seem to be a change in ITD reliability at ~160° following ruff-removal, the result is largely similar to the change that occurs in frontal space (Fig 1b), which is consistent with the ruff-removed head functioning as a sphere. Thus, we wouldn’t expect rearwardly-tuned neurons, if they could be readily found, to adjust their frequency tuning to higher frequencies. Finally, we want to clarify that we focused our analyses on frontally-tuned neurons because frontal space is where we observed the largest change in ITD reliability. Text was added to the Discussion section to clarify this point (Lines 313-321).

      2) The study suggests that information about high-frequency ITDs is not passed on to the ICX if the ICX does not contain neurons that have a high best frequency. However, neurons might be sensitive to ITDs at frequencies other than the best frequency, particularly if their frequency tuning is broader. It is also unclear whether the best frequency of a neuron always corresponds to the frequency that provides the most reliable ITD information, which the study implicitly assumes.

      The concern about ITD sensitivity at non-preferred frequencies was addressed under the essential revision #3, as well as under Reviewer 1’s concerns.

    1. But why do the feeble-minded tend so strongly to become delinquent? The answer may be stated in simple terms. Morality depends upon two things: (a) the ability to foresee and to weigh the possible consequences for self and others of different kinds of behavior; and (b) upon the willingness and capacity to exercise self-restraint. That there are many intelligent criminals is due to the fact that (a) may exist without (b). On the other hand, (b) presupposes (a). In other words, not all criminals are feeble-minded, but all feeble-minded are at least potential criminals. That every feeble-minded woman is a potential prostitute would hardly be disputed by any one. Moral judgment, like business judgment, social judgment, or any other kind of higher thought process, is a function of intelligence. Morality cannot flower and fruit if intelligence remains infantile.

      I like this explanation because it gives some insight as to why feeble-minded people become delinquents. I like that it stated not all criminals are feeble-minded because there are some criminals that are very intelligent. The unabomber for example, Ted Kaczynski was a mathematics professor, and retired to later became a bomb maker. He wasn't caught for a good while which shows that he was intelligent. I think this is important to the history of psychology because we now have developed knowledge that feeble-minded people aren't always criminals, but have some potential of becoming a criminal.

    2. Instead of wasting energy in the vain attempt to hold mentally slow and defective children up to a level of progress which is normal to the average child, it will be wiser to take account of the inequalities of children in original endowment and to differentiate the course of study in such a way that each child will be allowed to progress at the rate which is normal to him, whether that rate be rapid or slow.

      I think this would be the best approach to help children that are slower than "normal" children. I stated earlier that the best way to help children learn is to see how they learn such as being hands on, visual, or auditory learning. Everyone doesn't learn stuff at the exact same time, it may take some people longer to understand the concept than other people. Everyone learns differently. This is important to the history of psychology because it helps us understand that children who are slower than other children can develop intelligence at their own speed. I think another reason this is important to the history of psychology is because we now understand the grade of intelligence is different for everyone, but people can build intelligence at their own progressive rate and that we can't force someone who is slower at learning to be at normal speed.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We thank the reviewers for their comments and very helpful suggestions to improve the manuscript. All the reviewers address that further confirmation of the causality of activity-induced AMPK activation and AMPK-induced mitochondrial fission and mitophagy regulating dendritic outgrowth in immature neurons would strengthen the significance of this study. We believe that this is the first study demonstrating that AMPK mediates activity-dependent dendritic outgrowth of immature neurons, and that regulation of mitophagy is critical for dendrite development.

      We can perform most of the experimentations and corrections requested by the reviewers. We have already made several revisions and are currently working on additional experiments. All experiments will be finished in several weeks and we expect to submit a full revision by the due date.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Reviewer #1-1.- MMP alone is not a good indicator of mitochondrial health. For instance, ATPase inhibitor causes increase in MMP and complex I inhibition diminish MMP and in both cases mitochondrial function is impaired. On the other hand, authors use increased flickering and mitochondrial ROS production as an indicator of enhanced respiration but they could also be used as indicators of mitochondrial dysfunction. Other assays, such as oxygen consumption, are needed to assess the mitochondrial function.

      *Related comments by Reviewer #2-C. In figure 6 it is unclear what is the significance of the TMRM "flickering" parameter quantified and the difference between the control and knockdown condition is small on average. *

      Increase in TMRM flickering and mitochondrial ROS production, which we used as indicators of enhanced mitochondrial respiration, can certainly also be caused by mitochondrial dysfunction. We think it difficult to adopt an oxygen consumption assay in our system, as the transfection efficiency in the primary hippocampal culture is low (~10%). Instead, we plan to assess the mitochondrial function in control and AMPK deficient cells by using an ATP FRET sensor targeted to mitochondria (Mito-ATeam, Imamura et al., PNAS, 2009; Yoshida et al., Methods Mol Biol 2017). Mito-ATeam will be transfected in neurons to compare mitochondrial ATP synthesis in control and AMPK deficient neurons.

      *Reviewer #1-2.- It would be interesting to show a better characterization of the mitophagy flux and to test whether pharmacological or genetic stimulation of mitophagy could revert the effect of AMPK KD on dendritic outgrowth, ultimately linking AMPK, mitophagy and dendritic outgrowth. The latter experiments may be challenging but not impossible, for example see (PMID: 27760312). *

      We understand that it is important to demonstrate more strongly the correlation of the AMPK-induced peripheral fission and subsequent mitophagy of fragmented mitochondria with dendritic outgrowth. We will attempt the suggested experiment to see if induction of autophagy could revert the dendritic hypoplasia by AMPK KD. However, because AMPK deficiency generates elongated mitochondria defective in fission rather than fragmented mitochondria that are failed to undergo mitophagy, we doubt that activating mitophagy will properly remove damaged mitochondria.

      In parallel to the above experiments, we currently analyze if inhibition of mitochondrial fission or mitophagy would phenocopy the hypoplastic dendrites of AMPK-deficient neurons, and if the activation of fission would rescue the phenotypes of AMPK KD, to strengthen the causality of AMPK-dependent fission, autophagy and dendrite outgrowth. So far we have observed that inhibition of mitochondrial fission by MFF knockdown or inhibition of autophagy by bafilomycin treatment strongly suppress dendrite outgrowth. MFF knockdown also leads to the elongation of mitochondria with decreased association of p62-puncta, strikingly reminiscent of AMPK-deficient neurons. Please see attached figures. Completed analyses will be included in the full revision.

      *Reviewer #1-4.- Results clearly indicate that AMPK enhances mitochondrial fission, and that AMPK is necessary for proper dendritic outgrowth. However, as indicated, the role of AMPK-dependent mitochondrial fission in promoting dendritic growth is not well demonstrated. A possible, and not very difficult experiment, would be the expression of non-phosphorylable MFF S155/172 mutant (perhaps is also needed to knock down the endogenous MFF). Use of this mutant would abolish AMPK-dependent mitochondrial fission while preserving its other functions. *

      Related comments by Reviewer #3-3. The authors could further confirm the claim by examining how mutations in Mff and ULK2 which cannot be phosphorylated by AMPK can rescue defects in mitochondrial fission and spine density.

      We will examine if the expression of non-phosphorylable MFF S155/172 mutant would cause defective autophagy and dendritic arbor growth similarly to AMPK KD neurons. In addition, we will test whether MFF S155/172 mutant would inhibit activity-induced mitochondrial fission to strengthen the link between activity-AMPK-MFF-autophagy axis and dendritic outgrowth.

      *Reviewer #1-Minor 2.- It is intriguing that as shown in Fig. 2A, rather than an increase in pAMPK/AMPK at DIV5 seems there is less phosphorylation despite FRET analysis indicate more AMPK activation. On the other hand, most of the blots in Fig. 6 seem to be overexposed. *

      The exposure time of WB in Fig. 2A was adjusted so that all lanes can be compared. We will fix the exposure time.

      Reviewer#2-A. Most of the evidence on the role of AMPKa2 relies on a shRNA-based strategy. The authors have performed this approach with the best practice, including selecting 2 shRNA plasmids for each gene, and performing a rescue experiment with shRNA -resistant cDNA. Yet, it is critical to provide stronger evidence with all the tools available to demonstrate the role of AMPKa2 in dendritic development. This is especially important because the effect reported by the authors is a transient effect: indeed, dendritic development appears abnormal in very young neurons (P5) but largely normal afterwards (P10). Hence one cannot discard a non specific effect on cell viability or sampling effect. The number of neurons counted is fairly low (about 30 neurons per condition) and it is not clear if they come from several independent cultures. It is known that plasmid preparation can impact cell viability and performing the experiment with only one batch of plasmid prep could explain why one plasmid would produce a short-lived effect on cell morphology. Two shRNA constructs are presented in figure S2A but only one is used for morphological experiments quantified in S2D-E with again a very low N number. The specific experiments I would recommend would be to increase the N: at least 25-30 neurons counted per culture, 3 independent cultures, and presenting the results of the two shRNA plasmids for both AMPKa1 and AMPKa2. Furthermore, the immunofluorescence validation of knockdown provided in figure S2B is not really convincing, a nuclear marker is lacking to determine where cells are (it seems that many cells are present in the image, maybe some of them with low AMPKa2 expression as well). A quantification should be provided as well as evidence for shRNA #1 and #2. *

      *

      We thank the reviewer for valuable suggestions to improve our manuscript. All the knockdown analyses were done from three independent experiments using different mouse litters and multiple batches of plasmid prep. N number was low because of a low transfection efficiency in the primary culture. We will repeat experiments and increase the sample number. We will also present results of the two shRNA constructs. We will redo the immunofluorescence for validation of shRNA knockdown and replace Extended Data Fig. 2B which was pointed out as not being clear.

      *Reviewer #2-C. The observation, in vivo, that dendritic development is normal at P10 is intriguing but this reconciles the observation of altered dendritic development with previous studies demonstrating that AMPK knockout has little effect on brain development, as well as previous studies (Mairet-Coello et al. Neuron 2013, Lee et al. Nat commun 2022) targeting AMPKa2 in the hippocampus of AD mouse models by in utero electroporation. This is a critical aspect of the paper and as stated in the discussion, the previous studies only looked at the end product (neuronal morphology appears normal after development) but not the process of neuronal development and maturation. The in vitro experiment offer the possibility to study dendritic development over time in the same population of neurons, either through selected time points, or through time lapse imaging. This would strengthen one of the most original aspect of this work. *

      We thank the reviewer for an important suggestion. We will analyze if dendritic morphology and mitochondria would recover in later stages in culture. However, the dendritic growth defects in AMPK KD neurons are apparently more severe in culture and our preliminary results have shown that dendritic growth defects and mitochondrial elongation persist until 10DIV. We anticipate that AMPK deficiency is complemented by certain compensation mechanisms in vivo that are not present in culture, such as chemical signals or synaptic inputs from correct afferents. We will confirm the recovery of dendritic outgrowth in vivo using an AMPK alpha2 knockout mouse. We will include the results in vivo and in vitro in the revised manuscript.

      * The authors use a FRET probe to witness AMPK activity, and this part raises a lot of questions. A lot of the signal matches the regularly spaced activity peaks suggesting that FRET response is a coincidence detector of calcium waves. Hence, is the FRET signal influenced by intracellular calcium concentration, or changes in pH? To address this question, the proper control would be to use a FRET biosensor with a mutated AMPK phosphorylation site and demonstrating the absence of response to calcium waves. *

      We think it unlikely that the FRET probe detects calcium concentration or pH change, as its kinetics and timing are different from calcium spikes. For confirmation, we will examine a FRET probe lacking phosphorylation sites to negate that calcium waves directly activate the FRET probe.

      * Also, the parameter used for quantification is a so-called "number of FRET peaks over 3 minutes" for which the biological significance is unknown. On average there are 1-2 such "peaks" in control conditions (figure 4). These peaks have low amplitude, sometimes around 0.05-0.1 of the YFP/CFP ratio, which is about what is expected even in AMPKa2 knockdown cells (figure S4C). Are there changes in the baseline of FRET signal? *

      We monitor FRET at 3-5 sec intervals and is set to 3 minutes due to gradual photobleaching. Although the event frequency is 0-4 times per 3 minutes observation, it is nearly absent in AMPK KD (1 small peak in 3 cells out of 40 cells) or activity deprivation, which we consider a significant difference. We have replaced Figure 4B, 4D, 4I, 4J andExtended Data Fig.4E. The basal FRET signal is lowered in AMPK KD cells, but also varies depending on the expression level of the probe. For comparision of the results shown in Figure 4 and Extended Data Fig.4, we have changed the y-axis to the normalized FRET signal {FRET/FRETbaseline} and jRGECO signals (DF/F0) in Fig. 4F, Extended Data Fig.4C, 4D.

      *

      *

      *Finally, given that calcium peaks and AMPK activity peaks overlap, one key observation is the continued presence of calcium peaks upon AMPKa2 knockdown in figure S4D. Yet, the scale for jRGECO1 intensity in figure S4D differs from the scale in figure 4, making it difficult to interpret. It seems that on average the delta (peak-baseline) is 2000 in wild-type cells (figure 4), compared to 500 in AMPKa2 knockdown cells, which suggests a strong reduction in calcium signal amplitude upon knockdown of AMPK. This should be clarified to demonstrate that the FRET probe peaks are really due to AMPK activity. Also, the effect of STO-609 should be added to this figure. *

      We think that the presence of calcium transients in AMPK KD cells supports our conclusion that AMPK is downstream of calcium signaling. The amplitude of calcium spikes was actually lowered in AMPK KD cells. We think it is due to the reduction of the cell size and complexity in KD cells. To negate that AMPK inhibition affects calcium influx, we will examine if acute inhibition by an AMPK inhibitor will suppress only FRET signals but not calcium waves. In addition, we will monitor calcium waves and FRET signals in neurons treated with STO-609 or AICAR. STO-609 and AICAR should decrease and increase FRET signals without affecting calcium influx.

      • Other comments by Reviewer #2*
      • Similarly, the number of events in figure 5F-G is really low. Is a difference between 0.02 in the control group and 0.01 in the knockdown group physiologically relevant?*

      Since p62 puncta contact only a small mitochondrial region, the overlap area of mitochondria with p62 in the total mitochondrial area is small. We will analyze the number of p62 puncta associated with mitochondria per unit dendritic area.

        • Lines 339-350, the authors discuss about a putative regulatory loop involving AMPK dephosphorylation. Since this part of the discussion is based on the FRET signal, the authors should consider if an alternative explanation could be the kinetics of the biosensor dephosphorylation.* We will revise Discussion to argue about alternative possibilities of dynamic oscillation of the FRET signal when we get data from the above experiments.

      *In terms of significance, I would have two major criticisms. The first is that it appears that many of the findings by the authors are redundant with observations of the roles of CAMKK2-AMPK-MFF-ULK1 in AD model mice, see for example the work by Polleux (Mairet-Coello et al. Neuron 2013, Lee et al., Nat commun 2022). As said above, my opinion is that the paper should put more emphasis on the transient effect of AMPK, which would be a novel observation and, as the authors rightfully discuss, a phenotype potentially overlooked in previous studies of AMPK KO mice. The second is that many points in the discussion seem to be over reached and are not entirely supported by the data. As an example lines 298-299 "leading to mitochondrial dysfunction with low respiratory activity" (not addressed in this manuscript), lines 312-313 "multiple signatures of mitochondrial dysfunction such as reduced delta-Psi-m and ROS production" (biological significance of these parameters?), lines 332-334 "AMPK phosphorylation dynamically oscillates in dendrites, depending on Ca2+ influx and CAMKK2 activity, while it is independent of LKB1" (the authors don't study AMPK phosphorylation, and the experimental data has many limitations that need addressing), etc. *

      We thank reviewer’s guidance. We think this is the first study showing AMPK function in dendritic arbor growth in immature neurons before synaptogenesis. We will rewrite the manuscript to emphasize that neuronal activity in immature neurons regulates dendrite formation via AMPK in a short time window during brain development. Discussion will be revised according to the data of the ongoing additional experiments.

      Reviewer#3-1. All these studies are done in invitro neuronal culture modal with transfection of ShRNAs to Knockdown AMPK. An alternative possibility is that authors could use an AMPK Conditional Knockout mouse models Conditional deletion of (AMPKα1/α2 (AMPKα1−/−; AMPKα2F/F; Emx1-Cre) derived neurons for this study.

      We showed the effect of AMPK knockdown in hippocampal neurons in culture and in vivo (Fig. 2). For validation, we also examined CRISPR interference (Extended data Fig.2). We will examine in vivo phenotypes in pyramidal neurons in AMPK alpha2 knockout mice to further validate our observation.

      Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      *Reviewer #1-1.- Authors use mitochondrial membrane potential (MMP), MMP flickering and mitochondrial ROS production as indicators of mitochondrial function, but this is not convincing. To analyze MMP, authors use TMRM fluorescence normalized by mitochondria area. This is not correct, using this strategy would mean that a symmetric fission would instantly double MMP and fusion would half MMP. The analysis must be made by tracing ROIs of the same surface in different mitochondria and determining TMRM fluorescence in these ROIs. *

      We have reanalyzed TMRM fluorescence using the method indicated. As a result, TMRM fluorescence show a slight but significant decrease (p=0.0071) in AMPK KD cells. Extended Data Fig. 5C has been replaced accordingly. We thank the Reviewer for kind guidance!

      Reviewer #1-3.- The authors treat neurons with glutamate to support the view that synaptic activity activates AMPK and promotes mitochondrial fission. However, the concentration used (100 mM) may be excitotoxic. Synaptic activity can be induced by electric field stimulation, although this require equipment that may not be available in the authors' lab. Another alternative is network disinhibition with bicuculine or to use lower concentrations of glutamate. In any case, since neurons are immature and may respond differently from mature neurons, it would be worth to verify synaptic activity by analyzing Ca2+ transients.

      *Reviewer #1-Minor 3.- It is necessary more explanation about spontaneous Ca2+ transients in immature cultures. What percentage of neuros experience it? Is it synchronized? *

      *Related comments by Reviewer#2-D. It is well established and thus not surprising that AMPK activity increases in response to synaptic activity. It is more surprising to witness such an effect of activity in very immature neurons, where presumably synapses are sparse and not well developed. For example dendritic segments in Figure 1E and 3A don't have dendritic spines. Western-blot and/or immunofluorescence of synaptic markers with comparison to fully mature neurons would complete figure 1 and make the case whether the reported effects are marginal or a strong driver for dendritic development and AMPK regulation. *

      We thank the reviewers’ point that we failed to emphasize in the original manuscript.

      We focus on AMPK function during activity-dependent dendritic outgrowth in immature neurons before the onset of synaptogenesis. It has been shown that synaptogenesis occurs in dissociated hippocampal cultures between 7-12 DIV (eg, Renger et al., Neuron 2001) and that developing dendrites at 5 DIV are activated by ambient glutamate which is spontaneously released from nearby immature axon terminals and undergo spontaneous Ca2+ transients, and this non-synaptic activity is important for dendritic outgrowth (Andreae and Burrone, Cell Rep 2015). We have observed that Ca2+ transients in individual neurons are variable in frequency and magnitude and are not synchronized in consistent with previous studies. We have performed immunofluorescence with a synaptic marker PSD95 and confirmed that dendritic spines are not yet differentiated and PSD95 is sparsely distributed along the dendritic shaft in DIV5 hippocampal neurons. We describe the nature of Ca2+ transients in the Results more clearly and provide high magnified images and immunofluorescence with a synaptic marker PSD95 of the neurons at DIV5 and DIV13 as a new Fig. 1A. We believe that this is the first indication of AMPK function in non-synaptic neuronal activity during dendritogenesis.

      We have observed induction of mitochondrial fission in neurons treated with 1 µM glutamate. Extended Data Fig. 1E has been replaced accordingly. Since GABA is known to induce depolarization in immature neurons (Soriano et al., PNAS 2008), we would like to exclude bicuculine treatment from this analysis.

      *Reviewer #1-5.- The statistical analysis seems appropriate, but it is confusing that sometimes non-parametric and sometimes parametric tests are used. It is not indicated which test is used to determine normality since the methods section lacks a statistical analysis section.

      *

      We have revised Methods and have described statistical analysis in detail.

      *Reviewer #1-Minor 1.- Authors should double check the analysis shown in Fig. 1A. As it is shown, Ca2+ transients are 2-3% higher than basal, when the video shown in video 1 seem to indicate much more. *

      Thank you for pointing this out. In the original version, the percentile change was erroneously measured across the entire visual field, including areas without neurons. We have replaced Fig. 1B (original Fig. 1A) with reanalyzed data in the proximal region of the apical dendrite.

      *Reviewer #1-Minor 4.- It is interesting that AMPK KD in vivo impairs dendritic architecture at P5, however at P10 the defect seem to be somehow compensated. This result apparently detracts from the relevance of the findings, however last year was published a paper in which in an animal model of Huntington's disease dendritic architecture is delayed during the first week but normalizes thereafter. Despite later normalization in dendritic architecture, this early defect in maturation has effects in adulthood as pharmacological restoration of arborization during the neonatal period suppresses some phenotypes observed in adulthood (PMID: 36137051). I believe that discussing this paper would help the reader to recognize the potential relevance of the findings. *

      *Related comments by Reviewer #2: Nonetheless let aside the technical concern, if their findings hold true, this is an intriguing mechanism. There are interesting parallels to be made with observations of altered morphology and excitability of neurons in Huntington's disease model mice during the first postnatal week. These changes spontaneously reverts and are undetectable in the second week (Braz et al. Science 2022). Thus, precedent suggests that indeed dendritic development can take a slow course, and this study also suggests that this is important later since normalization of abnormal excitability during the first week in HTT mice prevents some of the phenotypes later in life. Here again, an interesting parallel could be made with the known role of AMPK in synaptic loss in AD models. *

      We thank the reviewers for the supportive comments. We will refer this paper and discuss about potential significance of the transient defects in early dendrite morphology in AMPK deficient neurons.

      *Reviewer#2-B. The Crispr method lacks validation which should be provided somehow. The drug-based experiment relies on compound C, a notoriously non specific AMPK inhibitor (see for example Bain et al. Biochem J 2007, or Vogt et al. Cell Signal 2011). Data obtained with Compound C is hard to interpret given the number of kinases that are affected by the drug and should be removed from the manuscript. *

      We have added immunofluorescence images for validation of AMPK deletion by CRISPRi (Extended Data Fig. 2F).

      We think the results of Compound C treatment support our conclusion in combination with KD and CRISPRi, but will delete the results in accordance with this comment.

      • Other comments by Reviewer#2*
      • Figure 5A-C relies on the quantification of fission events that appear very rare (0.4 event per 20 minutes). The difference between the two groups is between 0.1 and 0.2 events on average. Since this was quantified on a fairly low number of cells (N=14), it is hard to appreciate exactly how many events have been observed and the actual physiological relevance. Furthermore individual datapoints should be added to the figure to estimate variability.*

      The number of fission events was counted in mitochondria in a unit length of dendrite of similar diameter, and normalized by the number of mitochondria. The values were thus small as they represent average number of events in one mitochondrion in 20 minutes. We have replaced the Fig. 1K, 3F, 5B and 5C to show the number of fission events in mitochondria included in a unit length of dendrites of similar diameter. Individual data points have been included.

      Reviewer #3-4. Authors showed activity-dependent calcium signaling controls mitochondrial homeostasis and dendritic outgrowth via AMP-activated protein kinase (AMPK) in developing hippocampal neurons do the cortical mitochondria respond the same way as the hippocampal neurons?

      Thank you for the comment. As pyramidal neurons in the cerebral cortex and hippocampus are basically the same origin, it is likely that they share the same signaling. We use hippocampal neurons in this study to perform quantitative analysis of dendritic morphology in the same type of neurons. Primary cultures of cortical neurons contain multiple different cell types, making it difficult to analyze the same cell type.

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      *Reviewer #1: If activity is observed in only a portion of the neurons, taking advantage of the stablished long-term live imaging protocol in the authors' lab, it would be interesting to study in the same culture whether neurons that experience spontaneous activity develop more than those that do not. *

      We prefer not to carry out this analysis, as activity-dependent dendritic growth has already been well described in previous papers. It will take considerable time to observe the number of neurons for analysis of correlation between Ca2+ transients and dendrite morphology. We would like to focus our effort to demonstrate AMPK signaling during activity-dependent dendritic growth.

      Reviewer#3-2. Another technical issue here, most of the experiments are carried out on Neurobasal media, which has a lot of glucose plus substitution of glutamax might be not the perfect conditions for AMPK. Authors could not obtain evidence supporting the regulation of mitochondria biogenesis by PGC1α phosphorylation and expression. This surprise me, if you could reduce the glucose concentration if might change.

      We observed little or no changes in phosphorylation of PGC1alpha by enhancing or suppressing neuronal activity or AMPK activity. As mitochondrial biogenesis is very active in growing neurons, we surmise that PGC1alpha and mitochondrial biogenesis is regulated by multiple mechanisms during neuronal differentiation and AMPK activation/inhibition might not induce visible changes. We agree the reviewer that there is room to seek the conditions under which changes in PGC1alpha can be detected, but we do not see why Neurobasal plus glutamax is not suitable for this study. Multiple papers studying AMPK function in cultured neurons use similar culture media (Sample et al., Mol. Cell. Biol., 2015; Muraleedharan et al., Cell. Rep., 2020; Lee et al., Nat. Commun., 2022). We might see PGC1 phosphorylation by glucose deprivation, as it decreases glycolysis-derived ATP and thereby activates AMPK. Since we focus on AMPK activation by calcium signals, we are afraid that it would be difficult to distinguish AMPK activation by ATP deficiency or calcium signaling in glucose deficient condition. In addition, glucose deprivation would affect neuronal activity (which consumes large amount of ATP) and neuronal differentiation including dendritic outgrowth.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Point-by-point response to reviewer comments

      General statement

      Several studies have previously demonstrated functional links between the death receptors (DR) TRAIL-R1/2 and the Unfolded protein response (UPR). In this manuscript, we describe the previously unrecognized IRE1-dependent dual regulation of the expression of another DR, CD95, and CD95L-induced cell death. Our work therefore adds to the current knowledge on the functional links existing between UPR and DR signaling and provides novel mechanistic insights on a dual regulation involving both transcriptional and post-transcriptional control of the expression of CD95 mRNA expression by IRE1. To demonstrate this, we have used both genetic (overexpression of XBP1s or dominant-negative forms of IRE1) and pharmacologic (IRE1 RNase inhibitor) approaches and cellular models of glioblastoma (GB) and triple-negative breast cancer (TNBC). We show that IRE1 RNase activity promotes CD95 expression and CD95-mediated cell death via the transcription factor XBP1s whilst IRE1 RNase limits CD95 expression and cell death via its ability to cleave RNAs (through RIDD, for Regulated IRE1-dependent decay of RNAs, activity). Furthermore, we report that IRE1-mediated control of CD95 expression is active in vivo, using a model of CD95-mediated fulminant hepatitis in mice. Lastly, we correlate these results to the pathology by showing that CD95 expression is decreased in RIDD high or XBP1s low human GB and TNBC tumors.

      We thank the reviewers for their fair assessment of our manuscript and for their insightful comments. Below, we describe the experiments we plan to carry out to address the reviewers’ comments.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): Summary: Here the authors argue that IRE1 activation has opposite effects on Fas/CD95 expression/stability in a number of contexts, via either RIDD-dependent degradation of Fas mRNA or XBP-1-mediated induction of Fas expression, which led to either increased or decreased sensitivity to Fas-induced apoptosis in a number of settings. Major issues: The study is somewhat preliminary and inconclusive in that it is not clear why the RIDD function of XBP-1 appears to predominate in vitro in the cell lines examined, leading to modest increases in Fas expression levels (Figure 1) when IRE1 DN versus IRE1 WT constructs are overexpressed, which is at odds with the latter part of the paper which suggests that inhibition of RIDD led to reduced Fas expression levels. However, this could be due to supraphysiological levels of IRE1 being expressed under overexpression conditions, leading to confounding results. Similarly, when XBP-1s is overexpressed in vitro (Figure 5) the modest increases in CD95/Fas expression and sensitization to Fas-induced cell death may not be fully representative of what would be observed at physiological levels of XBP-1s activation. The in vivo results obtained using an IRE1 RNase inhibitor (MKC8866) contradict the earlier part of the study (as Fas levels decreased and there was protection from Fas-induced liver toxicity) and this could be due to a multitude of reasons. There is no doubt that impacting on IRE1 activity has interesting effects on CD95/Fas expression, which can be up- or -down-regulated, with consequences for cell death induced via engagement of the latter receptor, however, the manuscript does not offer a lot of clarity on which outcome is the predominant one in the context of engagement of the UPR. I have the following suggestions for improvement.

      We thank the reviewer for this overall positive assessment.

      1. The authors should induce ER stress using Thaps, Brefeldin A and Tunicamycin, and explore the effects of doing this on Fas expression levels in the context of silencing endogenous IRE1, XBP-1 and PERK.

      We do agree with this reviewer that the proposed experiments might further highlight which of the IRE1-dependent control of CD95 expression dominates upon ER stress induction. Therefore, we will perform the requested experiment in the various cell lines already used in the manuscript.

      We propose to evaluate the expression of CD95 (at the mRNA and total protein levels) under ER stress induction (by different ER stressors) upon knock-down of IRE1 or XBP1. Other DRs (TRAIL-R1 and 2) have been shown to be induced by PERK activation and it is also demonstrated that PERK and IRE1 signaling pathways coregulate each other. As such, we also propose to assess whether PERK could also control CD95 expression in this setting.

      1. The authors should explore the effects of silencing of IRE1, XBP-1 and PERK on constitutive Fas expression and the outcome of Fas/CD95-induced apoptosis in cells not experiencing an overt activation of the UPR (i.e. in the absence of Thaps, Brefeldin A or other UPR inducer).

      We thank the reviewer for their suggestion and will perform the requested experiments as proposed.

      1. The specificity of MKC8866 at the concentration used (30 uM) is unclear. What effect does MKCC have on sensitivity towards Fas-induced apoptosis, similar to the type of experimental set up presented in Figure 5A, 5B?

      Regarding the specificity of MKC8866, this drug has been optimized and refined from a family of IRE1-specific endoribonuclease inhibitors initially obtained from a chemical library screen [1-3]. This salicylaldehyde analog has already shown to be effective in multiple cancer models including breast [4, 5] and prostate [2] cancers. We have recently demonstrated its efficacy in a GB mouse model [6]. It is therefore a widely used IRE1 inhibitor, including in the dose range 10-30 mM used in this study (e.g [4, 5]). We therefore do not think it is in the scope of this manuscript to re-assess it specificity. However, we will aim at testing an additional IRE1 inhibitor to assess whether similar effects can be observed on CD95 expression in cells. To do so, we propose to use a novel IRE1 kinase inhibitor developed in the laboratory (DOI: 10.26434/chemrxiv-2022-2ld35 – Accepted iScience) and shown to efficiently blunt IRE1 activity in GB. As also suggested by the reviewer, we will assess whether the use of MKC-8866 can affect CD95L-induced cell death in cell lines.

      1. Similarly, what effects does MKC8866 (at 30 uM) have on key Fas pathway determinants, such as Fas, FLIPL, FLIPs, Caspase-8, FADD, RIPK1, A20, CYLD, cIAP-1, cIAP-2 and Bid? There are many points at which MKC8866 could influence the outcome of Fas receptor engagement beyond the receptor itself.

      In the present manuscript, we have shown that MKC-8866 reduces CD95 expression in mouse liver (IHC depicted in Figure 4B and S3B) in vivo and that, when used at 30 mM in vitro, it prevents the loss of CD95 expression induced by tunicamycin or thapsigargin in U87 cells (Fig 1C-F). We do agree with the reviewer that IRE1 may impact CD95-induced cell death beyond modulating CD95 expression, as also already discussed in the present manuscript. Therefore, and as suggested, we will assess whether MKC-8866, used at 30 mM, also impacts on the basal cellular expression of the various components of CD95 signaling mentioned by this reviewer.

      Minor issues:

      1. For the Fas mRNA cleavage experiments presented in Figure 2, there are no irrelevant control mRNAs to allow the reader to judge whether the effects presented are specific to Fas mRNA or are commonly observed for many mRNAs at these amounts of IRE1 (1 ug, 0.5 ug, which appear high).

      The expression of Fas mRNA was already normalized to GAPDH (which does not seem to vary upon incubation with IRE1). We nevertheless will test the expression of additional “irrelevant” RNAs as suggested by the reviewer.

      Reviewer #1 (Significance (Required)): General assessment: this is an interesting study, as there is little knowledge currently concerning how the UPR influences Fas expression or Fas-dependent outcomes. However, the impact of this work is limited by the overexpression approaches used, which could produce artifactual results, as well as the contradictory message of the study.

      Although we think that the message of the manuscript is indeed complex, the work presented herein does not rely exclusively on overexpression approaches as our genetic-based results are also comforted by the use of pharmacologic inhibitors of IRE1.

      Advance: the advance reported here is relatively modest and limited in scope due to the inconclusive nature of the data presented.

      Audience: this study will be of interests to specialists in the UPR and cell death communities.

      We thank the reviewer for acknowledging the overall novelty of our work. We do hope that the experiments proposed will address her/his remaining concerns.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors address here for the first time the connection between CD95, which is known as Fas, and ER stress. The role of another DR, TRAIL-R2 has been already reported, but this is the first study uncovering the link between Cd95 system and ER stress. The study is performed on the high level and supported by all necessary controls. They find the connection between IRE1 and CD95 and show that it might play a role in Cd95 signaling and attenuate CD95-mediated cell death.

      Further, the correlation between CD95 expression and IRE is found in tumors. Importantly the authors find out the connection between XBP1 and CD95 expression, which was not reported to date. Hence, it is a very important and highly essential piece of research.

      We thank the reviewer for these very positive comments and the acknowledging of the novelty and importance of our study.

      However, I would like to clarify the several issues:

      1: Figure 1. Tunicamycin obviously leads to deglycosylation of CD95, which is indicated by the appearance of 35 Kda band. This should be highlighted and commented.

      We agree, this will be commented on in the text.

      1. Figure 2c, d. The piece of mRNA structure, which is synthesized, might have the different secondary structure and might be not cleaved by IRE, accordingly. More detailed comments have to be provided in this regard.

      The model depicted in Figure 2B is a predicted computational secondary structure of CD95 mRNA. In the experiments performed in Figure 2A, C and D the mRNA was extracted from U87 cells prior to incubation with recombinant IRE1 and the resulting products analyzed using RT-qPCR with primers flanking different portions of the CD95 mRNA sequence. For Figures 2C and D, the primers used flank the two sites which were predicted to be cleaved by IRE1 based on previous work from our lab [7]. Even though we cannot exclude that additional sites can be targeted beyond these two, the fact that the amplification of CD95 sequence is reduced in samples pre-incubated with recombinant IRE1 strongly suggests that IRE1 is indeed able to cleave CD95 mRNA in these regions in vitro. We will modify the main text to further explain this point.

      1. Figure 3. Caspase-8-3 western blots show beautiful effects but did authors see some effects further downstream, eg on PARP1 cleavage? Was cell death (not viability) measured as well? Can you comment on this?

      This is absolutely right, we will test PARP-1 cleavage in this setting as suggested. Given the morphology of the cells we observed in the viability experiments, we would expect a similar trend using cell death assays. However, we do agree with the reviewer that this should be proven experimentally, so we will perform these experiments again using cell death assays as a read out.

      1. Did the authors looked at the DISC assembly? Did they detect some differences there?

      No, we did not. We would expect some difference given the impact we have observed on CD95 expression, caspase-8 activation and cell death of expressing dominant negative forms of IRE1, but this of course needs to be actually tested. We are in the process of optimizing CD95 DISC experiments in our lab and we therefore hope to be able to address this reviewer’s comment in a revised version of the manuscript.

      Reviewer #2 (Significance (Required)):

      This is an excellent study. The authors address here for the first time the connection between CD95, which is known as Fas, and ER stress. The role of another DR, TRAIL-R2 has been already reported, but this is the first study uncovering the link between Cd95 system and ER stress. The study is performed on the high level and supported by all necessary controls. This is an important advance for the death receptor field.

      Thank you again for these very positive comments and your insightful appreciation of our work.

      References

      1. Volkmann, K., Lucas, J. L., Vuga, D., Wang, X., Brumm, D., Stiles, C., Kriebel, D., Der-Sarkissian, A., Krishnan, K., Schweitzer, C., Liu, Z., Malyankar, U. M., Chiovitti, D., Canny, M., Durocher, D., Sicheri, F. & Patterson, J. B. (2011) Potent and selective inhibitors of the inositol-requiring enzyme 1 endoribonuclease, J Biol Chem. 286, 12743-55.
      2. Sheng, X., Nenseth, H. Z., Qu, S., Kuzu, O. F., Frahnow, T., Simon, L., Greene, S., Zeng, Q., Fazli, L., Rennie, P. S., Mills, I. G., Danielsen, H., Theis, F., Patterson, J. B., Jin, Y. & Saatcioglu, F. (2019) IRE1α-XBP1s pathway promotes prostate cancer by activating c-MYC signaling, Nat Commun. 10, 323.
      3. Langlais, T., Pelizzari-Raymundo, D., Mahdizadeh, S. J., Gouault, N., Carreaux, F., Chevet, E., Eriksson, L. A. & Guillory, X. (2021) Structural and molecular bases to IRE1 activity modulation, Biochem J. 478, 2953-2975.
      4. Logue, S. E., McGrath, E. P., Cleary, P., Greene, S., Mnich, K., Almanza, A., Chevet, E., Dwyer, R. M., Oommen, A., Legembre, P., Godey, F., Madden, E. C., Leuzzi, B., Obacz, J., Zeng, Q., Patterson, J. B., Jager, R., Gorman, A. M. & Samali, A. (2018) Inhibition of IRE1 RNase activity modulates the tumor cell secretome and enhances response to chemotherapy, Nat Commun. 9, 3267.
      5. Almanza, A., Mnich, K., Blomme, A., Robinson, C. M., Rodriguez-Blanco, G., Kierszniowska, S., McGrath, E. P., Le Gallo, M., Pilalis, E., Swinnen, J. V., Chatziioannou, A., Chevet, E., Gorman, A. M. & Samali, A. (2022) Regulated IRE1α-dependent decay (RIDD)-mediated reprograming of lipid metabolism in cancer, Nat Commun. 13, 2493.
      6. Le Reste, P. J., Pineau, R., Voutetakis, K., Samal, J., Jégou, G., Lhomond, S., Gorman, A. M., Samali, A., Patterson, J. B., Zeng, Q., Pandit, A., Aubry, M., Soriano, N., Etcheverry, A., Chatziioannou, A., Mosser, J., Avril, T. & Chevet, E. (2020) Local intracerebral Inhibition of IRE1 by MKC8866 sensitizes glioblastoma to irradiation/chemotherapy in vivo, 841296.
      7. Voutetakis, K. D., D.; Vlachavas, E-I., Leonidas, DD.; Chevet, E.; Chatzioannou, A. (In preparation) RNA sequence motif and structure in IRE1-mediated cleavage.
    1. Reviewer #2 (Public Review):

      The authors' manuscript has several strengths. First, the authors consider multiple relevant levels of biology including genomics, transcriptomics, structural and functional neuroimaging, cognitive neuroscience, and psychological/environmental factors. Such an approach is often necessary to deconvolute the complexities of psychiatric phenotypes. The authors have taken careful steps to think about potential confounds (e.g., ancestry for PRS) and to try to define their phenotypes (e.g., psychological resilience and biological aging) as best as they can, given the data they have access to from the ABCD study. The manuscript is well written overall.

      My main concerns relate to core assumptions and techniques that underlie the premise of the study. First, while there is comorbidity between AD and MDD, a causal relationship between the two (in either direction) is not established. Though MDD often predates AD, this is to be expected given MDD's high lifetime prevalence (15-20% of the general population) and typical age of onset before age 65. Because AD typically presents late in life (>65 years of age), MDD will, by definition, usually predate AD. While new onset, late life MDD is often the first presenting symptom of AD/Parkinson's disease and other neurodegenerative conditions, it is also not clear that this is the same disorder as idiopathic MDD.

      To this point, two genetic tools can help us determine the biological relationship between MDD/AD, genetic correlation and Mendelian Randomization. Using the data from the MDD PRS used in this analysis, the Supplementary Table 3 from the Howard et al. 2019 paper (https://doi.org/10.1038/s41593-018-0326-7) reveals a genetic correlation of -0.041 between the two. This indicates essentially no strong relationship between the MDD/AD (perhaps even a slightly inverse relationship). Mendelian Randomization studies in addition to the Howard et al paper (https://doi.org/10.1212/WNL.0000000000010463) find no causal role for MDD towards AD and vice versa. Thus, their comorbidity is likely mediated by additional factors. Additionally, while stress contributes to AD pathophysiology, AD is strongly genetic and, given its late onset, it is unclear how genetic risk for AD would meaningfully impact the psychological resilience of a 9 to 10-year-old.

      My second concern is regarding the statement "adolescents at genetic risk for AD/MDD" when describing the sample. Per Howard et al 2019 out-of-sample prediction testing, the MDD PRS used by the authors explains between 1.5-3.2% of the phenotypic variance in MDD when used on a sample such as ABCD. MDD PRS is in its infancy and cannot reliably be used to identify individuals at high risk of MDD given that even individuals in the top 10th percentile of MDD PRS have an odds ratio for depression of only ~2.4. We would expect 90 or so individuals in this cohort to fall into this group leaving significant concerns about statistical power and the potential for false positive discoveries. While the AD PRS is significantly further along compared to MDD because of AD's simpler genetic architecture, the same concerns apply as, outside of APOE, the AD PRS does not capture the majority of phenotypic variance in AD.

      The authors state that they wish to examine the effects of perinatal adversity directly/indirectly on biological aging and then assess the potential effects of biological aging on resilience. The authors use of pubertal age as a measure of accelerated aging is understandable given the data available, though not ideal. There are well validated measures of biological age such as Horvath's epigenetic clock. While advanced pubertal age is technically a form of accelerated aging, the majority of pubertal age as a phenotype is not likely to be explained by perinatal adversity. Rather, a combination of unmeasured variables including genetic variation, dietary factors, environmental exposures (endocrine disrupting chemicals), and obesity that play a substantial role in determining pubertal age. Childhood stress has been shown to have relatively small effects on pubertal age (d = -0.1) (10.1037/bul0000270).

      Lastly, the authors employ the use of an as of yet unpublished technique to map neurotransmitters density to structural data from neuroimaging studies. While this technique is certainly interesting, its face validity is not clear given that many of the receptor-disease associations reported in the original preprint do not line up with what we know about the biology of these disorders from strong human genetics data or current FDA approved treatments. Moreover, the authors mention "Excitation/Inhibition" imbalance but the technique used appears to only include glutamate data from one receptor type, mGluR5. This may not be an adequate measure of E/I imbalance, despite there being a statistically significant finding.

      Measuring both transcriptional output from GWAS loci and gene expression correlates from MRI data is a noisy and challenging prospect. Indeed, recent research has shown poor correlation between gene expression and neurotransmitter receptor density.(https://doi.org/10.1016/j.neuroimage.2022.119671).

      Thus, fundamental aspects of this manuscript including the use of MDD PRS to identify "at risk" individuals, the unclear link between AD and adolescent psychological resilience, the use of prepubertal age as a measure of biological age, and the limited conclusions that can be drawn from the gene expression and receptor density technique limits confidence in the results as presented.

    1. Author Response:

      What is novel here is that we calculated the time-varying retinal motion patterns generated during the gait cycle using a 3D reconstruction of the terrain. This allows calculation of the actual statistics of retinal motion experienced by walkers over a broad range of normal experience. We certainly do not mean to claim that stabilizing gaze is novel, and agree that the general patterns follow directly from the geometry as worked out very elegantly by Koenderink and others.  We spend time describing the terrain-linked gaze behavior because it is essential for understanding the paper. We do not claim that the basic saccade/stabilize/saccade behavior is novel and now make this clearer.

      The other novel aspect is that the motion patterns vary with gaze location which in turn varies with terrain in a way that depends on behavioral goals. So while some aspects of the general patterns are not unexpected, the quantitative values depend on the statistics of the behavior.  The actual statistics require these in situ measurements, and this has not previously been done, as stated in the abstract.

      The measured statistics provide a well-defined set of hypotheses about the pattern of direction and speed tuning across the visual field in humans. Points of comparison in the existing literature are hard to find because the stimuli have not been closely matched to actual retinal flow patterns, and the statistics will vary with the species in question. However, recent advances allow for neurophysiological measurements and eye tracking during experiments with head-fixed running, head-free, and freely moving animals. These emerging paradigms will allow the study of retinal optic flow processing in contexts that do not require simulated locomotion. While the exact the relation between the retinal motion statistics we have measured and the response properties of motion-sensitive cells remains unresolved, the emerging tools in neurophysiology and computation make similar approaches with different species more feasible.

      A more detailed description of the methods including the photogrammetry and the reference frames for the measurements has been added primarily to the Methods section.

      Reviewer #1 (Public Review):

      Much experimental work on understanding how the visual system processes optic flow during navigation has involved the use of artificial visual stimuli that do not recapitulate the complexity of optic flow patterns generated by actual walking through a natural environment. The paper by Muller and colleagues aims to carefully document "retinal" optic flow patterns generated by human participants walking a straight path in real terrains that differ in "smoothness". By doing so, they gain unique insights into an aspect of natural behavior that should move the field forward and allow for the development of new, more principled, computational models that may better explain the visual processing taking place during walking in humans.

      Strengths:

      Appropriate, state-of-the-art technology was used to obtain a simultaneous assessment of eye movements, head movements, and gait, together with an analysis of the scene, so as to estimate retinal motion maps across the central 90 deg of the visual field. This allowed the team to show that walkers stabilize gaze, causing low velocities to be concentrated around the fovea and faster velocities at the visual periphery (albeit more the periphery of the camera used than the actual visual field). The study concluded that the pattern of optic flow observed around the visual field was most likely related to the translation of the eye and body in space, and the rotations and counter-rotations this entailed to maintain stability. The authors were able to specify what aspects of the retinal motion flow pattern were impacted by terrain roughness, and why (concentration of gaze closer to the body, to control foot placement), and to differentiate this from the impact of lateral eye movements. They were also able to identify generalizable aspects of the pattern of retinal flow across terrains by subsampling identical behaviors in different conditions.

      Weaknesses:

      While the study has much to commend, it could benefit from additional methodological information about the computations performed to generate the data shown. In addition, an estimation of inter-individual variability, and the role of sex, age, and optical correction would increase our understanding of factors that could impact these results, thus providing a clearer estimate of how generalizable they are outside the confines of the present experiments.

      Properties of gait depend on the passive dynamics of the body and factors such as leg length and subject specific cost functions which are influenced by image quality and therefore by optical correction. In this experiment all subjects were normal acuity or corrected to normal (with no information regarding their uncorrected vision). This is now noted in the Methods. The goal of the present work was to calculate average statistics over a range of observers and conditions in order to constrain the experience-dependent properties one might see in neurophysiology. We have added between-subjects error bars to Figure 2 and added gaze angle distributions as a function of terrain for individual observers in the Supplementary materials. Figure 4 b and d now show standard errors across subjects. Individual subject plots are shown in the Supplementary materials. For Figure 2, most variability between subjects occurs in the Flat and Bark terrains where one might expect individual choices of energetic costs versus speed and stability etc might come into play. This is supported by our subsequent unpublished work on factors influencing foothold choice. We have also found that leg length determines path choices and thus will influence the retinal motion. Differences between observers are now noted in the text. These individual subject differences should indicate the range of variability that might be expected in the underlying neural properties and perhaps in behavioral sensitivity. Because of the size of our dataset (n=11) it is not feasible to make comparisons of sex or age. There were equal numbers of males and females and age ranged from 24 to 54. Now noted in the Methods section.

      Reviewer #2 (Public Review):

      The goal of this study was to provide in situ measurements of how combined eye and body movements interact with real 3D environments to shape the statistics of retinal motion signals. To achieve this, they had human walkers navigate different natural terrains while they measured information about eyes, body, and the 3D environment. They found average flow fields that resemble the Gibsonian view of optic flow, an asymmetry between upper and lower visual fields, low velocities at the fovea, a compression of directions near the horizontal meridian, and a preponderance of vertical directions modulated by lateral gaze positions.

      Strengths of the work include the methodological rigor with which the measurements were obtained. The 3D capture and motion capture systems, which have been tested and published before, are state-of-the-art. In addition, the authors used computer vision to reconstruct the 3D terrain structure from the recorded video.

      Together this setup makes for an exciting rig that should enable state-of-the-art measurements of eye and body movements during locomotion. The results are presented clearly and convincingly and reveal a number of interesting statistical properties (summarized above) that are a direct result of human walking behavior.

      A weakness of the article concerns tying the behavioral results and statistical descriptions to insights about neural organization. Although the authors relate their findings about the statistics of retinal motion to previous literature, the implications of their findings for neural organization remain somewhat speculative and inconclusive. An efficient coding theory of visual motion would indeed suggest that some of the statistics of retinal motion patterns should be reflected in the tuning of neural populations in the visual cortex, but as is the present findings could not be convincingly tied to known findings about the neural code of vision. Thus, the behavioral results remain strong, but the link to neural organization principles appears somewhat weak.

      We agree, but we think that strengthening the neural links requires future studies. As mentioned above, it is very difficult to relate the measured statistics to existing neurophysiological literature and we have tried to make this clearer in the Discussion (p14, 15, 16). This is because the stimuli chosen are typically arbitrary and not chosen to be realistic examples of patterns consistent with natural motion across a ground plane. Other stimuli are simply inconsistent with self-motion together with gaze stabilization (eg not zero velocity at the fovea). It has also been technically difficult to map cell properties across the visual field. We have made the comparisons we thought were useful. The point of the paper is to provide a hypothesis about the pattern of direction and speed tuning across the visual field. So the challenge for neurophysiology is to show how the observed cell properties vary across the visual field. Note also that the motion patterns will be influenced by the body motion of the animal in question, and because of this we are now collaborating with a group who are attempting to record from monkey MT/MST during locomotion while tracking eyes and body. Similarly we are training neural networks to learn the patterns generated by human gait to develop more specific hypotheses about receptive field properties.

      Reviewer #3 (Public Review):

      Gaze-stabilizing motor coordination and the resulting patterns of retinal image flow are computed from empirically recorded eye movement and motion capture data. These patterns are assessed in terms of the information that would be potentially useful for guiding locomotion that the retinal signals actually yield. (As opposed to the "ecological" information in the optic array, defined as independent of a particular sensor and sampling strategy).

      While the question posed is fundamental, and the concept of the methodology shows promise, there are some methodological details to resolve. Also, some terminological ambiguities remain, which are the legacy of the field not having settled on a standardized meaning for several technical terms that would be consistent across laboratory setups and field experiments.

      Technical limits and potential error sources should be discussed more. Additional ideas about how to extend/scale up the approach to tasks with more complex scenes, higher speed or other additional task demands and what that might reveal beyond the present results could be discussed.

      This issue is addressed in more detail in the Discussion, second paragraph, and also the second last paragraph.

    1. Author Response

      Reviewer #1 (Public Review):

      This work presents a unification model (of sorts) for explaining how the flow of evidence through networks can be controlled during decision-making. The authors combine two general frameworks previously used as neural models of cortical decision-making, dynamic normalization (that implement value encoding via firing activity) and recurrent network models (which capture winner-take-all selection processes) into a unified model called the local disinhibition-based decision model (LDDM). The simple motif of the LDDM allows for the disinhibition of excitatory cells that represent the engagement of individual actions that happens through a recurrent inhibitory loop (i.e., a leaky competing accumulator). The authors show how the LDDM works effectively well at explaining both decision dynamics and the properties of cortical cells during perceptual decision-making tasks.

      All in all, I thought this was an interesting study with an ambitious goal. But like any good study, there are some open issues worth noting and correcting.

      MAJOR CONCERNS

      1. Big picture

      This was a comprehensive and extremely well-vetted set of theoretical experiments. However, the scope and complexity also made the take-home message hard to discern. The abstract and most of the introduction focus on the framing of LDDM as a hybrid of dynamic normalization models (DNM) and recurrent network models (RNMs). This is sold as a unification of value normalization and selection into a novel unified framework. Then the focus shifts to the role of disinhibition in decision-making. Then in the Discussion, the goal is stated as to determine whether the LDDM generates persistent activity and does this activity differ from RNMs. As a reader, it seems like the paper jumps between two high- level goals: 1) the unification of DNM and RNM architectures, and 2) the role of disinhibition. This constant changing makes it hard to focus as the reader goes on. So what is the big picture goal specifically?

      Also, the framing of value normalization and WTA as a novel computational goal is a bit odd as this is a major focus of the field of reinforcement learning (both abstractly at the computational level and more concretely in models of the circuits that regulate it). I know that the authors do not think they are the first to unify value judgements with selection criteria. The writing just comes across that way and should be clarified.

      We thank the Reviewer for their thoughtful consideration of the overall framing of the big picture goals of the paper. Upon reflection, we agree that the paper really centers on the importance of incorporating disinhibition into computational circuit-based models of decision-making. Thus, we have significantly revised the Introduction and Discussion to focus on the theoretical and empirical importance of incorporating disinhibition into computational models of decision-making, and use the integration of value normalization and WTA selection as an example of how disinhibition increases the richness of circuit decision models. Please see the response to recommendations below for more detail on the changes.

      1. Link to other models

      The LDDM is described as a novel unification of value normalization and winner-take-all (WTA) selection, combining value processing and selection. While the authors do an excellent job of referencing a significant chunk of the decision neuroscience literature (160 references!) the motif they end up designing has a highly similar structure to a well-known neural circuit linked to decision-making: the cortico-basal ganglia pathways. Extensive work over the past 20+ years has highlighted how cortical-basal ganglia loops work via disinhibition of cortical decision units in a similar way as the LDDM (see the work by Michael Frank, Wei Wei, Jonathan Rubin, Fred Hamker, Rafal Bogacz, and many others). It was surprising to not see this link brought up in the paper as most of the framing was on the possibility of the LDDM representing cortical motifs, yet as far as I know, there does not exist evidence for such architectures in the cortex, but there is in these cortical-basal ganglia systems.

      We thank the Reviewer for the suggestion to link the LDDM to disinhibition in CBG models; this is indeed an important body of empirical and computational work that we overlooked in the original manuscript. We have now added text to the Discussion to highlight the link between LDDM and these CBL disinhibition models, focusing on how they are conceptually similar and how they differ. Please see our response to recommendations below for a more detailed discussion of the revisions.

      1. Model evaluations

      The authors do a great job of extensively probing the LDDM under different conditions and against some empirical data. However, most of the time there is no "control" model or current state-of-the-art model that the LDDM is being compared against. In a few of the simulation experiments, the LDDM is compared against the DNM and RNM alone, so as to show how the two components of the LDDM motif compare against the holistic model itself. But this component model comparison is inconsistently used across simulation experiments.

      Also, it is worth asking whether the DNM and RNM are appropriate comparison models to vet the LDDM against for two reasons. First, these are the components of the full LDDM. So these tests show us how the two underlying architectural systems that go into LDDM perform independently, but not necessarily how the LDDM compares against other architectures without these features. Second, as pointed out in my previous comment, the LDDM is a more complex model, with more parameters, than either the DNM or RNM. The field of decision neuroscience is awash in competing decision models (including probabilistic attractor models, non-recurrent integrators, etc.). If we really want to understand the utility of the LDDM, it would be good to know how it performs against similarly complex models, as opposed to its two underlying component models.

      We greatly appreciate the Reviewer’s comments on the point of model comparison, which points out that our original manuscript failed to clearly convey a very important difference between the LDDM and the existing RNM(s). In the revision, we now make it clearer that the fundamental difference between the LDDM and the RNMs is the architecture of disinhibition (see the revised Introduction, especially p. 8 lines 164-168). The LDDM is not simply a combination of the DNM model with RNM architecture (a point we may have mistakenly conveyed in the original manuscript): the introduction of disinhibition separates LDDM inhibition into option-selective subpopulations, as opposed to the single pooled inhibition of RNM models. Given this fact, the LDDM predicts unique selectiveinhibition dynamics shown in recent optogenetic and calcium imaging results, a finding inconsistent with the common-pooled and non-selective inhibition assumed in the existing RNMs and many of its variants. Thus, we believe that a comparison between the LDDM and the RNM, which share similar level of complexity and numbers of parameters, is important.

      We also appreciated the Reviewer’s concern about testing the LDDM against alternative models. In order to better connect to the existing literature, we now compare the LDDM to another standard circuit model of decision-making - the leaky competing accumulator (LCA) model. The LCA is a circuit model that captures many of the aspects of perceptual decision-making seen in the mathematical drift diffusion model (DDM), but with a construction that allows for fitting to behavioral data and comparison of underlying unit activities. Please see our response to recommendations below for further detail.

      1. Comparison to physiological data

      I quite enjoyed the comparisons of the excitatory cell activity to empirical data from the Shadlen lab experiments. However, these were largely qualitative in nature. In conjunction with my prior point on the models that the LDDM is being compared against, it would be ideal to have a direct measure of model fits that can be used to compare the performance of different competing "control" models. These measures would have to account for differences in model complexity (e.g., AIC or BIC), but such an analysis would help the reader understand the utility of the LDDM in connecting with empirical data much better.

      We agree with the Reviewer that a quantitative comparison of the match between model neural predictions and empirical neurophysiological data is important. First, we wish to clarify that the model neural predictions are simulated from models fit to the behavioral (choice and RT data), not from fits to the neural activity traces – a point we now clarify in the text. While directly fitting dynamic models (LDDM, RNM, or LCA) to the neurophysiological data is appealing, there are currently several obstacles to this approach. The first problem is the complexity of the dynamic neural traces. Despite the long history of the random-dot motion paradigm, detailed features of the dynamics are still not understood. For example, the stereotyped initial dip after stimulus onset may reflect a reset of the network state to improve signal to noise ratio (Conen and Padoa-Schioppa, 2015) or simply reflect a surround suppression-like lateral inhibition in visual processing. A second problem is that the primary difference between the models is the activity of inhibitory (and disinhibitory) neurons, which are typically not recorded in neurophysiological experiments; thus, there is a lack of empirical data to which to fit the models. In the revision, we clarified that the model fitting to the Roitman & Shadlen data is for behavioral data only, and model unit activity traces are derived from models fit to behavioral data.

      That being said, we agree that a quantitative comparison of model activity predictions is helpful. Because the models are fit not to the neural data but to the behavioral data, rather than using likelihood-based measures like AIC and BIC we used a simple RMSE measure to compare the match between predicted and neural activity patterns (revised Fig. 6E, Fig 6-S4E, Fig 6-S5E). Please see response to recommendations below for details.

      Reviewer #2 (Public Review):

      The aim of this article was to create a biologically plausible model of decision-making that can both represent a choice's value and reproduce winner-take-all ramping behavior that determines the choice, two fundamental components of value- based decision-making. Both of these aspects have been studied and modeled independently but empirical studies have found that single neurons can switch between both of the aspects (i.e., from representing value to winner-take-all ramping behavior) in ways that are not well described by current biological plausible models of decision making.

      The current article provides a thorough investigation of a new model (the local disinhibition decision model; LDDM) that has the goal of combining value representations and winner-takes-all ramping dynamics related to choice. Their model uses biologically plausible disinhibition to control the levels of inhibition in a local network of simulated neurons. Through a careful series of simulation experiments, they demonstrate that their network can first represent the value of different options, then switch to winner-takes-all ramping dynamics when a choice needs to be made. They further demonstrate that their single model reproduces key components of value-based and winner-takes-all dynamics found in both neural and behavioral data. They additionally conduct simulation studies to demonstrate that recurrent excitatory properties in their network produce value-persistence behavior that could be related to memory. They end by conducting a careful simulation study of the influence of GABA agonists that provide clear and testable predictions of their proposed role of inhibition in the neural processes that underlie decision-making. This last piece is especially important as it provides a clear set of predictions and experiments to help support or falsify their model.

      There are overall many strengths to this paper. As the authors note, current network models do not explain both value- based and ramping-like decision-making properties. Their thorough simulation studies and their validation against empirical neural and behavioral data will be of strong interest to neuroscientists and psychologists interested in value- based decision-making. The simulations related to persistence and the GABA-agonist experiments they propose also provide very clear guidelines for future research that would help advance the field of decision-making research.

      Although the methods and model were generally clear, there was a fair amount of emphasis on the role of recurrence in the LDDM, but very little evidence that recurrence was important or necessary for any of the empirical data examined. The authors do demonstrate the importance of recurrence in some of their simulation studies (particularly in their studies of persistence), but these would need to be compared against empirical data to be validated. Nevertheless, the model and thorough simulation investigations will likely help develop more precise theories of value-based decision-making.

      We appreciate the Reviewer’s thoughtful comments. These comments - especially about anatomic recurrence and its relationship to the parameter 𝛼 - inspired us to think more about the uniqueness of the current circuit to others, especially the implications related to the parameters 𝛼 (i.e., self-excitation) and 𝛽 (i.e., local disinhibition). Recurrence is required to drive winner-take-all competition in the standard RNM of decision-making. However, we show here with both analytical and numerical approaches that recurrence helps WTA competition but is not necessary in our model. Instead, the key feature of the LDDM is to utilize disinhibition in conjunction with lateral inhibition to realize winner-take-all competition. That leads to many different predictions of the current model from the existing models, such as selective inhibition and flexible control of dynamics.

      In response to the Reviewer’s points and after careful consideration of the differential equations, we realized that in our model fitting, the 𝛼 parameter fitting to zero does not necessarily mean recurrence should be zero. The 𝛼 parameter shares a lot of similarity to the baseline gain control (parameter BG in our revision), and thus is unidentifiable in the current dataset. In the interest of parsimony, we did not include the parameter BG in the original manuscript, but now include it because it reveals the difficulty of interpreting fit 𝛼 values as simply the level of recurrence.

      Overall, disinhibition (𝛽) in the LDDM is required for WTA activity while recurrence (𝛼) can contribute but is not necessary; however, 𝛼 is theoretically important for generating persistent activity, with the caveat that in the current framework there is an unclear relationship between fit 𝛼 and recurrence. Regardless, we agree that the contribution of 𝛼 to the LDDM framework is worth further testing and examining with future empirical data.

      Reviewer #3 (Public Review):

      Shen et al. attempt to reconcile two distinct features of neural responses in frontoparietal areas during perceptual and value-guided decision-making into a single biologically realistic circuit model. First, previous work has demonstrated that value coding in the parietal cortex is relative (dependent on the value of all available choice options) and that this feature can be explained by divisive normalization, implemented using adaptive gain control in a recurrently connected circuit model (Louie et al, 2011). Second, a wealth of previous studies on perceptual decision-making (Gold & Shadlen 2007) have provided strong evidence that competitive winner-take-all dynamics implemented through recurrent dynamics characterized by mutual inhibition (Wang 2008) can account for categorical choice coding. The authors propose a circuit model whose key feature is the flexible gating of 'disinhibition', which captures both types of computation - divisive normalization and winner-take-all competition. The model is qualitatively able to explain the 'early' transients in parietal neural responses, which show signatures of divisive normalization indicating a relative value code, persistent activity during delay periods, and 'late' accumulation-to-bound type categorical responses prior to the report of choice/action onset.

      The attempt to integrate these two sets of findings by a unified circuit model is certainly interesting and would be useful to those who seek a tighter link between biologically realistic recurrent neural network models and neural recordings. I also appreciate the effort undertaken by the authors in using analytical tools to gain an understanding of the underlying dynamical mechanism of the proposed model. However, I have two major concerns. First, the manuscript in its current form lacks sufficient clarity, specifically in how some of the key parameters of the model are supposed to be interpreted (see point 1 below). Second, the authors overlook important previous work that is closely related to the ideas that are being presented in this paper (see point 2 below).

      1) The behavior of the proposed model is critically dependent on a single parameter 'beta' whose value, the authors claim, controls the switch from value-coding to choice-coding. However, the precise definition/interpretation of 'beta' seems inconsistent in different parts of the text. I elaborate on this issue in sub-points (1a-b) below:

      1a). For instance, in the equations of the main text (Equations 1-3), 'beta' is used to denote the coupling from the excitatory units (R) to the disinhibitory units (D) in Equations 1-3. However, in the main figures (Fig 2) and in the methods (Equation 5-8), 'beta' is instead used to refer to the coupling between the disinhibitory (D) and the inhibitory gain control units (G). Based on my reading of the text (and the predominant definition used by the authors themselves in the main figures and the methods), it seems that 'beta' should be the coupling between the D and G units.

      1b). A more general and critical issue is the failure to clearly specify whether this coupling of D-G units (parameterized by 'beta') should be interpreted as a 'functional' one, or an 'anatomical' one. A straightforward interpretation of the model equations (Equations 5-8) suggests that 'beta' is the synaptic weight (anatomical coupling) between the D and G units/populations. However, significant portions of the text seem to indicate otherwise (i.e a 'functional' coupling). I elaborate on this in subpoints (i-iii) below:

      (1b-i). One of the main claims of the paper is that the value of 'beta' is under 'external' top-down control (Figure 2 caption, lines 124-126). When 'beta' equals zero, the model is consistent with the previous DNM model (dynamic normalization, Louie et al 2011), but for moderate/large non-zero values of 'beta', the network exhibits WTA dynamics. If 'beta' is indeed the anatomical coupling between D and G (as suggested by the equations of the model), then, are we to interpret that the synaptic weight between D-G is changed by the top-down control signal within a trial? My understanding of the text suggests that this is not in fact the case. Instead, the authors seem to want to convey that top-down input "functionally" gates the activity of D units. When the top-down control signal is "off", the disinhibitory units (D) are "effectively absent" (i.e their activity is clamped at zero as in the schematic in Fig 2B), and therefore do not drive the G units. This would in- turn be equivalent to there being no "anatomical coupling" between D and G. However when the top-down signal is "on", D units have non-zero activity (schematic in Fig 2B), and therefore drive the G units, ultimately resulting in WTA-like dynamics.

      (1b-ii). Therefore, it seems like when the authors say that beta equals zero during the value coding phase they are almost certainly referring to a functional coupling from D to G, or else it would be inconsistent with their other claim that the proposed model flexibly reconfigures dynamics only through a single topdown input but without a change to the circuit architecture (reiterated in lines 398-399, 442-444, 544-546, 557-558, 579-590). However, such a 'functional' definition of 'beta' would seem inconsistent with how it should actually be interpreted based on the model equations, and also somewhat misleading considering the claim that the proposed network is a biologically realistic circuit model.

      (1b-iii). The only way to reconcile the results with an 'anatomical' interpretation of 'beta' is if there is a way to clamp the values of the 'D' units to zero when the top-down control signal is 'off'. Considering that the D units also integrate feed- forward inputs from the excitatory R units (Fig 2, Equations 1-3 or 5-8), this can be achieved either via a non-linearity, or if the top-down control input multiplicatively gates the synapse (consistent with the argument made in lines 115-116 and 585-586 that this top-down control signal is 'neuromodulatory' in nature). Neither of these two scenarios seems to be consistent with the basic definition of the model (Equations 1-3), which therefore confirms my suspicion that the interpretation of 'beta' being used in the text is more consistent with a 'functional' coupling from D to G.

      We thank the reviewer for pointing out this confusion. We apologize that the original illustrations (Fig. 2A) and the differential equations in Methods (Eqs. 5-8) did not convey very well our ideas. 𝛽 is intended to reference the coupling from R to D, not a change in the weights between D and G units. We realize there was some confusion on this part due to inconsistency between our original figures, text, and supplementary material.

      Given the lack of clarity in the previous version as well as the Reviewer’s questions, we now emphasize that 𝛽 represents a functional coupling between the R and D neurons. The biological assumption of the disinhibitory architecture is built based on recent findings that VIP neurons in the cortex always inhibit other neighboring inhibitory cells, such as SST and PV neurons, and consequently disinhibit the neighboring primary neurons (e.g., Fu et al., 2014; Karnani et al., 2014, 2016). We did not see evidence in the literature of fast-changing (anatomic) connections between VIP and SST/PV. However, there is evidence that the responsiveness of VIP neurons to excitatory neurons can be modulated by changing the concentrations of neuromodulators, such as acetylcholine and serotonin (Prönneke et al., 2020). While the stereotype of neuromodulator action is slow dynamics, recent findings show that for example basal forebrain cholinergic neurons respond to reward and punishment with surprising speed and precision (18 ± 3ms) (Hangya et al., 2015) to modulate arousal, attention, and learning in the neocortex. Given the large number of studies that identify long-term projections and neuromodulatory inputs to VIP neurons (e.g., Pfeffer et al., 2013; Pi et al., 2013; Alitto & Dan, 2013; Tremblay et al., 2016), we believe that it will be more plausible to assume the connection weights between R and D in our case is quickly modulated within a trial.

      To clarify this issue in the revised manuscript, we made the following corrections:

      1. We repositioned the 𝛽 parameter in Fig. 2A between the connection from R to D, to align the description of 𝛽 modulating R to D in the main text.

      2. We modified the differential equations 5-8 (now numbered as Eqs. 28-32) in Methods (pp. 61) to include the disinhibitory unit D as an independent control from the inhibitory unit I, in order to be consistent with the disinhibitory D units in LDDM. Such a change makes tiny differences in the model predictions (please see dynamics simulated after the change in Fig. 2-figure supplement 1B).

      3. We updated the neural circuit motif in Fig. 2 -figure supplement 1A accordingly.

      2) The main contribution of the manuscript is to integrate the characteristics of the dynamic normalization model (Louie et al, 2011) and the winner-take-all behavior of recurrent circuit models that employ mutual inhibition (Wang, 2008), into a circuit motif that can flexibly switch between these two computations. The main ingredient for achieving this seems to be the dynamical 'gating' of the disinhibition, which produces a switch in the dynamics, from point-attractor-like 'stable' dynamics during value coding to saddle-point-like 'unstable' dynamics during categorical choice coding. While the specific use of disinhibition to switch between these two computations is new, the authors fail to cite previous work that has explored similar ideas that are closely related to the results being presented in their study. It would be very useful if the authors can elaborate on the relationship between their work and some of these previous studies. I elaborate on this point in (a-b) below:

      2a) While the authors may be correct in claiming that RNM models based on mutual inhibition are incapable of relative value coding, it has already been shown previously that RNM models characterized by mutual inhibition can be flexibly reconfigured to produce dynamical regimes other than those that just support WTA competition (Machens, Romo & Brody, 2005). Similar to the behavior of the proposed model (Fig 9), the model by Machens and colleagues can flexibly switch between point-attractor dynamics (during stimulus encoding), line-attractor dynamics (during working memory), and saddle-point dynamics (during categorical choice) depending on the task epoch. It achieves this via a flexible reconfiguration of the external inputs to the RNM. Therefore, the authors should acknowledge that the mechanism they propose may just be one of many potential ways in which a single circuit motif is reconfigured to produce different task dynamics. This also brings into question their claim that the type of persistent activity produced by the model is "novel", which I don't believe it is (see Machens et al 2005 for the same line-attractor-based mechanism for working memory)

      We thank the Reviewer for pointing out the conceptual similarities between the LDDM and the Machens Romo Brody model, and now include a discussion of the link between the two early in the revised Discussion (p. 38, lines 826-837). Please see response to recommendations below for a more detailed discussion of this point.

      2b) The authors also fail to cite or describe their work in relation to previous work that has used disinhibition-based circuit motifs to achieve all 3 proposed functions of their model - (i) divisive normalization (Litwin-Kumar et al, 2016), (ii) flexible gating/decision making (Yang et al, 2016), and working memory maintenance (Kim & Sejnowski,2021)

      The Reviewer notes several relevant papers, and we have now discussed them and their relationship to the LDDM in a revised Discussion section (pp. 35-36). Please see response to recommendations below for a more details.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank the reviewers for their extensive review of our manuscript and constructive criticism. We have attempted to address the points raised in the reviewer's comments and have performed additional experiments and have edited the text of the manuscript to explain these points. Please see below, our point-by-point response to the reviewer’s comments. In the submitted revised manuscript, some figure numbers have changed from the prior reviewed version.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this MS, Mrj - a member of the JDP family of Hsp70 co-chaperones was identified as a regulator of the conversion of Orb2A (the Dm ortholog of CPEB) to its prion-like form.

      In drosophila, Mrj deletion does not cause any gross neurodevelopmental defect nor leads to detectable alterations in protein homeostasis. Loss of Mrj, however, does lead to altered Orb2 oligomerization. Consistent with a role of prion-like characteristics of Orb2 in memory consolidation, loss of Mrj results in a deficit in long-term memory.

      Aside from the fact that there are some unclarities related to the physicochemical properties of Orb2 and how Mrj affects this precisely, the finding that a chaperone could be important for memory is an interesting observation, albeit not entirely novel.

      In addition, there are several minor technical concerns and questions I have that I feel the authors should address, including a major one related to the actual approach used to demonstrate memory deficits upon loss of Mrj.

      Reviewer #1 (Significance (Required)):

      Figure 1 (plus related Supplemental figures): • There seem to be two isoforms of Mrj (like what has been found for human DNAJB6). I find it striking to see that only (preferentially?) the shorter isoform interacts with Orb2. For DNAJB6, the long isoform is mainly related to an NLS and the presumed substrate binding is identical for both isoforms. If this is true for Dm-Mrj too, the authors could actually use this to demonstrate the specificity of their IPs where Orb2 is exclusively non-nuclear?

      According to Flybase, Mrj has 8 predicted isoforms of which four are of 259 amino acids (PA, PB, PC, and PD), 3 are of 346 amino acids (PE, PG, and PH) and one is of 208 amino acids (PF) length (Supplementary data 1). We isolated RNA from flyheads and used this in RT-PCR experiments to check which Mrj isoforms express in the brain. Since both the 346 amino acid (1038 nucleotide long) and 259 amino acids (777 nucleotides long) form, which we refer to as the long and middle isoform, has the same N and C terminal sequences we used the same primer pair for this, but on RT-PCR the only amplicon we got corresponds to the 259 amino acid form. For the 208 amino acids (624 nucleotides long) form we designed a separate forward primer and attempted to amplify this using RT-PCR but were unable to detect this isoform also. This data is now presented in Supplemental Figure 4B. Since the only isoform detected from fly head cDNA corresponded to the 259 amino acid form, we think this is the predominant isoform of Mrj expressing in Drosophila and this is what is in our DnaJ library and what we have used in all our experiments here. This is also the same isoform described in previous papers on Drosophila Mrj (Fayazi et al, 2006; Li et al, 2016b). For this 259 amino acid Mrj isoform, we see its expression in both the nucleus and cytoplasm (Supplemental Figure 4C). As the long 346 AA fragment was undetectable in the brain, it was not feasible to address the reviewer’s point of using the long and short forms of Mrj for IP with Orb2. However, we have performed IP of human CPEB2 (hCPEB2) with the long and short isoforms of human DnaJB6 and have detected interaction of hCPEB2 with both the long and short isoforms of DnaJB6 (Supplemental Figure 6E).

      • I would be interested to know a bit more about the other 5 JDPs that are interactors with Orb2: are the human orthologs of those known? It is striking that these other 5 JDPs interact with Orb2 in Dm (in IPs) but have no impact on Sup35 prion behavior. Importantly, this does not imply they may not have impact on the prion-like behavior of other Dm substrates, including Dm-Orb2.

      We have performed BlastP analysis of CG4164, CG9828, CG7130, DroJ2, and Tpr2 protein sequences against Human proteins. Based on this we have listed the highest-ranking candidate identified here for each of these genes.

      Drosophila Gene

      Human gene

      Query cover

      Percent identity

      E value

      CG4164

      dnaJ homolog subfamily B member 11 isoform 1

      98 %

      62.96%

      2e-150

      CG9828

      dnaJ homolog subfamily A member 2

      92%

      39.41%

      3e-84

      CG7130

      dnaJ homolog subfamily B member 4 isoform d

      56%

      69.44%

      2e-30

      Tpr2

      dnaJ homolog subfamily C member 7 isoform 1

      93%

      46.22%

      6e-139

      DroJ2

      dnaJ homolog subfamily A member 4 isoform 2

      98%

      60.60%

      2e-169

      In the context of the chimeric Sup35-based assay where Orb2A’s Prion-like domain (PrD) is coupled with the C-terminal domain of Sup35, the only protein which could convert Orb2A PrD-Sup35 C from its non-prion state to prion state was Mrj. Within the limitations of this heterologous-system based assay, the other 5 DnaJ domain proteins as well as the Hsp70’s were unable to convert the Orb2A PrD from its non-prion to prion-like state. What these other 5 interacting JDP proteins are doing through their interaction with Orb2A and if they are even expressing in the Orb2 relevant neurons will need to be tested separately and will be the subject of our future studies.

      • The data in panels H, I indeed suggest that Mrj1 alters the (size of) the oligomers. It would be important to know what is the actual physicochemical change that is occurring here. The observed species are insoluble in 0.1 % TX100 but soluble in 0.1% SDS, which suggest they could be gels, but not real amyloids such as formed by the polyQ proteins that require much higher SDS concentrations (~2%) to be solubilized. This is relevant as Mrj1 reduces polyQ amyloidogenesis whereas is here is shown to enhance Orb2A oligomerization/gelidification. In the same context, it is striking to see that without Mrj the amount of Orb2A seems drastically reduced and I wonder whether this might be due to the fact that in the absence of Mrj a part of Orb2A is not recovered/solubilized due to its conversion for a gel to a solid/amyloid state? In other words: Mrj1 may not promote the prion state, but prevents that state to become an irreversible, non-functional amyloid?

      On the reviewer’s point to address what is the actual physicochemical change occurring here, we will need to develop methods to purify the Orb2 oligomers in significant quantities to examine and distinguish if they are of gel or real amyloid-like nature. Currently, within the limitations of our ongoing work, this has not been possible for us to do and we can attempt to address this in our future work. Cryo-EM derived structure of endogenous Orb2 oligomers purified from a fly head extract from 3 million fly heads, made in the TritonX-100 and NP-40 containing buffer, the same buffer as what we have used here for the first soluble fraction, showed these oligomers as amyloids (Hervas et al, 2020). If the oligomers extracted using 0.1% and 2% SDS are structurally and physicochemically different, within the limitations of our current work, had not been possible to address.

      The other point raised by the reviewer is, if in the absence of Mrj (in the context of Figure 4 of our previously submitted manuscript), a part of Orb2 is not solubilized due to us using a lower 0.1% SDS for extraction. To address this, we attempted to see how much of leftover Orb2 is remaining in the pellet after extraction with 0.1 % SDS. Towards this, according to the reviewers’ suggestion, we used a higher 2% SDS containing buffer to resuspend the leftover pellet after 0.1% SDS extraction, and post solubilisation ran all the fractions in SDD-AGE. We did this experiment with both wild-type and Mrj knockout fly heads. Under these different extractions, we first observed while there is more Orb2 in the soluble fraction (Triton X-100 extracted) of Mrj knockout, this amount is reduced in both the 0.1% SDS solubilized and 2% SDS solubilized fractions. So, even though there is leftover Orb2 after 0.1% SDS extraction, which can be extracted using 2% SDS, this amount is reduced in Mrj knockout. The other observation here is the Orb2 extracted using 2% SDS shows a longer smear in comparison to the 0.1% SDS extracted form suggesting a possibility of more and higher-sized oligomers present in this fraction. Since we do not have the exact physicochemical characterization of these oligomers detected with 0.1% and 2% SDS-containing buffer, we are not differentiating them by using the terms gels and real amyloids, but refer to them as 0.1% SDS soluble Orb2 oligomers and 2% SDS soluble Orb2 oligomers. Overall, our observations here suggest in absence of Mrj, both of these kinds of Orb2 oligomers are decreased and so Mrj is most likely promoting the formation of Orb2 oligomers. It is possible that the 0.1% SDS soluble Orb2 oligomers gradually accumulate and undergo a further transition to the 2% SDS soluble Orb2 oligomers, so if in absence of Mrj, the formation of the 0.1% SDS soluble Orb2 oligomers is decreased, the next step of formation of 2% SDS soluble Orb2 oligomers also be decreased. This data is now presented in Figure 5H, I and J).

      On the other possibility raised by the reviewer that Mrj can prevent the oligomeric state of Orb2 to become an irreversible non-functional amyloid, we think it is still possible for Mrj to do this but this could not be tested under the present conditions.

      • It may be good for clarity to refer to the human Mrj as DNAJB6 according to the HUGO nomenclature. Also, the first evidence for its oligomerization was by Hageman et al 2010.

      We have now changed mentions of human Mrj to DNAJB6. We apologize for missing the Hageman et al 2010 reference and have now cited this reference in the context of Mrj oligomerization.

      • It is striking to see that Mrj co-Ips with Hsp70AA, Hsp70-4 but not Hsp70Cb. The fact that interactions were detected without using crosslinking is also striking given the reported transient nature of J-domain-Hsp70 interactions Together, this may even suggest that Mrj-1 is recognized as a Hsp70 substrate (for Hsp70AA, Hsp70-4 but not Hsp70Cb) rather than as a co-chaperone. In fact, a variant of Mrj-1 with a mutation in the HPD motif should be used to exclude this option.

      In IP experiments we notice Mrj interacts with Hsp70Aa and Hsc70-4 but not with Hsc70-1 and Hsc70Cb. In our previously submitted manuscript, we realized we made a typo on the figure, where we referred to Hsp70Aa as Hsc70Aa. We have corrected this in the current revised manuscript. On the crosslinking point raised by the reviewer, we reviewed the published literature for studies of immunoprecipitation experiments which showed an interaction between DnaJB6 and Hsp70. We noted while one of the papers (Kakkar et al, 2016) report the use of a crosslinker in the experiment which showed an interaction between GFP-Hsp70 and V5-DnaJB6, in another two papers the interaction between endogenous Mrj and endogenous Hsp/c70 (Izawa et al, 2000) and Flag-Hsp70 and GFP-DnaJB6 (Bengoechea et al, 2020) could be detected without using any crosslinker. Our observations of detecting the interaction of Mrj with Hsp70Aa and Hsc70-4 in the absence of a crosslinker are thus similar to the observations reported by (Izawa et al, 2000; Bengoechea et al, 2020).

      On the point of if Mrj is a substrate for Hsp70aa and Hsc70-4 and not a co-chaperone, we feel in the context of this manuscript, since we are focussing on the role of Mrj in the regulation of oligomerization of Orb2 and in memory, the experiment with HPD motif mutant is probably not necessary here. However, if the reviewers suggest this experiment to be essential, we can attempt this experiment by making this HPD motif mutant.

      • The rest of these data reconfirm nicely that Mrj/DNAJB6 can suppress polyQ-Htt aggregation. Yet note that in this case the oligomers that enter the agarose gel are smaller, not bigger. This argues that Mrj is not an enhancer of oligomerization, but rather an inhibitor of the conversion of oligomers to a more amyloid like state.

      Figure 2 and Supplemental Figure 4 discuss the effect of Mrj on Htt aggregation. We have used 2 different Htt constructs here. For Figure 2I, we used only Exon1 of Htt with the poly Q repeats. Here in SDD-AGE, for the control lane, we see the Htt oligomers as a smear for the control. For Mrj, we see only a small band at the bottom which can be interpreted most likely as either a monomer or as small oligomers since we do not observe any smear here. However, for the 588 amino acid fragment of HttQ138 in the SDD-AGE we do not see a difference in the length of the smear but in the intensity of the smear of the Htt oligomers (Supplemental Figure 4E). Based on this we are suggesting in presence of Mrj, there are lesser Htt oligomers. On the point of Mrj is not an enhancer of oligomerization, but rather an inhibitor of the conversion of oligomers to a more amyloid-like state, our experiments with the Mrj knockout show reduced Orb2 oligomers (both for 0.1% and 2% SDS soluble forms), suggesting Mrj plays a role in the conversion of Orb2 to the oligomeric state. If Mrj inhibits the conversion of oligomers to a more amyloid-like state, this is possible but we couldn’t test this hypothesis here. However, for Htt amyloid aggregates, previous works done by other labs with DnaJB6 as well as our experiments with Mrj suggest this as a likely possibility.

      Figure 3: • The finding that knockout of DNAJB6 in mice is embryonic lethal is related to a problem with placental development and not embryonic development (Hunter et al, 1999; Watson et al, 2007, 2009, 2011) as well recognized by the authors. Therefore, the finding that deletion of Dm-Mrj has no developmental phenotype in Drosophila may not be that surprising.

      We agree with the reviewer’s point that DNAJB6 mutant mice have a problem with placental development. However, one of the papers cited here (Watson et al, 2009) suggests DNAJB6 also plays a crucial role in the development of the embryo independent of the placenta development defect. The mammalian DNAJB6 mutant embryos generated using the tetraploid complementation method show severe neural defects including exencephaly, defect in neural tube closure, reduced neural tube size, and thinner neuroepithelium. Due to these features seen in the mice knockout, and the lack of such developmental defects in the Drosophila knockout, we interpreted our findings in Drosophila as significantly different from the mammals.

      • It is a bit more surprising that Mrj knockout flies showed no aggregation phenotype or muscle phenotype, especially knowing that DNAJB6 mutations are linked to human diseases associated with aggregation (again well recognized by the authors). However, most of these diseases are late-onset and the phenotype may require stress to be revealed. So, while important to this MS in terms of not being a confounder for the memory test, I would like to ask the authors to add a note of caution that their data do not exclude that loss of Mrj activity still may cause a protein aggregation-related disease phenotype. Yet, I also do think that for the main message of this MS, it is not required to further test this experimentally.

      We agree with the reviewer and have added this suggestion in the discussion that loss of Mrj may still result in a protein aggregation-related disease phenotype, probably under a sensitized condition of certain stresses which is not tested in this manuscript.

      Figure 4:

      • IPs were done with Orb2A as bite and clearly pulled down substantial amounts of GFP-tagged Mrj. For interactions with Orb2B, a V5-tagged Mrj was use and only a minor fraction was pulled down. Why were two different Mrj constructs used for Arb2A and Orb2b?

      In the previously submitted manuscript, we have used HA-tagged Mrj (not V5) for checking the interaction with full-length Orb2B tagged with GFP. This was done with the goal of using the same Mrj-HA construct as that used in the initial Orb2A immunoprecipitation experiment. Since this has raised some concern as in the IPs to check for interaction between truncated Orb2A constructs (Orb2A325-GFP and Orb2AD162-GFP) and Mrj (Mrj-RFP), we used a different GFP and RFP tag combination. To address this, we have now added the same tag combinations for the IPs (Mrj-RFP with Orb2A-GFP and Orb2B-GFP). In these immunoprecipitation experiments where Mrj-RFP was pulled down using RFP Trap beads, we were able to detect positive interaction with GFP-tagged Orb2A and Orb2B. This data is now added in Figure 4F and 4I. We also have added the IP data for interaction between Mrj-HA and untagged Orb2B in Figure 4K, similar to the combination of initial experiment between Mrj-HA and untagged Orb2A.

      • In addition, I think it would be important what one would see when pulling on Mrj1, especially under non-denaturing conditions and what is the status of the Orb2 that is than found to be associated with Mrj (monomeric, oligomeric and what size).

      We have now performed IP from wild-type fly heads using anti Mrj antibody and ran the immunoprecipitate in SDS-PAGE and SDD-AGE followed by immunoblotting them with anti-Orb2 antibody. Our experiments suggest that immunoprecipitating endogenous Mrj brings down both the monomeric and oligomeric forms of Orb2. This data is now added in Figure 4L, M and N.

      • This also relates to my remark at figure 1 and the subsequent fractionation experiments they show here in which there is a slight (not very convincing) increase in the ratio of TX100-soluble and insoluble (0.1% SDS soluble) material. My question would be if there is a remaining fraction of 0.1% insoluble (2% soluble) Orb2 and how Mrj affects that? As stated before, this is (only) mechanistically relevant to understanding why there is less oligomers of Orb2 in terms of Mrj either promoting it or by preventing it to transfer from a gel to a solid state. The link to the memory data remains intriguing, irrespective of what is going on (but also on those data I do have several comments: see below).

      We have addressed this in response to the reviewer’s comments on Figure 1. We find in both wild type and Mrj knockout fly heads, there are Orb2 oligomers that can be detected using 0.1% SDS extraction and with further extraction with 2% SDS. The 2% SDS soluble Orb2 oligomers were previously insoluble during 0.1% SDS-based extraction. However, the amounts of both of these oligomers are reduced in Mrj knockout fly heads. Since we do not have the physicochemical characterization of both of these kinds of oligomers, we are not using the terms gel or solid state here but referring to these oligomers as 0.1% SDS soluble Orb2 oligomers and 2% SDS soluble Orb2 oligomers. We speculate that the 0.1% SDS soluble Orb2 oligomers over time transition to the 2% SDS soluble Orb2 oligomers. As in the absence of Mrj in the knockout flies, both the 0.1% SDS soluble and 2% SDS soluble Orb2 oligomers are decreased, this suggests Mrj is promoting the formation of Orb2 oligomers. On the reviewer’s point, if Mrj can prevent the transition from 0.1% SDS soluble to 2% SDS soluble Orb2 oligomers, we think it is possible for Mrj to both promote oligomerization of Orb2 as well as prevent it from forming bigger non-functional oligomers, but the second point is not tested here. The relevant data is now presented in Figure 5H, I and J.

      • I also find the sentence that "Mrj is probably regulating the oligomerization of endogenous Orb2 in the brain" somewhat an overstatement. I would rather prefer to say that the data show that Mrj1 affects the oligomeric behavior/status of Orb2.

      Based on the reviewer’s suggestion we have now changed the sentence to Mrj is probably regulating the oligomeric status of Orb2

      Figure 5:

      • To my knowledge, the Elav driver regulates expression in all neurons, but not in glial cells that comprise a significant part of the fly heads/brain. The complete absence of Mrj in the fly-heads when using this driver is therefore somewhat surprising. Or do we need to conclude from this that glial cells normally already lack Mrj expression?

      On driving Mrj RNAi with Elav Gal4, we did not detect any Mrj in the western. We attempted to address the glial contribution towards Mrj’s expression we used a Glia-specific driver Repo Gal4 line to drive the control and Mrj RNAi line and performed a western blot using fly head lysate with anti-Mrj antibody. In this experiment, we did not observe any difference in Mrj levels between the two sets. As the Mrj antibody raised by us works in western blots but not in immunostainings, we could not do a colocalization analysis with a glial marker. However, we used the Mrj knockout Gal4 line to drive NLS-GFP and performed immunostainings of these flies with a glial marker anti-Repo antibody. Here we see two kinds of cells in the brain, one which have GFP but no Repo and the other where both are present together. This suggest that while Glial cells have Mrj but probably majority of Mrj in the brain comes from the neurons. We also found a reference where it was shown that Elav protein as well as Elav Gal4 at earlier stages of development expresses in neuroblasts and in all Glia (Berger et al, 2007). So, another possibility is when we are driving Mrj RNAi using Elav Gal4, this knocks down Mrj in both the neurons as well as in the glia. This coupled with the catalytic nature of RNAi probably creates an effective knockdown of Mrj as seen in the western blot. This data is now added in Supplementary Figure 5G and H.

      • Why not use these lines also for the memory test for confirmation? I understand the concerns of putative confounding effects of a full knockdown (which were however not reported), but now data rely only on the mushroom body-specific knockdown for the 201Y Gal4 line, for which the knockdown efficiency is not provided. But even more so, here a temperature shift (22oC-30oC) was required to activate the expression of the siRNA. For the effects of this shift alone no controls were provided. The functional memory data are nice and consistent with the hypothesis, but can it be excluded that the temperature shift (rather than the Mrj) knockdown has caused the memory defects? I think it is crucial to include the proper controls or use a different knockdown approach that does not require temperature shifts or even use the knockout flies.

      We have now performed the memory experiments with Mrj knockout flies. Our experiments show at 16 and 24-hour time points Mrj knockout flies have significantly reduced memory in comparison to the control wildtype. This data is now added in Figure 6B.

      Figure 6:

      The finding of a co-IP between Rpl18 and Mrj (one-directional only) by no means suffices to conclude that Mrj may interact with nascent Orb2 chains here (which would be the relevant finding here). The fact that Mrj is a self-oligomerising protein (also in vitro, so irrespective of ribosomal associations!), and hence is found in all fractions in a sucrose gradient, also is not a very strong case for its specific interaction with polysomes. The finding that there is just more self-oligomerizing Orb2A co-sedimenting with polysomes in sucrose gradients neither is evidence for a direct effect of Mrj enhancing association of Orb2A with the translating ribosomes even though it would fit the hypothesis. So all in all, I think the data in this figure and non-conclusive and the related conclusions should be deleted.

      We have now performed the reverse co-IP between Rpl18-Flag and Mrj-HA using anti-HA antibody and could detect an interaction between the two. This data is now added in Supplementary Figure 6A.

      To address if Mrj is a self-oligomerizing protein that can migrate to heavier polysome fractions due to its size, we have loaded recombinant Mrj on an identical sucrose gradient as we use for polysome analysis. Post ultra-centrifugation we fractionated the gradients and checked if Mrj can be detected in the fraction numbers where polysomes are present. In this experiment, we could not detect recombinant Mrj in the heavier polysome fractions (data presented in Supplementary Figure 6B). Overall, our observations of Mrj-Rpl18 IPs, the presence of cellularly expressed Mrj in polysome fractions, and the absence of recombinant Mrj from these fractions, suggest a likelihood of Mrj’s association with the translating ribosomes.

      On the reviewer’s point of us concluding Mrj may interact with nascent Orb2 chains, we have not mentioned this possibility in the manuscript as we don’t have any evidence to suggest this. We have also added a sentence: This indicates that in presence of Mrj, the association of Orb2A with the translating ribosomes is enhanced, however, if this is a consequence of increased Orb2A oligomers due to Mrj or caused by interaction between polysome-associated Orb2A and Mrj will need to be tested in future.

      Based on these above-mentioned points, we hope we can keep the data and conclusions of this section.

      Overall, provided that proper controls/additional data can be provided for the key experiments of memory consolidation, I find this an intriguing study that would point towards a role of a molecular chaperone in controlling memory functions via regulating the oligomeric status of a prion-like protein and that is worthwhile publishing in a good journal.

      However, in terms of mechanistical interpretations, several points have to be reconsidered (see remarks on figure 1,4); this pertains especially to what is discussed on page 13. In addition, I'd like the authors to put their data into the perspective of the findings that in differentiated neurons DNAJB6 levels actually decline, not incline (Thiruvalluvan et al, 2020), which would be counterintuitive if these proteins are playing a role as suggested here in memory consolidation.

      We have addressed the comments on Figures 1 and 4 earlier. We have also added new memory experiment’s data with the Mrj knockout in Figure 6.

      We have attempted to put the observations with Drosophila Mrj in perspective to observations in Thiruvalluvan et al, on human DnaJB6 in the discussions as follows:

      Can our observation in Drosophila also be relevant for higher mammals? We tested this with human DnaJB6 and CPEB2. In mice CPEB2 knockout exhibited impaired hippocampus-dependent memory (Lu et al, 2017), so like Drosophila Orb2, its mammalian homolog CPEB2 is also a regulator of long-term memory. In immunoprecipitation assay we could detect an interaction between human CPEB2 and human DnaJB6, suggesting the feasibility for DnaJB6 to play a similar role to Drosophila Mrj in mammals. However, as the human DnaJB6 level was observed to undergo a reduction in transitioning from ES cells to neurons, (Thiruvalluvan et al, 2020) how this can be reconciled with its possible role in the regulation of memory. We speculate, such a reduction if is happening in the brain will occur in a highly regulatable manner to allow precise control over CPEB2 oligomerization only in specific neurons where it is needed and the reduced levels of DnaJB6 is probably sufficient to aid CPEB oligomerization Alternatively, there may be additional chaperones that may function in a stage-specific manner and be able to compensate for the decline in levels of DNAJB6.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: The manuscript describes the role of the Hsp40 family protein Mrj in the prion-like oligomerization of Orb2. The authors demonstrate that Mrj promotes the oligomerization of Orb2, while a loss in Mrj diminishes the extent of Orb2 oligomerization. They observe that while Mrj is not an essential gene, a loss in Mrj causes deficiencies in the consolidation of long-term memory. Further, they demonstrate that Mrj associates with polysomes and increases the association of Orb2 with polysomes.

      Major comments: None

      Minor comments:

      1. In the section describing the chaperone properties of Mrj in clearing Htt aggregates (Fig 2), the legend describes that "Mrj-HA constructs are more efficient in decreasing Htt aggregation compared to Mrj-RFP". It would be helpful to add Mrj-RFP to the quantification in Fig 2G to know exactly the difference in efficiency. Is there an explanation for why the 2 constructs behave differently?

      We have added the quantitation of Htt aggregates in presence of Mrj-RFP in the revised version (Data presented in Figure 2G). While the efficiency of Mrj-RFP to decrease Htt aggregates is significantly less in comparison to Mrj-HA, it is still significantly better in comparison to the control CG7133-HA construct. It is possible, due to the tagging of Mrj with a larger tag (RFP), this reduces its ability to decrease the Htt aggregates in comparison to the construct where Mrj is tagged with a much smaller HA tag.

      Figs A, B, C, G need to have quantification of the percentage of colocalization with details about the number of cells quantified for each experiment.

      We have now added the intensity profile images and colocalization quantitation (pearson’s coefficient) in the Supplemental Figure 5A and B. This quantitation is done from multiple ROI’s taken from at 4-6 cells.

      In Fig 6 B, C, F, G it would be helpful to label the 40S, 60S and 80S peaks in the A 254 trace.

      We have now labeled the 80S, and polysome peaks in the Figure 7B, C, F and G. We could not separate the 40S and 60S peaks in the A254 trace.

      It's interesting that Mrj has opposing functions with regard to aggregation when comparing huntingtin with Orb2. From the literature presented in the discussion, it appears as though chaperones including Mrj have an anti-aggregation role for prions. It would be helpful to have more discussion around why, in the case of Orb2, this is different. The discussion states that "The only Hsp40 chaperone which was found similar to Mrj in increasing Orb2's oligomerization is the yeast Jjj2 protein" - this point needs elaboration, as well as a reference.

      In the discussions section we have now added the following speculations on this:

      One question here is why Mrj behaves differently with Orb2 in comparison to other amyloids. Orb2 differs from other pathogenic amyloids in its extremely transient existence in the toxic intermediate form (Hervás et al, 2016). For the pathogenic amyloids, since they exist in the toxic intermediate form for longer, Mrj probably gets more time to act and prevent or delay them from forming larger aggregates. For Orb2, Mrj may help to quickly transition it from the toxic intermediate state, thereby helping this state to be transient instead of longer. An alternate possibility is post-transition from the toxic intermediate state, Mrj stabilizes these Orb2 oligomers and prevents them from forming larger aggregates. This can be through Mrj interacting with Orb2 oligomers and blocking its surface thereby preventing more Orb2 from assembling over it. Another difference between the Orb2 oligomeric amyloids and the pathogenic amyloids is in the nature of their amyloid core. For the pathogenic amyloids, this core is hydrophobic devoid of any water molecules, however for Orb2, the core is hydrophilic. This raises another possibility that if the Orb2 oligomers go beyond a certain critical size, Mrj can destabilize these larger Orb2 aggregates by targeting its hydrophilic core.

      On the Jjj2 point raised by the reviewer, we have added the (Li et al, 2016a) reference now and elaborated as:

      The only Hsp40 chaperone which was found similar to Mrj in increasing Orb2’s oligomerization is the yeast Jjj2 protein. In Jjj2 knockout yeast strain, Orb2A mainly exists in the non-prion state, whereas on Jjj2 overexpression the non-prion state could be converted to a prion-like state. In S2 cells coexpression of Jjj2 with Orb2A lead to an increase in Orb2 oligomerization (Li et al, 2016a). However, Jjj2 differs from Mrj, as when it is expressed in S2 cells, we do not detect it to be present in the polysome fractions.

      The Jjj2 polysome data is now presented in Supplementary Figure 6C.

      Reviewer #2 (Significance (Required)):

      General assessment:

      Overall, the work is clearly described and the manuscript is very well-written. The motivation behind the study and its importance are well-explained. I only have minor comments and suggestions to improve the clarity of the work. The study newly describes the interaction between the chaperone Mrj and the translation regulator Orb2. The experiments that the screen for proteins that interact with Orb2 and promote its oligomerization are very thorough. The experiments that delve into the role of Mrj in protein synthesis are a good start, and need to be explored further, but that is beyond the scope of this study.

      Advance: The study describes a new interaction between the chaperone Mrj and the translation regulator Orb2. The study is helpful in expanding our knowledge of prion regulators as well factors that affect memory acquisition and consolidation.

      Audience: This paper will be of most interest to basic researchers.

      My expertise is in Drosophila genetics and neuronal injury.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript submitted by Desai et al. identifies a chaperone of the Hsp40 family (Mrj) that binds Orb2 and modulates its oligomerization, which is critical for Orb2 function in learning and memory in Drosophila. Orb2 are proteins with prion-like properties whose oligomerization is critical for their function in the storage of memories. The main contribution of the article is the screen of Hsp40 and Hsp70-family proteins that bind Orb2. The authors show IP results for all the candidates tested, including those that bind Fig. 1) and those that don't (Supp Fig 3). There is also a figure devoted to examining the interaction of Mrj with polyglutamine models (Htt). They also generate a KO mutant that is viable and shows no gross defects or protein aggregation. Lastly, they show that the silencing of Mrj in the mushroom body gamma neurons results in weaker memories in a courtship paradigm. Although the data is consistent and generally supportive of the hypothesis, key details are missing in several areas, including controls. Additionally, the interpretation of some results leaves room for debate. Overall, this is an ambitious article that needs additional work before publication.

      Specific comments:

      1. General concern over the interpretation of IP experiments and colocalization. These experiments don't necessarily reflect direct interactions. They are consistent with direct interaction but not the only explanation for a positive IP or colocalization.

      This paper is centred on the interaction between Orb2A and Mrj, which we have detected using immunoprecipitation. The reviewer’s concern here is, this experiment is not able to distinguish if this can be a direct protein-protein interaction or if the two proteins are part of a complex.

      To address this concern we have used purified recombinant protein-based pulldowns. Our experiments with purified protein pulldowns (GST tagged Mrj from E.coli with Orb2A from E.coli or Orb2A-GFP from Sf9 cells) suggest Orb2A and Mrj can directly interact amongst themselves. This data is now presented in Figure 1J and K.

      The Huntingtin section has a few concerns. The IF doesn't show all controls and the quantification is not well done in terms of what is relevant. A major problem is the interpretation of Fig 2F. The idea is that Mrj prevents the aggregation of Htt, which is the opposite of what is observed with Orb2. The panel actually shows a large Htt aggregate instead of multiple small aggregates. This has been reported before in Drosophila and other systems with different polyQ models. Mrj and other Hsp40 and Hsp70 proteins modify Htt aggregation, but in an unexpected way. This affects the model shown in Fig. 6H. Lastly, Fig 2H and 2I show very different level of total Htt.

      In Figure 2F of the previously submitted manuscript, we have shown representative images of HttQ103-GFP cells coexpressing with a control DnaJ protein CG7133-HA and Mrj-HA. In Figure 2G we quantitated the number of cells showing aggregates within the population of doubly transfected cells. On the reviewer’s point of figure 2F showing large Htt aggregates instead of multiple small aggregates, we do not see a large Htt aggregate in presence of Mrj in this figure, the pattern looks diffused here and very different from the control CG7133 where the aggregates are seen. We have performed the same experiment with a different Htt construct (588 amino acids long fragment) tagged with RFP, and here also we notice in presence of Mrj, the aggregates are decreased and the expression pattern looks diffused (Supplementary Figure 4E, 4F).

      If the comment on large Htt aggregates in presence of Mrj is concerning figure 2E, here we show Mrj-RFP to colocalize with the Htt aggregates. Here, even though Mrj-RFP colocalizes with Htt aggregates, it rescues the Htt aggregation phenotype as in comparison to the control CG7133, the number of cells with Htt aggregates is still significantly less here. We have added this quantitation of rescue by Mrj-RFP in the revised manuscript now. The observation of colocalization of Mrj-RFP with Htt aggregates is similar to previous reports of chaperones rescuing Htt aggregation and yet showing colocalization with the aggregates. Both Hdj-2 and Hsc70 suppress Htt aggregation and yet were observed to colocalize with Htt aggregates in the cell line model as well as in nuclear inclusions in the brain (Jana et al, 2000). In a nematode model of Htt aggregation, DNJ-13 (DnaJB-1), HSP-1 (Hsc70), and HSP-11 (Apg-2) were shown to colocalize with Htt aggregates and yet decrease the Htt aggregation (Scior et al, 2018). Hsp70 was also found to colocalize with Htt aggregates in Hela cells (Kim et al, 2002).

      Regarding Figures 2H and 2I, while figure 2H is of an SDS-PAGE to show no difference in the levels of monomeric HttQ103 (marked with *) in presence of Mrj and the control CG7133, figure 2I is for the same samples ran in an SDD-AGE where reduced amount of Htt oligomers as seen with the absence of a smear in presence of Mrj. The apparent difference in Htt levels between 2H and 2I is due to the detection of Htt aggregates/oligomers in the SDD-AGE which are unable to enter the SDS-PAGE and hence undetected. In Supplementary Figure 4E, similar experiments were done with the longer Htt588 fragment and here we notice in the SDD-AGE reduced intensity of the smear made up of Htt oligomers, again suggesting a reduction in Htt aggregates. Thus our results are not in contradiction to previous studies where Mrj was found to rescue Htt aggregate-associated toxicity.

      Endogenous expression of Mrj using Gal4 line: where else is it expressed in the brain / head and in muscle. Fig 3G shows no muscle abnormalities but no evidence is shown for muscle expression. It is nice that Fig 3E and F show no abnormal aggregates in the Mrj mutant, but this would be maybe more interesting if flies were subjected to some form of stress.

      We have now added images of the brain and muscles to show the expression pattern of Mrj. Using Mrj Gal4 line and UAS- CD8GFP, we noticed enriched expression in the optic lobes, mushroom body, and olfactory lobes. We also noticed GFP expression in the larval muscles and neuromuscular junction synaptic boutons. This data is now presented in Supplementary Figure 5C, D, E and F.

      On the reviewer’s point of subjecting the Mrj KO flies to some form of stress, we have not performed this. We have added in the discussions a note of caution, that loss of Mrj may still result in a protein aggregation-related disease phenotype, probably under a sensitized condition of certain stresses which is not tested in this manuscript.

      Fig. 5B shows no Mrj detectable from head homogenates in flies silencing Mrj in neurons with Elav-Gal4. It would be nice if they could show that ONLY neurons express Mrj in the head. Also noted, Elav-Gal4 is a weak driver, so it is surprising that it can generate such robust loss of Mrj protein

      We have used an X chromosome Elav Gal4 driver to drive the UAS-Mrj RNAi line and here we could not detect Mrj in the western. To address the reviewer’s point on the glial contribution towards expression of Mrj, we used a Glial driver Repo Gal4 to drive Mrj RNAi. In this experiment, we did not detect any difference in Mrj levels between the control and the Mrj RNAi line (presented now in Supplementary Figure 5G). We also used the Mrj knockout Gal4 line to drive NLS-GFP and immunostained these using a glial marker anti-Repo antibody. Here, we were able to detect cells colabelled by GFP as well as Repo, suggesting Mrj is likely to be present in the glial cells (presented now in Supplementary Figure 5H). We also looked in the literature and found a reference where it was shown that Elav protein as well as Elav Gal4 at earlier stages of development expresses in neuroblasts and in all Glia (Berger et al, 2007). So, another possibility is when we are driving Mrj RNAi using Elav Gal4, this knocks down Mrj in both the neurons as well as in the glia.

      Fig 4-Colocalization of Orb2 with Mrj lacks controls. The quantification could describe other phenomena because the colocalization is robust but the numbers shown describe something else.

      We have now added the intensity profile and colocalization quantitation (pearson’s coefficient) in Supplemental Figure 5A and B. This quantitation is done from multiple ROI’s taken from 4-6 cells. Also, to suggest the interaction of Orb2 isoforms with Mrj, we are not depending on colocalization alone and have used immunoprecipitation experiments to support our observations.

      Fly behavior. The results shown for Mrj RNAi alleles is fine but it would be more robust if this was validated with the KO line AND rescued with Mrj overexpression.

      We have now performed memory assays with the Mrj knockout. Our experiments showed Mrj knockouts to show significantly decreased memory in comparison to wild-type flies at 16 and 24-hour time points (presented in Figure 6B). We have not been able to make an Mrj Knockout-UAS Mrj recombinant fly, most likely due to the closeness of the two with respect to their genomic location in second chromosome.

      Minor comments:

      Please, revise minor errors, there are several examples of words together without a space.

      We have identified the words without space and have corrected them now.

      Intro: describe the use of functional prions. Starting the paragraph with this sentence and then explaining what prion diseases are is a little confusing. Also "prion proteins" can be confusing because the term refers to PrP, the protein found in prions.

      We have now altered the introduction and have described functional prions.

      Results, second subtitle in page 5. This sentence is quite confusing using prion-like twice

      We have now changed the heading to “Drosophila Mrj converts Orb2A from non-prion to a prion-like state.”

      Page 6: "conversion from non-prion to prion-like form...". This can be presented differently. Prion-like properties are intrinsic, proteins don't change from non-prion to prion-like. They may be oligomeric or monomeric or highly aggregated but the prion-like property doesn't change

      We agree with the reviewer's point of Prion-like properties are intrinsic, but the protein might or might not exist in the prion-like state or confirmation. When we are using the term conversion from non-prion to prion-like form we mean to suggest a conformational conversion leading to the eventual formation of the oligomeric species. We also noted the terminology of non-prion to prion-like state change is used in several papers (Satpute-Krishnan & Serio, 2005; Sw & Yo, 2012; Uptain et al, 2001).

      Scale bars and text are too small in several figures

      We have now mentioned in the figure legends the size of the scale bars. For several images we have made the scale bars also larger.

      Not sure why Fig 4C is supplemental, seems like an important piece of data.

      We have kept this data in the supplemental data as we performed this experiment with recombinant protein which is tagged with 6X His and we are not sure if this high degree of oligomerization/aggregation of recombinant Mrj and further precipitation over time, happens inside the cells/ brain.

      Intro to Mrj KO in page 7 is too long. Most of it belongs in the discussion

      We have now moved the portions on mammalian DNAJB6 which were earlier in the results section to the discussions section.

      Change red panels in IF to other color to make it easier for colorblind readers.

      We have now changed the red panels to magenta. We apologize for our figures not being colorblind friendly earlier.

      The discussion is a little diffuse by trying to compare Orb2 with mammalian prions and amyloids and yeast prions.

      We looked into the functional prion data and couldn’t find much on chaperone mediated regulation of these. Also, we felt comparing with the amyloids and yeast prions brings out the contrast with respect to the Mrj mediated regulatory differences between the two.

      Reviewer #3 (Significance (Required)):

      This is a paper with a broad scope and approaches. The paper describes the role of Mrj in the oligomerization of Orb2 by protein biochemistry techniques and determine the role of loss of Mrj in the mushroom bodies in fly behavior.

      The audience for this content is basic research and specialized. The role of Mrj in Orb2 aggregation and function sheds new light on the mechanisms regulating the function of this protein involved in a novel mechanism of learning and memory.

      References:

      Bengoechea R, Findlay AR, Bhadra AK, Shao H, Stein KC, Pittman SK, Daw JA, Gestwicki JE, True HL & Weihl CC (2020) Inhibition of DNAJ-HSP70 interaction improves strength in muscular dystrophy. J Clin Invest 130: 4470–4485

      Berger C, Renner S, Lüer K & Technau GM (2007) The commonly used marker ELAV is transiently expressed in neuroblasts and glial cells in the Drosophila embryonic CNS. Dev Dyn 236: 3562–3568

      Fayazi Z, Ghosh S, Marion S, Bao X, Shero M & Kazemi-Esfarjani P (2006) A Drosophila ortholog of the human MRJ modulates polyglutamine toxicity and aggregation. Neurobiol Dis 24: 226–244

      Heinrich SU & Lindquist S (2011) Protein-only mechanism induces self-perpetuating changes in the activity of neuronal Aplysia cytoplasmic polyadenylation element binding protein (CPEB). Proc Natl Acad Sci U S A 108: 2999–3004

      Hervás R, Li L, Majumdar A, Fernández-Ramírez MDC, Unruh JR, Slaughter BD, Galera-Prat A, Santana E, Suzuki M, Nagai Y, et al (2016) Molecular Basis of Orb2 Amyloidogenesis and Blockade of Memory Consolidation. PLoS Biol 14: e1002361

      Hervas R, Rau MJ, Park Y, Zhang W, Murzin AG, Fitzpatrick JAJ, Scheres SHW & Si K (2020) Cryo-EM structure of a neuronal functional amyloid implicated in memory persistence in Drosophila. Science 367: 1230–1234

      Izawa I, Nishizawa M, Ohtakara K, Ohtsuka K, Inada H & Inagaki M (2000) Identification of Mrj, a DnaJ/Hsp40 family protein, as a keratin 8/18 filament regulatory protein. J Biol Chem 275: 34521–34527

      Jana NR, Tanaka M, Wang G h & Nukina N (2000) Polyglutamine length-dependent interaction of Hsp40 and Hsp70 family chaperones with truncated N-terminal huntingtin: their role in suppression of aggregation and cellular toxicity. Hum Mol Genet 9: 2009–2018

      Kakkar V, Månsson C, de Mattos EP, Bergink S, van der Zwaag M, van Waarde MAWH, Kloosterhuis NJ, Melki R, van Cruchten RTP, Al-Karadaghi S, et al (2016) The S/T-Rich Motif in the DNAJB6 Chaperone Delays Polyglutamine Aggregation and the Onset of Disease in a Mouse Model. Mol Cell 62: 272–283

      Kim S, Nollen EAA, Kitagawa K, Bindokas VP & Morimoto RI (2002) Polyglutamine protein aggregates are dynamic. Nat Cell Biol 4: 826–831

      Li L, Sanchez CP, Slaughter BD, Zhao Y, Khan MR, Unruh JR, Rubinstein B & Si K (2016a) A Putative Biochemical Engram of Long-Term Memory. Curr Biol 26: 3143–3156

      Li S, Zhang P, Freibaum BD, Kim NC, Kolaitis R-M, Molliex A, Kanagaraj AP, Yabe I, Tanino M, Tanaka S, et al (2016b) Genetic interaction of hnRNPA2B1 and DNAJB6 in a Drosophila model of multisystem proteinopathy. Hum Mol Genet 25: 936–950

      Liebman SW & Chernoff YO (2012) Prions in yeast. Genetics 191: 1041–1072

      Lu W-H, Yeh N-H & Huang Y-S (2017) CPEB2 Activates GRASP1 mRNA Translation and Promotes AMPA Receptor Surface Expression, Long-Term Potentiation, and Memory. Cell Rep 21: 1783–1794

      Prusiner SB (2001) Neurodegenerative Diseases and Prions. New England Journal of Medicine 344: 1516–1526

      Satpute-Krishnan P & Serio TR (2005) Prion protein remodelling confers an immediate phenotypic switch. Nature 437: 262–265

      Scior A, Buntru A, Arnsburg K, Ast A, Iburg M, Juenemann K, Pigazzini ML, Mlody B, Puchkov D, Priller J, et al (2018) Complete suppression of Htt fibrilization and disaggregation of Htt fibrils by a trimeric chaperone complex. EMBO J 37: 282–299

      Si K (2015) Prions: what are they good for? Annu Rev Cell Dev Biol 31: 149–169

      Si K, Choi Y-B, White-Grindley E, Majumdar A & Kandel ER (2010) Aplysia CPEB can form prion-like multimers in sensory neurons that contribute to long-term facilitation. Cell 140: 421–435

      Si K, Lindquist S & Kandel ER (2003) A neuronal isoform of the aplysia CPEB has prion-like properties. Cell 115: 879–891

      Sw L & Yo C (2012) Prions in yeast. Genetics 191

      Thiruvalluvan A, de Mattos EP, Brunsting JF, Bakels R, Serlidaki D, Barazzuol L, Conforti P, Fatima A, Koyuncu S, Cattaneo E, et al (2020) DNAJB6, a Key Factor in Neuronal Sensitivity to Amyloidogenesis. Mol Cell 78: 346-358.e9

      Uptain SM & Lindquist S (2002) Prions as protein-based genetic elements. Annu Rev Microbiol 56: 703–741

      Uptain SM, Sawicki GJ, Caughey B & Lindquist S (2001) Strains of [PSI(+)] are distinguished by their efficiencies of prion-mediated conformational conversion. EMBO J 20: 6236–6245

      Watson ED, Mattar P, Schuurmans C & Cross JC (2009) Neural stem cell self-renewal requires the Mrj co-chaperone. Dev Dyn 238: 2564–2574

      Wickner RB (2016) Yeast and Fungal Prions. Cold Spring Harb Perspect Biol 8: a023531

      Wickner RB, Edskes HK, Maddelein ML, Taylor KL & Moriyama H (1999) Prions of yeast and fungi. Proteins as genetic material. J Biol Chem 274: 555–558

      Wickner RB, Masison DC, Edskes HK & Maddelein ML (1996) Prions of yeast, [PSI] and [URE3], as models for neurodegenerative diseases. Cold Spring Harb Symp Quant Biol 61: 541–550

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this MS, Mrj - a member of the JDP family of Hsp70 co-chaperones was identified as a regulator of the conversion of Orb2A (the Dm ortholog of CPEB) to its prion-like form.

      In drosophila, Mrj deletion does not cause any gross neurodevelopmental defect nor leads to detectable alterations in protein homeostasis. Loss of Mrj, however, does lead to altered Orb2 oligomerization. Consistent with a role of prion-like characteristics of Orb2 in memory consolidation, loss of Mrj results in a deficit in long-term memory.

      Aside from the fact that there are some unclarities related to the physicochemical properties of Orb2 and how Mrj affects this precisely, the finding that a chaperone could be important for memory is an interesting observation, albeit not entirely novel.

      In addition, there are several minor technical concerns and questions I have that I feel the authors should address, including a major one related to the actual approach used to demonstrate memory deficits upon loss of Mrj.

      Significance

      Figure 1 (plus related Supplemental figures):

      • There seem to be two isoforms of Mrj (like what has been found for human DNAJB6). I find it striking to see that only (preferentially?) the shorter isoform interacts with Orb2. For DNAJB6, the long isoform is mainly related to an NLS and the presumed substrate binding is identical for both isoforms. If this is true for Dm-Mrj too, the authors could actually use this to demonstrate the specificity of their IPs where Orb2 is exclusively non-nuclear?
      • I would be interested to know a bit more about the other 5 JDPs that are interactors with Orb2: are the human orthologs of those known? It is striking that these other 5 JDPs interact with Orb2 in Dm (in IPs) but have no impact on Sup35 prion behavior. Importantly, this does not imply they may not have impact on the prion-like behavior of other Dm substrates, including Dm-Orb2.
      • The data in panels H,I indeed suggest that Mrj1 alters the (size of) the oligomers. It would be important to know what is the actual physicochemical change that is occurring here. The observed species are insoluble in 0.1 % TX100 but soluble in 0.1% SDS, which suggest they could be gels, but not real amyloids such as formed by the polyQ proteins that require much higher SDS concentrations (~2%) to be solubilized. This is relevant as Mrj1 reduces polyQ amyloidogenesis whereas is here is shown to enhance Orb2A oligomerization/gelidification. In the same context, it is striking to see that without Mrj the amount of Orb2A seems drastically reduced and I wonder whether this might be due to the fact that in the absence of Mrj a part of Orb2A is not recovered/solubilized due to its conversion for a gel to a solid/amyloid state? In other words: Mrj1 may not promote the prion state, but prevents that state to become an irreversible, non-functional amyloid?

      Figure 2 (plus related Supplemental figures):

      • It may be good for clarity to refer to the human Mrj as DNAJB6 according to the HUGO nomenclature. Also, the first evidence for its oligomerization was by Hageman et al 2010.
      • It is striking to see that Mrj co-IPs with Hsp70AA, Hsp70-4 but not Hsp70Cb. The fact that interactions were detected without using crosslinking is also striking given the reported transient nature of J-domain-Hsp70 interactions Together, this may even suggest that Mrj-1 is recognized as a Hsp70 substrate (for Hsp70AA, Hsp70-4 but not Hsp70Cb) rather than as a co-chaperone. In fact, a variant of Mrj-1 with a mutation in the HPD motif should be used to exclude this option.
      • The rest of these data reconfirm nicely that Mrj/DNAJB6 can suppress polyQ-Htt aggregation. Yet note that in this case the oligomers that enter the agarose gel are smaller, not bigger. This argues that Mrj is not an enhancer of oligomerization, but rather an inhibitor of the conversion of oligomers to a more amyloid like state.

      Figure 3:

      • The finding that knockout of DNAJB6 in mice is embryonic lethal is related to a problem with placental development and not embryonic development (Hunter et al, 1999; Watson et al, 2007, 2009, 2011) as well recognized by the authors. Therefore, the finding that deletion of Dm-Mrj has no developmental phenotype in Drosophila may not be that surprising.
      • It is a bit more surprising that Mrj knockout flies showed no aggregation phenotype or muscle phenotype, especially knowing that DNAJB6 mutations are linked to human diseases associated with aggregation (again well recognized by the authors). However, most of these diseases are late-onset and the phenotype may require stress to be revealed. So, while important to this MS in terms of not being a confounder for the memory test, I would like to ask the authors to add a note of caution that their data do not exclude that loss of Mrj activity still may cause a protein aggregation-related disease phenotype. Yet, I also do think that for the main message of this MS, it is not required to further test this experimentally.

      Figure 4:

      • IPs were done with Orb2A as bite and clearly pulled down substantial amounts of GFP-tagged Mrj. For interactions with Orb2B, a V5-tagged Mrj was use and only a minor fraction was pulled down. Why were two different Mrj constructs used for Arb2A and Orb2b?
      • In addition, I think it would be important what one would see when pulling on Mrj1, especially under non-denaturing conditions and what is the status of the Orb2 that is than found to be associated with Mrj (monomeric, oligomeric and what size).
      • This also relates to my remark at figure 1 and the subsequent fractionation experiments they show here in which there is a slight (not very convincing) increase in the ratio of TX100-soluble and insoluble (0.1% SDS soluble) material. My question would be if there is a remaining fraction of 0.1% insoluble (2% soluble) Orb2 and how Mrj affects that? As stated before, this is (only) mechanistically relevant to understanding why there is less oligomers of Orb2 in terms of Mrj either promoting it or by preventing it to transfer from a gel to a solid state. The link to the memory data remains intriguing, irrespective of what is going on (but also on those data I do have several comments: see below).
      • I also find the sentence that "Mrj is probably regulating the oligomerization of endogenous Orb2 in the brain" somewhat an overstatement. I would rather prefer to say that the data show that Mrj1 affects the oligomeric behavior/status of Orb2.

      Figure 5:

      • To my knowledge, the Elav driver regulates expression in all neurons, but not in glial cells that comprise a significant part of the fly heads/brain. The complete absence of Mrj in the fly-heads when using this driver is therefore somewhat surprising. Or do we need to conclude from this that glial cells normally already lack Mrj expression?
      • Why not use these lines also for the memory test for confirmation? I understand the concerns of putative confounding effects of a full knockdown (which were however not reported), but now data rely only on the mushroom body-specific knockdown for the 201Y Gal4 line, for which the knockdown efficiency is not provided. But even more so, here a temperature shift (22oC-30oC) was required to activate the expression of the siRNA. For the effects of this shift alone no controls were provided. The functional memory data are nice and consistent with the hypothesis, but can it be excluded that the temperature shift (rather than the Mrj) knockdown has caused the memory defects? I think it is crucial to include the proper controls or use a different knockdown approach that does not require temperature shifts or even use the knockout flies.

      Figure 6:

      The finding of a co-IP between Rpl18 and Mrj (one-directional only) by no means suffices to conclude that Mrj may interact with nascent Orb2 chains here (which would be the relevant finding here). The fact that Mrj is a self-oligomerising protein (also in vitro, so irrespective of ribosomal associations!), and hence is found in all fractions in a sucrose gradient, also is not a very strong case for its specific interaction with polysomes. The finding that there is just more self-oligomerizing Orb2A co-sedimenting with polysomes in sucrose gradients neither is evidence for a direct effect of Mrj enhancing association of Orb2A with the translating ribosomes even though it would fit the hypothesis. So all in all, I think the data in this figure and non-conclusive and the related conclusions should be deleted.

      Overall, provided that proper controls/additional data can be provided for the key experiments of memory consolidation, I find this an intriguing study that would point towards a role of a molecular chaperone in controlling memory functions via regulating the oligomeric status of a prion-like protein and that is worthwhile publishing in a good journal.

      However, in terms of mechanistical interpretations, several points have to be reconsidered (see remarks on figure 1,4); this pertains especially to what is discussed on page 13. In addition, I'd like the authors to put their data into the perspective of the findings that in differentiated neurons DNAJB6 levels actually decline, not incline (Thiruvalluvan et al, 2020), which would be counterintuitive if these proteins are playing a role as suggested here in memory consolidation.

    1. Reviewer #3 (Public Review):

      In empirical data, the dependence of microbial diversity on environmental temperature can take multiple different functional forms, while the previous theory has not established a clear understanding of when the temperature-dependence of diversity should take a particular form, and why. The authors seek to understand what forms are possible, and when they will occur, via analysis of the feasibility (i.e. positivity) of Lotka-Volterra equation solutions. This is combined with an assumption for the way that species' growth rates depend on temperature, along with an assumption for the way species interaction rates depend on temperature. Together, this completely specifies the form of the Lotka-Volterra equations, and whether all species in the model can coexist indefinitely at a given temperature, or whether only a lower-diversity subset can persist.

      The overall goal is valuable, and the overall approach of using this classic model of species interactions is justifiable. My main question marks relate to the way the conditions on feasibility (i.e. when all species will have positive equilibria), whether and when we need to consider the stability of these feasible solutions, and finally how general the way in which model parameters are specified to depend on temperature. I will expand on these three issues below. A more minor issue is that the authors set up this problem with extensive reference to the interaction of consumers and resources, referencing previous approaches that explicitly model these. Since resources are not explicitly present in the Lotka-Volterra formalism, it would be helpful to have a clearer justification for the authors' rationale in choosing this kind of model.

      (1) Conditions on growth and interaction rates for feasibility and stability. The authors approach this using a mean field approximation, and it is important to note that there is no particular temperature dependence assumed here: as far as it goes, this analysis is completely general for arbitrary Lotka-Volterra interactions.

      However, the starting point for the authors' mean field analysis is the statement that "it is not possible to meaningfully link the structure of species interactions to the exact closed-form analytical solution for [equilibria] 𝑥^*_𝑖 in the Lotka-Volterra model.

      I may be misunderstanding, but I don't agree with this statement. The time-independent equilibrium solution with all species present (i.e. at non-zero abundances) takes the form

      x^* = A^{-1}r

      where A is the inverse of the community matrix, and r is the vector of growth rates. The exceptions to this would be when one or more species has abundance = 0, or A is not invertible. I don't think the authors intended to tackle either of these cases, but maybe I am misunderstanding that.

      So to me, the difficulty here is not in writing a closed-form solution for the equilibrium x^*, it is in writing the inverse matrix as a nice function of the entries of the matrix A itself, which is where the authors want to get to. In this light, it looks to me like the condition for feasibility (i.e. that all x^* are positive, which is necessary for an ecologically-interpretable solution) is maybe an approximation for the inverse of A---perhaps valid when off-diagonal entries are small. A weakness then for me was in understanding the range of validity of this approximation, and whether it still holds when off-diagonal entries of A (i.e. inter-specific interactions) are arbitrarily large. I could not tell from the simulation runs whether this full range of off-diagonal values was tested.

      As a secondary issue here, it would have been helpful to understand whether the authors' feasible solutions are always stable to small perturbations. In general, I would expect this to be an additional criterion needed to understand diversity, though as the authors point out there are certain broad classes of solutions where feasibility implies stability.

      (2) I did not follow the precise rationale for selecting the temperature dependence of growth rate and interaction rates, or how the latter could be tested with empirical data, though I do think that in principle this could be a valuable way to understand the role of temperature dependence in the Lotka-Volterra equations.

      First, as the authors note, "the temperature dependence of resource supply will undoubtedly be an important factor in microbial communities"

      Even though resources aren't explicitly modeled here, this suggests to me that at some temperatures, resource supply will be sufficiently low for some species that their growth rates will become negative. For example, if temperature dependence is such that the limiting resource for a given species becomes too low to balance its maintenance costs (and hence mortality rate), it seems that the net growth rate will be negative. The alternative would be that temperature affects resource availability, but never such that a limiting resource leads to a negative growth rate when a taxon is rare.

      On the other hand, the functional form for the distribution of growth rates (eq 3) seems to imply that growth rates are always positive. I could imagine that this is a good description of microbial populations in a setting where the resource supply rate is controlled independently of temperature, but it wasn't clear how generally this would hold.

      Secondly, while I understand that the growth rate in the exponential phase for a single population can be measured to high precision in the lab as a function of temperature, the assumption for the form of the interaction rates' dependence on temperature seems very hard to test using empirical data. In the section starting L193, the authors seem to fit the model parameters using growth rate dependence on temperature, but then assume that it is reasonable to "use the same thermal response for growth rates and interactions". I did not follow this, and I think a weakness here is in not providing clear evidence that the functional form assumed in Equation (4) actually holds.

    1. Dynamic Scoping Modern programming languages implement global and/or module scope and/or lexical scope. A name defined globally is available everywhere in the code. A name given module scope is only directly accessible within that module (and may be available outside if qualified with the module name). A name defined with lexical scope is available inside the current lexical block and (typically) the blocks it encloses. All three of these are statically defined: the meaning of a variable name can be determined at compilation time. In the past, languages such as Perl also offered dynamic scope. This looks a little like lexical scope, except the names defined in a block are available not just in that block but also in all the functions invoked by that block, and functions invoked below them, and so on. The scope is only determined at runtime: the name exists for the duration of the block that defines it, and it exists in all functions executed during that time. As you can imagine, this was both powerful and widely abused: it’s hard to know just what a name means when its definition depends on the execution flow. This is one reason we don’t often see dynamic scoping in current languages. Unison’s abilities are a form of dynamic scoping. However, they overcome many of the issues with previous kinds of dynamic scoping because they are fully type safe. You cannot accidentally use a name injected freom a higher context, and you always know where every name comes from.

      This is really fascinating! In some ways this makes me think of React's context which enables passing data deeply down a component tree.

    1. . They were for the most partcut from extensive typescripts of his, other copies of which stillexist. Some few were cut from typescripts which we have notbeen able to trace and which it is likely that he destroyed but forthe bits that he put in the box.

      In Zettel, the editors indicate that many of Wittgenstein's zettels "were for the most part cut from extensive typescripts of his, other copies of which still exist." Perhaps not knowing of the commonplace book or zettelkasten traditions, they may have mistook the notes in his zettelkasten as having originated in his typescripts rather than them having originated as notes which then later made it into his typescripts!

      What in particular about the originals may have made them think it was typescript to zettel?

    1. Author Response

      Reviewer #1 (Public Review):

      Part 1: Type 2 deiodinase

      Table I is supposed to clarify and summarize the results but brings confusion. The text says that table I supports the claim that "in the cerebellum, Luc-mRNA was lower in the Ala92-Dio2 mice" whereas figure 1G does not show any difference. It is unclear whether Table I and figure 1 report the same data, and what the statistical tests are actually addressing (effect of genotype vs effect of treatment, whereas what matters here is only the interaction between genotype and treatment). Overall, it is not acceptable to present quantitative data without giving numbers, standard deviation, p-value, etc. as in Table I.

      Thank you. We agree with the reviewer. We intended to minimize the amount of data presented, which was already very large, and therefore only presented the ratios of thr/alaDio2 and which created confusion. This part was removed from the new version of the MS.

      Also, evaluating T3 signaling by only looking at the luc reporter and the Hprt housekeeping gene is not always sufficient (many T3 responsive genes can be found in the literature and more than one housekeeping gene should be used as a reference).

      Thank you. The advantage of using the THAI mouse is that the Luciferase reporter gene is driven by a promoter that is only sensitive to T3, which is not the case for any other T3-responsive responsive gene. The Hprt housekeeping signal was stable among the samples, and the differences observed were not caused by differences in the housekeeping gene expression. This part was removed from the new version of the MS.

      Another important weakness is that the wild-type mice have a proline at position 92. Why not include them? In absence of structural prediction, one wonders whether the mouse models are relevant to the human situation and whether the absence of the proline reduces the enzymatic activity when substituted for an Ala or Thr. This might have been addressed in previous work, but the authors should explain.

      The position 92 in DIO2 is occupied by Thr in humans. Its Km(T4) is indistinguishable from mouse Dio2 which has a Pro in the position 92 (4nM vs. 3.1nM) [PMID 8754756; PMID: 10655523]. Humans also carry an Ala in position 92. Comparing the two human alleles is the purpose of the study.

      Experiment 2: Ala92-Dio2 Astrocytes Have Limited Ability to Activate T4 to T3

      Here, the authors use primary cell cultures from different areas of the brain to measure the in vitro conversion of T4 to T3 by Dio2. They find that hippocampus astrocytes are less active, notably if they come from Ala92-Dio2 mice.

      This part has the following weaknesses:

      • This result correlates with the results from Fig 1F however the difference between Ala92-Dio2 and Thr92-Dio2 is significant in vitro, but not in vivo.

      From a deiodinase perspective, TH signaling in vivo depends on the presence of D2 (expressed in glial cells) and D3 (expressed in neurons), whereas in vitro it only depends on D2. In fact, D2 and D3 are known for a reciprocal regulation to preserve TH signaling [PMID: 33123655]. Thus, it is conceivable that the differences observed between the two models are explained by the intrinsic differences in the models.

      What matters is not the activity/astrocytes, but the total activity of the brain area, which depends on the number of astrocytes x individual activity. This is not measured.

      We respectfully disagree with the reviewer. The total D2 activity in a brain area depends fundamentally on the number of astrocytes in that area and on the intrinsic activity of the enzyme. The reviewer is suggesting that having an area denser in astrocytes expressing a catalytically less active D2 preserves a normal local T3 production. This is unlikely to be the case because we have no evidence that the density of astrocytes is different in Ala-DIo2 mice. Please keep in mind that the intimate relationship between astrocytes and neurons is what defines the microenvironment that surrounds the neuron. By separating astrocytes from neurons we are able to measure T3 production that is occurring in the neuronal microenvironment and show that cells obtained from AlaDio2 mouse produce less T3.

      • What the authors called 'primary astrocytes' is an undefined mixed population of glial cells, (including radial glial cells, stem cells, ependymal cells, progenitor cells, etc...) that proliferated differentially for more than a week in culture, among which an unknown ratio expresses Dio2. The cellular model is thus poorly characterized, and the interpretation must be prudent.

      • Again, wild-type mice are not included.

      Thank you. We now include a reference to illustrate the types and percentages of cells present in our cultures. Given that the study is to compare the Thr92 and the Ala92 alleles, which are both present in humans, we did not believe it was necessary to include them here. Please note (as explained above) the Km(T4) for Thr92 and Pro92-Dio2 is indistinguishable.

      Part 2: Neuronal response to T3 Involves MCT8 and Retrograde TH transport

      The authors next move to primary neuronal cultures, prepared from the fetal cortex which they grow in the microfluidic chamber to study axonal transport. This is a surprising move: the focus is not on Dio2 anymore, but on the MCT8 transporter, which is known in humans to play an important role to transfer TH into the brain. It is expressed mainly in glia, but also in neurons. They study the influence of endosomes and type 3 deiodinase on the trafficking and metabolism of TH.

      Thank you.

      It would be useful to perform an experiment, in which radioactive T3 is introduced in the "wrong" side of the chamber, in an attempt to detect a possible anterograde transport. This would address the possibility that Mct8 also promotes efflux and control so that the chamber is not leaking.

      Thank you. To satisfy the reviewer, we have conducted three new experiments adding 125IT3 in the MC-CS. The first experiment verified that the T3 transport in the cortical neurons also occurs anterogradely. The second experiment showed that the anterograde transport depends on mct8. The third experiment shows that D3 activity in the neuronal soma is limiting the amount of T3 transported along axons. We have included a new paragraph in the results section describing these experiments (Line 154 to 167), and a new supplementary figure (Figure 3—figure supplement 3). We have also discussed these new findings. Line 383 to 386. In every experiment, we have controlled for the possibility of leaking using one device without neurons that received radioactive T3. After 24 and 72h samples from the opposite side were obtained but did not contain any radioactive T3. We refer the reviewer to figure 1, where this is explained.

      The authors use sylichristin as an inhibitor of Mct8, to demonstrate that transport is Mct8 dependent. They do not provide indications or references that would clearly indicate that this drug is a fully selective antagonist of Mct8 (but not of Oatp1c1, Mct10, Lat1, Lat2, etc., the other TH transporters). A good alternative would be to use Mct8 KO mice as controls.

      Thank you. We refer the reviewer to reference 27 [J. Johannes et al., Silychristin, a Flavonolignan Derived from the Milk Thistle, Is a Potent Inhibitor of the Thyroid Hormone Transporter MCT8. Endocrinology 157, 1694-1701 (2016)] clearly indicating that Silychristin has a remarkable specificity toward MCT8. While using mct8 KO is interesting, it would have prevented us from testing some of our hypotheses. Being able to selectively inhibit Mct8 either in the MC-CS or in the MC-AS was a clear advantage. For example, pls see the experiment in which we add T3 in the MC-AS and the silychristin in the MC-CS (Fig. 3F). Here, we discovered new roles of mct8, such as its involvement in the release of T3 from the endosomes (line 228 to 231).

      The B27 used in primary neuronal culture might contain TH. This is not easy to know, but at least some batches do.

      Thank you. While the neurons were cultured in B27, all experiments were performed in cells incubated with neurobasal only (B27 was removed 24 earlier). This was not clear in the initial version, where there was only a vague reference in the legend of figure 3F. Now, this has been explained in the footnote of figure 3 and in line 207.

      The presence of astrocytes, probably expressing Mct8 and Dio2 is inevitable in primary neuronal cultures, and is not mentioned, but might interfere with TH metabolism.

      Thank you. We were aware that, under normal conditions, primary neuronal culture contains 25% of astrocytes. This was however minimized/eliminated by 2-day culture with the anti-mitotic cytosine arabinoside, which restricts astrocytes and microglia to <0.01 in this type of culture. This was explained in the initial version of the manuscript in the material and methods section (lines x to x) and supported with reference 53 (reference 57 in the previous version).

      Part 3: T3 Transport Triggers Localized TH Signaling in the Mouse Brain

      The authors return to in vivo experiments, implanting T3 crystals, labeled or not with radioactive iodine. They do so in the hypothalamus, where they address the retrograde transport of TH in TRH neurons, and in the cortex, looking for contralateral transport. These data are the most difficult to interpret. - First, T3 is hydrosoluble and would probably migrate without active transport.

      Thank you. Please note that at no point we characterized the T3 transport “active transport”, which by definition is an ATP-dependent process. Please note that to address the issue raised by the reviewer “migrate without active transport”, in both experimental approaches, we included controls to assess the random diffusion of T3.

      In hypothalamic studies, we used the (i) cerebral cortex and (ii) the lateral hypothalamus, a region that is immediately adjacent to the PVN. Neither region exhibit an axonal connection to the median emminence. The results, in both cases, show that the presence of radioactive T3 in the control areas was minimal when compared to the PVN (Fig. 5C).

      In the cerebral cortical studies, we included ipsi- and contra-lateral hypothalamic measurements that served as controls given the absence of a connection between the cortex and the hypothalamus. Accordingly, T3 signaling was not detected in any of the control regions (Fig. 6C previous version; now figure 5). Thus, these controls indicate that it is unlikely that the results could be explained by “migrate without active transport” of T3.

      • The authors do not demonstrate that these specific neuronal populations contain Mct8, and that these observations are connected to the previous in vitro observation (which used cortical neurons prepared from the fetus).

      Thank you. In the previous version, we did not make it abundantly clear that the EM pictures in Fig. 3D-G (previous version; now figure 2 D-G) were from neurons in the mouse motor cortex (this information is now explained in lines 149 to 151), which is where we inserted the T3 crystals. In addition, we have done more histological work on the brain M1 (cortex) of adult mice and found that many neurons in the M1 express D3 and Mct8—lines 433-434 and Figure 5 G-K (along with histological studies showing the specificity of the ab against D3 Fig S6).

      The possibility that astrocytes are involved, as reported in the literature, is not considered.

      • Here again, using Mct8KO mice would greatly help to interpret the data. In particular, the experiments with cold T3 involve a 48h delay which is very long in comparison to the 30 minutes required for long-distance transfer of radioactive T3.

      Thank you. We are unsure about the question posed by the reviewer. We are wondering how would astrocytes play a role in inter-hemispheric transport of T3? Given that astrocytes are not known to project across long distances, we have not considered this possibility. We agree that using the Mct8KO mouse could have provided supporting evidence of the role played by Mct8 in this process, but please keep in mind that the Mct8KO mouse does not have or exhibits a very mild brain phenotype, indicating that during development compensatory mechanisms have occurred that obviate the function of the transporter. This compensatory mechanism most likely involved Oatp1c1, given that only the double Mct8 and Oatp1c1 KO mouse develops a significant phenotype. This consideration directed us to the utilization of sylycristin, the highly selective Mct8 inhibitor, which disrupts the Mct8 pathway in a mouse that developed normally.

      The two approaches used to demonstrate neuronal T3 transport in vivo are fundamentally different. The hypothalamus experiments employed radioactive T3, whereas T3 crystals were used in the cerebral cortex. The first approach studied T3 transport whereas the second studied downstream T3 effects, logically requiring more time. The solid T3 implant requires time to release T3 and activate gene expression. In the original paper that utilized T3 implants in the rodent brain, samples were processed after 4 days. (Dyess et al. 1988 Endo; PMID 3139393)

      Discussion

      Considering the diversity of questions that are addressed in the study, it is not surprising that the discussion is not covering all aspects. The authors implicitly consider that their conclusions can be extended to all neurons, while they use in their experiments a variety of different populations coming from either the fetal cortex, hippocampus, adult cortex, or hypothalamus. The claim that they discovered a mechanism applying to all neurons is not supported by the data.

      Thank you. We agree with the reviewer: the high number of neuronal subtypes might include different mechanisms in T3 transport. Our studies involved cortical (central) and dorsal root ganglia (peripheral) neurons in vitro and cortical and hypothalamic neurons in vivo. Thus we think that the described mechanism is not confined to specific neuronal subtypes. The discussion has been modified accordingly (lines 402 to 411).

      Moreover, we have done immunofluorescence studies to characterize the neurons present in the MC-CS better. We have found that all the neurons residing in the MC-CS are excitatory, expressing the vesicular glutamate transporter 1 (Vglut1). But no neurons were expressing GAD67, a marker for inhibitory neurons Figure 5—figure supplement 5). This is supported by the fact that during the mouse's brain development, the embryonic days 14.5 to 17.5 is the birth date of layer 4 and 2/3 excitatory neurons (PMID: 34163074). These neurons are migrating and have not extended their cellular processes, making them more likely to survive the isolation protocol from the cortex. On the other hand, the neurons (mostly excitatory) already residing in the cortex may have expanded their processes and changed their morphology, making them less capable of surviving the isolation process.

      Some highly relevant literature is not cited. In particular:

      • Mct8 KO mice do not have marked brain hypothyroidism (PMID: 24691440) which at least suggests that the pathway discovered by the authors can be efficiently compensated by alternative pathways.

      We agree with the reviewer. As mentioned above, a compensatory mechanism triggered during development “compensates” for the inactivation of Mct8. That, however, does not mean that mct8 is not critically important. We have added that limitation to the discussion (lines 342); ref 46.

      • Dio3 KO only increases T3 signaling in a few brain areas and only in the long term (PMID: 20719855).

      Thank you. That is now included in the ms; ref 25.

      • Anterograde transport of T3 has been reported for some brainstem neurons (PMID: 10473259).

      Thank you. This was our mistake, indeed. We had worked on several versions of the manuscript that included references to her seminal work but unfortunately deleted it from the final version. This is now included in refs 48 and 49.

      Reviewer #2 (Public Review):

      Salas-Lucia et al. investigated two main questions: whether the Thr92Ala-DIO2 mutation impairs brain responsiveness to T4 therapy under hypothyroidism induction and the mechanisms of neuronal retrograde transport of T3. They find that the Thr92Ala-DIO2 mutation reduces T4-initiated T3 signaling in the hippocampus, but not in other brain regions. Using neurons cultured in microfluidic chambers, they further describe a novel mechanism for retrograde transport of T3 that depends on MCT8 and endosomal loading (possibly protecting T3 from D3-mediated cytosolic degradation) and microtubule retrotransport. Finally, they present evidence of retrograde transport of T3 through hypothalamic projections and interhemispheric connections in vivo. The main novelty of this study is the delineation of the mechanism of T3 retrograde transport in neurons. This is interesting from the cell biology perspective. The notion of impaired hippocampal T3 signaling is relevant for the cognitive outcomes of hypothyroidism and its associated therapy.

      Thank you.

      Although the data are exciting and relevant for the community, some issues need to be addressed so that conclusions are more clearly justified by data:

      1) The title and the abstract mean that dissecting this novel mechanism of T3 retrograde transport may help improve cognition or brain responsiveness in patients taking T4 or L-T3 therapy. However, how initial results (Figs 1 and 2) connect to later data is not essentially clear. For example, do Thr92Ala-DIO2 mice present altered retrograde transport of T3? Would stimulation of retrograde transport in Thr92Ala-DIO2 mice rescue neurological phenotypes? Can the authors address this experimentally?

      Thank you. These are all interesting points raised by the reviewer. However, the three reviewers felt that a connection between the studies in astrocytes and the studies in neurons was missing, and complained about the disjoint nature of the manuscript. To satisfy the reviewers we removed from the MS the experiments with astrocytes and DIO2 polymorphism, and focused on the neuronal transport of T3.

      2) Although the authors present in vivo evidence of retrograde T3 transport in the hypothalamus and motor cortex, given the select susceptibility of the hippocampus to hypothyroidism, it would be especially interesting to test whether this mechanism also happens in a hippocampal circuit (CA3-CA1 Schaffer collaterals, mossy fibers or perforant pathway).

      Thank you. We agree that this would be interesting, but technically challenging. Nonetheless, we intend to study this in the future.

      3) Table 1 should present the raw values for Ala92-DIO2 mice and treatments instead of only displaying the direction of change and statistical significance. From Panels 1E-J, it is unclear if Thr92Ala-DIO2 mice or treatments caused any real change in brain regions other than the hippocampus.

      Thank you. These experiments were removed from the new version of the MS.

      4) The authors put forward the notion that a rapid nondegradative endosome/lysosome incorporation protects T3 from D3 degradation in the cytosol. Their experiments with pharmacological modulation of MCT8, lysosomes, and microtubules are in this direction. However, they do not represent an unequivocal demonstration of this mechanism. Therefore, the authors should be more cautious in their interpretation and discuss the limitations of their approaches.

      Thank you. The manuscript was edited to reflect these important points.

      Reviewer #3 (Public Review):

      Initially, Salas-Lucia et al examined the effect of deiodinase polymorphism on thyroid hormone-medicated transcription using a transgenic animal model and found that the hippocampus may be the region responsible for altered behavior. Then, by changing to topic completely, they examined T3 transport through the axon using a compartmentalized microfluid device. By using various techniques including an electron microscope, they identified that T3 is uptaken into clathrin-dependent, endosomal/non-degradative lysosomes (NDLs), transported in the axon to reach the nucleus and activate thyroid hormone receptor-mediated transcription.

      Although both topics are interesting, it may not be appropriate to deal with two completely different topics in one paper. By deleting the topic shown in Table 1, Figure 1, and Figure 2, the scope of the manuscript can be more clear.

      Thank you. We did as suggested by the reviewer. These studies were removed from the present version of the ms.

      Their finding showing that triiodothyronine is retrogradely transported through axon without degradation by type 3 deiodinase provides a novel pathway of thyroid hormone transport to the cell nucleus and thus can contribute greatly to increasing our understanding of the mechanisms of thyroid hormone action in the brain.

      Thank you.

    1. Author Response

      Reviewer #2 (Public Review):

      In their study the authors aimed to investigate the dissemination of Enterobacterales plasmids between geographically and temporally restricted isolates recovered from different niches, such as human blood stream infections, livestock, and wastewater treatment works. By using a very strict similarity threshold (Mash distance < 0.0001) the authors identified so-called groups of near-identical plasmids in which plasmids from different genera, species, and clonal background co-clustered. Also, 8% of these groups contained plasmids from different niches (e.g., human BSI and livestock) while in 35% of these cross-niche groups plasmids carried antimicrobial resistance (AMR) genes suggesting recent transfer of AMR plasmids between these ecological niches.

      Next, the authors set-out to examine the wider plasmid population structure by clustering plasmids based on 21-mer distributions capturing both coding and non-coding plasmid regions and using a data-driven threshold to build plasmid networks and the Louvain algorithm to detect the plasmid clusters. This yielded 247 clusters of which almost half of the clusters contained BSI plasmids and plasmids from at least one other niche, while 21% contained plasmids carrying AMR genes. To further assess cross-niche plasmids similarities, the authors performed an additional plasmid pangenome-like analysis. This highlighted patterns of gain and loss of accessory plasmid functions in the background of a conserved plasmid backbone.

      By comparing plasmid core gene or plasmid backbone phylogenies with chromosome core gene phylogenies, the authors assessed in more detail the dissemination of plasmids between humans and livestock. This indicated that, at least for E. coli, AMR dissemination between human and livestock-associated niches is most likely not the result of clonal spread but that plasmid movement plays an important role in cross-niche dissemination of AMR.

      Based on these data the authors conclude that in Enterobacterales plasmid spread between different ecological niches could be relatively common, even might be occurring at greater rates than estimated, as signatures of near-identity could be transient once plasmids occupy and adept to a different niche. After such a host jump, subsequent acquisition, and loss of parts of the accessory plasmid gene content, as a result of plasmid evolution after inter-host transfer, may obscure this near-identity signature. As stated by the authors, this will raise challenges for future One Health-based genomic studies.

      Strengths

      The article is well written with a clear structure. The authors have used for their analysis a comprehensive collection of more than 1500 whole genome sequenced and fully assembled isolates, yielding a dataset of more than 3600 fully assembled plasmids across different bacterial genera, species, clonal backgrounds, and ecological niches. A strong asset of the collection, especially when analyzing dissemination of AMR contained on plasmids, is that isolates were geographically and temporally restricted. Bioinformatic analyses used to discern plasmid similarity are beyond state-of-the-art. The conclusions about dissemination of plasmids between genera, species, clonal background and across ecological niches are well supported by the data. Although conclusions about inter-host plasmid dissemination patterns may have been drawn before, this is to my knowledge the first time that patterns of dissemination of plasmids have been studied at such a high-level of detail in such a well selected dataset using so many fully assembled genomes.

      Weaknesses

      One conclusion that is not entirely supported by the data is the general statement in the discussion that "cross-niche plasmid in not driven by clonal lineages". From the tanglegram, displaying the low congruence between the plasmid and chromosome core gene phylogeny in E. coli, this conclusion is probably valid for E. coli, but this not necessarily means that this is also the case for the other Enterobacterales genera and species included in this study. For these other genera, the data supporting this conclusion are not given, probably because total number of isolates for certain genera were low, or because certain niches were clearly underrepresented in certain genera.

      Thank you for reviewing our manuscript.

      We agree that this statement in the conclusion was too general, and have adapted it (lines 407-409):

      “By examining plasmid relatedness compared to bacterial host relatedness in E. coli, we demonstrated that plasmids seen across different niches are not necessarily associated with clonal lineages”

      In the limitations section of the Discussion, we have also referenced this specifically as a limitation (lines 422-424):

      “Although we evaluated four bacterial genera, 72% (1,044/1,458) of our sequenced isolates were E. coli, and so our analyses and findings are particularly focused on this species.”

      Furthermore, the BSI as well as the livestock niches were analyzed as single niches while the BSI niche included both nosocomial and community-derived BSI isolates and the Livestock niche included samples from different livestock-related hosts. Given the fact that a substantial number of plasmids were available from cattle, sheep, pigs, and poultry, it would be interesting to see whether particular livestock hosts were more frequently found in the cross-niche plasmid clusters than other livestock hosts and whether the BSI plasmids in these cross-niche clusters were predominantly of community or nosocomial origin.

      We agree that analyses which distinguish between nosocomial/community acquired BSI isolates would be interesting further work, but are beyond the scope of this study. Our analysis of the BSI/livestock cross-niche near-identical plasmid groups details the livestock hosts involved (lines 144-154). Briefly, of the n=8 BSI/livestock cross-niche groups, these involved

      • pig/poultry (1/8)

      • poultry (1/8)

      • pig (2/8)

      • sheep (3/8)

      • cattle/pig/poultry (1/8)

      We have added a note of explanation in the methods to explain how the distance threshold we use for near-identical clustering is maximally conservative at small plasmid sizes (a single SNP produces a new plasmid cluster) but remains highly conservative (tens of SNPs) at large plasmid sizes.

      We have carefully considered the point about whether particular hosts were more frequently found in cross-niche plasmid clusters. However, we do not think it is easy to infer whether a particular livestock host is represented more frequently in these cross-niche events than would be expected from chance, given the low density of the sampling.

      We have reorganised the paragraph in lines 144-154 to provide more clarity on the groups’ niches.

      “Sharing between BSI and livestock-associated isolates was supported by 8/17 cross-niche groups (n=45 plasmids). Of these, n=3/8 groups contained BSI/sheep plasmids: one group contained mobilisable Col-type plasmids, the remaining two groups contained conjugative FIB-type plasmids. Of these, one group contained plasmids carrying the AMR genes aph(3'')-Ib, aph(6)-Id, blaTEM-1, dfrA5, sul2, and the other group contained plasmids carrying the MDR efflux pump protein robA (see Materials and Methods). A further n=2/8 groups contained BSI/pig mobilisable Col-type plasmids, of which one group other carried the AMR genes aph(3'')-Ib, aph(6)-Id, dfrA14, and sul2. Lastly, n=1/8 groups contained BSI/poultry non-mobilisable Col-type plasmids, n=1/8 contained BSI/pig/poultry/influent non-mobilisable Col-type plasmids, and n=1/8 contained BSI/cattle/pig/poultry/influent mobilisable Col-type plasmids.”

      We have also added this as a limitation in the discussion (lines 424-426):

      “Additionally, we did not sample livestock-associated niches densely enough to explore individual livestock types (cattle/pigs/poultry/sheep) sharing plasmids with BSI isolates (see Appendix 1 Fig. 9).”

      We have already recognised that our culture methods may have affected our sensitivity to detect Klebsiella spp. isolates in the livestock/environmental samples – we have expanded on this to explicitly highlight that this may have affected our capacity to evaluate Klebsiella-associated plasmids (lines 443-444):

      “This limited our ability to study the epidemiology of livestock Klebsiella plasmids.”

    1. Author Response

      Reviewer #1 (Public Review):

      We would like to thank reviewer #1 for her helpful comments and would like to respond to these as follows:

      1) “Editing efficiencies were variable (99% to 0%) depending on the species, being worst for L. major.”

      It is true that the editing efficiency was different in each species and worst for L. major. However, it is important to note that these efficiencies varied not only for each species but also amongst genes and especially chosen sgRNA sequences. Variations in efficiency across sgRNAs targeting the same gene and locus is a common problem in any CRISPR approach. We made this clearer in our revised manuscript (line 670 – 673).

      2) “The use of premature termination codons also clearly raises issues for false positives and negatives, especially as there is no evidence for nonsense-mediated mRNA decay in Leishmania.”

      We have now included in our revised manuscript that it is currently unclear whether a classical nonsense-mediated decay pathway is present in Leishmania or not. If such a pathway would be present, mutant mRNAs in which a termination codon is present within the normal open reading frame would be removed (Clayton, Open Biology 2019; Delhi et al., PLoS One 2011). But if not, remaining N-terminal protein parts could be functional and may lead to false positive and negative results. However, as reviewer #2 pointed out, this may also provide extra information about functional domains of the targeted protein and highlights that our tool can not only be used to create functional null mutants by inserting premature STOP codons but also to pursue targeted mutagenesis screens (line 674 - 683).

      3) “There are already two genome-wide screening options for Leishmania, so the advantages and disadvantages of the method proposed here need to be discussed in a much more detailed and balanced way.”

      We have revised our manuscript to include in our introduction (line 36 - 73) and discussion (line 658 - 697) a better comparison of all potential tools for genome-wide screening in Leishmania, including RNAi, bar-seq and base editing screening. We highlight why we think that base editing has unique advantages.

      4) “In the "LeishGEM" project (http://www.leishgem.org) all Leishmania mexicana genes will be knocked out and each KO will be bar-coded. At the end, 170 pooled populations of 48 bar-coded mutants will be publicly available. The only real reason the authors of the current paper give for not using this approach is that it is labour-intensive. However, LeishGEM is funded and underway, with several centres involved, so that argument is weak.”

      In our original manuscript we gave multiple reasons why we think that the LeishGEdit method, which is being used for the LeishGEM screen and has been developed by the lead author of our here presented study, has clear disadvantages compared to base editing.

      As written in our original manuscript (line 709 – 716): “However, for a bar-seq screen, each barcoded mutant needs to be created individually by replacing target genes with drug selectable marker cassettes (20,21), making them extremely labour intensive and most likely “one-offs” on a genome-wide scale. Furthermore, aneuploidy in some Leishmania species can be a major challenge for gene replacement strategies as multiple rounds of transfection or isolation of clones may be required to target genes on multi-copy chromosomes. Using gene replacement approaches it is also not feasible to study multi-copy genes that have copies on multiple chromosomes. These are major disadvantages of bar-seq screening.”

      Therefore, we still think that the main disadvantage of bar-seq screening is that it is labour-intensive as each mutant needs to be created individually. The fact that LeishGEM requires five years and several research centres to knockout all genes in just one Leishmania species is proof for this argument.

      However, to clarify our position about this further, we have listed other disadvantages of the LeishGEM screen, including difficulties of sharing mutant pools between labs, possible problems in expanding mutant pools without losing uniformity, no ability to change the composition of generated pools and limited ability to distinguish between technical failures and essentiality. If any of these problems would occur, it would require a de novo generation of barcoded mutants and therefore this is an extremely labour-intensive method for large-scale screening. We also added that bar-seq screens are not feasible in Leishmania species that display extreme cases of aneuploidy, such as L. donovani (line 59 – 73).

      Despite all these disadvantages of the LeishGEdit approach for the LeishGEM project, there are of course also clear advantages, which we also point out in our introduction (line 52 – 55).

      5) “There is also a preprint describing RNAi for functional analysis in Leishmania braziliensis.”

      Although our original manuscript included the pre-print about RNAi screening in Leishmania braziliensis already (line 706-709), we understand that this deserves a stronger discussion. We have therefore highlighted now RNAi as a possible tool for genome-wide screening in selected Leishmania species in our revised introduction (line 36 - 43). However, we also argue that RNAi approaches are at the moment only available to Leishmania of the Viannia subgenus and that RNAi activity greatly varies between the species (line 36 – 43 and 665 - 669). In addition, we discuss that the use of RNAi genome-wide screens is much less specific, as usually randomly sheared genomic DNA is used to generate RNAi libraries (line 687 - 689). Since the pre-print is now published, we have replaced the pre-print publication with the peer-reviewed one.

      Reviewer #2 (Public Review):

      We would like to thank reviewer #2 for helpful comments and would like to respond to those as follows:

      1) “Line 482 - the authors wrote 'As expected, the proportion of cells showing a motility phenotype in the IFT88 targeted L. infantum population decreased further' Why is this result expected? Presumably, this is due to the fact that cells without a functional IFT system lack flagella and grow slower so can be outcompeted by faster-growing mutants. This speaks to the major caveat highlighted by the authors in the discussion and the final small-scale screen. In a population of cells, those with deleterious mutations in an essential gene or one whose disruption results in slower growth will be outcompeted by cells in which a non-deleterious mutation has occurred, which feeds into the issue of timing.”

      As the reviewer highlighted himself, deleterious mutations that result in slower growth will be outcompeted by cells in which a non-deleterious mutation has occurred. We have stated that the complete deletion of IFT88 in Leishmania mexicana has been shown to have reduced doubling time (Beneke et al., PLoS Pathogens 2019) and are therefore most likely outcompeted from the pool (line 529 – 532 and 767 - 769).

      2) “The authors show with CRK3 this process of non-deleterious mutants outcompeting deleterious mutants does result in a detectable drop in the number of parasites with specific CRK3 guides but not in those with IFT88. Is this due to the fact that the outgrowth of the non-deleterious IFT88 mutants occurs rapidly or that the mutation of the targets in IFT88 was ineffective? The data presented in Figure 5 shows that for some species at least a mutation of the IFT88 gene was possible. This might mean that for certain genes the outgrowth occurs within the first 12 days after transfections so will not be seen using this approach, without a wider study, which is beyond the scope of this manuscript it will be difficult to know.”

      As we stated in our discussion, we did not test IFT88 guides individually in L. mexicana. Therefore, the editing rate observed for the IFT88 guides in L. major and L. infantum (Fig. 5) may differ from the editing rate in L. mexicana, which is the species we used for the pooled transfection screen. It is therefore difficult to conclude why IFT88 was not depleted from the pool. This may be due to lower guide activity in L. mexicana or rapid selection of non-deleterious mutations (line 769 - 774). We are therefore planning to further optimize our system by streamlining the editing efficiency and eliminating species-specifics effects (line 735 - 745). As the reviewer highlighted, this is beyond the scope of this study.

      However, the reviewer raises a fair point about the exact timing of isolating DNA from pools, which might influence when exactly parasites with a deleterious mutation are depleted from the pool. This may differ between guides and may even be gene specific. We have added this point to our discussion (776 - 780).

      3) “The authors highlight that this base editing approach will leave potentially functional regions of the NT of proteins, which is true and may mean genes are missed. However, this may also provide extra information about the protein's function/domain structure if STOP codons in certain positions showed an effect on function whereas those in others don't.”

      We thank reviewer #2 for pointing out that functional parts of truncated proteins following base editing may actually allow to draw additional conclusions. We have included this in the manuscript (681 - 683).

    1. Author Response

      Reviewer #1 (Public Review):

      This umbrella review aims to synthesize the results of systematic reviews of the impact of the COVID-19 pandemic on various dimensions of cancer care from prevention to treatment. This is a challenging endeavor given the diversity of outcomes that can be assessed in cancer care.

      Search and review methods are good and are in line with recommendations for umbrella reviews. Perhaps one weakness of the search strategy was that only one database (Pubmed) was searched. The search strategy appears adequate, though perhaps some more search terms related to reviews and cancer could have been included. It is therefore possible that some reviews may have been missed by the search strategy.

      It is challenging to perform a good umbrella review that yields novel insights, as it is difficult to combine results from different reviews which themselves combine results from different studies with different methodologies. However, I think perhaps one of the main weaknesses of this study is that it is not clear to me what is the core objective of the umbrella review, and how analyses relate to that core objective. In other words, I do not understand based on the introduction what new information the authors are hoping to learn from their umbrella review that could not be learned from reading the individual systematic reviews, beyond a vague objective of "synthesizing" the literature. Because of this, it is not very clear to me how the data extracted and the analysis fits into the larger objectives, and what the new knowledge generated by this review is. Based on the reported results, it would appear that one of the main goals is to assess the quality of systematic reviews and of the underlying studies in the reviews, but it is hard to tell. I think there are potentially important insights this review could tell us, but the message and implications of current evidence remain for me a little confused in the current manuscript.

      We thank the reviewer for the encouraging remarks on our work, and for the useful feedback. We have now addressed all concerns as outline below.

      Reviewer #2 (Public Review):

      This umbrella review summarizes the results of systematic reviews about the impact of the COVID-19 pandemic on cancer care. PRISMA checklist is used for reporting. The literature search was performed in PubMed and systematic reviews published until November 29th, 2022 were included. The quality of included systematic reviews was appraised using the AMSTAR-2 tool and data were reported descriptively due to the high heterogeneity of 45 included studies. Based on the results of this paper, regardless of the low quality of included evidence, COVID-19 affected cancer care in many ways including delay and postponement of cancer screening, diagnosis, and treatment. Also, patients with cancer had been affected psychologically, socially, and financially during the COVID-19 pandemic.

      The main limitation of the current study is that the authors have searched only one database, which might have missed some relevant systematic reviews. Also, most of the included reviews in this paper had low and medium methodological quality.

      We thank the reviewer for this excellent remark. Guideline on umbrella reviews suggest PubMed, reference screening and an additional bibliographic database for an optimal database combination for searching systematic reviews (Goossen K et al. 2020). To follow the guidelines, and considering the specialized focused on COVID-19, in addition to Pubmed and reference screening, we also performed a search in the WHO COVID-19 Database. Furthermore, we revised the search strategy in Pubmed to include mesh terms. The search was performed by a specialized librarian with experiences in systematic review searches. Overall, we retrieve 485 new references, and found 6 new studies that met out inclusion criteria to be included in final analysis. We have now revised the manuscript to reflect the above changes, and also highlighted this as a strength of our work. In addition, we added the new detailed search strategy in the supplemental material.

    1. Author Response

      Reviewer #2 (Public Review):

      In this manuscript, the authors use an embedding of human olfactory perceptual data within a graph neural network (which they term principal odor map, or POM). This embedding is a better predictor of a diverse set of olfactory neural and behavior data than methods that use chemical features as a starting point to create embeddings. The embedding is also seen to be better for comparison of pairwise similarities (distances of various sorts) - the claim is that proximity of pairs of odors in the POM is predictive of their similarity in neural data from olfactory receptor neurons.

      A major strength of the paper is the conceptualization of the problem. The authors have previously described a graph neural net (GNN) to predict verbal odor descriptors from molecular features (here, a 2019 preprint is cited, but a newer related one in 2022 describing the POM is not cited). They now use the embedding created by that GNN to predict similarities in large and diverse datasets in olfactory neuroscience (which the authors have curated from published work). They show that predictions from POM are better than just generic chemical features. The authors also present an interesting hypothesis that the underlying latent structure discovered by the GNN relates to metabolic pathway proximity, which they claim accounts for the success in the prediction of a wide range of data (insect sensory neuron responses to human behavior). In addition to the creativity of the project, the technical aspects, are sound and thorough.

      There are some questions about the ideas, and the size of the effects observed.

      1) The authors frame the manuscript by invoking an analogy to other senses, and how naturalstatistics affect what's represented (and how similarity is defined). However, in vision or audition, the part of the world that different animals "look at" can be very different (different wavelengths, different textures and spatial frequencies, etc). It is still unresolved why any given animal has the particular range of reception it has. Each animal is presumably adapted for its ecological niche, which can have different salient sensory features. In vision, different animals pick different sound bandwidths or EM spectra. Therefore, it is puzzling to think that all animals will somehow treat chemicals the same way.

      Our assumption (an assumption of the broader interpretation, not of the analyses themselves) that all terrestrial animals have a correlated odor environment is certainly only true for some values of “correlated”. One could imagine, for example, that some animals are able to exploit food energy sources that humans cannot (for example, plants with high cellulose content), and that they might therefore be adapted to smell metabolic signatures of such plants, whereas humans would not be so adapted. This seems quite reasonable and there are probably many such examples. In future work they might be used to test the theory directly: representations might be more likely to differ across species on tasks when the relevant ecological niches are non-overlapping. We have updated the discussion to propose such future tests. However, it is also apparent that the odor environment overall is nonetheless highly correlated across species. Recent work (Mayhew et al, PNAS) showed that nearly all molecules that pass simple mass transport requirements (that should apply to all mammals, at the least) are likely to have an odor to humans, so it seems unlikely that the “olfactory blind spots” are intrinsically large.

      2) The performance index could be made clearer, and perhaps raw numbers shown beforeshowing the differences from the benchmark (Mordred molecular descriptor). For example, can we get a sense of how much variance in the data does it explain, what percent of the hold-out tests does it fit well, etc.?

      The performance index in Figure 1 is required to compare across different types of tasks, which are in turn dictated by the nature of the data (e.g. continuous vs categorical). Regression tasks yields an R2 value and categorical tasks yield an AUROC. We normalized and placed these on a single scale in order to show all of the tasks clearly together. We have added a table to the shared code (from link in Methods section, go to predictive_performance/data/dataset_performance_index_raw.csv) that shows the original (non-normalized) values, for both the POM and the benchmark(s) across multiple seeds and various metrics with the model hyper-parameters that generate the best performance.

      3) The "fitting" and predictions are in line with how ML is used for classification and regression inlots of applications. The end result is a better fit (prediction), but it's not actually clear whether there are any fundamental regularities or orders identified. The metabolic angle is very intriguing, but it looks like Mordred descriptor does a very good job as well (extended figure 5 [now Figure 2-figure supplement 5]). Is it possible to show the relation between metabolic distance and Mordred distance in Figure 2c? In fact, even there, cFP distance looks very well correlated with metabolic distance (we are talking about r= 0.9 vs r = 0.8). This could simply be due to a slightly nonlinear mapping between chemical similarity and perceptual similarity (which was used to get POM distance).

      We show additional “showdown” comparisons between metabolic distance, POM distance, and alternative distance metrics in the new Figure 2-figure supplement 3 and Figure 2-figure supplement 4. Indeed, the Mordred descriptors perform well; after all, metabolic reactants and products must be at least somewhat structurally related. But POM (derived only from human perceptual data) outperforms it significantly. Visual inspection of Figure 2c also reveals that the dispersion of structural distances (at a given metabolic distance) is just much higher than the dispersion of POM distances. This won’t change if one uses a non-linear curve fit, as it is a property of the data itself.

      It’s also worth noting while r=0.8 and r=0.9 might seem close, in terms of variance unexplained (1 - r2) they are approximately two-fold different. Reducing the unexplained variance by half seems like a meaningful difference. Alternatively, if one simulates scatter plots with correlation r=0.8 vs r=0.9, it is apparent that the latter is simply a much tighter relationship.

      4) How frequent are such examples shown in Fig 2d? Pentenal and pentenol are actually verysimilar in many ways, and it may be that Tanimoto distance is not a great descriptor of chemical similarity. cFP edit distance is quite small, just like metabolic distance. The thiol example on the right is much better. Also, even in Fig 2C POM vs metabolic distance, the lowest metabolic distances have large variations in the POM values - so there too, metabolic reactions that create very different molecules in 1 step can vary widely in POM distance as well.

      We agree that Tanimoto distance is not perfect. We were unable to find a measure of structural distance that agreed with human intuitions about “structural distance” in all cases; indeed that intuition is often generated by an understanding of odor/flavor characteristics of function in metabolic networks, which would beg the question! To answer the question about the frequency of examples like the ones shown in Figure 2d, we created a new density map (Figure 2-figure supplement 4) showing the number of one-step metabolite pairs for a given range of POM vs cFP edit/Tanimoto distance. We found >25 pairs of metabolites in the same “small POM distance” and “large structural distance” quadrant from which we found the original examples shown in Figure 2d..

      5) A major worry is that Mordred descriptors are doing fine, and POM offers only a smallimprovement (but statistically significant of course). Another way to ask this question is this: if you plot pairwise correlation/distance of pairs of odors from POM against that for Mordred, how correlated does this look? My suspicion is that it will be highly correlated.

      It will look highly correlated (as shown in the new Figure 2-figure supplement 3). The reason is that metabolic reactions cannot make arbitrary transformations to molecules (the reactants must have some structural relationship to the products) or similarly that olfactory receptors (in any species) cannot have arbitrary tuning – at the end of the day receptors mostly bind to similar-looking classes of molecules. As stated above, we believe that the improvement here is not just statistically significant but meaningful – a 2-fold drop in unexplained variance is large – and that it is important to identify principles by which the nervous system can be tuned, above and beyond the physical constraints imposed by basic rules of chemistry.

      Also, the metabolic distances that we constructed from available data are themselves noisy, since not all metabolic pathways and the compounds that compose them are known, which places an upper bound on the correlation that we could have obtained. Despite that, we still found a correlation of r>0.9.

      6) The co-occurrence in mixtures and close POM distance may arise from the way theembedding was done - with perceptual descriptors used as a key variable. Humans may just classify molecules that occur in a mixture as similar just from experiencing them together. Can the authors show that these same molecules in Fig 4d,e have very similar representations in neural data from insects or mice?

      We have added a new Figure 4-figure supplement 1 to show this. One constraint is that the neural datasets must contain molecules that are also in the natural substance datasets used in Figure 4. In all cases where the data is sufficient to be powered to test the hypothesis (i.e. more than five co-occuring pairs of molecules in essential oil), we observe an effect in the predicted direction.

    2. Reviewer #2 (Public Review):

      In this manuscript, the authors use an embedding of human olfactory perceptual data within a graph neural network (which they term principal odor map, or POM). This embedding is a better predictor of a diverse set of olfactory neural and behavior data than methods that use chemical features as a starting point to create embeddings. The embedding is also seen to be better for comparison of pairwise similarities (distances of various sorts) - the claim is that proximity of pairs of odors in the POM is predictive of their similarity in neural data from olfactory receptor neurons.

      A major strength of the paper is the conceptualization of the problem. The authors have previously described a graph neural net (GNN) to predict verbal odor descriptors from molecular features (here, a 2019 preprint is cited, but a newer related one in 2022 describing the POM is not cited). They now use the embedding created by that GNN to predict similarities in large and diverse datasets in olfactory neuroscience (which the authors have curated from published work). They show that predictions from POM are better than just generic chemical features. The authors also present an interesting hypothesis that the underlying latent structure discovered by the GNN relates to metabolic pathway proximity, which they claim accounts for the success in the prediction of a wide range of data (insect sensory neuron responses to human behavior). In addition to the creativity of the project, the technical aspects, are sound and thorough.

      There are some questions about the ideas, and the size of the effects observed.

      1. The authors frame the manuscript by invoking an analogy to other senses, and how natural statistics affect what's represented (and how similarity is defined). However, in vision or audition, the part of the world that different animals "look at" can be very different (different wavelengths, different textures and spatial frequencies, etc). It is still unresolved why any given animal has the particular range of reception it has. Each animal is presumably adapted for its ecological niche, which can have different salient sensory features. In vision, different animals pick different sound bandwidths or EM spectra. Therefore, it is puzzling to think that all animals will somehow treat chemicals the same way.

      2. The performance index could be made clearer, and perhaps raw numbers shown before showing the differences from the benchmark (Mordred molecular descriptor). For example, can we get a sense of how much variance in the data does it explain, what percent of the hold-out tests does it fit well, etc.?

      3. The "fitting" and predictions are in line with how ML is used for classification and regression in lots of applications. The end result is a better fit (prediction), but it's not actually clear whether there are any fundamental regularities or orders identified. The metabolic angle is very intriguing, but it looks like Mordred descriptor does a very good job as well (extended figure 5). Is it possible to show the relation between metabolic distance and Mordred distance in Figure 2c? In fact, even there, cFP distance looks very well correlated with metabolic distance (we are talking about r= 0.9 vs r = 0.8). This could simply be due to a slightly nonlinear mapping between chemical similarity and perceptual similarity (which was used to get POM distance).

      4. How frequent are such examples shown in Fig 2d? Pentenal and pentenol are actually very similar in many ways, and it may be that Tanimoto distance is not a great descriptor of chemical similarity. cFFP edit distance is quite small, just like metabolic distance. The thiol example on the right is much better. Also, even in Fig 2C POM vs metabolic distance, the lowest metabolic distances have large variations in the POM values - so there too, metabolic reactions that create very different molecules in 1 step can vary widely in POM distance as well.

      5. A major worry is that Mordred descriptors are doing fine, and POM offers only a small improvement (but statistically significant of course). Another way to ask this question is this: if you plot pairwise correlation/distance of pairs of odors from POM against that for Mordred, how correlated does this look? My suspicion is that it will be highly correlated.

      6. The co-occurrence in mixtures and close POM distance may arise from the way the embedding was done - with perceptual descriptors used as a key variable. Humans may just classify molecules that occur in a mixture as similar just from experiencing them together. Can the authors show that these same molecules in Fig 4d,e have very similar representations in neural data from insects or mice?

    1. Author Response

      Reviewer #1 (Public Review):

      Collins et al use mesoscopic two-photon imaging to simultaneously record activity from basal forebrain cholinergic or noradrenergic axons in several distant regions of the dorsal cortex during spontaneous behavior in head-fixed awake mice. They find that activity in axons from both neuromodulatory systems is closely correlated with measures of behavioral state, such as whisking, locomotion and face movements. While axons were globally correlated with these behavioral state-related metrics across the dorsal cortex, they also find evidence of behavioral state independent heterogenous signals.

      The use of simultaneous multiarea optical recordings across a large extent of dorsal cortex with single axon resolution for studying the coherence of neuromodulatory afferents across cortical areas is novel and addresses important questions regarding neuromodulation in the neocortex. The manuscript is clearly written, the data is well presented and, for the most part, carefully analyzed. Parts of the manuscript confirm previous results on the influence of behavioral state on norepinephrine and acetylcholine cortical afferents. However, the observation that these modulations are globally broadcasted to the dorsal cortex while behavioral state independent heterogenous signals are also present in these axons is novel and important for the field.

      While the evidence for a behavioral state driven global modulation of activity in both neuromodulatory systems is quite clear, I have concerns that the apparent heterogeneity in axonal responses might be driven by movement-induced artifacts. Moreover, even in the case that the heterogeneity in calcium activity across axons is confirmed, it might not be driven by differences in spiking activity across neuromodulatory axons as concluded, but by other mechanisms that are not explicitly discussed or considered.

      1) Motion artifacts are always a concern when imaging from small structures in behaving animals. This issue is addressed in the manuscript in Fig 2A-C by comparing axonal responses to "autofluorescent blebs that did not have calcium-dependent activity" (line 1011). Still, as calcium-dependent activity and motion artifacts can both be locked to behavioral variables the "bleb" selection criterion seems biased and flawed with a circular logic. "Blebs" presenting motion-induced changes in fluorescence that may pass as neural activity will be wrongly excluded when from the "bleb" control group using this criterion. This will result in an underestimation of the extent of the contamination of the GCaMP signals by movement-induced artifacts. This potential confound might generate apparent heterogeneity across axons and regions as some axons and some cortical areas might be more prone to movements artifacts than others.

      Thank you for the suggestion. We agree that motion artifacts are a reasonable concern. We rigorously addressed this concern by introducing non-calcium-dependent mCherry into cholinergic cortical axons and demonstrating that motion cannot explain our results (see Fig. 2F, Fig. 4H,L,P, Fig. 4 - figure supplement 1G, Video 3, and response above). These axons were chosen for analysis based solely on their ability to be imaged, in a manner identical to that of GCaMP6s containing axons.

      We agree that the observed evidence of heterogeneity is not as clear as the evidence of a common signal. We now carefully present our evidence. Heterogeneity may arise from variations in activity between single axons that is not explained by a common signal such as behavioral state. Heterogeneity could also be signaled by variations in correlated activity between axons. We now address these two possibilities in our manuscript. Our new analysis reveals that the correlated activity between axons is as expected for axons that are variably correlated to a common signal, such as behavioral state. Although we do find some evidence of correlation outside this common signal, we are not able to discern if this is related to imaging axon segments that are part of the same axon, or if it truly represents an independent signal. This is now stated in the text. On the other hand, strong variations in axonal activity from trial to trial that appear to be separate from the common signal is also prevalent. We now point out this variation as a possible source of heterogeneity. Since we do not know the source or meaning of this heterogeneous activity, we discuss only the possibility that it may hold behaviorally relevant information in these modulatory systems.

      2) In the case that the heterogeneity is indeed due to differences in calcium activity, it might be not due to modularity in spiking activity within the LC or the BF as interpreted and discussed in the manuscript. As calcium signaling in axons not only relates to spiking activity but can also reflect presynaptic modulations, the observed heterogeneity might be due to local action of presynaptic modulators in a context of global identical broadcasted activity. The current dataset does not allow distinguishing which of the two different mechanisms underlies the observed signal heterogeneity.

      It is true that our data set is unable to determine whether presynaptic modulations contribute to any observed heterogeneity. We have adjusted our interpretation of heterogeneity throughout the manuscript and have specifically addressed this comment in the discussion by presenting the possibility that a global signal could be locally modulated.

      Reviewer #3 (Public Review):

      Acetylcholine and Norepinephrine are two of the most powerful neuromodulators in the CNS. Recently developments of new methods allow monitoring of the dynamic changes in the activity of these agents in the brain in vivo. Here the authors explore the relationship between the dynamic changes in behavioral states and those of ACh and NE in the cortex. Since neuromodulatory systems cover most of the cortical tissue, it is essential to be able to monitor the activity of these systems in many cortical areas simultaneously. This is a daunting task because the axons releasing NE and ACh are very thin. To my knowledge, this study is the first to use mesoscopic imaging over a wide range of the cortex at the single axon resolution in awake animals. They find that almost any observable change in behavioral state is accompanied by a transient change in the activity of cortical ACh and NE axonal segments. Whisking is significantly correlated with ACh and NE. The authors also explore the spatial pattern of activity of ACh and NE axons over the dorsal cortex and find that most of the dynamics is synchronous over a wide spatial scale. They look for deviation from this pattern (which I will discuss later). Lastly, the authors monitor the activity of cortical interneurons capable of releasing ACh.

      Comments:

      1) On a broad overview, I find the discussion of behavioral states, brain states, and neuromodulation states quite confusing. To begin with, I am not convinced by the statement that "brain states or behavioral states change on a moment-to-moment basis." I find that the division of brain activity into microstates (e.g., microarousal) is counterproductive. After all, at the extreme, going along this path, we might eventually have an extremely high dimensional space of all neuronal activity, and any change in any neuron would define a new brain state. Similarly, mice can walk without whisking, can whisk without walking, can walk and whisk, are all these different behavioral states? And if so, are they all associated with different brain states? And if so, are they all associated with different brain states? Most importantly, in the context of this manuscript, one would expect that different states (brain, behavior) would be associated with at least four potential states of the ACh x NE system (high ACh and High NE, High ACh and Low NE, etc.). However, the reported findings indicate that the two systems are highly synchronized (or at least correlated), and both transiently go on with any change from a passive state to an active state. Therefore, the manuscript describes a rather confined relationship of the neuromodulation systems with the rather rich potential of brain and behavioral states. Of course, this is only my viewpoint, and the authors are not obliged to accept it, but they should recognize that the viewpoint they take for granted is not shared by all and consider acknowledging it in the manuscript.

      We thank this reviewer for this thoughtful comment. While it is clear that animals do in fact exhibit distinct and clear brain and behavioral states (e.g. sleep, waking, grooming, still, walking, etc.), it is beyond the scope of the present manuscript to attempt to tackle this complex field - rather, we refer the reader to a recent review that we have published on this important topic (McCormick, Nestvogel, and He 2020). We agree that properly delineating brain and behavioral states is of great importance, as it could significantly impact experimental design and interpretation of results. Since all of the relevant substates that a mouse may exhibit have not yet been determined, we decided to use changes in whisking and walking behaviors to differentiate between distinct behavioral states owing to: 1) historical use of these measures in behavioral and neural states in head-fixed mice, 2) relative ease of measurement of these variables, 3) a clearly observable relationship with cholinergic and noradrenergic activity with these measures of behavior, and, arguably most importantly, 4) assumed relevance to the animal (Musall et al. 2019; Reimer et al. 2016; Salkoff et al. 2020; Stringer et al. 2019).

      Our manuscript seeks to simply relate the activity of cholinergic and noradrenergic axons across the dorsal surface of the cortex in comparison to these commonly used measures of spontaneous behavior in head-fixed mice to discern to what relative degree there are common, global signals in these two modulatory systems and how they relate to changes in the measured behaviors. Somewhat surprisingly, previous studies have found that neural activity throughout the dorsal cortex of mice is strongly related to movements of the face and body as well as behavioral arousal (Stringer et al. 2019; Musall et al. 2019; Salkoff et al. 2020). Here we determine to what degree these commonly used measures of “state” are already reflected in the GCaMP6s activity of cholinergic and noradrenergic axons (and local cortical interneurons).

      We agree with the interpretation that our results suggest a confined relationship between spontaneous cholinergic and noradrenergic activity in the cortex within the spontaneous behaviors that we observe. We, by no means, mean to suggest that this confined relationship is the only relationship cholinergic and noradrenergic systems exhibit to each other or to behavior. It seems very likely that in the wide variety of behavior exhibited by freely moving mice in their lifetime, there are times in which the activity of cholinergic and noradrenergic systems exhibit a radically different relationship to each other and to behavior. We simply cannot know this without experimental examination. We now mention this possibility in the discussion and give a few appropriate references.

      2) Most of the manuscript (bar one case) reports nearly identical dynamics of ACh and NE. Is that a principle? What makes these systems behave so similarly? Why have two systems that act nearly the same? Still, if there is a difference, it is the time scale of the ACh compared to the NE. Can the authors explain this difference or speculate what drives it?

      Perhaps one of the most striking findings in recent years from examination of mouse brain activity is the prominence and prevalence of a general signal in nearly all neural systems that relates to movement and arousal of the animal (Stringer et al. 2019; Salkoff et al. 2020). Here we report that this signal is also strongly present within the cholinergic and noradrenergic systems. Perhaps this is unsurprising, since everywhere one looks, one finds this global signal. However, we feel that understanding the presence and nature of this large signal is critical to deciphering behavior-related signals in these systems in the future. We discuss this point in the discussion. The one difference we did find is in the more transient nature of NE axonal activity versus both behavior and cholinergic axon activity. We now speculate on this difference in the discussion.

      3) Whisker activity explains most strongly the neuromodulators dynamics, but pupil dilation almost does not (in contrast to many previous reports including reports of the same authors). If I am not mistaken, this was nearly ignored in the presentation of the results and the discussion section. Could the author elaborate more on what is the reason for this discrepancy?

      We apologize for the misleading presentation of our results. In Fig. 3C and D it is clear that pupil diameter is highly coherent with both cholinergic and noradrenergic axon activity, as published previously. In the present study, this coherence peaks at 0.4 to 0.5 for both. In our previous study (Reimer et al. 2016), the cholinergic activity also peaked in coherence at low frequencies at around 0.4 to 0.5 (Reimer et al., Fig. 1H) while the noradrenergic activity coherence peaked at 0.6 to 0.7. The present study was not optimized for pupil diameter examination, since we kept the light levels as low as possible (resulting in low dynamic range of pupil dilations since they were nearly always enlarged to near maximum) in order to increase the S/N of cortical axon activity. We now mention these similarities and differences and caveats in the manuscript. An additional important point is that the kinetics of pupil diameter changes are slow in comparison to whisker movements, reducing the ability of pupil dilation to accurately track changes in axonal activity at frequencies greater than approximately 0.2 Hz (Fig. 2 - figure supplement 2). This is now mentioned in the text.

      4) I find the question of homogenous vs. heterogenous signaling of both the ACh and NE systems quite important. It is one thing if the two systems just broadcast "one bit" information to the whole brain or if there are neuromodulation signals that are confined in space and are uncorrelated with the global signal. However, the way the analysis of this question is presented in the manuscript is very difficult to follow, and eventually, the take-home message is unclear. The discussion section indicates that the results support that beyond a global synchronized signal, there is a significant amount of heterogeneous activity. I think this question could benefit from further analysis. I suggest trying to demonstrate more specific examples of axonal ROIs where their activity is decorrelated with the global signal, test how consistent this property is (for those ROIs), and find a behavioral parameter that it predicts.

      Also, in the discussion part, I am missing a discussion of the potential mechanism that allows this heterogeneity. On the one hand, an area may receive NE/ACh innervation from different BF/LC neurons, which are not completely synchronized. But those neurons also innervate other areas, so what is the expected eventual pattern? Also, do the results support neuromodulation control by local interneuron circuits targeting the axons (as is the case with dopaminergic axons in the Basal Ganglia)?

      Our results clearly demonstrate a robust global signal that is common across cholinergic and noradrenergic axons which is related to behavioral state. We have less strong, but still present, evidence for a heterogeneous signal in addition to this global signal. This evidence is based largely upon the large variation in activities in different axon segments during behavioral events that appear similar. This result suggests that the axon segments we monitored do not all act as if they are members of the same axon. We now discuss the strong evidence for the global signal present in our data, and leave open the possibility of a heterogeneous signal whose mechanisms and importance remains to be determined.

      5) The axonal signal seems to be very similar across the cortex. I am not sure this is technically possible, but given that NE axons are thin and non-myelinated and taking advantage of the mesoscopic scale, could the author find any clue for the propagation of the signal on the rostral to caudal axis?

      We were unable to detect propagation across the cortical sheet and believe this is beyond the scope of the present study.

      6) While the section about local VCIN is consistent with the story, it is somehow a sidetrack and ends the manuscript on the wrong note. I leave it to the authors to decide but recommend them to reconsider if and where to include it. Unfortunately, the figure attached was on a very poor resolution, and I could not look into the details, so I am afraid that I could not review this section properly.

      We believe this adds to the manuscript and therefore have decided to include this data.

    2. Reviewer #3 (Public Review):

      Acetylcholine and Norepinephrine are two of the most powerful neuromodulators in the CNS. Recently developments of new methods allow monitoring of the dynamic changes in the activity of these agents in the brain in vivo. Here the authors explore the relationship between the dynamic changes in behavioral states and those of ACh and NE in the cortex. Since neuromodulatory systems cover most of the cortical tissue, it is essential to be able to monitor the activity of these systems in many cortical areas simultaneously. This is a daunting task because the axons releasing NE and ACh are very thin. To my knowledge, this study is the first to use mesoscopic imaging over a wide range of the cortex at the single axon resolution in awake animals. They find that almost any observable change in behavioral state is accompanied by a transient change in the activity of cortical ACh and NE axonal segments. Whisking is significantly correlated with ACh and NE. The authors also explore the spatial pattern of activity of ACh and NE axons over the dorsal cortex and find that most of the dynamics is synchronous over a wide spatial scale. They look for deviation from this pattern (which I will discuss later). Lastly, the authors monitor the activity of cortical interneurons capable of releasing ACh.

      Comments:<br /> 1. On a broad overview, I find the discussion of behavioral states, brain states, and neuromodulation states quite confusing. To begin with, I am not convinced by the statement that "brain states or behavioral states change on a moment-to-moment basis." I find that the division of brain activity into microstates (e.g., microarousal) is counterproductive. After all, at the extreme, going along this path, we might eventually have an extremely high dimensional space of all neuronal activity, and any change in any neuron would define a new brain state. Similarly, mice can walk without whisking, can whisk without walking, can walk and whisk, are all these different behavioral states? And if so, are they all associated with different brain states? Most importantly, in the context of this manuscript, one would expect that different states (brain, behavior) would be associated with at least four potential states of the ACh x NE system (high ACh and High NE, High ACh and Low NE, etc.). However, the reported findings indicate that the two systems are highly synchronized (or at least correlated), and both transiently go on with any change from a passive state to an active state. Therefore, the manuscript describes a rather confined relationship of the neuromodulation systems with the rather rich potential of brain and behavioral states. Of course, this is only my viewpoint, and the authors are not obliged to accept it, but they should recognize that the viewpoint they take for granted is not shared by all and consider acknowledging it in the manuscript.<br /> 2. Most of the manuscript (bar one case) reports nearly identical dynamics of ACh and NE. Is that a principle? What makes these systems behave so similarly? Why have two systems that act nearly the same? Still, if there is a difference, it is the time scale of the ACh compared to the NE. Can the authors explain this difference or speculate what drives it?<br /> 3. Whisker activity explains most strongly the neuromodulators dynamics, but pupil dilation almost does not (in contrast to many previous reports including reports of the same authors). If I am not mistaken, this was nearly ignored in the presentation of the results and the discussion section. Could the author elaborate more on what is the reason for this discrepancy?<br /> 4. I find the question of homogenous vs. heterogenous signaling of both the ACh and NE systems quite important. It is one thing if the two systems just broadcast "one bit" information to the whole brain or if there are neuromodulation signals that are confined in space and are uncorrelated with the global signal. However, the way the analysis of this question is presented in the manuscript is very difficult to follow, and eventually, the take-home message is unclear. The discussion section indicates that the results support that beyond a global synchronized signal, there is a significant amount of heterogeneous activity. I think this question could benefit from further analysis. I suggest trying to demonstrate more specific examples of axonal ROIs where their activity is decorrelated with the global signal, test how consistent this property is (for those ROIs), and find a behavioral parameter that it predicts. Also, in the discussion part, I am missing a discussion of the potential mechanism that allows this heterogeneity. On the one hand, an area may receive NE/ACh innervation from different BF/LC neurons, which are not completely synchronized. But those neurons also innervate other areas, so what is the expected eventual pattern? Also, do the results support neuromodulation control by local interneuron circuits targeting the axons (as is the case with dopaminergic axons in the Basal Ganglia)?<br /> 5. The axonal signal seems to be very similar across the cortex. I am not sure this is technically possible, but given that NE axons are thin and non-myelinated and taking advantage of the mesoscopic scale, could the author find any clue for the propagation of the signal on the rostral to caudal axis?<br /> 6. While the section about local VCIN is consistent with the story, it is somehow a sidetrack and ends the manuscript on the wrong note. I leave it to the authors to decide but recommend them to reconsider if and where to include it. Unfortunately, the figure attached was on a very poor resolution, and I could not look into the details, so I am afraid that I could not review this section properly.

    1. Author Response

      Reviewer #1 (Public Review):

      In this study, the authors aim to identify the cell state dynamics and molecular mechanisms underlying melanocyte regeneration in zebrafish. By analyzing thousands of single-cell transcriptomes over regeneration in both wild-type and Kit mutant animals, they provide thorough and convincing evidence of (1) two paths to melanocyte regeneration and (2) that Kit signaling, via the RAS/MAPK pathway, is a key regulator of this process. Finally, the authors suggest that another proliferative subpopulation cells, expressing markers of a separate pigment cell type, constitute an additional population of progenitors with the ability to contribute to melanocytes. The data supporting this claim are not as convincing, and the authors failed to show that these cells did indeed differentiate into melanocytes. Despite the challenges of describing this third cell state, this study offers compelling new findings on the mechanisms of melanocyte regeneration and provides paths forward to understanding why some animals lack this capacity.

      The majority of the main conclusions are well supported by the data, but one claim, in particular, should be revisited by the authors.

      (1) Provided evidence that the aox5(hi)mitfa(lo) population of cells contributes to melanocyte regeneration is inconclusive and somewhat circumstantial. First, the transcriptional profiles of these cells are much more consistent with the xanthophore lineage. Indeed, xanthophores have been shown to express mitfa (in embryos in Parichy, et al. 2003 (PMID: 10862741), and in post-embryonic cells in Saunders, et al. 2019). Second, while the authors address this possibility in Supplemental figure 7 by showing that interstripe xanthophores fail to divide following melanocyte ablation, they fail to account for the stripe-resident xanthophores/xanthoblasts. The presence and dynamics of aox5+ stripe-resident xanthophores/xanthoblasts are detailed in McMenamin, et al., 2014 (PMID: 25170046) and Eom, et al., 2015 (PMID: 26701906). Without direct evidence that the symmetrically-dividing, aox5+ cells measured in this study do indeed differentiate into melanocytes, it is more likely that these cells are a dividing population of xanthophores/xanthoblasts. The authors should revise their claims accordingly.

      We agree with the editor and reviewers that the identities of the mitfa+aox5hi cells and the interplay between these cells and the mitfa+aox5lo cells is a fascinating, and originally unexpected, aspect of this manuscript. The issue, as we see it, is whether mitfa+aox5hi cells that arise via cell division during regeneration are multipotent pigment cell progenitors or ‘cryptic’ xanthophores. The experiments we have performed to address this ambiguity have not worked for technical reasons, so we have tempered text in the relevant Results and Discussion sections to leave both options open. We have backed off from calling these cells progenitors but have included additional data showing that they (i.e. the mitfa+aox5hi subpopulation of cells that we believe are daughters of mitfa+aox5hi cycling cells) express multiple markers associated with multipotent pigment cell progenitors that have been characterized in developing zebrafish. Our expanded Discussion is as follows:

      “Heterogeneity may also be evident by the additional mitfa+aox5hi G2/M adj subpopulation that likely arises via cell divisions during regeneration. There are reasons to think that this could be a progenitor subpopulation. Firstly, these cells arose in response to specific ablation of melanocytes. Secondly, this subpopulation expresses markers that are associated with multipotent pigment progenitors cells found during development (Budi, et al., 2011; Saunders, et al., 2019). Thirdly, although this subpopulation expresses aox5 and some other markers associated with xanthophores, we showed that differentiated xanthophores are not ablated by the melanocyte-ablating drug neocuproine and this mitfa+aox5hi subpopulation does not make new pigmented xanthophores following neocuproine treatment. However, current observations cannot definitively determine the potency and fates adopted by these cells. One possibility is that these cells are indeed progenitors that arise through cell divisions, are in an as yet undefined way lineally related to MP-0 and MP-1 subpopulations, and ultimately give rise to new melanocytes during additional rounds of regeneration. Given their expression of markers associated with multipotent pigment cell progenitors, these cells could be multipotent but fated toward the melanocyte lineage following melanocyte-specific ablation. However, we cannot exclude the possibility that these cells are another cell type. For example, there is a type of partially differentiated xanthophores that populate adult melanocyte stripes (McMenamin, et al., 2014). At least some of these cells arise from embryonic xanthophores that transitioned through a cryptic and proliferative state (McMenamin, et al., 2014). That the descendants remain partially differentiated could indicate that they are in more of a xanthoblast state and maintain proliferative capacity (Eom, et al., 2015). It is possible that some or all of the cells in question are melanocyte stripe-resident, partially-differentiated xanthophores that arise: a) from cell divisions that are triggered by loss of interactions with melanocytes or, b) simply to fill space that is vacated due to melanocyte death. Such causes for partially-differentiated xanthophore divisions have not been documented, but nonetheless this possibility must be considered given the mitfa and aox5 expression and proliferative potential of these cells. Transcriptional profiles of ‘cryptic’ xanthophores are not available to help clarify the nature of these cells. Lastly, the relationship between adult progenitor populations – MP-0, MP-1 and, potentially, mitfa+aox5hi G2/M adj – and other progenitors present at earlier developmental stages is unclear and could be defined through additional long-term lineage tracing studies. In particular, previous examinations of pigment cell progenitors in developing zebrafish have identified dorsal root ganglion-associated pigment cell progenitors in larvae that contribute to adult pigmentation patterns (Singh, et al., 2016; Dooley, et al., 2013; Budi, et al., 2011). It is possible that these cells give rise to the adult progenitors we have identified. The further alignment of cell types that have been observed in vivo and cell subpopulations defined through expression profiling is a necessary route for understanding the complex relationship between stem and progenitor cells in development, homeostasis, and regeneration.”

      (1) At line 140, it is noted that Xanthophores are pteridine-producing, but they also get their yellow color from carotenoids (especially in adults). This should be noted as well, especially since the authors display the xanthophore marker, scarb1, which plays a key role in xanthophore carotenoid coloration.

      [Mapping expression levels onto UMAP space for scarb1 and perhaps other markers of xan, irid, or proliferation would be helpful as a supplement to the dot plot in Fig 1 and could help to clarify the transcriptomic signature of mitfa+ aox5-hi cells and plausibility of the model that they are an McSC population. -Parichy]

      We thank the reviewer for the suggestion, and we have changed the text to include the carotenoid coloration facts of xanthophores as follows:

      “aox5 is expressed in differentiated xanthophores, a pteridine- and carotenoid-producing pigment cell type of zebrafish, and in some undifferentiated pigment progenitor cells”

      Additionally, we have also added a new Figure Supplement to Figure 1 (Figure 1 – figure supplement 3) with feature plots demonstrating the expression of xanthophore markers scarb1 and bco2b, iridophore markers lypc and cdh11, and proliferation markers pcna and mki67. As noted above, there is some heterogeneity within the large grouping of mitfa+aox5hi cells. Whereas some markers associated with xanthophores are broadly expressed in this grouping (e.g. scarb1), others have more restricted expression (e.g. bco2b). The heterogeneity could reflect multiple differentiation states of xanthophores, multiple types of differentiated xanthophores, xanthophore progenitors and/or less fate-restricted pigment cell progenitors that cluster in this grouping.

      (2) The authors should provide the list of genes that comprise their cluster signatures (line 252) as part of the supplementary tables.

      We have now included a table of genes in the cluster signatures. The Supplementary Table is called “Supplementary File 2.”

      (3) The authors should more clearly describe how they performed lineage tracing (line 339). Additionally, for the corresponding figure 4E, the authors should list the number of cells traced. The source data only contains calculated percentages rather than counts for each type of differentiation. My understanding is that the number listed in the figure legend is the number of fish (i.e. n = 4), but this should be clarified as well.

      [A supplementary figure of labeled cells is important here with enough context to show that cells can be re-identified unambiguously. Additionally note that "lineage tracing" will typically be assumed to mean single-cell labeling and tracking, so if that is not the case for these experiments it would be preferable to use an alternative descriptor. -Parichy]

      We have included additional detail in our revised manuscript. In Figure 4E we now include the number of cells imaged and have included a breakdown of the raw numbers in the Source Data. We have also included Supplementary Animations as examples of the single-cell tracing that we perform through serial imaging.

      Additionally, the point about using ‘lineage tracing’ is well taken. We have replaced this with ‘serial imaging’ through the text.

      (4) Line 321, the authors list the mean regeneration percentages for the kita and kitlga(lf) mutants, but these differences are not significantly different according to Figure 4B. By listing the means (which should be noted), the authors seem to be highlighting the differences but then do not comment on them. The description and integration of this result into the main text should be clarified.

      We have changed the wording in the text to clarify that the mean percentage is being listed. We have also reworded the text to de-emphasize the mean percentage difference between kita(lf) and kitlga(lf) mutants, instead highlighting that their defects are similar. In the figure legend we have clarified that the mean percentage regeneration is being shown.

      (5) In Figure 6E, the RNA-velocity result is not particularly consistent with the authors' claims. Visually, the arrows seem fairly randomly directed. The data in 6B, showing gene expression associated with the S phase and G2/M phase much more clearly convey the directionality of the loop (S phase, followed by G2/M). I suggest that the authors weaken their claim about the RNA-velocity result or remove it altogether and focus on the cell cycle-related gene expression signatures.

      We thank the reviewer for their careful eye here. We have decided to remove the RNA-velocity result previously displayed in Figure 6E. As the reviewer points out the results are more clearly demonstrated by Figure 6B.

    1. Author Responses

      Reviewer #1 (Public Review):

      This work aimed at investigating how a BMI decoding performance is impacted by changing the conditions under which a motor task is performed. They recorded motor cortical activity using multielectrode arrays in two monkeys executing a finger flexion and extension task in four conditions: normal (no load, neutral wrist position), loaded (manipulandum attached to springs or rubber bands to resist flexion), wrist (no load, flexed wrist position) or both (loaded and flexed wrist). They found, as expected, that BMI decoders trained and tested on data sets collected during the same conditions performed better at predicting kinematics and muscle activity than others trained and tested across conditions. They also report that the performance of monkeys a BMI task involving the online control of a virtual hand was almost unaffected by changing either the actual manipulandum conditions as above or switching between decoders trained from data collected under different conditions. As for the neuronal activity, they found a mix of changes across task contexts. Interestingly, a principal component analysis revealed that activity in each context falls within well-aligned manifolds, and that the context-dependent variance in neuronal activity strongly correlated to the amplitude of muscle activity.

      Strengths

      The current study expands on previous findings about BMI decoders generalizability and contributes scientifically in at least three important ways.

      First, their results are obtained from monkeys performing a fine finger control task with up to two degrees of freedom. This provides a powerful setting to investigate fine motor control of the hand in primates. The authors use the accuracy of BMI decoders between data sets as a measure of stationarity in the neuronsto-fingers mapping, which provides a reliable assessment. They show that changes in wrist angle or finger load affect the relationship between cortical neurons and otherwise identical movements. Interestingly, this result holds up for both kinematics and muscle activity predictions, albeit being stronger for the latter.

      Second, their results confirming that neuronal activity recorded during different task conditions lies effectively within a common manifold is interesting. It supports prior observations, but in the specific context of finger movements.

      Third, the dPCA results provide interesting and perhaps unexpected information about the fact that amplitude of muscle activity (or force) is clearly present in the motor cortical activity. This is possibly one of the most interesting findings because extracting a component from neural activity that can related robustly to muscle activity across context would provide great benefits to the development of BMIs for functional electrical stimulation.

      Overall, the analyses are well designed and the interpretation of the results is sound.

      Weaknesses

      I found the discussion about the possible reasons why offline decoders are more sensitive to context than online decoders very interesting. Nonetheless, as the authors recognize, the possibility that the BMI itself causes a change in context, "in the plant", limits their interpretation. It could mean for the monkeys to switch from one suboptimal decoder to another, causing a ceiling effect occluding generalization errors.

      Overall, several new and original results were obtained through these experiments and analyses. Nonetheless, I found it difficult to extract a clear unique and strong take-home message. The study comes short of proposing a new way to improve BMIs generalizability or precisely identifying factors that influence decoders generalizability.

      We thank the reviewer for the positive comments. Relating these results to BMI design and interpreting the adaptation to contexts during online trials comprised a bulk of the essential revisions from the eLife editorial staff. More details can be found in common response #2 and essential revisions #1-3. To summarize, we added an analysis of neural activity during online trials to provide insight into how the monkeys were adapting. We have expanded the discussion of online adaptation, as detailed in essential revision #2. We also expanded discussion of how both the online and offline results might affect BMI design, as detailed in essential revision #3.

      Reviewer #2 (Public Review):

      The authors motivate this study by the medical need to develop brain-machine interfaces (BMIs) to restore lost arm and hand function, for example through functional electrical stimulation. More specifically, they are interested in developing BMI decoding algorithms that work across a variety of "contexts" that a BMI user would encounter out in the real world, for example having their hand in different postures and manipulating a variety of objects. They note that in different contexts, the motor cortex neural activity patterns that produce the desired muscle outputs may change (including neurons' specific relationship to different muscles' activations), which could render a static decoder trained in a different context inaccurate.

      To test whether this potential challenge is indeed the case, this study tested BMI control of virtual (onscreen) fingers by two rhesus macaques trained to perform 1 or 2 degree-of-freedom non-grasping tasks either by moving their fingers, or just controlling the virtual finger kinematics with neural activity. The key experimental manipulations were context shifts in the form of springs on the fingers or flexion of the wrist (or both). BMI performance was then evaluated when these context changes were present, which builds on this group's previous demonstration of accurate finger BMI without any context shifts.

      The study convincingly shows the aforementioned context shifts do cause large changes in measured firing rates. When neural decoding accuracy (for both muscle and position/velocity) is evaluated across these context changes, reconstruction accuracy is substantially impaired. The headline finding, however, is that that despite this, BMI performance is, on aggregate, not substantially reduced. Although: it is noteworthy that in a second experiment paradigm where the decoder was trained on the spring or wrist-manipulated context and tested in a normal context, there were quite large performance reductions in several datasets as quantified by multiple performance measures; this asymmetry in the results is not really explored much further. The changes in neural activity due to context shifts appear to be relatively modest in magnitude and can be fit well as simple linear shifts (in the neural state space), and the authors posit that this would make it feasible (in future work) to find context-invariant neural readouts that would result in more robust muscle activity decoders.

      An additional novel contribution of this study is showing that these motor cortical signals support quite accurately decode muscle activations during non-prehensile finger movements (and also that the EMG decoding was more negatively affected by context shifts than kinematics decoding); previous work decoded finger kinematics but not these kinetics. Note that this was demonstrated with just one of the two monkeys (the second did not have muscle recordings).

      This is a rigorous study, its main results are well-supported, and it does not make major claims beyond what the data support.

      One of its limitations is that while the eventual motivating goal is to show that decoders are robust across a variety of tasks of daily living, only two specific types of context shifts are tested here, and they are relatively simple and potentially do not result in as strong a neural change as could be encountered in realworld context shifts. This is by no means a major flaw (simplifying experimental preparations are a standard and prudent way to make progress). But the study could point this out a bit more prominently that their results do not preclude that more challenging context shifts will be encountered by BMI users, and this study in its current form does not indicate how strong a perturbation the tested context shifts are relative to the full possible range of hand movement context shifts that would be encountered during human daily living activities.

      A second limitation is that while the discrepancy between large offline decoding performance reduction and small online performance reduction are attributed to rapid sensorimotor adaptation, this process is not directly examined in any detail.

      Third, the assessment of how neural dynamics change in a way that preserves the overall shape of the dynamics is rather qualitative rather than quantitative, and that this implementation of a more contextagnostic finger BMI is left for future work.

      We thank the reviewer for the positive comments. We agree that the paper could discuss how this work impacts a wider range of movements and we now include more discussion to that point as detailed in the responses to feedback below. We also acknowledge that the paper did not directly examine online adaptation and we have now included an analysis aimed at answering how the monkeys adapted to the context changes during online tasks.

      Reviewer #3 (Public Review):

      In this manuscript the authors ask whether finger movements in non-human primates can be predicted from neural activity recorded from the primary motor cortex. This question is driven by an ultimate goal of using neural decoding to create brain-computer interfaces that can restore upper limb function using prosthetics or functional electrical stimulation systems. More specifically, since functional use of the hand (real or prosthetic) will ultimately require generating very different grasp forces for different objects, these experiments use a constant set of finger kinematics, but introduce different force requirements for the finger muscles using several different techniques. Under these different conditions (contexts), the study examines how population neural activity changed and uses decoder analyses to look at how these different contexts affect offline predictions of muscle forces and finger kinematics, as well as the animals' ability to use different decoders to control 1 or 2-DOF online. In general, the study found that when linear models were trained on one context from offline data, they did not generalize well to the other context. However, when performance was tested online (monkeys controlling a virtual hand in real time using neural activity related to movement of their own hands) with a ReFIT Kalman filter, the animals were able to complete the task effectively, even with a decoder trained without the springs or wrist perturbation. The authors show data to support the idea that neural activity was constrained to the same manifold in the different contexts, which enabled the animals to rapidly change their behavior to achieve the task goals, compared to the more complex requirement of having to learn entirely new patterns of neural activity. This work takes studies that have been conducted for upper-limb movements and extends them to include hand grasp, which is important for creating decoders for brain-computer interfaces. Finally, the authors show using dPCA can extract features during changes in context that may be related to the activity of specific muscles that would allow for improved decoders.

      Strengths

      The issue of hand control, and how it compares to arm control, is an important question to tackle in sensorimotor control and in the development of brain-computer interfaces. Interestingly, the experiments use two very different ways of changing the muscle force requirements for achieving the same finger movements; springs attached to a manipulandum and changes in wrist posture. Using both paradigms the decoder analysis clearly shows that linear models trained without any manipulation do not predict muscle forces or finger kinematics well, clearly illustrating the limitations of common linear decoders to generalize to scenarios that might encompass real grasping activities that require forceful interactions. Using a welldescribed real-time decoder (ReFIT Kalman Filter), the authors show that this performance decrease observed offline is easily overcome in online testing. The metrics used to make these claims are welldescribed, and the likely explanations for these findings are described well. A particular strength of this manuscript is that, at least for these relatively simple movements and contexts, a component of neural activity (identified using dPCA) is identified that is significantly modulated by the task context in a way that sensibly represents the changes in muscle activity that would be required to complete the task in the new contexts. We thank the reviewer for the positive comments.

      Weaknesses

      The differences between exemplar data sets and comprehensively tested contexts was difficult to follow. There are many references to how many datasets or trials were used for a particular experiment, but overall, this is fragmented across the manuscript. As a result, it is difficult to assess how generalizable the results of the manuscript were across time or animal, or whether day-to-day variations, or the different data collection schedules had an effect.

      Thank you for the comment, we have added in the number of sessions in results in multiple places throughout the paper. For example, starting line 274 in the results:

      "During these 10 sessions the context changes were tested 15 times: four times for the wrist context, seven times for the spring context, and four times for the combined wrist and spring context."

      The introduction allocates a lot of space to discussing the concepts of generating (computing) movements as opposed to representing movements and relates this to ideas of neural dynamics. The distinction between these as described in the introduction is not very clear, nor is it clear what specific hypothesis this leads to for these experiments. Further, this line of thinking is not returned to in the discussion, so the contribution of these experiments to ideas raised in the introduction are unclear.

      Thank you for the comment, we have written a new paragraph relating these results to the concept of generating movement. Starting line 452 of the discussion:

      "During the offline tasks, many channels changed neural activity with context, with 20.9% to 61.7% of tuned SBP channels modulating activity with context (Table I). The magnitude of these shifts were relatively small, especially when compared to the large changes in required muscle activation (Figure 2D-E), with weak trends to require greater activation for resisted flexion and lesser for assisted extension (Figure 7B-C). Additionally, the neural manifolds underlying movements in each context were well-aligned (Figure 7D). Using dPCA we found that while a large proportion of neural variance was explained by dPCA components that did not change with context, a significant proportion of the neural variance is associated with components that are context-dependent (Figure 8B). Visually, the context components are shifting the trajectories without changing the overall shape and the shift in neural activity is strongly correlated with muscle activations in new contexts (Figure 8C). This agrees with other studies which found lower variance activity may be related to the actual motor commands (Gallego et al., 2018; Russo et al., 2018; Saxena et al., 2022)."

      The complexity of the control that was possible in this task (1 or 2 DOF finger flexion/extension) was low. Further, the manipulations that were used to control context were simple and static. Both these factors likely contribute to the finding that there was little change in the principal angles of the high-variance principal components. While this is not a criticism of the specific results presented here, the simplicity of the task and contexts, contrasted with the complexity of hand control more generally, especially for even moderately dexterous movements, makes it unclear how well the finding of stable manifolds will scale. On a related point, it is unclear whether the feature, identified using dPCA, that could account for changes in muscle activity, could be robustly captured in more realistic behaviors. It is stated that future work is needed, but at this point, the value of identifying this feature is highly speculative.

      Thank you for the comment, we have included more discussion to relate these results to decoder development in general as described in essential revision #3 from the editor.

      The maintained control in online BMI trials could also be explained by another factor, which I don't think was explicitly described by either of the two suggestions. Prism goggle experiments introduce a visual shift can be learned quickly, and some BCI experiments have introduced simple rotations in the decoder output (e.g. Chase et. al. 2012, J Neurophys). This latter case is likely similar in concept to in-manifold perturbations. Regardless, the performance can be rapidly rescued by simply re-aiming, which is a simple behavioral adaptation. In a 1DOF or 2DOF control case like used in these experiments, with constant visual feedback on performance, the change in context could likely be rapidly learned by the animals, maybe even within a single trial. In other words, the high performance in the online case may be a consequence of the relatively simple task demands, and the simple biomechanical solution to this problem (push harder). What is the expectation that the results seen in these experiments would be relevant to more realistic situations that require grasp and interaction?

      Thank you for the suggestion, we agree that the quick adaptation is likely related to re-aiming. To this end, we have included a re-aiming analysis, as described in essential revisions #1 and #2 from the editor and common response #2, to look into the quick adjustment.

      Some of the figures were difficult to read and the captions contained some minor incorrect information. The primary purpose of some of the figures was not immediately clear from the caption. For example, the bar plots in Figures 5 and 6 were very small and difficult to read. This also made distinguishing the data from the two different animals challenging.

      Thank you for the comments, multiple figures have been edited to increase legibility and a review of text has been done to fix errors and improve interpretability.

      There is no specific quantification of the data in Figures 4D and 5D. In Figure 4D it seems apparent that the vast majority of the points are below the unity line. But, it remains unclear, particularly in Figure 5D whether the correlations between the two contexts truly are different or not in a way that would allow conclusive statements.

      Thank you for the comments, Figure 4D has been moved to the supplement and 5D has now been replaced by figures analyzing the neural activity patterns during the online task.

    1. Think about looking at a piece of abstract art or a computer’s circuit board for the first time. With little knowledge, you may find it difficult to explain what you’re seeing — which would also make it difficult for you to answer the question, “What’s your perspective on this artwork or circuit board?” You have a perspective, you just don’t have the tools to talk about it meaningfully. With a little help, however (e.g., acquired knowledge from a teacher, friend, YouTube, and so on), you can learn to identify and talk about certain brushstrokes and the color palette (or reflow and silkscreen layering). Consequently, your perspective will be enriched, and you’ll be more articulate when you share that perspective with others.

      Reading this example of how our perspectives change as we gain knowledge enlightens many things for me. It helped me reflect on everything that changes our perspectives as we learn more about it. For example looking at a math problem that seems difficult at first but once you learn how to solve it, it's not so difficult with the knowledge you learned and eventually your perspective on the math problem has changed to viewing it as an easy problem.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      I summarise the major findings of the work below. In my opinion the range and application of approaches has provided a broad evidence base that, in general, supports the authors conclusions. However, there are, in my opinion, particular failures to utilise and communicate this evidence. The manuscript may be much improved with attention in the following areas. In each case I will give general criticism with a few examples, but the principals of my comments could be applied throughout the work.

      1) Insufficient quantification. The investigation combines various sources of qualitative data (EM, fluorescence microscopy, western blotting) to generate a reasonably strong evidence base. However, the work is over-reliant on representative images and should include more quantification from repeat experiments. When there are multiple fluorescence micrographs with intensity changes (not necessarily just representative images) (e.g. Figure 1 or 2) the authors should consider making measurements of these. Also the VLP production assays, which are assessed by western blotting would particularly benefit from a quantitative assessment (either by densitometry or, if samples remain, ELISA/similar approach).

      We have performed quantification of immunofluoresence, western blotting and VLP experiments from existing data. These quantification are presented in our revised manuscript. An overview of new quantification is shown below:

      Data shown

      Quantification now shown in

      Method

      Analysis

      Figure 1A

      Supp F1C

      IF

      HAE (-/+ SARS-CoV-2)

      • Tetherin total fluorescence intensity

      Figure 1D

      Supp F1E

      IF

      HeLa+ACE2 (-/+ SARS-CoV-2 )

      • Tetherin total fluorescence intensity

      Figure 2C

      Supp F2B

      IF

      A549+ACE2 (-/+ SARS-CoV-2)

      • Tetherin total fluorescence intensity

      Figure 2G

      Supp F2D

      IF

      T84 (-/+ SARS-CoV-2)

      • Tetherin total fluorescence intensity

      Supp F4A

      Supp F4B

      IF

      HeLa + ss-HA-Spike transients (-/+ HA stained cells) - Tetherin total fluorescence intensity

      Figure 4D

      Supp F4E

      IF

      HeLa + TetOne ss-HA-Spike stables (-/+ Dox)

      • Tetherin total fluorescence intensity

      Figure 4F

      Supp F4G

      W blot

      HeLa + TetOne ss-HA-Spike stables (-/+ Dox)

      – Tetherin abundance

      Figure 4G

      Supp F4I

      W blot – lysates

      Spike VLP experiments

      – tetherin abundance

      Figure 4G

      Supp F4J

      W blot - VLPs

      Spike VLP experiments

      • N-FLAG abundance

      Figure 6A

      Supp F7A

      W blot – lysates

      ORF3a VLP experiments

      – tetherin abundance

      Figure 6A

      Supp F7B

      W blot - VLPs

      ORF3a VLP experiments

      • N-FLAG

      For immunofluoresence anaysis, the mean, standard deviation, number of cells analysed and number of independent experiments are shown in the updated figure legends. Statistical analysis is also detailed in figure legends. Methods for the quantificaiton of fluoresence intensity is included in the Methods section.

      Densitometry was performed on western blots and VLP experiments as suggested. The mean, standard devisation and number of independent expreiments analysed are expressed in figure legends. Methods for densityometry quantification is now included.

      2) Insufficient explanation. I found some of the images and legends contained insufficient annotation and/or description for a non-expert reader to appreciate the result(s). Particularly if the authors want to draw attention to features in micrographs they should consider using more enlarged/inset images and annotations (e.g. arrows) to point out structures (e.g DMVs etc.). This short coming exacerbates the lack of quantification.

      Additional detail has been provided to the figure legends, and we have updated several figures to draw attention to features in micrographs. Black arrowheads have been added to Figures 1E, 2D, 2H to highlight plasma membrane-associated virions, and asterisks to highlight DMVs in Figures 1E, 2D and Supplemental Figures 2C, 2E. Similarly, typical Golgi cisternae are highlighted by white arrowheads micrographs in Figure 2E. These figure legends have also been modified to highlight these additions.

      3) Insufficient exploration of the data. I had a sense that some aspects of the data seem unconsidered or ignored, and the discussion lacks depth and reflection. For example the tetherin down-regulation apparent in Figures 1 and 2 is not really explained by the spike/ORF3a antagonism described later on, but this is not explicitly addressed.

      We have made changes throughout the manuscript, but the discussion especially has been modified. We now discuss the ORF3a data in more depth, discuss possible mechanisms by which ORF3a alone enhances VLP release, and discuss our ORF7a data in context to previous reports.

      The discussion has been updated to now include a better description of our data, and additional writing putting our work in to context with previously published work. See discussion section of revised manuscript.

      Also, Figure 6 suggests that ORF3a results in high levels of incorporation of tetherin in to VLPs, but I don't think this is even described(?). The discussion should also include more comparison with previous studies on the relationship between SARS-2 and tetherin.

      We have added a section to discuss how ORF3a may enhance VLP release,

      ‘We found that the expression of ORF3a enhanced VLP independently of its ability to relocalise tetherin (Figure 6A). This may be due to either the ability of ORF3a to induce Golgi fragmentation [38] which facilitates viral trafficking [39], or due to enhanced lysosomal exocytosis [37]. Tetherin was also found in VLPs upon co-expression with ORF3a (Figure 6A) which may also indicate to enhanced release via lysosomal exocytosis [37].

      The secretion of lysosomal hydrolases has been reported upon expression of ORF3a [31] and whilst this may in-part be due to enhanced lysosome-plasma membrane fusion, our data highlights that ORF3a impairs the retrograde trafficking of CIMPR (Supplemental Figures 6B, 6F, 6G), which may similarly increase hydrolase secretion.’ – (Line 625-654).

      The discussion has been developed to compare the relationship between SARS-CoV-2 and tetherin in previous studies,

      ‘SARS-CoV-1 ORF7a is reported to inhibit tetherin glycosylation and localise to the plasma membrane in the presence of tetherin [18]. We did not observe any difference in total tetherin levels, tetherin glycosylation, ability to form dimers, or surface tetherin upon expression of either SARS-CoV-1 or SARS-CoV-2 ORF7a (Figures 4A, 4B, 4C).

      Others groups have demonstrated a role for ORF7a in sarbecovirus infection and both SARS-CoV-1 and SARS-CoV-2 virus lacking ORF7a show impaired virus replication in the presence of tetherin [18,41]. A direct interaction between SARS-CoV-1 ORF7a and SARS-CoV-2 ORF7a and tetherin have been described [18,41], although the precise mechanism(s) by which ORF7a antagonises tetherin remains enigmatic. We cannot exclude that ORF7a requires other viral proteins to antagonise tetherin, or that ORF7a antagonises tetherin via another mechanism. For example, ORF7a can potently antagonise IFN signalling [42] which would impair tetherin induction in many cell types.’ – (Line 667-704).

      I have no minor comments on this draft of the manuscript.

      Reviewer #1 (Significance (Required)):

      Tetherin, encoded by the BST2 gene, is an antiviral restriction factor that inhibits the release of enveloped viruses by creating tethers between viral and host membranes. It also has a capacity for sensing and signalling viral infection. It is most widely understood in the context of HIV-1, however, there is evidence of restriction in a wide variety of enveloped viruses, many of which have evolved strategies for antagonising tetherin. This knowledge informs on viral interactions with the innate immune system, with implications for basic virology and translational research.

      This study investigates tetherin in the context of SARS-CoV-2. The authors use a powerful collection of tools (live virus, gene knock out cells, recombinant viral and host expression systems) and a variety of approaches (microscopy, western blotting, infection assays), which is, itself, a strength. The study provides evidence to support a series of conclusions: I) BST2/tetherin restricts SARS-CoV-2 II) SARS-CoV-2 ablates tetherin expression III) spike protein can modestly down-regulate tetherin IV) ORF3A dysregulates tetherin localisation by altering retrograde trafficking. These conclusions are broadly supported by the data and this study make significant contributions to our understanding of SARS-CoV-2/tetherin interactions.

      My enthusiasm is reduced by, in my opinion, a failure of the authors to fully quantify, explain and explore their data. I expect the manuscript could be significantly improved without further experimentation by strengthening these aspects.

      This manuscript will be of interest to investigators in virology and/or cellular intrinsic immunity. Given the focus on SARS-CoV-2 it is possible/likely that it will find a slightly broader readership.

      I have highly appropriate skills for evaluating this work being experienced in virology, SARS-CoV-2, cell biology and microscopy.

      We wish to thank Reviewer #1 for their comments which have helped us to improve the quality of our revised manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      BST2/tetherin can restrict the release and transmission of many enveloped viruses, including coronaviruses. In many cases, restricted viruses have developed mechanisms to abrogate tetherin-restriction by expressing proteins that antagonize tetherin; HIV-1 Vpu-mediated antagonism of tetherin restriction is a particularly well studied example. In this paper, Stewart et al. report their studies of the mechanism(s) underlying SARS-CoV-2 antagonism of tetherin restriction. They conclude that Orf3a is the primary virally encoded protein involved and that Orf3a manipulates endo-lysosomal trafficking to decrease tetherin cycling and divert the protein away from putative assembly sites.

      Major comments:- In my view some of the claims made by the authors are not fully supported by the data. For example, the bystander effect discussed in line 162 may suggest that infected cells can produce IFN but does not 'indicate' that they do

      This text has now been edited,

      ‘The levels of tetherin in uninfected HAE cells is lower than observed in uninfected neighbours in infected wells demonstrating that infected HAE cells are able to generate IFN to act upon uninfected neighbouring cells, enhancing tetherin expression.’ - (Lines 163-172).

      Most of the EM images show part of a cell profile, so statements such as (line 192) 'virus containing tubulovesicular organelles were often polarised towards sites of significant surface-associated virus' should be backed up with appropriate images, or indicated as 'not shown', or removed (the observation is not so important for this story). Line 196, DMVs can't be seen in these micrographs.

      The statement 'virus containing tubulovesicular organelles were often polarised towards sites of significant surface-associated virus' has been removed. The micrographs in Figure 1E have been re-cropped, and image iii replaced with an image showing DMVs and budding virions. Plasma membrane-associated virions are highlighted by black arrowheads, DMVs by black asterisks, and intracellular virion by a white arrow.

      Line 391, I can't see much change in CD63 distribution.

      CD63 reproducibly appears clustered towards the nuclei in ORF3a expressing cells, whilst CD63 positive puncta are abundant in the periphery of mock cells. CD63 puncta are also larger, and the staining of CIMPR and VPS35 also appears to be associated with larger organelles. We have amended the text to now read,

      ‘Expression of ORF3a also disrupted the distribution of numerous endosome-related markers including CIMPR, VPS35, CD63, which all localised to larger and less peripheral puncta (Supplemental Figure 6B), and the mixing of early and late endosomal markers’ - (Line 469).

      Quantification of the diameter of CD63 puncta indicate that they are larger in ORF3a expressing cells than in mock cells. Mock cells - 0.71μm (SD; 0.19), ORF3a - 1.15μm (SD;0.35). At least 75 organelles per sample, from 10 different cells. We have not included this data as we do not wish to labor this point but are happy to include this quantification if required to do so.

      Line 321, the authors show that ORF7a does not affect tetherin localization, abundance, glycosylation or dimer formation, but they don't show that it doesn't restrict SARS-CoV-2. Can they be sure that epitope tagging this molecule does not abrogate function (or the functions of any of the other tagged proteins for that matter), or that ORF7a works in conjunction with one of the other viral proteins?

      We are careful in the manuscript not to claim that ORF7a has no effect on tetherin. Our data indicate that ‘ORF7a does not directly influence tetherin localisation, abundance, glycosylation or dimer formation’ - (Line 361-362).

      We were unable to reproduce an effect of ORF7a on tetherin glycosylation. Our data conflicts with that presented by Taylor et al, 2015, where ORF7a impaired tetherin glycosylation and ORF7a localised to the plasma membrane in tetherin expressing cells. The experiments performed by Taylor et al used HEK293 cells and ectopically expressed tagged tetherin. The differences in results may be attributed to the differences between cell lines or due to differences between endogenous or ectopic / tagged tetherin.

      The study by Taylor et al uses SARS-CoV-1 ORF7a-HA from Kopecky-Bromberg et al., 2007 (DOI: 1128/JVI.01782-06), where the -HA tag is positioned at the C-terminus. Our ORF7a-FLAG constructs have a C-terminal epitope tag. While we cannot exclude the possibility that tagged proteins may act differently from untagged ones, the differences between our findings and previous work appear unlikely to be due to epitope tags.

      Our manuscript states that although we cannot find any effect of ORF7a on tetherin localisation, abundance, glycosylation, or dimer formation, we cannot exclude that ORF7a impacts tetherin by another mechanism. For example, ORF7a has been found to antagonise interferon responses. Tetherin is abundantly expressed in HeLa cells and expression does not require induction through interferon. None of our experiments above would be impacted by interferon antagonism yet this could impact other cell types besides infection in vivo. These possibilities may explain the reported differential impact of ORF7a by different labs. An addition comment has been added to the discussion to reflect this,

      ’We cannot exclude that ORF7a requires other viral proteins to antagonise tetherin, or that ORF7a antagonises tetherin via another mechanism. For example, ORF7a potently antagonises IFN signalling [38], which would impair tetherin induction in many cell types. - (Line 701-704).

      Note - Reference 38 has been added to the manuscript – Xia et al., Cell Reports DOI: 10.1016/j.celrep.2020.108234

      In the ORF screen, a number of the constructs are expressed at low level, is it possible they [the authors] are missing something?

      Some of the ORFs expressed in the miniscreen appear poorly expressed. We accept that in the use of epitope tagged constructs expression levels of individual viral proteins may impact upon a successful screen. However, this screen was performed to identify any potential changes in tetherin abundance or localisation, and the screen did successfully identify ORF3a, which we were able to follow-up and verify.

      Line 376, the authors refer to ORF3a being a viroporin. A recent eLife paper (doi: 10.7554/eLife.84477; initially published in BioRxiv) refutes this claim and builds on other evidence that ORF3a interacts with the HOPS complex. The authors should at least mention this work, especially in the discussion, as it would seem to provide a molecular mechanism to support their conclusions.

      This paper had not been peer reviewed at the time of our initial submission. We have now included the following text,

      ‘SARS-CoV-2 ORF3a is an accessory protein that localises to and perturbs endosomes and lysosomes [29]. It may do so by acting either as a viroporin [30] or by interacting with, and possibly interfering with the function of VPS 39, a component of the HOPS complex which facilitates tethering of late endosomes or autophagosomes with lysosomes [29,31]. Given ORF3a likely impairs lysosome function, the observed increased….’ - (Lines 444-449).

      Fig 3, the growth curves illustrated in Fig3 C and D do not have errors bars; how many times were these experiments repeated?

      These experiments require more repeats to include error bars. Infection and plaque assay (Figure 3C, 3D) are currently ongoing and we plan to complete them in the next 6-8 weeks and include them in the finalised manuscript.

      In the new experiments, infections will additionally be performed at MOI 0.01, in addition to the previous MOIs (1 and 5).

      Line 396, the authors show increased co-localization with LAMP1. As LAMP1 is found in late endosomes as well as lysosomes, they cannot claim the redistributed tetherin is specifically in lysosomes.

      We have altered the text to now say:

      ‘The ORF3a-mediated increase in tetherin abundance within endolysosomes could be due to defective lysosomal degradation.’ - (Line 475).

      There seems to be a marked difference in the anti-rb555 signal in the 'mock' cells in panels 5H and Suppl 6E. Is there a good reason for this, or does this indicate variability between experiments?

      Antibody uptake experiments in Figure 5H and Supp Figure 6E were performed and acquired on different days. Relatively low levels of signal are available in these antibody uptake experiments, and the disperse labelling seen in the mocks does not aid this.

      Fig 6a, why is there negligible VLP release from cells lacking BST2 and ORF3a-strep? How many times were these experiments performed? Is this a representative image? I think it confusing to refer to the same protein by two different names in the same figure (i.e. BST2 and tetherin). Do the authors know how the levels of ORF3a expressed in cells in these experiments compares to those seen in infected cells?

      We have changed the blot in Figure 6A for one with clearer FLAG bands. Three independent experiments were performed for Figure 6A. Quantification of VLPs is now included in Supplemental Figure 7B.

      We have changed ‘Bst2’ to ‘tetherin’ in all previous figures relating to protein; Figure 4G, Figure 6A, B, C.

      We have no current information to compare ORF3a levels in these experiments versus in infected cells. We can investigate quantifying this if necessary.

      My final point is, perhaps, the trickiest to answer, but nevertheless needs to be considered. As far as we know, SARS-CoV-2 and at least some other coronaviruses, bud into organelles of the early secretory pathway, often considered to be ERGIC. In the experiments shown here the authors provide evidence that ORF3a can influence tetherin recycling, but the main way of showing this is through its increased association with endocytic organelles. Do the authors have any evidence that Orf3a reduces tetherin levels in the ERGIC or whether the tetherin cycling pathway(s) involve the ERGIC?

      This is an interesting point, and as the reviewer concedes, this is tricky to answer. Expression of ORF3a causes the redistribution or remodeling of various organelles (Figures 1E, 2D, 2F, Supp Figures 2C, 2E, 3E, 6B, 6C, 6D). We have been unable to test the direct involvement of ERGIC, despite attempts with a number of commercial antibodies. Given the huge rearrangements of organelles during SARS-CoV-2 infection, it is unclear exactly what will happen to the distribution of ERGIC.

      Minor comments: Line 53, delete 'shell' its redundant and confusing when the authors have said coronaviruses have a membrane.

      Deleted.

      Line 61, delete 'the'

      Deleted.

      Line 72, delete 'enveloped'; coronaviruses already described as enveloped viruses (line 53)

      Deleted.

      Lines 93 - 100, lop-sided discussion of the viral life cycle; this paragraph is mostly about entry, which is not relevant to this paper, and does not really deal with the synthesis and assembly side of the cycle.

      We have now added the following text,

      ‘….liberating the viral nucleocapsid to the cytosol of the cell. Upon uncoating, the RNA genome is released into the host cytosol and replication-transcription complexes assemble to drive the replication of the viral genome and the expression of viral proteins. Coronaviruses modify host organelles to generate viral replication factories - so-called DMVs (double-membrane vesicles) that act as hubs for viral RNA synthesis [10]. SARS-CoV-2 viral budding occurs at ER-to-Golgi intermediate compartments (ERGIC) and newly formed viral particles traffic through secretory vesicles to the plasma membrane where they are released to the extracellular space.’ - (Lines 95-104).

      Line 103, why are the neighbouring cells 'naive'?

      ‘naïve’ removed.

      Line 112 - 113, delete last phrase; tetherin is described as an IFN stimulated gene in line 111; to be accurate, the beginning of the sentence should be 'Tetherin is expressed from a type 1 Interferon stimulated gene ...'

      Amended.

      Line 118 - 119, should say 'For tetherin-restricted enveloped viruses' as not all enveloped viruses are restricted by tetherin.

      Amended.

      Line 131, coronaviruses are not the only family of tetherin-restricted viruses that assemble on intracellular membranes, e.g. bunyaviruses.

      This has been modified and now reads,

      ‘In order for tetherin to tether coronaviruses, tetherin must be incorporated in the virus envelope during budding which occurs in intracellular organelles.’ - (Lines 133-135).

      Line 192, there is no EM data in Supplemental Fig 1C.

      This has now been removed.

      Line 251, 'a synchronous infection event' should be 'synchronous infection' as there will be multiple infection events.

      This has been changed.

      Page 13 (and elsewhere), unlike Southern, 'Western' should not have a capital letter, except at the start of a sentence.

      These have been updated throughout the manuscript (Lines 183, 341, 3549, 356, 392, 509, 763, 1330, 1399).

      Lines 330 and 352, can the authors quantitate S protein-induced reduction in cell surface tetherin rather than using the somewhat subjective 'mild'?

      These are now changed to,

      ‘Transient transfection of cells with ss-HA-Spike caused a 32% decrease in tetherin as observed by immunofluorescence (Supplemental Figure 4A, 4B), with…’ – (Line 370).

      ‘To explore whether the Spike-induced tetherin downregulation altered virus release, we performed experiments with virus like particles (VLPs) in HEK293T …’ – (Line 399).

      Line 379, OFR, should be ORF.

      Yes, changed.

      Line 448, 'Tetherin retains the ability' - did it ever loose it?

      This has been rephrased to,

      ‘Tetherin has the ability to restrict a number of different enveloped viruses that bud at distinct organelles.’ - (Line 547).

      Line 451, 'luminal' is confusing in this context.

      This has been modified to,

      ‘Tetherin forms homodimers between opposing membranes (e.g., plasma membrane and viral envelope) that are linked via disulphide bonds.’ - (Line 549).

      Line 453, the process of virus envelopment is likely to be more than a 'single step'

      This now reads,

      ‘…virus during viral budding, which occurs in modified ERGIC organelles.’ - (Line 552).

      Line 457, in my view the notion that Vpu abrogation of tetherin restriction is just due to redistribution of tetherin to the TGN is somewhat simplistic and disregards a lot of other work.

      We have removed mention of mechanisms of tetherin antagonism by other viruses. The key point we wish to make here is that tetherin is lost from the budding compartment. This now reads,

      ‘Many enveloped viruses antagonise tetherin by altering its localisation and removing it from the respective site of virus budding.’ – (Line 552-553).

      Line 472, what is meant by 'resting states'?

      This should have been ‘in the absence of stimulation’ and have now been re-written,

      ‘Tetherin is an IFN-stimulated gene (ISG) [13], and many cell types express low levels of tetherin in the absence of stimulation.’ - (Line 577).

      Line 1204, how were 'mock infected cells .......... infected'?

      This has now been re-written,

      ‘Differentiated nasal primary human airway epithelial (HAE) cells were embedded to OCT….’ - (Line 1385).

      Reviewer #2 (Significance (Required)):

      This study builds on published work supporting the notion that SARS-Cov-2 ORF3a is an antagonist for the restriction factor tetherin. Importantly, it provides insights to the the mechanism of ORF3a mediated tetherin antagonism, specifically to ORF3a inhibits tetherin cycling, diverting the protein to lysosomes and away from compartment(s) where virions assemble. Overall, the authors provide good supporting evidence for these conclusions, however there are issues that the authors need to address.

      We wish to thank Reviewer #2 for their insightful comments and suggestions for improving this work.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Restriction factors are major barriers against viral infections. A prime example is Tetherin (aka BST2), which is able to physically tether budding virions to the plasma membrane preventing release of the infectious particles. Of note, tetherin has broad anti-viral activity and has been established as a crucial innate immune defense factor against HIV, IAV, SARS-CoV-2 and other important human pathogens. However, successful viruses like SARS-CoV-2 evolved strategies to counteract restriction factors and promote their replication. Important restriction factors, such as tetherin, may often be targeted by multiple viral strategies to ensure complete suppression of their anti-viral activities by the pathogen. Of note, it was previously published that the accessory protein ORF7a of SARS-CoV-2 binds to (Petrosino et al, Chemistry Europe, 2021) and antagonizes it (Martin-Sancho et al, Molecular Cell, 2021). Previous data on SARS-CoV also revealed that ORF7a promotes cleavage of tetherin (Taylor et al, 2015, J Virol). In this manuscript, the authors show that tetherin restricts SARS-CoV-2 by tethering virions to the plasma membrane and propose that tetherin is targeted by two proteins of SARS-CoV-2. Whereas the Spike protein promotes degradation of tetherin, the accessory protein ORF3a redirects tetherin away from newly forming SARS-CoV-2 virions. While the overall findings that both S and ORF3a are additionally targeting tetherin is both novel and intriguing, additional evidence is needed to support this. In addition, the authors show that in their experimental setups ORF7a does not induce cleavage of tetherin. This is in direct contrast to previously published data both on SARS-CoV(-1) and -2 (Taylor et al, 2015, J Virol; Petrosino et al, Chemistry Europe, 2021; Martin-Sancho et al, Molecular Cell, 2021). From my point of view that needs further experimental confirmation. While the authors state that the impact of Spike on tethrin is mild, the experiments should still allow the conclusion whether there is a (mild) effect or not. The mechanism of ORF3a is fortunately more robustly assessed and provides some novel insights. Unfortunately, the whole manuscript suffers from a striking lack of quantifications. In addition, it is not clear whether and how many times experiments were repeated to the same results. Overall, the data in this manuscript seem very speculative and preliminary and thus do not support the authors conclusions.

      Major:

      Much of the data seems like it was only done once. As I am sure that this is a writing issue, please clearly state how many times the individual assays were repeated, provide the quantification graphs and appropriate statistics. Some experiments may need additional quantification and confirmation by other methods to be convincing.

      Quantification is provided throughout the revised manuscript. Figure legends have also been updated to provide information on quantification and statistical analysis.

      For example, Figure 1A, C and D: Please quantify the levels of tetherin and use an alternative readout, e.g. Western blotting of infected cells.

      Quantification has been performed and included in our revised manuscript in Supplemental Figures 1C, 1E. Tetherin is not shown in Figure 1C.

      A table is provided (above) to highlight the additional quantification.

      Figure 2A: Please quantify.

      We are not sure we understand this point. The western blot shown in Figure 2A demonstrates the ectopic expression of ACE2 in our A549 cell line. A549 cells have been used by many labs to study SARS-CoV-2 infection, but express negligible ACE2.

      Fig 3A: Please show and confirm successful tetherin KO in the cell lines that are used not only in microscopy.

      A new blot is now shown in Figure 3A, including a blot demonstrating tetherin loss in both KO lines.

      Figure 4C: Please quantify

      Currently flow cytometry experiments have been performed twice each and this is now detailed in the figure legends. The data shown in each panel is representative and the data has been explored using analogous approaches. For example, Figure 4C is complemented by Figures 4A and 4B, Figures 4E is complemented by 4D and 4F. We do not feel that repeating these flow cytometry analysis will significantly improve the manuscript.

      Figure 4D: Please quantify the effects are not obvious from the images provided.

      Quantification is now provided in Supplemental Figure 4E.

      Figure 4E, F Please provide a quantification of multiple independent repeats, the claimed differences are neither striking nor obvious.

      Quantification of 4F is now provided in Supplemental Figure 4G. Tetherin levels were quantified to be reduced by 25% (SD: 8%) by addition of Doxycycline and induction of ss-HA-Spike. Information for quantification is provided in figure legends.

      Figure 5A: Please quantify

      These experiments have currently been performed twice and this is now described in the figure legends. Data shown is representative. We can perform one more repeat of these experiments to quantify if neccessary, but do not feel it will significantly alter the manuscript.

      Figure 3C and D: At timepoint 0 the infection input levels are different. The initial infection levels have to be the same to draw the conclusion that tetherin KO affects virion release and not the initial infection efficiency. Can the authors either normalize or ensure that the initial infection is the same in all conditions and that variations in the initial infection efficiency do not correlated with the impact of tetherin on replication/release ? How often were those experiments repeated? Are the marginal differences in infectious titre significant? Overall the impact of tetherin on SARS-CoV-2 is very underwhelming but that may be due to efficient viral tetherin-counteraction strategies. Why is the phenotype inverted at 72 h?

      Equal amounts of virus, as measured by plaque-forming units (PFU), were used for both HeLa cell lines and thus at 0 hpi the variation seen is within the parameters of the assay used. It remains possible that tetherin affects virus entry but this is unlikely and this assay was not designed to investigate that effect.

      Growth curve assays are currently being repeated using an MOI of 0.01, 1 and 5. We are removing the 72 hpi sample from future experiments. At this time point, we find that the extensive cell death caused by viral replication (especially at higher MOIs) makes it difficult to accurately separate the released from intracellular fractions and conclusions cannot be accurately drawn from the data.

      Additional repeats of these experiments are in progress and will be included in the finalised manuscript.

      Figure 4B and C: Can the authors provide an explanation why SARS-CoV ORF7a is not inducing cleavage/removes glycosylation of tetherin. To show that the assays work, an independent positive control needs to be included. The FACS data in C is unfortunately not quantified.

      See above comments (Reviewer #2) regarding discussion on ORF7a. Additional text has been included to discuss ORF7a data,

      ‘SARS-CoV-1 ORF7a is reported to inhibit tetherin glycosylation and localise to the plasma membrane in the presence of tetherin [18]. We did not observe any difference in total tetherin levels, tetherin glycosylation, ability to form dimers, or surface tetherin upon expression of either SARS-CoV-1 or SARS-CoV-2 ORF7a (Figures 4A, 4B, 4C).

      Others groups have demonstrated a role for ORF7a in sarbecovirus infection and both SARS-CoV-1 and SARS-CoV-2 virus lacking ORF7a show impaired virus replication in the presence of tetherin [18,41]. A direct interaction between SARS-CoV-1 ORF7a and SARS-CoV-2 ORF7a and tetherin have been described [18,41], although the precise mechanism(s) by which ORF7a antagonises tetherin remains enigmatic. We cannot exclude that ORF7a requires other viral proteins to antagonise tetherin, or that ORF7a antagonises tetherin via another mechanism. For example, ORF7a can potently antagonise IFN signalling [42] which would impair tetherin induction in many cell types.’ – (Line 667-704).

      Fig 4G: The rationale and result of this experiment are not clear.

      The rationale for Spike VLP experiments is explained at Line 403. Given that Spike caused a reproducible decrease in cellular tetherin, we examined whether this downregulation was sufficient to antagonise tetherin and increase VLP yield.

      Fig 6: What is the benefit of doing the VLP assays as opposed to genuine virus experiments? To me it rather seems to be making the data unnecessarily complex. Again, no quantifications or repeats are provided.

      VLPs are used to separate the budding and release process from the replication process of RNA viruses. VLPs have been used in a number of SARS-CoV (DOI: 1002/jmv.25518) and HIV-1 (DOI: https://doi.org/10.1186/1742-4690-7-51) studies to analyse the impact of tetherin (and tetherin mutants) on release.

      VLP experiment quantification are now included throughout.

      Minor: Fig 1D: How do the authors explain the mainly intracellular Spike staining?

      We do not understand this point. Spike staining is intracellular, whether expressed alone or in the context of infected cells.

      Please add statistical analyses on the data e.g. Fig. 3 C and D

      Additional repeats of these experiments are in progress and will be included in the finalised manuscript.

      Fig. 4B and F: Why do the annotated sizes of tetherin differ between the blots?

      Figures 4B and 4F are run in non-reduced and reduced conditions respectively. In order to best show the dimer deficient C3A-Tetherin, blots are typically run in non-reduced conditions to exemplify dimer formation and to highlight any defects in dimer formation. The rest of the blots in the manuscript are run in denaturing conditions to aid blotting of other proteins. (Lines 957-958) and now (Lines 1356-1357).

      Fig. 5A: What is ORF6a? Do the authors mean ORF6?

      Yes, this has been changed.

      An MOI of 1 is NOT considered a low or relevant MOI. Can the authors either rephrase or repeat experiments with an actual low or relevant MOI i.e. 0.01 ?

      We are currently repeating these experiments and are including MOIs of 0.01, 1 and 5.

      Why were the cell models switched between Figure 1 and 2 and essentially the same experiments repeated?

      HeLa cells express high levels of tetherin at steady state, whilst A549 cells require IFN stimulation. HeLa cells demonstrate that tetherin downregulation occurs via an IFN-independent manner. A549 and T84 cells are more physiologically relevant cell types for SARS-CoV-2 infection. These points are stated in Lines 230 and 261.

      The manuscript may benefit a lot from streamlining and removing unessential deviations from the main message (e.g. discussions why multistep/single step growth curves are used/not relevant; why are they shown if the authors conclude that a single step is not relevant?). The discussion is extremely lengthy and does not provide sufficient discussion of the presented data.

      The multistep/single step growth curve text will be adapted, but it will be re-written after additional infection experiments.

      We have removed from the Discussion a small section discussing ORF7a mutants, given that the emphasis of our manuscript is not on ORF7a.

      We have also removed a small section describing the rearrangements of intracellular organelles by SARS-CoV-2 as it does not directly relate to the central message of our manuscript.

      According to my opinion, the current manuscript does not provide significant advancement for the field. While the intention was to update and expand our existing knowledge about tetherin restriction by SARS-CoV-2, the experiments do not support this yet. However, the general premise and approach/concept of the manuscript would be appealing to a broader audience. I especially like the notion that multiple proteins of SARS-CoV-2 could synergistically counteract an important innate immune defense factor, tetherin. My expertise is on SARS-CoV-2 and the interplay between the virus and host cell restriction factors.

      Reviewer #3 (Significance (Required)):

      According to my opinion, the current manuscript does not provide significant advancement for the field. While the intention was to update and expand our existing knowledge about tetherin restriction by SARS-CoV-2, the experiments do not support this yet. However, the general premise and approach/concept of the manuscript would be appealing to a broader audience. I especially like the notion that multiple proteins of SARS-CoV-2 could synergistically counteract an important innate immune defense factor, tetherin. My expertise is on SARS-CoV-2 and the interplay between the virus and host cell restriction factors.

      We thank Reviewer#3 for their comments and suggestions for improving this work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      BST2/tetherin can restrict the release and transmission of many enveloped viruses, including coronaviruses. In many cases, restricted viruses have developed mechanisms to abrogate tetherin-restriction by expressing proteins that antagonize tetherin; HIV-1 Vpu-mediated antagonism of tetherin restriction is a particularly well studied example. In this paper, Stewart et al. report their studies of the mechanism(s) underlying SARS-CoV-2 antagonism of tetherin restriction. They conclude that Orf3a is the primary virally encoded protein involved and that Orf3a manipulates endo-lysosomal trafficking to decrease tetherin cycling and divert the protein away from putative assembly sites.

      Major comments:

      In my view some of the claims made by the authors are not fully supported by the data. For example, the bystander effect discussed in line 162 may suggest that infected cells can produce IFN but does not 'indicate' that they do. Most of the EM images show part of a cell profile, so statements such as (line 192) 'virus containing tubulovesicular organelles were often polarised towards sites of significant surface-associated virus' should be backed up with appropriate images, or indicated as 'not shown', or removed (the observation is not so important for this story). Line 196, DMVs can't be seen in these micrographs. Line 391, I can't see much change in CD63 distribution.

      Line 321, the authors show that ORF7a does not affect tetherin localization, abundance, glycosylation or dimer formation, but they don't show that it doesn't restrict SARS-CoV-2. Can they be sure that epitope tagging this molecule does not abrogate function (or the functions of any of the other tagged proteins for that matter), or that ORF7a works in conjunction with one of the other viral proteins? In the ORF screen, a number of the constructs are expressed at low level, is it possible they are missing something?

      Line 376, the authors refer to ORF3a being a viroporin. A recent eLife paper (doi: 10.7554/eLife.84477; initially published in BioRxiv) refutes this claim and builds on other evidence that ORF3a interacts with the HOPS complex. The authors should at least mention this work, especially in the discussion, as it would seem to provide a molecular mechanism to support their conclusions.

      Fig 3, the growth curves illustrated in Fig3 C and D do not have errors bars; how many times were these experiments repeated?

      Line 396, the authors show increased co-localization with LAMP1. As LAMP1 is found in late endosomes as well as lysosomes, they cannot claim the redistributed tetherin is specifically in lysosomes.

      There seems to be a marked difference in the anti-rb555 signal in the 'mock' cells in panels 5H and Suppl 6E. Is there a good reason for this, or does this indicate variability between experiments?

      Fig 6a, why is there negligible VLP release from cells lacking BST2 and ORF3a-strep? How many times were these experiments performed? Is this a representative image? I think it confusing to refer to the same protein by two different names in the same figure (i.e. BST2 and tetherin). Do the authors know how the levels of ORF3a expressed in cells in these experiments compares to those seen in infected cells?

      My final point is, perhaps, the trickiest to answer, but nevertheless needs to be considered. As far as we know, SARS-CoV-2 and at least some other coronaviruses, bud into organelles of the early secretory pathway, often considered to be ERGIC. In the experiments shown here the authors provide evidence that ORF3a can influence tetherin recycling, but the main way of showing this is through its increased association with endocytic organelles. Do the authors have any evidence that Orf3a reduces tetherin levels in the ERGIC or whether the tetherin cycling pathway(s) involve the ERGIC?

      Minor comments:

      Overall, the manuscript should be carefully edited to ensure the text reads clearly. A few examples of thing that need to be fixed are:-

      Line 53, delete 'shell' its redundant and confusing when the authors have said coronaviruses have a membrane.

      Line 61, delete 'the'

      Line 72, delete 'enveloped'; coronaviruses already described as enveloped viruses (line 53)

      Lines 93 - 100, lop-sided discussion of the viral life cycle; this paragraph is mostly about entry, which is not relevant to this paper, and does not really deal with the synthesis and assembly side of the cycle.

      Line 103, why are the neighbouring cells 'naive'?

      Line 112 - 113, delete last phrase; tetherin is described as an IFN stimulated gene in line 111; to be accurate, the beginning of the sentence should be 'Tetherin is expressed from a type 1 Interferon stimulated gene ...'

      Line 118 - 119, should say 'For tetherin-restricted enveloped viruses' as not all enveloped viruses are restricted by tetherin.

      Line 131, coronaviruses are not the only family of tetherin-restricted viruses that assemble on intracellular membranes, e.g. bunyaviruses.

      Line 192, there is no EM data in Supplemental Fig 1C.

      Line 251, 'a synchronous infection event' should be 'synchronous infection' as there will be multiple infection events

      Page 13 (and elsewhere), unlike Southern, 'Western' should not have a capital letter, except at the start of a sentence.

      Lines 330 and 352, can the authors quantitate S protein-induced reduction in cell surface tetherin rather than using the somewhat subjective 'mild'?

      Line 379, OFR, should be ORF.

      Line 448, 'Tetherin retains the ability' - did it ever loose it?

      Line 451, 'luminal' is confusing in this context.

      Line 453, the process of virus envelopment is likely to be more than a 'single step'

      Line 457, in my view the notion that Vpu abrogation of tetherin restriction is just due to redistribution of tetherin to the TGN is somewhat simplistic and disregards a lot of other work.

      Line 472, what is meant by 'resting states'?

      Line 1204, how were 'mock infected cells .......... infected'?

      Significance

      This study builds on published work supporting the notion that SARS-Cov-2 ORF3a is an antagonist for the restriction factor tetherin. Importantly, it provides insights to the the mechanism of ORF3a mediated tetherin antagonism, specifically to ORF3a inhibits tetherin cycling, diverting the protein to lysosomes and away from compartment(s) where virions assemble. Overall, the authors provide good supporting evidence for these conclusions, however there are issues that the authors need to address.

    1. Reviewer #2 (Public Review):

      In this manuscript, Roberts et al. present XTABLE, a tool to integrate, visualise and extract new insights from published datasets in the field of preinvasive lung cancer lesions. This approach is critical and to be highly commended; whilst the Cancer Genome Atlas provided many insights into cancer biology it was the development of accessible visualisation tools such as cbioportal that democratised this knowledge and allowed researchers around the world to interrogate their genes and pathways of interest. XTABLE is trying to do this in the preinvasive space and should certainly be commended as such. We are also very impressed by the transparency of the approach; it is quite simple to download and run XTABLE from their Gitlab account, in which all data acquisition and analysis code can be easily interrogated.

      We would however strongly advocate deploying XTABLE to a web-accessible server so that researchers without experience in R and git can utilise it. We found it a little buggy running locally and cannot be sure whether this is due to my setup or the code itself. Some issues clearly need development; Progeny analysis brings up a warning "Not working for GSE109743 on the server and not sure why". GSEA analysis does not seem to work at all, raising an error "Length information for genome hg38 and gene ID ensGene is not available". In such relatively complex software, some such errors can be overlooked, as long as the authors have a clear process for responding to them, for example using Gitlab issue reporting. Some acknowledgement that this is an ongoing development would be helpful.

      The authors discuss some very important differences between the datasets in the text. Most notably they differ in endpoints and in the presence of laser capture. We would advocate including some warning text within the XTABLE application to explain these. For example, the "persistent/progressive" endpoint used in Beane et al (next biopsy is the same or higher grade) is not the same as the "progressive" endpoint in Teixeira et al (next biopsy is cancer); samples defined as "persistent/progressive" may never progress to cancer. This may not be immediately obvious to a user of XTABLE who wishes to compare progressive and regressive lesions. Similarly, the use of laser capture is important; the authors state that not using laser capture has the advantage of capturing microenvironment signals, but differentiating between intra-lesional and stromal signals is important, as shown in the Mascaux and Pennycuick papers. The authors cannot do much about the different study designs, but as the goal is to make these data more accessible We think some brief description of these issues within the app would help to prevent non-expert users from drawing incorrect conclusions.

      The authors themselves illustrate this clearly in their analysis of CIN signatures in progression potential. They observe that there is a much clearer progressive/regressive signal in GSE108124 compared to GSE114489 and GSE109743. This does not seem at all surprising, since the first study used a much stricter definition of progression - these samples are all about to become cancer whereas "progressive" samples in GSE109743 may never become cancer - and are much enriched for CIN signals due to laser capture. Their discussion states "CIN scores as a predictor of progression might be limited to microdissected samples and CIS lesions"; you cannot really claim this when "progression" in the two cohorts has such a different meaning. To their credit, the authors do explain these issues but they really should be clearly spelled out within the app.

      We are not sure we agree with their analysis of CDK4/Cyclin-D1 and E2F expression in early lesions. The authors claim these are inhibited by CDKN2A and therefore are markers of CDKN2A loss of function. But these genes are markers of proliferation and can be driven by a range of proliferative processes. Histologically, low-grade metaplasias and dysplasias all represent proliferative epithelium when compared to normal control, but most never become cancer. It is too much of a leap to say that these are influenced by CDKN2A because that gene is inactivated in LUSC; do the authors have any evidence that this gene is altered at the genomic level in low-grade lesions?

      Overall this tool is an important step forwards in the field. Whilst we are a little unconvinced by some of their biological interpretations, and the tool itself has a few bugs, this effort to make complex data more accessible will be greatly enabling for researchers and so should be commended. In the future, we would like to see additional molecular data integrated into this app, for example, the whole genome and methylation data mentioned in line 153. However, we think this is an excellent start to combining these datasets.

    2. Author Response:

      Reviewer #1 (Public Review):<br /> <br /> Roberts et al have developed a tool called "XTABLE" for the analysis of publicly available transcriptomic datasets of premalignant lesions (PML) of lung squamous cell carcinoma (LUSC). Detection of PMLs has clinical implications and can aid in the prevention of deaths by LUSC. Hence efforts such as this will be of benefit to the scientific community in better understanding the biology of PMLs.

      The authors have curated four studies that have profiled the transcriptomes of PMLs at different stages. While three of them are microarray-based studies, one study has profiled the transcriptome with RNA-seq. XTABLE fetches these datasets and performs analysis in an R shiny app (a graphical user interface). The tool has multiple functionalities to cover a wide range of transcriptomic analyses, including differential expression, signature identification, and immune cell type deconvolution.

      The authors have also included three chromosomal instability (CIN) signatures from literature based on gene expression profiles. They showed one of the CIN signatures as a good predictor of progression. However, this signature performed well only in one study. The authors have further utilised the tool XTABLE to identify the signalling pathways in LUSC important for its developmental stages. They found the activation of squamous differentiation and PI3K/Akt pathways to play a role in the transition from low to high-grade PMLs

      The authors have developed user-friendly software to analyse publicly available gene expression data from premalignant lesions of lung cancer. This would help researchers to quickly analyse the data and improve our understanding of such lesions. This would pave the way to improve early detection of PMLs to prevent lung cancer.

      Strengths:

      1. XTABLE is a nicely packaged application that can be used by researchers with very little computational knowledge.<br /> 2. The tool is easy to download and execute. The documentation is extensive both in the article and on the GitLab page.<br /> 3. The tool is user-friendly, and the tabs are intuitively designed for successive steps of analysis of the transcriptome data.<br /> 4. The authors have properly elaborated on the biological interest in investigating PMLs and their clinical significance.

      Weaknesses:

      The article is focused on the development and the utility of the tool XTABLE. While the tool is nicely developed, the need for a tool focussing only on the investigation of PMLs is not justified. Several shiny apps and online tools exist to perform transcriptomic analysis of published datasets. To list a few examples - i) http://ge-lab.org/idep/ ; ii) http://www.uusmb.unam.mx/ideamex/ ; iii) RNfuzzyApp (Haering et al., 2021); iv) DEGenR (https://doi.org/10.5281/zenodo.4815134); v) TCC-GUI (Su et al., 2019). While some of these are specific to RNA-seq, there are plenty of such shiny apps to perform both RNA-seq and microarray data analysis. Any of these tools could also be used easily for the analysis of the four curated datasets presented in this article. The authors could have elaborated on the availability of other tools for such analysis and provided an explanation of the necessity of XTABLE. Since 3 of the 4 datasets they curated are from microarray technology, another good example of a user-friendly tool is NCBI GEO2R. This is integrated with the NCBI GEO database, and the user doesn't need to download the data or run any tools. iDEP-READS (http://bioinformatics.sdstate.edu/reads/) provide an online user-friendly tool to download and analyse data from publicly available datasets. Another such example is GEO2Enrichr (https://maayanlab.cloud/g2e/). These tools have been designed for non-bioinformatic researchers that don't involve downloading datasets or installing/running other tools.

      Two of these tools (IDEP and TCC-GUI) were reviewed in a literature review covering 20 Shiny apps performed two years ago prior to work on XTABLE starting. Three of the suggested tools (IDEP, RNFuzzyApp, TCC-GUI) are for processing only RNA-seq datasets. IDEAMEX appears to be for RNA-seq data only and is severely limited in its downstream analysis capabilities. DEGenR appears to handle microarray datasets and features an option to retrieve data directly from GEO. However, it appears to be based on GEO2R (with additional downstream analyses) where it automatically logtransforms already log-transformed data and unlike GEO2R, you do not have the option to not apply a log-transformation. A refreshed literature search focusing on microarray datasets highlighted three additional tools. iGEAK which hasn’t been updated in three years and seems to have compatibility issues running on new Windows and Mac machines. sMAP, an upcoming Shiny app for microarray data published in bioRxiv on 29 May 2022. MAAP which has the same issue of log-transforming already log-transformed data. iDEP-READS does not list the datasets used in XTABLE. GEO2Enrichr appears to require the counts table and experimental design in one file, performs a “characteristic direction” DEG test and outputs enriched pathways. These apps require not just downloading of datasets but reformatting and renaming of expression data files and creation of additional files for setting up the DEG analysis which is not practical for the number of samples we have (122, 63, 33, 448) even if these apps handled microarray data. XTABLE also incorporates AUC metrics, which is appropriate given the number of samples in each dataset and tool known for adequately controlling FDR, which is not seen in other apps as well as emphasis on individual gene results and interrogation.

      A new paragraph on the discussion section (lines 361-370) of the discussion addresses the potential use of existing applications instead of XTABLE

      Secondly, XTABLE doesn't provide a solution to integrate the four datasets incorporated in the tool. One can only analyse one dataset at a time with XTABLE. The differences in terms of methodology and study design within these four datasets have been elaborated on in the article. However, attempts to integrate them were lacking.

      We repeatedly considered different strategies of integrating the analysis of the four datasets and we always reached the conclusion that it was hardly going to offer any advantage, or that it might be counterproductive.

      Integration can occur at multiple levels. One possibility is to carry out the same analysis (e.g. expression of a given gene in two groups of samples) in all datasets. Since the design and methodologies of the four studies differ substantially (different stages, different definitions of progression status, etc), a unique stratification for all datasets is not possible. Moreover, interrogating the four datasets simultaneously would slow the analysis, with no significant advantage in terms of speed. Another possibility is the integration of results in the same output. For instance, obtain a single chart with the expression of a given gene in multiple subgroups of the four datasets. We think that the results from each cohort should be kept separately and then compared with a similar analysis from other datasets due to differences in design. Scientifically, this is the best way to proceed as it avoids confusions.

      Nevertheless, XTABLE allows the export of data for further analysis. The user can use this option to integrate data using other applications or statistical packages.

      We do understand the attractiveness of integration between the four datasets is and we seriously considered it. But there is a fine balance between user-friendliness, flexibility, and scientific rigour. We think that XTABLE achieves this balance. Increasing integration of datasets might lead to error and wrong conclusions due to biological and methodological differences between studies. We believe that comparing analyses obtained independently from the four cohorts is the most sensible way to proceed.

      We propose to discuss these aspects accordingly.

      The integrative analysis of two or more datasets has been discussed in a new paragraph (382-391)

      The tool also lacks the flexibility for users to add more datasets. This would be helpful when there are more datasets of PMLs available publicly.

      This was also a permanent topic for discussion while designing XTABLE. Creating a tool that could be used to analyse other cohorts of precancerous lesions, while maintaining the ease of use was certainly a challenge. We had to adapt XTABLE to the characteristics of each one of the four databases: specific stratification criteria, different nomenclatures for the different sample types, etc. Designing a shiny app that can be adapted to other present or future datasets without the need of changing the code is simply not practical.

      The flexibility that these other Shiny apps incorporate to analyse any RNA-seq dataset requires the contrasts used for the differentially expressed gene analysis be manually defined. IDEP requires an experimental design file where sample names in the counts file must match exactly the sample names in this experimental design file and pre-processing visualisation is limited to the first 100 samples. RNFuzzyApp is similar but we could not format the experimental design file in a way that did not result in the app crashing upon upload. TCC-GUI requires all the sample names to be renamed to the contrast group with the addition of the replicate number. Apps that allow datasets to be uploaded do not have a practical or easy way to set up the DEG analysis of more than a couple dozen samples.

      Future versions of XTABLE can be updated to include additional curated PML datasets that would enhance hypothesis generation upon request. Importantly, the code is freely available and can be modified by other scientists to add their cohorts of interest, although we agree that a high level of expertise in coding will be needed. We propose to add these considerations to the text.

      The possibilities of expansion of XTABLE to new databases are discussed in lines 392-398

      Understanding the biology of PML progression would require a multi-omics approach. XTABLE analyses transcriptome data and lacks integration of other omics data. The authors mention the availability of data from whole exome, methylation, etc from the four studies they have selected. However, apart from the CIN scores, they haven't integrated any of the other layers of omics data available.

      Only one dataset (GSE108104) contains whole-exome sequencing and methylation data. We considered that a multi-omics approach in XTABLE would result in an overcomplicated application. As far as early detection and biomarker discovery is concerned, transcriptomic data is the most interesting parameter.

      Also discussed in lines 382-391

      Lastly, the authors could have elaborated on the limitations of the tool and their analysis in the discussion.

      We propose to raise these limitations accordingly in the discussion.

      See above.

      Reviewer #2 (Public Review):

      In this manuscript, Roberts et al. present XTABLE, a tool to integrate, visualise and extract new insights from published datasets in the field of preinvasive lung cancer lesions. This approach is critical and to be highly commended; whilst the Cancer Genome Atlas provided many insights into cancer biology it was the development of accessible visualisation tools such as cbioportal that democratised this knowledge and allowed researchers around the world to interrogate their genes and pathways of interest. XTABLE is trying to do this in the preinvasive space and should certainly be commended as such. We are also very impressed by the transparency of the approach; it is quite simple to download and run XTABLE from their Gitlab account, in which all data acquisition and analysis code can be easily interrogated.

      We would however strongly advocate deploying XTABLE to a web-accessible server so that researchers without experience in R and git can utilise it. We found it a little buggy running locally and cannot be sure whether this is due to my setup or the code itself. Some issues clearly need development; Progeny analysis brings up a warning "Not working for GSE109743 on the server and not sure why". GSEA analysis does not seem to work at all, raising an error "Length information for genome hg38 and gene ID ensGene is not available". In such relatively complex software, some such errors can be overlooked, as long as the authors have a clear process for responding to them, for example using Gitlab issue reporting. Some acknowledgement that this is an ongoing development would be helpful.

      We thank the reviewer for these comments. We will inspect the code to address those warnings, implement a system for issue reporting, and add the acknowledgements suggested by the reviewer. Regarding the deployment of XTABLE to a web-accessible server, this could present a challenge in the long term as computing resources need to be allocated for years and the economic cost involved.

      The code has been inspected to remove the warning and errors pointed out by the reviewer.

      The authors discuss some very important differences between the datasets in the text. Most notably they differ in endpoints and in the presence of laser capture. We would advocate including some warning text within the XTABLE application to explain these. For example, the "persistent/progressive" endpoint used in Beane et al (next biopsy is the same or higher grade) is not the same as the "progressive" endpoint in Teixeira et al (next biopsy is cancer); samples defined as "persistent/progressive" may never progress to cancer. This may not be immediately obvious to a user of XTABLE who wishes to compare progressive and regressive lesions. Similarly, the use of laser capture is important; the authors state that not using laser capture has the advantage of capturing microenvironment signals, but differentiating between intra-lesional and stromal signals is important, as shown in the Mascaux and Pennycuick papers. The authors cannot do much about the different study designs, but as the goal is to make these data more accessible We think some brief description of these issues within the app would help to prevent non-expert users from drawing incorrect conclusions.

      The authors themselves illustrate this clearly in their analysis of CIN signatures in progression potential. They observe that there is a much clearer progressive/regressive signal in GSE108124 compared to GSE114489 and GSE109743. This does not seem at all surprising, since the first study used a much stricter definition of progression - these samples are all about to become cancer whereas "progressive" samples in GSE109743 may never become cancer - and are much enriched for CIN signals due to laser capture. Their discussion states "CIN scores as a predictor of progression might be limited to microdissected samples and CIS lesions"; you cannot really claim this when "progression" in the two cohorts has such a different meaning. To their credit, the authors do explain these issues but they really should be clearly spelled out within the app.

      This is a very good point. We will add the warning text about the differences between studies regarding the definition of progression potential and the differences and sample processing (LCM or o not) so that the user is permanently aware of the differences between cohorts.

      A new tab (Dataset) has been added table with the methodologies used in each of each study, and the differences in progression status definitions. Additionally, we emphasized these differences in the main text of the manuscript (lines 296-300 and 403-409).

      We are not sure we agree with their analysis of CDK4/Cyclin-D1 and E2F expression in early lesions. The authors claim these are inhibited by CDKN2A and therefore are markers of CDKN2A loss of function. But these genes are markers of proliferation and can be driven by a range of proliferative processes. Histologically, low-grade metaplasias and dysplasias all represent proliferative epithelium when compared to normal control, but most never become cancer. It is too much of a leap to say that these are influenced by CDKN2A because that gene is inactivated in LUSC; do the authors have any evidence that this gene is altered at the genomic level in low-grade lesions?

      We are grateful for this comment. There is currently not evidence that CDKN2A mutations occur in low-grade lesions and therefore, we cannot argue that the of CDK4/Cyclin-D1 and E2F expression signature are the result of CDKN2A inactivation in low-grade lesions. We propose to modify the text to introduce these caveats to our conclusion an make our interpretations more accurate.

      We have modified the discussion (lines 443-454) to address the interpretation of our results regarding the connection between CDKN2A inactivation and the CDK4/cyclin-D1 and E2F signatures. We now focus our conclusions on the pathway itself and we mention Cyclin-D1 and CDKN2A alterations as a potential modulator of the changes in the pathway, but leaving the discussion open to other drivers.

      Overall this tool is an important step forwards in the field. Whilst we are a little unconvinced by some of their biological interpretations, and the tool itself has a few bugs, this effort to make complex data more accessible will be greatly enabling for researchers and so should be commended. In the future, we would like to see additional molecular data integrated into this app, for example, the whole genome and methylation data mentioned in line 153. However, we think this is an excellent start to combining these datasets.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their enthusiasm for our work and constructive feedback.

      Below please find our point-by-point response to the comments:

      Reviewer #1 -Key conclusions that were less convincing: -RhoA and NMII are in the title as mechanistic downstream regulators of CaM, but the results in Fig 8 call into question the role of RhoA. Why does RhoA activation not influence cell size and circularity? Can you overexpress MRLC-GFP and inhibit Rho and restore the wt phenotype? The role of NMII is also not clear - why does overexpressing MLCK-CA not have a phenotype but overexpressing MLCK downstream target MRLC show the phenotype? Are there any alternative pathways to regulate MRLC? It's not being discussed or described in 6D's schematic.

      Response: Rho activation usually leads to the formation of more stress fibers and therefore does not lead to decreased cell size and increased circularity observed in MFN2 KO. The phenotype is restored by either ROCK or MLCK knockdown. We have discussed in the main text that the formation of PAB requires both RhoA and NMII activation under restricted spatiotemporal control.

      MRLC has three major regulators (Ikebe & Hartshorne, 1985; Isotani et al., 2004). As we discussed, MLCK and ROCK phosphorylate MRLC at either Ser19 or Ser19 and Thr18. MRLC is dephosphorylated and inactivated by its phosphatase MLCP. We tried to knock down MLCP in wt MEF cells but failed to see any cell morphology changes (data not shown).

      We were also surprised to see MRLC-GFP overexpression with Rho Activator can phenocopy PAB, but “MLCK-CA + Rho Activator” failed to. We believe it is because MLCK-CA constitutively over-activates a broad range of downstream effectors while overexpressing MRLC mimics endogenous activation or NMII alone. Also, only a proportion of cells acquired PAB structure under Rho Activator and MRLC overexpression, which indicates PAB formation also requires specific spatiotemporal controls.

      *Rewrite for clarity -The role of ER/mito contacts in the system was unclear (since ER/mito contacts were not observed nor evaluated directly). *

      __Response: __We have included additional data to measure ER/mito contacts in MEFs. Our result is consistent with numerous previous reports that MFN2 regulates ER/mito contacts. The data is now included in Fig. S3.

      * -What role does focal adhesions have on PAB formation or any part of the model - There were results showing larger focal adhesions in the MFN2-/- cells, but not sure how this fits in with the bigger picture of contractility and PABs, and focal adhesions were not in the model in Figure 5.*

      __Response: __Focal adhesion and actomyosin are tightly coupled, and our work focuses on the actin network. Our model did not include FAs since FA is not a significant focus in this study.

      * -Whether regulating calcium impacts PAB formation*

      __Response: __Calcium likely regulates PAB formation. We have shown PAB cell percentage decreases in mfn2-/- with ER-mito tethering contrast in Fig. S3.

      -The role of PABs in migration is also unclear - can you affect PAB formation or get rid of PABs and quantify cell migration?

      __Response: __Our data suggest that PAB formation and cell migration are inversely correlated. Since PAB results from a contractile actin band on the cell periphery, its role in defective cell spreading and migration is expected. We demonstrated that MLCK and ROCK knockdown reduced PABs and rescued cell spreading.

      -It was hard to understand the correlation between the membrane tension of MFN2-/- cells and their ability to spread on softer substrates. How does this result fit in with the overarching model?

      __Response: __Reduced membrane tension is presumably associated with decreased cell spreading. Softer substrates attenuate the mechanical force on focal adhesion proteins and the actin cytoskeleton (Burridge & Chrzanowska-Wodnicka, 2003; Pelham & Wang, 1997; Wong et al., 2015), which is required for focal adhesion maturation. As a result, softer substrates can reduce the over-contraction in the MFN2 KO cells. The results support the model that MFN2 KO cells have enhanced cell contraction on the substrates dependent on substrate interaction and force transduction on focal adhesions. Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      __Response: __We have removed the MFN2-related disease from the introduction to focus the paper more on cell biology in vitro.

      *-There were a number of findings that did not seem to fit in with the paper, or they were included, but were not robustly described nor quantified. As an example, Paxillin-positive focal adhesions were evaluated in MFN2-/- and with various pharmacological approaches, but there is no quantification with respect to size, number, or distribution of focal adhesions, despite language in the main text that there are differences between conditions. *

      __Response: __We have quantified the focal adhesions in the KO cells, and the data is now included in Fig. 5. We used the actin distribution to quantify the “PAB”; therefore, FAs are not a significant focus of this study.

      *Also, the model was presented in figure 5, and then there were several pieces of data presented afterward, some that are included in the model (myosin regulation), and some that are not in the model (membrane tension, TFM, substrate stiffness, etc). * __Response: __Membrane tension and substrate stiffness dependence are physical properties of the cell. The model focuses on the molecular mechanism that leads to PAB formation.

      The stiffness/tension figure was not clear to me, and it was difficult to make sense of the data since one would predict that an increase in actomyosin contractility at the cortex would lead to higher membrane tension, not lower, and then how membrane tension relates to spreading on soft matrices is also unclear.

      __Response: __The result was surprising to us initially. However, the MFN2 KO cells have increased actomyosin contractility only at the cell-substrate interface but not throughout the entire cell cortex. A less spread cell would have a more relaxed membrane and display a lower membrane tension, consistent with our observation. Softer matrices reduce cell contractility at the cell-substrate interface, which allows MFN2 KO cells to relax and spread better. We have emphasized in the discussion of our manuscript that MFN2 KO cells have an increase in actomyosin contractility only at the cell-substrate interface.

      The manuscript seems like an amalgamation of different pieces of data that do not necessarily fit together into a cohesive story, so the authors are encouraged to either remove these data, or shore them up and weave them into the narrative.

      __Response: __We respectfully disagree with the reviewer since the cell morphology, actin structure, substrate interaction, and cell mechanics are tightly correlative and provide a complete picture of the role of MFN2 in regulating cell behavior.

      * Request additional experiments 1. -The imaging used for a lot of the quantification (migration, circularity) is difficult to resolve. The cells often look like they are not imaged in the correct imaging plane. It would be helpful to have better representative images such that it is clear how the cells were tracked and how cell periphery regions of interest were manually drawn. Focal adhesions should also be shown without thresholding.*

      __Response: __We used a TIRF microscope and imaged a Z stack. Imaris was used to combine the Z-stack images. The images in the manuscripts are now of the lowest stack with background subtraction.

      2. -For most of the quantification, it appears that the experiment was only performed once and that a handful of cells were quantified. The figure legend indicates the number of cells (often reported as >12 cells or >25 cells), but the methods indicate that high content imaging was performed, and so the interpretation is that these experiments were only performed once. Biological replicates are required. If the data do represent at least 3 biological replicates, then more cell quantification is required (12 or 25 cells in total would mean quantifying a small number of cells per replicate).

      Response: __We quantified more cells and indicated the number of cells quantified in the figure legends. The experiments are with three biological replicates.__

      * 3. -Mitochondrial morphology and quantification should be performed in the MFN2 knockdown and rescue lines.*

      __Response: __Mitochondrial morphology is well characterized in the Mfn2 KO and rescue MEF cells (Chen et al., 2003; Naon et al., 2016; Samanas et al., 2020). We observed a similar phenotype using mito-RFP to label mitochondrial structure (Fig. S1).

      4.-Many of the comparisons throughout the figures is between MFN2 knockdown and MFN2 knockdown plus rescue or genetic/pharmacological approaches, but a comparison that is rarely made is between wildtype and experimental. These comparisons could be useful in comparing partial rescues and potential redundancies with the other mitofusin.

      Response: We have included the WT in our assays (Fig. 2-6). We also confirmed that MFN1 could not rescue the MFN2 defects (Fig. 2). We observed partial and complete rescue in different assays. It would be difficult to conclude whether the phenotype is due to the redundancies with the other mitofusin because not all cells are rescued at the endogenous level.

      * 5. -For the mito/ER tethering experiments, it is important to show that ER/mito contacts are formed and not formed in the various conditions with imaging approaches.*

      __Response: __We adopted a previously established method to quantify ER-mitochondria contacts with the probe SPLICSL (Cieri et al., 2017; Vallese et al., 2020). Our results align with previous reports that Mfn2-null MEFs displayed significantly decreased ER-mitochondria contacts. MFN2 re-expression or ER-mitochondria tethering structure restored the contacts. (Fig. S3).

      * 6. -For some of the pharmacological perturbations, it would be helpful to show that the perturbation actually led to the expected phenotype - as an example, in cells treated with different concentrations of A23187, what are the intracellular calcium levels and how do these treatments influence PAB formation? This aspect should be generally applied across the study - when a modification is made, that particular phenotype should first be evaluated, before dissecting how the perturbation affects downstream phenotypes.*

      __Response: __We selected a collection of well-characterized inhibitors broadly used in the literature for pharmacological perturbations. For example, numerous studies used A23187 treatment to raise intracellular calcium to examine related actin cytoskeleton changes (Carson et al., 1994; Goldfine et al., 1981; Shao et al., 2015). We titrated the drugs in WT in preliminary experiments and observed similar phenotypes. (data not shown). We then use the same concentration to treat the MFN2 KO cells. Overall, we use pharmacological perturbations as supporting evidence. We use genetics (knockdown or overexpression) to validate our results.

      7. -In Figures 4 and 5, the thresholding approaches in the images make the focal adhesions difficult to resolve, and therefore it is difficult to determine the size. As described above, these metrics should be defined and quantified.

      __Response: __We used a TIRF microscope and imaged a Z stack. Imaris was used to combine the Z-stack images. The images in the manuscripts are now of the lowest stack with background subtraction.

      8. -What is a PAB? How is it defined? What metrics make a structure a PAB versus regular cortical actin - are there quantifiable metrics? In figure 8, there are some structures that are labelled as a PAB, but some aren't (as an example, the left panel in 8b is a PAB, but the right panel in 8A is not, but they look the same), so a PAB should be defined with quantifiable measures, and then applied to the entire study.

      __Response: __We developed an algorism to quantify PAB cells. We first used the ImageJ plugin FiloQuant (Jacquemet et al., 2019) to identify the cell border and cytoskeleton, then used our custom algorism to quantify the percentage of actin in the cell border area. The cellular circularity is also calculated at the same time. If the cell contains more than 50% actin in the cell border area, and the circularity is higher than 0.6, we then count it as a “PAB” cell (Fig. S2).

      -As described above, why does RhoA activation not influence cell size and circularity? Can you overexpress MRLC-GFP and inhibit Rho and restore the wildtype phenotype?

      __Response: __Rho activation usually leads to the formation of more stress fibers and therefore does not lead to decreased cell size and increased circularity observed in MFN2 KO.

      We are sorry that we don’t understand the rationale of this experiment proposed by the reviewer. ROCK inhibition restored the wildtype phenotype in MFN2 KO cells (Fig.7). Figure 8 is to create the MFN2 KO phenotype in WT cells, which requires both Rho and MRLC overactivation.

      * 10 Are the data and the methods presented in such a way that they can be reproduced? -We appreciate that the authors quantified many parameters, although some quantifications were missing. There are some missing methods - how was directionality quantified, was migration quantified by selecting the approximate center of cells using MTrackJ or were centroids quantified by outlining cells, for instance. Also, given that some of the phenotypes were somewhat arbitrarily assigned (ie. what constitutes a PAB?), it may be difficult to reproduce these approaches and interpret data appropriately.*

      __Response: __We have clarified directionality quantification methods and other details. We used MTrackJ to track cell migration. And as we mentioned above, we came up with a customized algorithm to quantify PAB cells, which shows the critical effectors in a more quantifiable way.

      * 11. Are the experiments adequately replicated and statistical analysis adequate? -Unfortunately, while the approximate number of cells was reported, the number of biological replicates were not reported, and therefore, the experimental information and statistical analyses are not adequate.*

      __Response: __We have quantified more cells and indicated the number of repeats and cells quantified in the figure legend. Minor comments: Specific experimental issues that are easily addressable.

      * 12. - for some of the graphs - mostly about calcium levels - fold change is shown, but raw values should also be included to determine whether the basal levels of calcium are different across the conditions.*

      __Response: __Delta F/F0 is the standard method to normalize dye loading in cells for calcium concentration measurements (Kijlstra et al., 2015; Zhou et al., 2021).

      13. - scale bars in every panel should also help make the points clearer.

      __Response: __We have added scale bars in all panels.

      * Reviewer #2 (Evidence, reproducibility and clarity (Required)): 1. Fig. 1A: The Mfn1 Western Blot is not of publication quality. Moreover, quantitation is necessary.*

      __Response: __We performed additional western blots, changed the representative images, and quantified the level of knockdown and overexpression (Fig.2 and 7). We did not quantify the WB in Fig.1A since it was to confirm that the Mfn1-/- or Mfn2-/- were knock-out cell lines.

      2. Fig. 1B (as well as Fig. 2G and others): the date do not reflect cellular size but instead spread cellular area.

      __Response: __We thank the reviewer for this suggestion. We have changed all similar descriptions to “Spread Area” in the main text and figures.

      3. Fig. 1C, D: Mfn1-null MEFs appear to be more spindle-shaped than wt cells, yet their circularity tends to be elevated. Do the authors have an explanation?

      __Response: __The circularity of Mfn1-/- MEFs has a slight increase but is not significant compared to the wt cells. As we observed, Mfn1-/- MEFs have fewer protrusions than wt, which may contribute to the slight increase in its circularity (Fig. 5C). However, this is not the focus of this study.

      * 4. Fig. 2A: The Mfn1 levels in Mfn2-/- + Mfn2 are lower than Mfn2-/-? Does this imply a crosstalk between Mfn1 and Mfn2 expression.*

      __Response: __We agree with the reviewer that a compensatory change in MFN1 expression might happen in Mfn2-/- + MFN2 MEFs. Previous research also indicated crosstalk between MFN1 and MFN2 expression (Sidarala et al., 2022).

      * 5. Fig. 2H: The authors should provide co-staining of mitochondria and Mfn2 as well.*

      __Response: __Co-staining of mitochondria and MFN2 in Mfn2-/- MEFs or rescue lines has been done in numerous previous studies (Chen et al., 2003; Naon et al., 2016; Samanas et al., 2020). In this work, we transfected our cells with mito-RFP and showed mitochondria changes in Mfn2-/- and rescue MEF cells (Fig. S1G).

      6. Fig. 4D-E: Western blots are not of publication quality. Looking at the blots provided in Fig. 4D, the reviewer is not convinced with the quantitative data shown in Fig. 4E. For instance, the intensity of pCaMKII band for "vec" does not look 3x higher than that of "+MFN2", whereas that of "+MFN2" looks much higher than that of WT.

      __Response: __We have performed additional western blots and changed the representative images.

      * 7. Fig. 5C: The authors should stain for vinculin, which are present in mature FAs only, rather than paxillin which are present in all FAs. This would strengthen the authors' conclusions. Also, FA size should be quantified.*

      __Response: __We have quantified FA size in Figure 5. The maturity of FAs is not a major focus of this study. It is likely that most FAs here are mature since they are connected with stress fibers.

      * 8. Fig. 6C - Why does the background have a grid and appear grey in color? Also, the cell interior appears in different colors in the different images. The authors should take a z-stack of images and provide the raw image files.*

      __Response: __We used a TIRF microscope and imaged a Z stack. Imaris was used to combine the Z-stack images. The images in the manuscripts are now of the lowest stack with background subtraction.

      9. Fig 7C: The MLCII Western blot is not of publication quality, and may affect the quantification provided in Fig. 7D.

      __Response: __We have performed additional western blots and changed the representative images.

      * 10. Fig 8: Do cell treatments with Rho Activator and MLCK-CA also impair migration velocity similar to Mfn2-null cells?*

      __Response: __Our data indicated that Rho Activator and MRLC induced the “PAB” structure seen in MFN2 KO cells. It is likely that cell migration is impaired here. Spatiotemporal regulation of Rho Activation is important to cell migration, it is known that Rho overactivation can significantly inhibit cell migration (Nobes & Hall, 1999). Showing Rho Activator and MLCK-CA will reduce cell migration will not add new knowledge to the cell migration field. However not all cell migration defects are associated with the PAB. We, therefore, focused on PAB quantification in this figure.

      11. Fig. 9A: The authors claim that wt cells have actin bundles that protrude against the membrane while Mfn2-null cells do not. This does not look convincing as the Mfn2-null actin bundle seem to be pushing against the membrane at the bottom of the image. No quantification is provided. It is unclear what conclusion can be drawn from the super-resolution images.

      __Response: __We used super-resolution imaging to demonstrate the details of the peripheral actin band (PAB) structure. We have used two boxes to enlarge the regions where membrane parallel actin structures are predominant. The quantification of PAB is provided in other figures.

      12. Suppl. Fig. 5C: The authors should take images using a confocal microscope for cells with Flipper-TR construct, eliminate the background and the cell center to only consider the cell periphery. Nikon TE2000 does not seem to be a confocal microscope.

      Response: __The amount of Flipper-TR that MEF cells can take in was limited. With the current signal-to-noise ratio, complete background elimination is not feasible. A confocal microscope is not necessary for Flipper-TR FLIM imaging (García-Calvo et al., 2022). __* Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      General Comments (Major) 1. Data Presentation and analysis: The data analysis would benefit from using a method such as Super-Plots to show the data from separate biological repeats and to use N numbers that represent the number of biological repeats rather than the number of cells analysed. Please see the following reference for a suggestion on how to analyse the data: Lord SJ, Velle KB, Mullins RD, Fritz-Laylin LK. SuperPlots: Communicating reproducibility and variability in cell biology. J Cell Biol. 2020 Jun 1;219(6):e202001064. doi: 10.1083/jcb.202001064. PMID: 32346721; PMCID: PMC7265319.*

      __Response: __We have changed the dot colors to show data from separate biological repeats.

      • Another general comment is that many of the experiments show analysis of very few numbers of cells or maybe only one field of view in a microscopy quantification experiment. This seems unusually low - for example, in Figure 1E only 6 cells have been analysed. It seems like more could have been done and if statistical analysis like we suggest in 1 above is used, this might reveal that some of the differences are less significant than the authors think/report. This is important, as cells are noisy and it is unusual to have such high significance for experiments like cell migration and other parameters unless a lot of measurements are made. In Figure 1G, it appears that only one field of view has been used to quantify the data. We routinely use 3-5 fields of view to get a representative sample of what the cells are doing.*

      __Response: __We have quantified more cells and indicated the number of repeats and cells quantified in the figure legend.

      • Some of the micrographs appear to be missing scale bars- e.g. Fig. 2H, Fig. 8*

      __Response: __We have included scale bars in the lower right corner of all panels.

      * 4. In the cell tracking experiments, only some of the cells in the images appear to have been tracked. How were the tracked cells chosen? Normally, we would track every cell to avoid bias in selection.*

      __Response: __We tracked all the cells in the view at the beginning of the experiments.

      5. The western blot images do not show the molecular size of the bands. Show ladder position

      __Response: __We have added bands to show molecular weights.

      6. Mostly the graphs show individual data points, which is good, but in some cases only a bar is shown- it would be nice to have individual points overlaid on the bars- e.g. Figure 1I, 4E, 5B, 5E, 7B, 7D

      __Response: __We have updated the graph to show individual points.

      7. Many of the confocal images look very processed- they have no background and have a hazy black halo around the cell. I am not familiar with the type of processing that was done and I worry that the images are only showing a masked and processed version of the actual data. The authors need to explain what processing they have done and probably also to provide the unprocessed images in a supplementary figure or dataset for readers to see. The methods description is inadequate as it only says Image J was used to process the data.

      __Response: __We used a TIRF microscope and imaged a Z stack. Imaris was used to combine the Z-stack images. The images in the manuscripts are now of the lowest stack with background subtraction.

      * Individual comments on Figures: Figure 1: See general comments above- consider to use Superplots, more cells and more fields of view in quantifications. Show experimental points in bar graph.*

      __Response: __We have quantified more cells and used super plots to display the data. The number of repeats and cells quantified are indicated in the figure legend.

      Figure 2: In 2E, the colours are very similar for two of the experiments so it is difficult to distinguish them- e.g. the MFN1 vs MFN2 rescue data both appear dark blue. Response: We have changed the color for MFN1 rescue to distinguish the two samples better.

      In 2H are the magnifications really the same for the WT as the +DOX and -DOX? The cell in the WT looks huge. Is this representative? Also, the phalloidin stain looks very spotty on the WT. This seems unusual.

      __Response: __The images are of the same scale. The Mfn2-/- MEFs are smaller, and DOX-induced MFN2 expression can only partially rescue the cell size.

      * Figure 3: Not many cells were analysed in 3B, especially the zero time point.*

      __Response: __We have quantified more cells.

      * Please define +T, we assume it is the tether construct, but it is not defined*

      __Response: __We defined T as a tether in the main text and the figure legend. In 3F, how were the tracked cells chosen?

      __Response: __We tracked all the cells in the view at the beginning of the experiments.

      * Figure 4:* 4B: Why have they not tested FK506 and STO609 on the WT cells?

      __Response: __We focused on understanding the MFN2 KO phenotype. Since neither FK506 nor STO609 altered the MFN2 KO phenotype, we did not include them in the WT group.

      4C: How were the tracked cells chosen?

      __Response: __We tracked all the cells in the view at the start of the experiments.

      4D-E: The blot doesn't look representative of the quantification- were the numbers normalised to vinculin? The difference between WT and vector looks too large to be real if the amounts were normalised to the vinculin, as vinculin is increased in vector. Likewise, the pCAMKII looks to be substantially decreased from the +MFN2, but this is not what the quantification shows.

      __Response: __We have performed additional western blots and changed the representative images.

      Figure 5: 5B- please clarify which ratio is shown here. I assume it is the ratio of RhoA-GTP vs RhoA between the zero and 4 minute time points.

      __Response: __Yes, we have clarified this point in the figure legend.

      5C- These images appear to have a mask around the cell. It is hard to tell where the edge of the cell really is- what sort of processing was used? Especially for the paxillin staining, why is there no cytoplasm shown? Is this because the image is in TIRF?

      __Response: __We used a TIRF microscope and imaged a Z stack. Imaris was used to combine the Z-stack images. The images in the manuscripts are now of the lowest stack with background subtraction.

      Figure 6: Figure 6C- the blebbistatin treated cell looks very large- is this representative?

      __Response: __Yes, Blebbistatin-treated cells are larger (Fig. 6A).

      Figure 7: Fig 7C- The lanes for MLCII are all run together- is this from a different gel? Is this quantification accurate?

      __Response: __We have performed additional western blots and changed the representative images.

      Fig. 7F- What is the % level of knockdown achieved?

      __Response: __The level of knockdown is labeled on the figure.

      Figure 8: Fig 8A,B- does the scale bar represent all of the images in these two panels?

      __Response: __Yes, the figure legend is updated to clarify this point.

      Fig 8C,D- Superplots would be helpful here.

      __Response: __We have used super plots to display the data.

      Supplementary Data: The OCR data do not add much and are not discussed much in the manuscript. Perhaps they could be omitted.

      __Response: __Our OCR data ruled out the possibility of metabolic regulation. Since MFN2 is a mitochondria protein with its typical functions in metabolic pathways, we cannot omit its influence on metabolism here. As we observed, shMLCK enhanced OCR, shROCK reduced OCR, and both knock-down rescued cell morphology and motility. We believe that PAB formation is independent of MFN2’s function in metabolic regulation.

      Figure S3- The figure label doesn't match the manuscript test- was fibrinogen or collagen used?

      __Response: __We tried cover glass alone, collagen, and fibronectin-coated glass. The PAB formation is independent of these extracellular substrates. We did not try fibrinogen because MEF cell is reported to prefer fibronectin (Lehtimäki et al., 2021).

      Reference

      Burridge, K., & Chrzanowska-Wodnicka, M. (2003). FOCAL ADHESIONS, CONTRACTILITY, AND SIGNALING. Http://Dx.Doi.Org/10.1146/Annurev.Cellbio.12.1.463, 12, 463–519. https://doi.org/10.1146/ANNUREV.CELLBIO.12.1.463

      Carson, S. D., Perry, G. A., & Pirruccello, S. J. (1994). Fibroblast Tissue Factor: Calcium and Ionophore Induce Shape Changes, Release of Membrane Vesicles, and Redistribution of Tissue Factor Antigen in Addition to Increased Procoagulant Activity. Blood, 84(2), 526–534. https://doi.org/10.1182/BLOOD.V84.2.526.526

      Chen, H., Detmer, S. A., Ewald, A. J., Griffin, E. E., Fraser, S. E., & Chan, D. C. (2003). Mitofusins Mfn1 and Mfn2 coordinately regulate mitochondrial fusion and are essential for embryonic development. The Journal of Cell Biology, 160(2), 189. https://doi.org/10.1083/JCB.200211046

      Cieri, D., Vicario, M., Giacomello, M., Vallese, F., Filadi, R., Wagner, T., Pozzan, T., Pizzo, P., Scorrano, L., Brini, M., & Calì, T. (2017). SPLICS: a split green fluorescent protein-based contact site sensor for narrow and wide heterotypic organelle juxtaposition. Cell Death & Differentiation 2018 25:6, 25(6), 1131–1145. https://doi.org/10.1038/s41418-017-0033-z

      García-Calvo, J., López-Andarias, J., Maillard, J., Mercier, V., Roffay, C., Roux, A., Fürstenberg, A., Sakai, N., & Matile, S. (2022). HydroFlipper membrane tension probes: imaging membrane hydration and mechanical compression simultaneously in living cells. Chemical Science, 13(7), 2086–2093. https://doi.org/10.1039/D1SC05208J

      Goldfine, S. M., Schroter, E. H., & Izzard, C. S. (1981). Calcium-dependent shortening of fibroblasts induced by the ionophore, A23187. Journal of Cell Science, 50(1), 391–405. https://doi.org/10.1242/JCS.50.1.391

      Ikebe, M., & Hartshorne, D. J. (1985). Phosphorylation of Smooth Muscle Myosin at Two Distinct Sites by Myosin Light Chain Kinase*. Journal of Biological Chemistry, 260, 10027–10031. https://doi.org/10.1016/S0021-9258(17)39206-2

      Isotani, E., Zhi, G., Lau, K. S., Huang, J., Mizuno, Y., Persechini, A., Geguchadze, R., Kamm, K. E., & Stull, J. T. (2004). Real-time evaluation of myosin light chain kinase activation in smooth muscle tissues from a transgenic calmodulin-biosensor mouse. Proceedings of the National Academy of Sciences of the United States of America, 101(16), 6279–6284. https://doi.org/10.1073/PNAS.0308742101

      Kijlstra, J. D., Hu, D., Mittal, N., Kausel, E., van der Meer, P., Garakani, A., & Domian, I. J. (2015). Integrated Analysis of Contractile Kinetics, Force Generation, and Electrical Activity in Single Human Stem Cell-Derived Cardiomyocytes. Stem Cell Reports, 5(6), 1226. https://doi.org/10.1016/J.STEMCR.2015.10.017

      Lehtimäki, J. I., Rajakylä, E. K., Tojkander, S., & Lappalainen, P. (2021). Generation of stress fibers through myosin-driven reorganization of the actin cortex. ELife, 10, 1–43. https://doi.org/10.7554/ELIFE.60710

      Naon, D., Zaninello, M., Giacomello, M., Varanita, T., Grespi, F., Lakshminaranayan, S., Serafini, A., Semenzato, M., Herkenne, S., Hernández-Alvarez, M. I., Zorzano, A., De Stefani, D., Dorn, G. W., & Scorrano, L. (2016). Critical reappraisal confirms that Mitofusin 2 is an endoplasmic reticulum-mitochondria tether. Proceedings of the National Academy of Sciences of the United States of America, 113(40), 11249–11254. https://doi.org/10.1073/PNAS.1606786113/SUPPL_FILE/PNAS.201606786SI.PDF

      Nobes, C. D., & Hall, A. (1999). Rho GTPases Control Polarity, Protrusion, and Adhesion during Cell Movement. The Journal of Cell Biology, 144(6), 1235. https://doi.org/10.1083/JCB.144.6.1235

      Pelham, R. J., & Wang, Y. L. (1997). Cell locomotion and focal adhesions are regulated by substrate flexibility. Proceedings of the National Academy of Sciences of the United States of America, 94(25), 13661. https://doi.org/10.1073/PNAS.94.25.13661

      Samanas, N. B., Engelhart, E. A., & Hoppins, S. (2020). Defective nucleotide-dependent assembly and membrane fusion in Mfn2 CMT2A variants improved by Bax. Life Science Alliance, 3(5). https://doi.org/10.26508/LSA.201900527

      Shao, X., Li, Q., Mogilner, A., Bershadsky, A. D., & Shivashankar, G. V. (2015). Mechanical stimulation induces formin-dependent assembly of a perinuclear actin rim. Proceedings of the National Academy of Sciences of the United States of America, 112(20), E2595–E2601. https://doi.org/10.1073/PNAS.1504837112/SUPPL_FILE/PNAS.1504837112.SM03.AVI

      Sidarala, V., Zhu, J., Levi-D’Ancona, E., Pearson, G. L., Reck, E. C., Walker, E. M., Kaufman, B. A., & Soleimanpour, S. A. (2022). Mitofusin 1 and 2 regulation of mitochondrial DNA content is a critical determinant of glucose homeostasis. Nature Communications 2022 13:1, 13(1), 1–16. https://doi.org/10.1038/s41467-022-29945-7

      Vallese, F., Catoni, C., Cieri, D., Barazzuol, L., Ramirez, O., Calore, V., Bonora, M., Giamogante, F., Pinton, P., Brini, M., & Calì, T. (2020). An expanded palette of improved SPLICS reporters detects multiple organelle contacts in vitro and in vivo. Nature Communications, 11(1). https://doi.org/10.1038/S41467-020-19892-6

      Wong, S. Y., Ulrich, T. A., Deleyrolle, L. P., MacKay, J. L., Lin, J. M. G., Martuscello, R. T., Jundi, M. A., Reynolds, B. A., & Kumar, S. (2015). Constitutive activation of myosin-dependent contractility sensitizes glioma tumor-initiating cells to mechanical inputs and reduces tissue invasion. Cancer Research, 75(6), 1113–1122. https://doi.org/10.1158/0008-5472.CAN-13-3426

      Zhou, W., Hsu, A. Y., Wang, Y., Syahirah, R., Wang, T., Jeffries, J., Wang, X., Mohammad, H., Seleem, M. N., Umulis, D., & Deng, Q. (2021). Mitofusin 2 regulates neutrophil adhesive migration and the actin cytoskeleton. Journal of Cell Science, 133(17). https://doi.org/10.1242/JCS.248880/VIDEO-11

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study explores the importance of MFN2, a known endoplasmic reticulum-mitochondria linker protein, in cell motility and actin-myosin organization. It is known that the mitofusin proteins MFN1 and MFN2 can tether the mitochondria to the ER and are connected with calcium regulation in muscle and non-muscle cell types. Calcium flux mediated by mitofusins has previously been implicated in apoptosis and ER stress. In this study, the authors show that loss or depletion of MFN2 (but not MFN1) can lead to aberrant calcium increase in the cytoplasm and trigger actin-myosin reorganisation. They show that the actin and myosin changes are linked with activation of RhoA and also that they can be suppressed by inhibiting myosin light chain phosphorylation/ activation. They also show that cells with reduced MFN2 are softer (using atomic force microscopy), which agrees with activation of RhoA and contractility. If cells are plated on softer substrata, it partially compensates for the over-activation of RhoA.

      In many places, the data support the claims made, but in several places the experiments could be made more convincing or be more clearly presented. Some of the experiments appear to have been repeated only 1-2 times and very few cells or fields of view have been analysed.

      General Comments (Major)

      1. Data Presentation and analysis: The data analysis would benefit from using a method such as Super-Plots to show the data from separate biological repeats and to use N numbers that represent the number of biological repeats rather than the number of cells analysed. Please see the following reference for a suggestion on how to analyse the data: Lord SJ, Velle KB, Mullins RD, Fritz-Laylin LK. SuperPlots: Communicating reproducibility and variability in cell biology. J Cell Biol. 2020 Jun 1;219(6):e202001064. doi: 10.1083/jcb.202001064. PMID: 32346721; PMCID: PMC7265319.
      2. Another general comment is that many of the experiments show analysis of very few numbers of cells or maybe only one field of view in a microscopy quantification experiment. This seems unusually low - for example, in Figure 1E only 6 cells have been analysed. It seems like more could have been done and if statistical analysis like we suggest in 1 above is used, this might reveal that some of the differences are less significant than the authors think/report. This is important, as cells are noisy and it is unusual to have such high significance for experiments like cell migration and other parameters unless a lot of measurements are made. In Figure 1G, it appears that only one field of view has been used to quantify the data. We routinely use 3-5 fields of view to get a representative sample of what the cells are doing.
      3. Some of the micrographs appear to be missing scale bars- e.g. Fig. 2H, Fig. 8
      4. In the cell tracking experiments, only some of the cells in the images appear to have been tracked. How were the tracked cells chosen? Normally, we would track every cell to avoid bias in selection.
      5. The western blot images do not show the molecular size of the bands.
      6. Mostly the graphs show individual data points, which is good, but in some cases only a bar is shown- it would be nice to have individual points overlaid on the bars- e.g. Figure 1I, 4E, 5B, 5E, 7B, 7D
      7. Many of the confocal images look very processed- they have no background and have a hazy black halo around the cell. I am not familiar with the type of processing that was done and I worry that the images are only showing a masked and processed version of the actual data. The authors need to explain what processing they have done and probably also to provide the unprocessed images in a supplementary figure or dataset for readers to see. The methods description is inadequate as it only says Image J was used to process the data.

      Individual comments on Figures:

      Figure 1: See general comments above- consider to use Superplots, more cells and more fields of view in quantifications. Show experimental points in bar graph.

      Figure 2: In 2E, the colours are very similar for two of the experiments so it is difficult to distinguish them- e.g. the MFN1 vs MFN2 rescue data both appear dark blue. In 2H are the magnifications really the same for the WT as the +DOX and -DOX? The cell in the WT looks huge. Is this representative? Also, the phalloidin stain looks very spotty on the WT. This seems unusual.

      Figure 3: Not many cells were analysed in 3B, especially the zero time point. Please define +T, we assume it is the tether construct, but it is not defined In 3F, how were the tracked cells chosen?

      Figure 4: 4B: Why have they not tested FK506 and STO609 on the WT cells? 4C: How were the tracked cells chosen? 4D-E: The blot doesn't look representative of the quantification- were the numbers normalised to vinculin? The difference between WT and vector looks too large to be real if the amounts were normalised to the vinculin, as vinculin is increased in vector. Likewise, the pCAMKII looks to be substantially decreased from the +MFN2, but this is not what the quantification shows.

      Figure 5: 5B- please clarify which ratio is shown here. I assume it is the ratio of RhoA-GTP vs RhoA between the zero and 4 minute time points. 5C- These images appear to have a mask around the cell. It is hard to tell where the edge of the cell really is- what sort of processing was used? Especially for the paxillin staining, why is there no cytoplasm shown? Is this because the image is in TIRF?

      Figure 6: Figure 6C- the blebbistatin treated cell looks very large- is this representative?

      Figure 7: Fig 7C- The lanes for MLCII are all run together- is this from a different gel? Is this quantification accurate? Fig. 7F- What is the % level of knockdown achieved?

      Figure 8: Fig 8A,B- does the scale bar represent all of the images in these two panels? Fig 8C,D- Superplots would be helpful here.

      Supplementary Data:

      The OCR data do not add much and are not discussed much in the manuscript. Perhaps they could be omitted. Figure S3- The figure label doesn't match the manuscript test- was fibrinogen or collagen used?

      Significance

      The main novelty here appears to be the connection between excess cytoplasmic calcium, MFN2 loss and RhoA/myosin activation. This is interesting and a useful addition to the literature. It is unclear what the significance is perhaps, as increased cytoplasmic calcium is likely to cause multiple effects in addition to these. So, this effect may be a side-effect of uncoupling the ER and the mitochondria. Nonetheless, it is important to know about this and it will likely inform future studies on the mitofusins.

      This will be of interest to basic researchers studying mitochondria function and the connections between signaling and mitochondria function.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Wang et al use a combination of cell biological and biochemical approaches to show that Mitofusin2 (MFN2) - a protein typically known to regulate mitochondrial morphology - regulates cell morphology by regulating calcium levels and downstream cell contractility players. They show that cells deficient in MFN2 exhibit increased intracellular calcium levels, and overactivation of myosin II, leading to increased cell contractility. Furthermore, MFN2-deficient cells exhibit an F-actin ring around the cell periphery (which they call PABs). The study takes advantage of both pharmacological and genetic perturbations, as well a variety of assays to support many of their findings; however, it is unclear how these findings are related to each other. Furthermore, MFN2-related disease was raised a few times in the abstract and throughout the manuscript, but it's unclear how the findings in the paper relate to disease states, both in terms of the cell biology, as well as the model that was used (MEFs). This reviewer applauds the authors for exploring MFN2 function outside of its conventional role in mitochondria; but it was difficult to parse through the findings to resolve a mechanistic explanation for how MFN2 affect cell behavior, and what role, if any, PABs have on biological function. While it is important to dissect mitochondrial-independent functions for MFN2, given the whole scale changes in cells in MFN2-deficient cells, and the fact that there is a metabolic phenotype, it is difficult to know how many of the observed phenotypes are downstream of perturbed mitochondrial function versus on cytoskeletal dynamics directly.

      Major comments:

      Are the key conclusions convincing?

      • Key conclusions that were convincing:
        • MFN2-/- cells exhibit decreased cell velocity, decreased cell size, increased cell circularity, and increased intracellular calcium
        • modifying the levels of calcium has an effect on cell circulariy.
        • MFN2-/- cells exhibit increased activation of contractility players
      • Key conclusions that were less convincing:
        • RhoA and NMII are in the title as mechanistic downstream regulators of CaM, but the results in Fig 8 call into question the role of RhoA. Why does RhoA activation not influence cell size and circularity? Can you overexpress MRLC-GFP and inhibit Rho and restore the wt phenotype? The role of NMII is also not clear - why does overexpressing MLCK-CA not have a phenotype but overexpressing MLCK downstream target MRLC show the phenotype? Are there any alternative pathways to regulate MRLC? It's not being discussed or described in 6D's schematic.
        • The role of ER/mito contacts in the system was unclear (since ER/mito contacts were not observed nor evaluated directly).
        • What role does focal adhesions have on PAB formation or any part of the model - There were results showing larger focal adhesions in the MFN2-/- cells, but not sure how this fits in with the bigger picture of contractility and PABs, and focal adhesions were not in the model in Figure 5.
        • Whether regulating calcium impacts PAB formation
        • The role of PABs in migration is also unclear - can you affect PAB formation or get rid of PABs and quantify cell migration?
        • It was hard to understand the correlation between the membrane tension of MFN2-/- cells and its ability to spread on softer substrates. How does this result fit in with the overarching model?

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      • There were a number of findings that did not seem to fit in with the paper, or they were included, but were not robustly described nor quantified. As an example, Paxillin-positive focal adhesions were evaluated in MFN2-/- and with various pharmacological approaches, but there is no quantification with respect to size, number, or distribution of focal adhesions, despite language in the main text that there are differences between conditions. Also, the model was presented in figure 5, and then there were several pieces of data presented afterward, some that are included in the model (myosin regulation), and some that are not in the model (membrane tension, TFM, substrate stiffness, etc). The stiffness/tension figure was not clear to me, and it was difficult to make sense of the data since one would predict that an increase in actomyosin contractility at the cortex would lead to higher membrane tension, not lower, and then how membrane tension relates to spreading on soft matrices is also unclear. The manuscript seems like an amalgamation of different pieces of data that do not necessarily fit together into a cohesive story, so the authors are encouraged to either remove these data, or shore them up and weave them into the narrative.

      Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      • The imaging used for a lot of the quantification (migration, circularity) is difficult to resolve. The cells often look like they are not imaged in the correct imaging plane. It would be helpful to have better representative images such that it is clear how the cells were tracked and how cell periphery regions of interest were manually drawn. Focal adhesions should also be shown without thresholding.
      • For most of the quantification, it appears that the experiment was only performed once and that a handful of cells were quantified. The figure legend indicates the number of cells (often reported as >12 cells or >25 cells), but the methods indicate that high content imaging was performed, and so the interpretation is that these experiments were only performed once. Biological replicates are required. If the data do represent at least 3 biological replicates, then more cell quantification is required (12 or 25 cells in total would mean quantifying a small number of cells per replicate).
      • Mitochondrial morphology and quantification should be performed in the MFN2 knockdown and rescue lines.
      • Many of the comparisons throughout the figures is between MFN2 knockdown and MFN2 knockdown plus rescue or genetic/pharmacological approaches, but a comparison that is rarely made is between wildtype and experimental. These comparisons could be useful in comparing partial rescues and potential redundancies with the other mitofusin.
      • For the mito/ER tethering experiments, it is important to show that ER/mito contacts are formed and not formed in the various conditions with imaging approaches
      • For some of the pharmacological perturbations, it would be helpful to show that the perturbation actually led to the expected phenotype - as an example, in cells treated with different concentrations of A23187, what are the intracellular calcium levels and how do these treatments influence PAB formation? This aspect should be generally applied across the study - when a modification is made, that particular phenotype should first be evaluated, before dissecting how the perturbation affects downstream phenotypes.
      • In Figures 4 and 5, the thresholding approaches in the images make the focal adhesions difficult to resolve, and therefore it is difficult to determine the size. As described above, these metrics should be defined and quantified.
      • What is a PAB? How is it defined? What metrics make a structure a PAB versus regular cortical actin - are there quantifiable metrics? In figure 8, there are some structures that are labelled as a PAB, but some aren't (as an example, the left panel in 8b is a PAB, but the right panel in 8A is not, but they look the same), so a PAB should be defined with quantifiable measures, and then applied to the entire study.
      • As described above, why does RhoA activation not influence cell size and circularity? Can you overexpress MRLC-GFP and inhibit Rho and restore the wildtype phenotype?

      Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      • These suggested experiments described above will take a substantial amount of time and money, as it appears that some experiments were only performed once, and therefore many of these experiments might need to be performed 2-3 more times. Also, experiments showing that addition of drugs lead to expected outcomes prior to analyzing downstream phenotypes will also require a significant amount of time. It is hard to predict how long it will take, but we would guess, 6-8 months?

      Are the data and the methods presented in such a way that they can be reproduced?

      • We appreciate that the authors quantified many parameters, although some quantifications were missing. There are some missing methods - how was directionality quantified, was migration quantified by selecting the approximate center of cells using MTrackJ or were centroids quantified by outlining cells, for instance. Also, given that some of the phenotypes were somewhat arbitrarily assigned (ie. what constitutes a PAB?), it may be difficult to reproduce these approaches and interpret data appropriately.

      Are the experiments adequately replicated and statistical analysis adequate?

      • Unfortunately, while the approximate number of cells was reported, the number of biological replicates were not reported, and therefore, the experimental information and statistical analyses are not adequate.

      Minor comments:

      Specific experimental issues that are easily addressable. - for some of the graphs - mostly about calcium levels - fold change is shown, but raw values should also be included to determine whether the basal levels of calcium are different across the conditions. - scale bars in every panel should also help make the points clearer.

      Are prior studies referenced appropriately?

      • Yes

      Are the text and figures clear and accurate?

      • Some of the data in the figures were unclear - see above for more info.

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      • See above for more info.

      Significance

      Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      • There is a growing field of mitochondrial biology and how it relates to cell migration. This paper examines the function of a key mitochondrial morphology regulator, MFN2, and dissects a role for MFN2 in migration at the level of cytoskeletal regulation. We think that this is interesting, and that it's clear that MFN2 has multiple functions in the cell, but the phenotypes are so pleiotropic that it's difficult to parse out mechanistic understanding. The authors also describe a new actin architecture - a structure that they refer to as PABs - but there is no indication that PABs form in other cell types or tissues, or in other contexts, so it is unclear whether PABs are an important structure or an artifact of the system. Furthermore, part of the motivation of the work seems to be to understand MFN-related pathologies, but using a MEF system does not necessarily allow for that. One way to strengthen this part of the manuscript is to potentially use disease-relevant MFN2 mutations and determine downstream effects on cell morphology and migration.

      State what audience might be interested in and influenced by the reported findings.

      • This work would appeal to card-carrying cell biologists.

      Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      • cell migration; actin; mitochondria
    1. What is the relationship between protocols and agency? Do protocols assume or require a set of participating agents with autonomy or free-will?

      Initial thoughts — review later I mean, if I had to pull in some Bandura, it's bi-drectional determinism? Right? So it's influencing behaviour as an environmental factor that could also be done by thinking?

      If I think about Csikszentmihalyi in Good Business on culture as a game, perhaps rules are to games what protocols are to culture? If culture is a set of norms that keep you from anomie / entropy and make spaces for alienation, then the agency of the individual may be developed over time (control over consciousness) that may allow for greater expression over agency to follow or not follow protocols. In this sense, protocols would be the default, and intentionally not following protocols (probabilistically not by chance) might require agency? That is if we are following the definition that good protocols have the Schelling point or become default and are almost invisible untill they break.

      Bureaucracy may be an example of a deeply frustrating protocol?

    1. Author Response

      Reviewer #1 (Public Review):

      Demographic inference is a notoriously difficult problem in population genetics, especially for non-model systems in which key population genetic parameters are often unknown and where the reality is always a lot more complex than the model. In this study, Rose et al. provided an elegant solution to these challenges in their analysis of the evolutionary history of human specialization in Ae. aegypti mosquitoes. They first applied state-of-the-art statistical phasing methods to obtain haplotype information in previously published mosquito sequences. Using this phased data, they conducted cross-coalescent and isolation-with-migration analyses, and they innovatively took advantage of a known historical event, i.e., the spread of Ae. aegypti to South America, to infer the key model parameters of generation time and mutation rate. With these parameters, they were able to confirm a previous hypothesis, which suggests that human specialists evolved at the end of the African Humid Period around 5,000 years ago when Ae. aegypti mosquitoes in the Sahel region had to adapt to human-derived water storage as their breeding sites during intense dry seasons. The authors further carried out an ancestry tract length analysis, showing that human specialists have recently introgressed into Ae. aegypti population in West African cities in the past 20-40 years, likely driven by rapid urbanization in these cities.

      Given all the complexities and uncertainties in the system, the authors have done outstanding jobs coming up with well-informed research questions and hypotheses, carrying out analyses that are most appropriate to their questions, and presenting their findings in a clear and compelling fashion. Their results reveal the deep connections between mosquito evolution and past climate change as well as human history and demonstrate that future mosquito control strategies should take these important interactions into account, especially in the face of ongoing climate change and urbanization. Methodologically, the analytical approach presented in this paper will be of broad interest to population geneticists working on demographic inference in a diversity of non-model organisms.

      In my opinion, the only major aspect that this paper can still benefit from is more explicit and in-depth communication and discussion about the assumptions made in the analyses and the uncertainties of the results. There is currently one short paragraph on this in the discussion section, but I think several other assumptions and sources of uncertainties could be included, and a few of them may benefit from some quantitative sensitivity analyses. To be clear, I don't think that most of these will have a huge impact on the main results, but some explicit clarification from the authors would be useful.

      Below are some examples:

      Thank you very much for your kind words and your feedback! We have expanded our discussion of assumptions and uncertainties – we have responded to each point below:

      1) Phasing accuracy: statistical phasing is a relatively new tool for non-model species, and it is unclear from the manuscript how accurate it is given the sample size, sequencing depth, population structure, genetic diversity, and levels of linkage disequilibrium in the study system. If authors would like to inspire broader adoption of this workflow, it would be very helpful if they could also briefly discuss the key characteristics of a study system that could make phasing successful/difficult, and how sensitive cross-coalescent analyses are to phasing accuracy.

      We agree that this is an important topic to expand on. We have clarified as follows:

      Results, Page 4, last paragraph: “Over 95% of prephase calls had maximal HAPCUT2 phred-scaled quality scores of 100 and prephase blocks (i.e. local haplotypes) were 728bp long on average (interquartile range 199-1009bp). We then used SHAPEIT4.2 to assemble the prephase blocks into chromosome-level haplotypes, using statistical linkage patterns present across our panel of 389 individuals (25).”

      Discussion, Page 8, last paragraph: “Overall linkage disequilibrium is relatively low in Ae. aegypti, dropping off quickly over a few kilobases and reaching half its maximum value within about 50kb (37); this is likely sufficient for assembling shorter, high-confidence prephase blocks into longer haplotypes in many cases. However, phase-switch errors may be common across longer distances – potentially affecting inferences in the most recent time windows. Nevertheless, the similar results we obtain using different proxy populations (and thus different input haplotype structures) for human-specialist and generalist lineages (see Figure S1) suggest that our results are robust to potential mistakes in long-range haplotype phasing.”

      Discussion, Page 9, paragraph 2: “Here, we take advantage of a continent-wide set of genomes, combined with read-based prephasing and population-wide statistical phasing to develop a phasing panel that should enable future studies in Ae. aegypti with a lower barrier to entry. The same approach may work for other study organisms with similar population genomic properties; high levels of diversity are helpful for prephasing and at least moderate levels of linkage disequilibrium are important for the assembly of prephase blocks.”

      2) Estimation of mutation rate and generation time: the estimation of these importantparameters is made based on the assumption that they should maximize the overlap between the distribution of estimated migration rate and the number of enslaved people crossing the Atlantic, but how reasonable is this assumption, and how much would the violation of this assumption affect the main result? Particularly, in the MSMC-IM paper (Wang et al. 2020, Fig 2A), even with a simulated clean split scenario, the estimated migration rate would have a wide distribution with a lot of uncertainty on both sides, so I believe that the exact meaning and limitations of such estimated migration rate over time should be clarified. This discussion would also be very helpful to readers who are thinking about using similar methods in their studies. Furthermore, the authors have taken 15 generations per year as their chosen generation time and based their mutation rate estimates on this assumption, but how much will the violation of this assumption affect the result?

      This is a great point. We have expanded our discussion of how this assumption affects our conclusions (see Discussion page 9, first paragraph): “Furthermore, we chose a scaling factor that maximized overlap between the peak of estimated Ae. aegypti migration and the peak of the Atlantic Slave Trade (Fig. 2B). If we instead consider alternative scenarios where peak migration occurred at the very beginning of the slave trade era, around 1500, then our inferred mutation rate would be lower (about 2.4e-9, assuming 15 generations per year), pushing back the split of human-specialist lineages to about 10,000 years before present. This scenario seems less plausible, in part because our isolation-with-migration analyses suggest a gradual onset of migration between continents rather than a single, early-pulse model. It would also make it harder to explain the timing of the bottleneck we see in invasive populations; the first signs of this bottleneck occur at the beginning of the slave trade (~500 years ago) with our current calibration (Fig. S1A), but would be pushed to a pre-trade date in this alternative scenario. We can also consider a scenario in which peak Ae. aegypti migration occurred more recently, perhaps around 1850, corresponding to increased global shipping traffic outside the slave trade alone. In this case, our inferred mutation rate would be higher (or generation time lower), and the split of human-specialist lineages would be placed at about 3,000 years ago. Overall, the best match between the existing literature and our data corresponds to our main estimates, but alternative scenarios could gain support if future research finds evidence for a different time course of invasion than is suggested by the epidemiological literature.”

      We have slightly expanded our description of calibration in Results, page 5, last paragraph: “The fact that we see good overlap between the two distributions (yellow–white color) across a wide range of reasonable mutation rates and generation times for Ae. aegypti is consistent with our understanding of the species’ recent history and supports our approach. For example, if we take the common literature value of 15 generations per year (0.067 years per generation) (17, 20), the de novo mutation rate that maximizes correspondence between the two datasets is 4.85x10-9 (black dot in Figure 2A, used in Figure 2B), which is on the order of values documented in other insects. We chose to carry forward this calibrated scaling factor (corresponding to any combination of mutation rate and generation time found along the line in Figure 2A) into subsequent analyses.”

      We have also expanded on the uncertainty of our analyses (see Discussion page 8, last paragraph): “First, the temporal resolution of our inferences is relatively low, and both previously published simulations (39) and our own bootstrap replicates (Figure 2B–D, grey lines) suggest relatively wide bounds for the precise timing of events.”

      3) The effect of selection: all analyses in this paper assume that no selection is at play,and the authors have excluded loci previously found to be under selection from these analyses, but how effective is this? In the ancestry tract length analysis, in particular, the authors have found that the human-specialist ancestry tends to concentrate in key genomic regions and suggested that selection could explain this, but doesn't this mean that excluding known loci under selection was insufficient? If the selection has indeed played an important role at a genome-wide level, how would it affect the main results (qualitatively)?

      We have clarified that we excluded those loci from our timing estimates for both MSMC and ancestry tract analyses, but then re-ran the ancestry tract analysis with all regions included to visualize and assess how tracts were distributed along chromosomes. See Methods, page 12, paragraph 2: “Since selection associated with adaptation to urban habitats could shape lengths of admixture tracts, we masked regions previously identified as under selection between human-specialists and generalists when estimating admixture timing—namely, the outlier regions in (2). However, we used an unmasked analysis to determine and visualize the genome-wide distribution of ancestries (Fig. 3).”

      We have also added additional discussion of the expected effects of selection on our analyses (see Discussion, page 9, last paragraph): “Positive selection during adaptive introgression can increase tract lengths and make admixture appear to be more recent than it actually is. For this reason, we masked regions of the genome thought to underlie adaptation to human habitats before running our analysis. Nevertheless, if selection has acted outside these regions, admixture may be somewhat older than we estimate.”

    1. Author Response

      Reviewer #1 (Public Review):

      The authors have tried to correlate changes in the cellular environment by means of altering temperature, the expression of key cellular factors involved in the viral replication cycle, and small molecules known to affect key viral protein-protein interactions with some physical properties of the liquid condensates of viral origin. The ideas and experiments are extremely interesting as they provide a framework to study viral replication and assembly from a thermodynamic point of view in live cells.

      The major strengths of this article are the extremely thoughtful and detailed experimental approach; although this data collection and analysis are most likely extremely time-consuming, the techniques used here are so simple that the main goal and idea of the article become elegant. A second major strength is that in other to understand some of the physicochemical properties of the viral liquid inclusion, they used stimuli that have been very well studied, and thus one can really focus on a relatively easy interpretation of most of the data presented here.

      There are three major weaknesses in this article. The way it is written, especially at the beginning, is extremely confusing. First, I would suggest authors should check and review extensively for improvements to the use of English. In particular, the abstract and introduction are extremely hard to understand. Second, in the abstract and introduction, the authors use terms such as "hardening", "perturbing the type/strength of interactions", "stabilization", and "material properties", for just citing some terms. It is clear that the authors do know exactly what they are referring to, but the definitions come so late in the text that it all becomes confusing. The second major weakness is that there is a lack of deep discussion of the physical meaning of some of the measured parameters like "C dense vs inclusion", and "nuclear density and supersaturation". There is a need to explain further the physical consequences of all the graphs. Most of them are discussed in a very superficial manner. The third major weakness is a lack of analysis of phase separations. Some of their data suggest phase transition and/or phase separation, thus, a more in-deep analysis is required. For example, could they calculate the change of entropy and enthalpy of some of these processes? Could they find some boundaries for these transitions between the "hard" (whatever that means) and the liquid?

      The authors have achieved almost all their goals, with the caveat of the third weakness I mentioned before. Their work presented in this article is of significant interest and can become extremely important if a more detailed analysis of the thermodynamics parameters is assessed and a better description of the physical phenomenon is provided.

      We thank you for the comments and, in particular, for being so positive regarding the strengths of our manuscript and for raising concerns that will surely improve it. We have taken the following actions to address your concerns:

      1) Extensive revisions have been made to the use of English, particularly in the abstract and introduction. Key terms are defined as they are introduced in the text to enhance the clarity of the argument. This is a significant revision that is highlighted within the text, but it is too extensive to detail here.

      2) In the results section, we improved and extended the discussion of our graphs to the extent possible. However, we found that attempting to explain the graphs' meanings more thoroughly would detract from our manuscript's main focus: identifying thermodynamic changes that could potentially lead to alterations in material properties, specifically aspect ratio, size, and Gibbs free energy. As a result, we introduced the type of information we could obtain from our analyses in the introduction (Lines 112-125) and briefly commented on it in the ‘results’ section (Lines 304-306, sentences below).

      From introduction – lines 112-125:

      “In addition, other parameters like nucleation density determine how many viral condensates are formed per area of cytosol. Overall, the data will inform us if changing one parameter, e.g. the concentration, drives the system towards larger condensates with the same or more stable properties, or more abundant condensates that are forced to maintain the initial or a different size on account of available nucleation centres (Riback et al., 2020:Snead, 2022 #1152). It will also inform us if liquid viral inclusions behave like a binary or a multi-component system. In a binary mixture, Cdilute is constant (Klosin et al., 2020). However, in multi-component systems, Cdilute increases with bulk concentration (Riback et al., 2020). This type of information could have direct implications about the condensates formed during influenza infection. As the 8 different genomic vRNPs have a similar overall structure, they could, in theory, behave as a binary system between units of vRNPs and Rab11a. However, a change in Cdilute with concentration would mean that the system behaves as a multi-component system. This could raise the hypothesis that the differences in length, RNA sequence and valency that each vRNP has may be relevant for the integrity and behaviour of condensates.”.

      From results lines 304-306:

      This indicates that the liquid inclusions behave as a multi-component system and allow us to speculate that the differences in length, RNA sequence and valency that each vRNP may be key for the integrity and behaviour of condensates.

      3) The reviewer has drawn our attention to the absence of phase separation analysis in our study. We believe that the formation of influenza A virus condensates is governed by phase separation (or percolation coupled to phase separation). However, we must exercise caution at this point because the condensates we are studying are highly complex, and the physics of our cellular system may not be adequate to claim phase separation without being validated by an in vitro reconstitution system. IAV inclusions contain a variety of cellular membranes, different vRNPs, and Rab11a. While we have robust data to propose a model in which the liquid-like properties of IAV inclusions arise from a network of interacting vRNPs that bridge multiple cognate vRNP-Rab11 units on flexible membranes, similar to what occurs in phase-separated vesicles in neurological synapses, our model for this system still lacks formal experimental validation. As a note, the data supporting our model includes: the demonstration of the liquid properties of our liquid inclusions (Alenquer et al. 2019, Nature Communications, 10, 1629); and impairment of recycling endocytic activity during IAV infection Bhagwat et al. 2020, Nat Commun, 11, 23; Kawaguchi et al. 2012, J Virol, 86, 11086-95; Vale-costa et al. 2016, J Cell Sci, 129, 1697-710. This leads to aggregated vesicles seen by correlative light and electron microscopy (Vale-Costa et al., 2016 JCS, 129, 1697-710) and by immunofluorescence and FISH (Amorim et al. 2011,. J Virol 85, 4143-4156; Avilov et al. 2012, Vaccine 30, 7411-7417; Chou et al. 2013, PLoS Pathog 9, e1003358; Eisfeld et al. 2011, J Virol 85, 6117-6126 and Lakdawala et al. 2014, PLoS Pathog 10, e1003971.

      To be able to explore the significance of the liquid material properties of IAV inclusions, we used the strategy described in this current work. By developing an effective method to manipulate the material properties of IAV inclusions, we provide evidence that controlled phase transitions can be induced, resulting in decreased vRNP dynamics in cells and a negative impact on progeny virion production. This suggests that the liquid character of liquid inclusions is important for their function in IAV infection. We have improved our explanation addressing this concern in the limitations of our study (as outlined below in the box and in manuscript in lines 857-872).

      We are currently establishing an in vitro reconstitution system to formally demonstrate, in an independent publication, that IAV inclusions are formed by phase separation (or percolation coupled to phase separation). For this future work, we teamed up with Pablo Sartori, a theorical physicist to derive in-depth analysis of the thermodynamics of the viral liquid condensates in the in vitro reconstituted system and compare it to results obtained in the cell. This will provide means to establish comparisons. We think that cells have too many variables to derive meaningful physics parameters (such as entropy and enthalpy) and models that need to be complemented by in vitro systems. For example, increasing the concentration inside a cell is not a simple endeavour as it relies on cellular pathways to deliver material to a specific place. At the same time, the 8 vRNPs, as mentioned above, have different size, valency and RNA sequence and can behave very differently in the formation of condensates and maintenance of their material properties. Ideally, they should be analysed individually or in selected combinations. For the future, we will combine data from in vitro reconstitution systems and cells to address this very important point raised by the reviewer.

      From the paper on the section ‘Limitations of the study’:

      “Understanding condensate biology in living cells is physiological relevant but complex because the systems are heterotypic and away from equilibria. This is especially challenging for influenza A liquid inclusions that are formed by 8 different vRNP complexes, which although sharing the same structure, vary in length, valency, and RNA sequence. In addition, liquid inclusions result from an incompletely understood interactome where vRNPs engage in multiple and distinct intersegment interactions bridging cognate vRNP-Rab11 units on flexible membranes (Chou et al., 2013, Gavazzi et al., 2013, Sugita et al., 2013, Shafiuddin and Boon, 2019, Haralampiev et al., 2020, Le Sage et al., 2020). At present, we lack an in vitro reconstitution system to understand the underlying mechanism governing demixing of vRNP-Rab11a-host membranes from the cytosol. This in vitro system would be useful to explore how the different segments independently modulate the material properties of inclusions, explore if condensates are sites of IAV genome assembly, determine thermodynamic values, thresholds accurately, perform rheological measurements for viscosity and elasticity and validate our findings. The results could be compared to those obtained in cell systems to derive thermodynamic principles happening in a complex system away from equilibrium. Using cells to map how liquid inclusions respond to different perturbations provide the answer of how the system adapts in vivo, but has limitations.

      Reviewer #2 (Public Review):

      During Influenza virus infection, newly synthesized viral ribonucleoproteins (vRNPs) form cytosolic condensates, postulated as viral genome assembly sites and having liquid properties. vRNP accumulation in liquid viral inclusions requires its association with the cellular protein Rab11a directly via the viral polymerase subunit PB2. Etibor et al. investigate and compare the contributions of entropy, concentration, and valency/strength/type of interactions, on the properties of the vRNP condensates. For this, they subjected infected cells to the following perturbations: temperature variation (4, 37, and 42{degree sign}C), the concentration of viral inclusion drivers (vRNPs and Rab11a), and the number or strength of interactions between vRNPs using nucleozin a well-characterized vRNP sticker. Lowering the temperature (i.e. decreasing the entropic contribution) leads to a mild growth of condensates that does not significantly impact their stability. Altering the concentration of drivers of IAV inclusions impact their size but not their material properties. The most spectacular effect on condensates was observed using nucleozin. The drug dramatically stabilizes vRNP inclusions acting as a condensate hardener. Using a mouse model of influenza infection, the authors provide evidence that the activity of nucleozin is retained in vivo. Finally, using a mass spectrometry approach, they show that the drug affects vRNP solubility in a Rab11a-dependent manner without altering the host proteome profile

      The data are compelling and support the idea that drugs that affect the material properties of viral condensates could constitute a new family of antiviral molecules as already described for the respiratory syncytial virus (Risso Ballester et al. Nature. 2021)

      Nevertheless, there are some limitations in the study. Several of them are mentioned in a dedicated paragraph at the end of a discussion. This includes the heterogeneity of the system (vRNP of different sizes, interactions between viral and cellular partners far from being understood), which is far from equilibrium, and the absence of minimal in vitro systems that would be useful to further characterize the thermodynamic and the material properties of the condensates.

      There are other ones.

      We thank reviewer 2 for highlighting specific details that need improving and raising such interesting questions to validate our findings. We have addressed the comments of Reviewer 2, we performed the experiments as described (in blue) below each point raised.

      1) The concentrations are mostly evaluated using antibodies. This may be correct for Cdilute. However, measurement of Cdense should be viewed with caution as the antibodies may have some difficulty accessing the inner of the condensates (as already shown in other systems), and this access may depend on some condensate properties (which may evolve along the infection). This might induce artifactual trends in some graphs (as seen in panel 2c), which could, in turn, affect the calculation of some thermodynamic parameters.

      The concern of using antibodies to calculate Cdense is valid, and we thought it was very important. We addressed this concern by performing the same analyses using a fluorescent tagged virus that has mNeon Green fused to the viral polymerase PA (PA-mNeonGreen PR8 virus). Like NP, PA is a component of vRNPs and labels viral inclusions, colocalising with Rab11 when vRNPs are in the cytosol. However, per vRNP there is only one molecule of PA, whilst of NP there are 37-96 depending on the size of vRNPs. As predicted, we did observe changes in the Cdilute, Cdense and nucleation density. However, the measurements and values obtained for Gibbs free energy, size, aspect ratio detecting viral inclusions with fluorescently tagged vRNPs or antibody staining followed the same trend and allow us to validate our conclusion that major changes in Gibbs free energy occur solely when there is a change in the valency/strength of interactions but not in temperature or concentration (Figure 1 below). Given the extent of these data, we show here the results but, in the manuscript, we will describe the limitations of using antibodies in our study within the section ‘Limitations of the study’ from lines 881-894. Given the importance of the question regarding the pros and cons of the different systems for analysing thermodynamic parameters, we have decided to systematically assess and explore these differences in detail in a future manuscript.

      For more information. This reviewer may be asking why we did not use the PA-fluorescent virus in the first place to evaluate inclusion thermodynamics and avoid problems in accessibility that antibodies may have to get deep into large inclusions. Our answer is that no system is perfect. In the case of the PA-fluorescent virus, the caveats revolve around the fact that the virus is attenuated (Figure 1a below), exhibiting a delayed infection as demonstrated by reduced levels of viral proteins (Figure 1b below). Consistently, it shows differences in the accumulation of vRNPs in the cytosol and viral inclusions form later in infection and the amount of vRNPs in the cytosol does not reach the levels observed in PR8-WT virus. After their emergence, inclusions behave as in the wild-type virus (PR8-WT), fusing and dividing (Figure 1c below) and displaying liquid properties.

      As the overarching goal of this manuscript is to evaluate the best strategies to harden liquid IAV inclusions and given that one of the parameters we were testing is concentration, we reasoned that using PR8-WT virus for our analyses would be reasonable.

      In conclusions, both systems have caveats that are important to systematically assess, and these differences may shift or alter thermodynamic parameters such as nucleation density, inclusion maturation rate, Cdense, Cdilute in particular by varying the total concentration. As a note, to validate all our results using the PA-mNeonGreen PR8 virus, we considered the delayed kinetics and applied our thermodynamic analyses up to 20 hpi rather than 16 hpi.

      However, because of the question raised by this reviewer, on which is the best solution for mitigating errors induced by using antibodies, we re-checked all our data. Not only have we compared the data originated from attenuated fluorescently tagged virus with our data, but also made comparisons with images acquired from Z stacks (as used for concentration and for type/strength of interactions) with those acquired from 2D images. Our analysis revealed that there is a very good match using images acquired with Z-stacks and analysed as Z projections with between antibody staining and vRNP fluorescent virus. Therefore, we re-analysed all our thermodynamic data done with temperature using images acquired from Z stacks and altered entirely Figure 2. We believe that all these comparisons and analyses have greatly improved the manuscript and hence we thank all reviewers for their input.

      Figure 1 – The PA-mNeonGreen virus is attenuated in comparison to the WT virus and data obtained is consistent for Gibbs free energy with analyses done with images processed with antibody fluorescent vRNPs. A. Representation of the PA-mNeonGreen virus (PA-mNG; Abbreviations: NCR: non coding region). B. Cells (A549) were transfected with a plasmid encoding mCherry-NP and co-infected with PA-mNeonGreen virus for 16h, at an MOI of 10. Cells were imaged under time-lapse conditions starting at 16 hpi. White boxes highlight vRNPs/viral inclusions in the cytoplasm in the individual frames. The dashed white and yellow lines mark the cell nucleus and the cell periphery, respectively. The yellow arrows indicate the fission/fusion events and movement of vRNPs/ viral inclusions. Bar = 10 µm. Bar in insets = 2 µm. C-D. Cells (A549) were infected or mock-infected with PR8 WT or PA-mNG viruses, at a multiplicity of infection (MOI) of 3, for the indicated times. C. Viral production was determined by plaque assay and plotted as plaque forming units (PFU) per milliliter (mL) ± standard error of the mean (SEM). Data are a pool from 2 independent experiments. D. The levels of viral PA, NP and M2 proteins and actin in cell lysates at the indicated time points were determined by western blotting. (E-G) Biophysical calculations in cells infected with the PA-mNeonGreen virus upon altering temperature (at 10 hpi, evaluating the concentration of vRNPs (over a time course) in conditions expressing native amounts of Rab11a or overexpressing low levels of Rab11a and upon altering the type/strength of vRNP interactions by adding nucleozin at 10 hpi during the indicated time periods. All data: Ccytoplasm/Cnucleus; Cdense, Cdilute, area aspect ratio and Gibbs free energy are represented as boxplots. Above each boxplot, same letters indicate no significant difference between them, while different letters indicate a statistical significance at α = 0.05 using one-way ANOVA, followed by Tukey multiple comparisons of means for parametric analysis, or Kruskal-Wallis Bonferroni treatment for non-parametric analysis.

      2) Although the authors have demonstrated that vRNP condensates exhibit several key characteristics of liquid condensates (they fuse and divide, they dissolve upon hypotonic shock or upon incubation with 1,6-hexanediol, FRAP experiments are consistent with a liquid nature), their aspect ratio (with a median above 1.4) is much higher than the aspect ratio observed for other cellular or viral liquid compartments. This is intriguing and might be discussed.

      IAV inclusions have been shown to interact with microtubules and the endoplasmic reticulum, that confers movement, and undergo fusion and fission events. We propose that these interactions and movement impose strength and deform inclusions making them less spherical. To validate this assumption, we compared the aspect ratio of viral inclusions in the absence and presence of nocodazole (that abrogates microtubule-based movement). The data in figure 2 shows that in the presence of nocodazole, the aspect ratio decreases from 1.42±0.36 to 1.26 ±0.17, supporting our assumption.

      Figure 2 – Treatment with nocodazole reduces the aspect ratio of influenza A virus inclusions. Cells (A549) were infected with PR8 WT for 8 h and treated with nocodazole (10 µg/mL) for 2h, after which the movement of influenza A virus inclusions was captured by live cell imaging. Viral inclusions were segmented, and the aspect ratio measured by imageJ, analysed and plotted in R.

      3) Similarly, the fusion event presented at the bottom of figure 3I is dubious. It might as well be an aggregation of condensates without fusion.

      We have changed this (check Fig 5A and B in the manuscript), thank you for the suggestion.

      4) The authors could have more systematically performed FRAP/FLAPh experiments on cells expressing fluorescent versions of both NP and Rab11a to investigate the influence of condensate size, time after infection, or global concentrations of Rab11a in the cell (using the total fluorescence of overexpressed GFP-Rab11a as a proxy) on condensate properties.

      We have included a new figure, figure 5 with the suggested data.

    1. This week, we looked at the Nara period, where Japanese rulers began to settle in one consistent capitol and embraced Buddhism. One of the interesting features of the Nara period was that the rulers continued to stay in one city; the most interesting part about that for me is why they had not done that in the first place. I am aware that the most obvious reason is the religious/superstitious issue about not wanting to be where the ghost of the last ruler was, but I wonder if there was any more pragmatic reason to keep moving. It strikes me that moving capitols would necessarily be an expensive process, and would mean that temples and palaces could not be improved over many decades. With these apparent drawbacks, why bother moving the capitol so much? I do not think that I have the information to suggest any other educated answers than the one of getting away from the death of the past ruler, but I would be interested in finding out if any (credible) theories do exist. The only possible (and extremely superficial) reason that I would suggest is that it does effectively convey the wealth of the rulers. Spending all of the funds that it would take to build the new capitol, along with raising all of the corvee labor that would also be required could be a good way to present one’s power to one’s subjects.

      I like the question that you raise about why Emperors/Leaders did not have a consistent location for the Empire. With this in mind, I would like to consider if back then could be considered more tumultuous than today or if each Emperor wanted a sense of individual rule that they may have changed their empire’s location? I liked the questions that you raised about what it meant for the Emperor’s to have fully embraced Buddhism, whether it meant that it affected their ruling style or if they became Monks. This reminds of the same way that Roman/Greeks (I forget, I think Romans) took on the ideas of Stoicism as they ruled. One example of this was Marcus Aurelius and how he was both a Stoic and an Emperor. I believe that the Emperor’s can have multiple identities at once, and embody those multiple identities through not only leading their country but through other aspects of their life, such as how they act.

    1. Thinking back to what we have already learned, I think it is really cool how gender roles were very much included in the basis of Japanese religion, yet they internalize it much differently than how the U.S internalizes our founding ideas. It is crazy to me that a female writer could get this much momentum and have such a long lasting impact on culture, while the U.S still seems to struggle with the idea of women being culture creators

      I first like the comparisons that you drew between the U.S and Japan when it comes to the approach of female writers as it seemed reminiscent of how the U.S only created the 19th Amendment in 1920 which seems to reflect the judgments mentioned about female writers. Also, the comparison between religion is an interesting one as yes, both the U.S and Japan have religious undertones the way that they deal with those undertones may be shaped by the religion itself along with the individuals who convince the public using religion as a basis for being correct. About the idea of the male’s ideal woman, the part that I was most curious about was whether these ideas would trickle down to the non-court men by accident? By exposing the court men for their idealistic and unrealistic ideas of who and what a woman should be I wonder if this idea was already permeating through Japan or if the Tale Of Genji led for other men to model themselves out of these fictional characters as though I can’t think of an example as of this moment, doesn’t things that start up in “high society” trickle down (not like the failed trickle down economy, but what I mean is that before Cars were for everyone, first the rich (equivalent to the court of Heian life) and then to everyone. I am essentially wondering if these ideas were widespread among Japanese men before the Tale of Genji or if the Tale of Genji gave Men an inspiration of Court Life, and in order to act more important/high and mighty if they too, would have copied the Tale of Genji men to be more like court members. I wonder if the court members would have enjoyed the Tale of Genji as to me it reminds me that it would almost be like fictional tabloids which often had psychological effects on celebrities and royalty like Princess Diana and Britney Spears – would they have enjoyed being exposed in this way? I imagined the commoners would be fascinated with this story the same way that we humans buy tabloids or read gossip about the rich/famous instead of the rich/famous reading inserted characters about themselves?

    1. Some may question how a society so intertwined with Bhuddist ideals could ever rationalize the fighting that would ensue. To that I would answer that just as we might have noticed court jealousy and superficiality in our own lives, desire for power is equally a characteristic of human society in of itself. That is to say I think perhaps such strife was inevitable. Perhaps because of these Bhuddist ideals there was less infighting than otherwise would have occurred, but nonetheless so long as there are vacuums or weaknesses in power, some will seek to fill that void.

      I thought this analysis was interesting. In other words, you think that without the influence of Buddhist ideals, there would have been more conflict? And that in any society - no matter how much value is placed on harmony and peacefulness - conflict is inevitable?

    1. consider whether the information the students need tolearn is invariable, since behaviorism stems from the idea thatknowledge is objective and there is one right answer (Keramida,2015). Behaviorism would be a useful approach to helping studentsmemorize and recall terms and facts

      The behaviorist approach is a staple in math, so that even this book included the example of student's receiving immediate feedback for solving a math equation. Indeed there is a time and place for that since many times we are asking students for the right answer to a problem.

      But with the new formats for key exams such as STAAR, it may be time to look at other ways of asking questions. We need to help students to form connections and thinking critically. At first I was quite discouraged when we stopped assigning Quizizz due to how easily students cheat on it. It's the same with other gamified websites such as Kahoot and Blooket. For Quizizz, I used to enjoy the premium features and question types as well as counting with all my customized assignments I had prepared over the years.

      So that has forced me to restructure my assignments and face the reality that I must look beyond behaviorism. It's a shame about the cheating, but all I can do is direct my students to think, struggle, and practice math while they are in the classroom.

    1. It's not a ZK furniture though. Index cards were not used to store atomic notes, or have alphanumeric indexes. :)

      Oh, but it is ZK furniture in every sense! The narrow definition of zettelkasten in common use (in this subreddit and in many other locations on the internet) to describe only card indexes/digital software which have the numbering scheme and form of Niklas Luhmann's only works for his and a number of imitators from roughly 2007/2013 to the present. Prior to this it is a much more generic term in Germany and elsewhere known in English as a card index or card file, but academics and others have been using practices broadly similar to Luhmann's for centuries in a variety of forms.

      You're likely right that this particular piece of furniture had a business-specific market use case for the majority of its users, but I'm sure there was a subset of customers, particularly those in academia, which may have used it primarily as a note storage or personal knowledge management tool in a way highly similar to Luhmann's. Because it was in America, it was unlikely to have been called by the German name zettelkasten, though there were many German-Americans (Gotthard Deutsch and S. D. Goitein come to mind) who had this practice and may have done so, though I've seen no direct evidence of this at present in their writings. Not all card indexes were used for business or library purposes. In addition to academic researchers, we know a variety of mid-century comedians used their card indexes for collation and storage of jokes over their careers.

      The quality of the advertisement is hard to make out, but on close examination it appears to have four drawers and the scale leads me to think that this would likely have accommodated 3 x 5" index cards. Some upcoming research work may uncover the manufacturing specifics and I'll share them as I find them.

      As for Harrison and Placcius they're definitely there and people talk about them occasionally, though few seem as interested in the historical aspects despite the fact that they have a lot to demonstrate about the pros/cons of various practices. I remember adding them both to the English wikipedia page in July 2021. Certainly they could stand to be more widely known for their work, as could Leibniz. More on both can be found mentioned in the following: - Cevolini, Alberto. “Where Does Niklas Luhmann’s Card Index Come From?” Erudition and the Republic of Letters 3, no. 4 (October 24, 2018): 390–420. https://doi.org/10.1163/24055069-00304002. - Blair, Ann M. Too Much to Know: Managing Scholarly Information before the Modern Age. Yale University Press, 2010. https://yalebooks.yale.edu/book/9780300165395/too-much-know. - Blei, Daniela. “How the Index Card Cataloged the World.” The Atlantic, December 1, 2017. https://www.theatlantic.com/technology/archive/2017/12/how-the-index-card-catalogued-the-world/547271/. - Vincentius Placcius. De arte excerpendi. Vom Gelahrten Buchhalten Liber singularis, quo genera et praecepta excerpendi... Gottfried Liebezeit, 1689. http://archive.org/details/bub_gb_IgMVAAAAQAAJ.

      There's also a bit on Placcius in: - Krajewski, Markus. Paper Machines: About Cards & Catalogs, 1548-1929. Translated by Peter Krapp. History and Foundations of Information Science. MIT Press, 2011. https://mitpress.mit.edu/books/paper-machines.

      The bigger hero, in my opinion, is Konrad Gessner and his work from 1548 which outlined much of the common "rules" note takers, practitioners of ars excerpendi, zettelers, and card indexers have been using ever since, including an early idea which many would now call "atomic notes". Much of his work, however was transferring ideas of commonplace book practices of his day into the form of paper slips which were heavily used until mass manufacture of index cards in the 20th century made them cheap and plentiful. Within the note taking space online the community also broadly ignores influential figures like Agricola, Erasmus, and Melanchthon who make some big strides in popularizing a variety of methods in the 1400-1500s.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank all the reviewers for their positive evaluations of our work and constructive comments, in particular for highlighting that our work “provides new insight into cancer metabolism knowledge”, is “conceptually interesting and experimentally well performed” and “the findings presented here will be very interesting to a broad range of researchers, including the cancer, metabolism and wider cell biology communities”.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): Nazemi et al. show that the extracellular matrix (ECM) has a crucial role in sustaining the growth of invasive breast and pancreatic cells during nutrient deprivation. In particular, under amino acid starvation, cancer cells internalize ECM by macropinocytosis and activate phenylalanine and tyrosine catabolism, which in turn support cell growth in nutrient stress conditions. The paper is well written and the results shown are very interesting. The experimental plan is well designed to assess the hypothesis and the description of the methods is sufficiently detailed to reproduce the analyses, which are also characterized by appropriate internal controls. Finally, the data provided sustain the conclusions proposed by the authors.

      * Major comment:*

      Since the authors performed their experiments on invasive breast and pancreatic cancers and it has been noted that stress conditions could promote the escape of cancer cells from the site of origin (e.g., Jimenez and Goding, Cell Metabolism 2018; Manzano et al, EMBO Reports 2020), it would be interesting to evaluate how ECM internalization could have a role in sustaining the invasive abilities of cancer cells under amino acid starvation. Which is the impact of the inhibition of macropinocytosis and tyrosine catabolism on cell invasion? The authors could evaluate this aspect by in vitro 2D and 3D analysis.

      This is a very important point, and we are planning to investigate this by using:

      • 2D single cell migration assays on cell-derived matrices (we have extensively used this system to characterise invasive cell migration; Rainero et al., 2015; Rainero et al., 2012)
      • 3D spheroids assays, to assess collective/3D cell invasion through collagen I and matrigel mixtures. Both experiments will be performed under amino acid starvation, in the presence of pharmacological inhibitors and siRNAs targeting macropinocytosis (FRAX597, PAK1) and tyrosine catabolism (Nitisinone, HPDL). Preliminary data suggest that both FRAX597 and Nitisinone reduce cell invasiveness.

      In addition, to strengthen the paper and give a stronger significance in terms of clinical translatability, it could be useful to implement the analysis of breast and pancreatic patients by publicly dataset evaluating for example free survival, disease free survival, overall survival and metastasis free survival.

      We have now included in the manuscript new data in figure 6 O-R showing that high HPDL expression correlates with reduces overall survival, distant metastasis-free survival, relapse-free survival and palliative performance scale in breast cancer patients. In response to other reviewers’ comments, we have removed the pancreatic cancer data from our manuscript.

      Minor Comment:

      The text and the figures are clear and accurate. The references cited support the hypothesis, rightly introduce the results and are appropriate for the discussion. However, the paragraph relative to figure 4 is a little confusing. Changing the order of the description of the results could be useful.

      We apologise for the lack of clarity in this section. We have now re-organised the data both in the figure and in the result section, to describe the findings in a more logical way.

      Reviewer #1 (Significance (Required)): Based on my metabolic background in tumour aetiology and progression, I think that this study provides new insights into cancer metabolism knowledge, in particular on how the stroma may drive metabolic reprogramming of cancer cells sustaining cell growth in nutrient stress conditions. Together with other similar studies on the stromal non-cellular components, the data here shown can contribute to expand the knowledge on the factors that promote cancer metabolic plasticity, which is exploited from cancer cells to obtain advantages in terms of growth, survival and progression. In conclusion, I think that the results shown are new and the manuscript is well presented. Following the short revision process suggested, it will be eligible for a final publication in a medium-high impact factor journal.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      • Please find enclosed my reviewing comments on the manuscript entitled "The extracellular matrix supports cancer cell growth under amino acid starvation by promoting tyrosine catabolism" by Nazemi et al.*

      In this manuscript the group of Elena Romero and colleagues provides evidence that breast cancer cells, and pancreatic cancer cell, use matrix proteins degradation to feed their proliferative metabolic needs under amino acid starvation. Under this drastic condition, cancer cells use micropinocytosis to uptake matrix proteins, a process that requires mTORC1 activation and PAK1. Furthermore, a metabolomic study demonstrates that ECM-dependent cancer cell growth relies on tyrosine catabolism. Altogether, I found this study conceptually interesting and experimentally well performed. Experiments are well controlled and state-of-the-art technologies used in this manuscript make it a good candidate for publication. However, some aspects of the work need to be strengthen to reinforce the overall good quality of the manuscript, therefore, please find below some experimental propositions. 1. Despite the reviewer proposition, I believe that the additional experiments using the PDAC cancer cell does not improve the quality of the manuscript. Instead, it brings confusion to me, since the relative addition is minor compare to what is demonstrated using breast cancer cells.

      We have decided to remove the pancreatic cancer cell data from the manuscript.

      To importantly improve the potential impact of this manuscript, I suggest to add in vivo data using either syngenic mice model of breast cancer or xenografted human breast cancer cells in nude mice. What would be the impact of micropinocytosis and tyrosine catabolism inhibition on cancer growth, in vivo, should be demonstrated? If possible, it may be interesting to demonstrate that this micropinocytosis may interfere with cancer evolution toward a metastatic phenotype using, for example, the PyMT-MMTV mice model of breast cancer development?

      We will perform orthotopic mammary fat pad injections in immunocompetent mice, to monitor primary tumour growth and localised invasion in the presence of Nitisinone or vehicle control. PyMT-driven breast cancer cells have been generated in the Blyth lab, from FVB-pure MMTV-PyMT mice and we have preliminary data indicating that these cells are able to internalise ECM and grow under starvation in an ECM-dependent manner. Prior to performing any in vivo work, we will perform further in vitro experiment to confirm the role of tyrosine catabolism in these cells. Nitisinone is an FDA-approved drug that has already been used in mouse models. Blood tyrosine levels can be measured to assess tyrosine catabolism inhibition by Nitisinone. These experiments will be conducted in collaboration with the Blyth lab at the CRUK Beatson Institute in Glasgow.

      Data obtained using cancer cells with different metastatic property suggest that the ability to use ECM to compensate for soluble nutrient starvation is acquired during cancer progression. To further demonstrate that it is the case, would it be possible that non metastatic breast cancer cells are not able to perform micropinocytosis? Is PAK1 overexpressed with increase cancer cells metastatic ability, without affecting invasive capacity in 3D spheroids?

      To address these points, we have started to measure PAK1 expression across the MCF10 series of cell lines, where MCF10A are non-transformed mammary epithelial cells, MCF10A-DCIS are ductal carcinoma in situ cells and MCF10CA1 are metastatic breast cancer cells. Our preliminary data show that there is no upregulation of PAK1 expression in the metastatic cells compared to non-transformed or non-invasive cancer cells. This suggest that the over-expression of PAK1 might not be a valuable strategy to address this point.

      In addition, we found that collagen I uptake was upregulated in MCF10CA1 compared to MCF10A and MCF10A-DCIS. We will corroborate our preliminary data by quantifying collagen I and cell-derived matrices internalisation across the 3 cell lines.

      What would be the efficacy to promote the ECM-dependent growth under starvation following a mTORC1 in non-invasive cancer cells?

      We will measure the growth of MCF10A and MCF10A-DCIS on ECM under starvation in the presence of the mTOR activator MHY1485. Western blot analysis of downstream targets of mTORC1 (p-S6 and p-4EBP1) will confirm the extent of mTOR activation.

      The discrepancy of cancer cells proliferation under starvation condition between plastic and ECM-based supports could be explained by the massive difference of support rigidity. This is also probably the case between CDM made by normal fibroblast and CAF. It brings the question of studying the role of matrix stiffness in regard to micropinocytosis-dependent cancer cells growth. It would also explain why this process is link to aggressive cancer cell behaviour, as ECM goes stiffer with time in cancer development. It may not be the case, but the demonstration that mechanical cues from the ECM could regulate the micropinocytosis-dependent cancer cells growth under amino acid starvation could bring additional value to the manuscript.

      We will use 2 experimental approaches to address the effect of different stiffness in ECM-dependent cell growth:

      1. Polyacrylamide hydrogels coated with different ECM components.
      2. Collagen I gels in which the stiffness is modified by Ribose treatment (this approach has been published by the Parson’s lab). Our preliminary data confirmed that ribose cross-linking increased YAP nuclear localisation and collagen I can still be internalised under these conditions. We will assess ECM endocytosis and cell growth under starvation conditions (using EdU incorporation in conjunction with A and high throughput imaging with B)

      Along with this, it has been demonstrated that matrix rigidity regulates glutaminolysis in breast cancer, resulting in aspartate production and cancer cells proliferation. Is asparate production increase by micropinocytosis? Could you rescue cancer cells growth by aspartate supplementation?

      Our metabolomics experiments were performed under amino acid starvation; therefore glutamine was not present in the media. Nor glutaminolysis intermediates nor aspartate were upregulated on ECM compared to plastic in our datasets, suggesting that aspartate might not be involved in this system. We added this point in the discussion. However, glutamine, glutamate and aspartate were found to be upregulated on collagen I compared to plastic in complete media, where the most enriched pathway was alanine, aspartate and glutamate metabolism. Future work will address the role of the ECM in supporting cancer cell metabolism in the absence of nutrient starvation.

      Data presented in Fig 1 and SF1 show that breast cancer cell lines growth in a comparable manner either they are cultured on plastic or on 3D ECM substrates in complete media. Again, on thick 3D substrates, in which the stiffness is lower compared to plastic, I would have thought that cancer cells would have grown slower. Could you please discuss this finding in regard to the literature?

      Our experiments in full media were performed in the presence of dialysed serum, to represent a better control for the starvation conditions, which were in the presence of dialysed serum. This is consistent with a vast body of literature assessing nutrient starvation conditions in the presence of dialysed serum. This could explain the discrepancy between ours and published results. We have addressed this point in the discussion.

      If you have the capacity to do so in your lab or in collaboration, would it be possible to measure the exact stiffness of the different matrix you use in this manuscript? Or using hydrogel, would it be possible to study the role of matrix stiffness in the ECM-dependent cancer cells growth under AA starvation? I would understand that this may be out of the scope of the present manuscript, but I again believe that such finding would reinforce the manuscript.

      We don’t have the capacity to measure the stiffness in our lab, however NF-CDM and CAF-CDM, generated by the same cells and using the same protocol, have been previously measured at ~0.4kPa and ~0.8 kPa, respectively (Hernandez-Fernaud et al., 2017). We have now included this in the paper. As mentioned in response to point 4, we will use hydrogels to directly test the effect of matrix stiffness on ECM-dependent cell growth under nutrient starvation.

      In SF 3A-C, it is shown that ECM does not affect caspase-dependent cell death under AA starvation. Did you considered a non-caspase dependent cell death that may be triggered by AA starvation?

      We will complement the caspase 3/7 data by performing PI staining, to detect all forms of cell death. Preliminary data indicate that, consistent with our cas3/7 data, amino acid starvation promotes cell death, but the presence of the ECM doesn’t affect the percentage of PI positive cells, corroborating our conclusions that the ECM modulates cell proliferation and not cell death. We will complete these experiments in both MDA-MB-231 and MCF10CA1 cells and will include them in figure S3.In fig 5, it is shown that inhibition of Focal Adhesion Kinase (FAK) does not impair the ECM-dependent rescue of cancer cell growth under starvation. To further decipher the concept of adhesion dependent signalling, maybe the authors could also inhibit the Src kinase or ITG-beta1 activation?

      Integrin b1 is also required for ECM internalisation (our unpublished data), therefore interfering with integrin function would make the interpretation of the data quite complex. As suggested by the reviewer, we will use the Src inhibitor PP2, which has been extensively used in the literature in MDA-MB-231 cells. Preliminary data indicate that, despite significantly reducing cell proliferation in complete media, Src inhibition does not affect cell growth on collagen I under amino acid starvation, consistent with our FAK inhibitor data. We will complete these experiments on both collagen I and cell-derived matrices and will include them in figure 5.

      Minor comment, in F1B, it is written "AA free starvation" while in every others legend, it is written "AA starvation". I believe the "free" should be removed.

      We apologise for this mistake; we have now removed “free” from the legend.

      Reviewer #2 (Significance (Required)): Altogether, I found this study conceptually interesting and experimentally well performed. Experiments are well controlled and state-of-the-art technologies used in this manuscript make it a good candidate for publication. However, some aspects of the work need to be strengthen to reinforce the overall good quality of the manuscript.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): In this manuscript the authors explore the mechanisms that metastatic cancer cells use to adapt their metabolism. The authors show that the growth of cancer cell lines is supported by uptake of ECM components in nutrient-starved conditions. The authors propose a very interesting mechanism in which the cells adapt their metabolism to ECM uptake as nutrient source via a PAK1-dependent macropinocytosis pathway which in turn increases tyrosine catabolism. Several key aspects of the authors complex hypothesis require further controls to fully support the authors ideas. As a disclaimer we do not feel qualified enough to comment on the metabolite experiments. Please find our detailed comments below.

      * Major -The ECM mediated increase of cell growth under amino acid (AA) starvation is nicely shown In Fig.1 but the authors should include the full medium data from figure S1 in the graphs of Fig. 1 to enable the reader to evaluate the magnitude of rescue effect of the ECM components. The values should also be included in the results text*.

      We have now moved all the complete media data into the main figure and highlighted the extent of the rescue in the result section.

      Also the authors only glutamine starve in Fig1&2 and then don't mention it again can the authors please include a sentence to explain why this experiment was dropped.

      As now highlighted in the result section, we focused on the amino acid starvation as it resulted in the strongest difference between normal and cancer. On the one hand, also non-invasive breast cancer cells can use ECM (namely matrigel) to grow under glutamine starvation, while this is not the case under amino acid starvation. On the other hand, only CAF-CDM, but not normal-CDM, could rescue cell growth under amino acid starvation. We reasoned that this condition was more likely to identify cancer-specific phenotypes.

      - The evaluation of uptake pathways is very interesting. The focus on macropinocytosis is not entirely justified in our opinion looking at FigS4A. Caveolin1/2 and DNM1/3 seem to have strongest effect on uptake of Matrigel and not PAK1? Statements like "Since our data indicate that macropinocytosis is the main pathway controlling ECM endocytosis..." are not justified nor are they really needed in our opinion. Several pathways can be implicated in passive uptake.

      We have now removed the statement, as suggested by the reviewer. In addition, we will assess CDM uptake upon caveolin 1/2 and DNM 2/3 knock-down, to test whether the effects are matrigel specific.

      - The authors use FAK inhibition to evaluate the effect of focal adhesion signalling on their phenotypes and conclude that there is no connection between the observed increase of cell proliferation in presence of ECM and adhesion signalling. To make this assessment the authors need at the very least to show that their FAK inhibitor treatment at the used concentration results in changes in focal adhesions and the associated force transduction.

      In the result section, we are including a western blot analysis showing that the concentration of FAK inhibitor used in sufficient so significantly reduced FAK auto-phosphorylation. Based on published evidence (Horton et al., 2016), FAK inhibition does not affect focal adhesion formation, but only the phosphorylation events. Therefore, we don’t think that we will be able to detected changes in focal adhesions regardless of the concentration of the inhibitor we use. To strengthen the observation that ECM-dependent cell growth in independent from adhesion signalling, as suggested by reviewer #2, we will use the Src inhibitor PP2, which has been extensively used in the literature in MDA-MB-231 cells. Preliminary data indicate that, despite significantly reducing cell proliferation in complete media, Src inhibition does not affect cell growth on collagen I under amino acid starvation, consistent with our FAK inhibitor data. We will complete these experiments on both collagen I and cell-derived matrices and will include them in figure 5.

      -The pancreatic cancer data currently feels a bit like an afterthought. We suggest to remove this data from the manuscript. If this data is included we suggest the authors should expand this section and repeat key experiments of earlier figures.

      We have now removed these data from the manuscript, as this was also the suggestion of reviewer #2.

      -Was the fetal bovine serum (FBS) and Horse Serum (HS) the authors use in their experiments tested for ECM components? The authors mention that the FBS for MDA231 cells was dialysed but not the HS.

      HS was used at a much lower concentration that FBS in our cell proliferation experiments (2.5% compared to 10%). We will characterise both sera components by mass spectrometry analysis, in collaboration with Dr Collins, biOMICS Facility, University of Sheffield.

      Minor comments:

      -Please can the authors provide experimental data directly comparing NF-CAM versus CAF-CDM on the same graph (Figure 1D-E).

      In the experiments included in the manuscript, the two matrices were generated independently, and we don’t feel it is appropriate to combine the results in the same graph. We are now repeating these experiments by generating both matrices in the same plates, so that we can present the data in the same graph. -Please can the authors give more insight to the use of 25% Plasmax to mimic starved tumor microenvironment. Is there previous research that suggests the nutrient values are representative of TME?

      Apologies for not clarifying this in the initial submission, the rationale behind this choice is based on the observation that, in pancreatic cancers, nutrients were shown to be depleted between 50-75% (Kamphorst et al., 2015). We have now stated this in the result section.

      -Fig3E Can the authors please include example images of the pS6 staining in the supplementary figures and explain "mTOR endosomal index" in figure legend.

      We have now included the representative images (new figure 3E) and we have described how the mTOR endosomal index was calculated both in the figure legend and in the method section. -Can the authors include a negative control for the mTORC1 localisation in Fig.3 (such as use of rapamycin/Torin)?

      Amino acid starvation is the gold-standard control for mTORC1 lysosomal targeting, as described in a variety of publications, including Manifava et al., 2016; Meng et al., 2021; Averous et al., 2014. In addition, Torin 1 treatment has been shown to result in a significant accumulation of mTOR on lysosomes compared with untreated cells (Settembre et al., 2012). Consistent with this, we looked at mTOR localisation in the presence of Rapamycin and we did not detect any reduction in lysosomal targeting.

      - The PAK1 expression level blots in the knockdown experiments should be quantified from N=3.

      We have not included the quantification of the western blots in the new supplementary figure 5.

      -What is the FA index in Fig.5, explain how it is calculated. Why not use FA size alone?

      We have now defined this is the method section. We haven’t used FA size alone, as this measure can be affected by cell size. If a cell is bigger, the overall FA size will be bigger, but this doesn’t necessarily reflect a change in adhesions.

      -Can the authors please use paragraphs on page 9 to improve readability. We apologise for overlooking this, we have now used paragraph in this section.

      Reviewer #3 (Significance (Required)): The findings presented here will be very interesting to a broad range of researchers including the cancer, metabolism and wider cell biology communities. The Rainero lab has progressed the idea that ECM uptake supports cancer progression and the data presented here has the potential to significantly advance our understanding of the underlying cellular mechanisms.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This study by Viedor et al. examines the role of TIS7 (IFRD1) and its ortholog SKMc15 (IFRD2) in the regulation of adipogenesis via their ability to modulate the levels of DLK1 (Pref-1), a well-known inhibitor of adipogenesis. They generate SKMc15 KO mice and cross them to previously published TIS7 KO mice. All 3 mutant strains show decreased fat mass, with the effect being most pronounced in double KO mice (dKO). Using mouse embryonic fibroblasts (MEFs) from mutant mice, they authors ascribe a defect in adipogenic differentiation of mutant cells to an upregulation of DLK-1. In the case of TIS7, they propose that this is due to its known inhibition of Wnt signaling, which regulates DLK-1 expression. In the case of SKMc15, they suggest a new mechanism linked to its ability to suppress translation. Overall, the work is of interest, with the finding, that SKMc15 regulates adipocyte differentiation being its novelty, and generally well done, though multiple aspects need to be improved to bolster the conclusions put forth.

      **Major concerns:**

      1)The main mechanism put forth by the authors to explain the inability of dKO cells to differentiate into adipocytes is the upregulation of DLK-1 levels. However, this notion is never directly tested. Authors should test if knockdown of DLK-1 in dKO cells is sufficient to correct the defect in differentiation, or if additional factors are involved.

      Response: In response to the reviewer’s concerns, we have generated two stable cell lines expressing short hairpin RNAs directed against DLK1 in the TIS7 SKMc15 dKO MEFs. With these two and the parental dKO MEF cell line, we have performed adipogenesis differentiation experiments as explained in the manuscript before. Figure EV2C (left and right panels) shows that knockdown of DLK1 with two different DLK1 shRNA constructs (targeting DLK1 with or without the extracellular cleavage site) significantly (P2)There are multiple instances were the authors refer to "data not shown", such as when discussing the body length of dKO mice. Please show the data in all cases (Supplementary Info is fine) or remove any discussion of data that is not shown and cannot be evaluated.

      Response: Following three results were in the initial version of our manuscript mentioned as “data not shown”:

      • line 137: “body length, including the tail did not significantly differ between WT and dKO mice”
      • line 307: “higher concentrations of free fatty acids in the feces of dKO mice”
      • line 331: “effects of ectopic expression of TIS7, SKMc15 and their co-expression on DLK-1 levels” In the current version of the manuscript, we provide these results as:

      • Figure EV1A shows no significant difference in body length.

      • The significantly elevated levels of free fatty acids and energy determined by bomb calorimetry in the feces of dKO animals fed HFD are shown in Figures 6A and B, respectively.
      • The significant inhibitory effect of ectopic expression of TIS7 and SKMc15 on DLK1 levels was identified by both qPCR and WB analyses, which are shown in Figure 3B. 3)Indirect calorimetry data shown in Fig. S1 should include an entire 24 hr cycle and plots of VO2, activity and other measured parameters shown (only RER and food intake are shown), not just alluded to in the legend.

      Response: Based on the reviewer’s suggestion, we present here a table containing all parameters measured in the indirect calorimetry experiment.

      Metabolic phenotyping presented in Figure EV1B containing 21 hours measurement was performed exactly according to the standardized protocol previously published by Rozman J. et al. [1]. All phenotyping tests were performed following the International Mouse Phenotyping Resource of Standardized Screens (IMPReSS) pipeline routines.

      4)It is surprising that the dKO mice weight so much less than WT even though their food consumption and activity levels are similar, and their RER does not indicate a switch in fuel preference. An explanation could be altered lipid absorption. The authors indicate that feces were collected. An analysis of fat content in feces (NEFAs, TG) needs to be performed to examine this possibility. The discussion alludes to it, but no data is shown.

      __Response: __We thank the reviewer for bringing up this important point that prompted us to present data clarifying this aspect of the metabolic phenotype of dKO mice. As shown in Figures 6A,B, while fed with HFD, dKO mice had higher concentrations of free fatty acids in the feces (109 ± 10.4 µmol/g) when compared to the WT animals (78 ± 6.5 µmol/g) and a consequent increase in feces energy content (WT: 14.442 ± 0.433 kJ/g dry mass compared to dKO: 15.497 ± 0.482 kJ/g dry mass). Thus, lack of TIS7 and SKMc15 reduced efficient free fatty acid uptake in the intestines of mice.

      5)It would be important to know if increased MEK/ERK signaling and SOX9 expression are seen in fat pads of mutant mice, not just on the MEF system. Similarly, what are the expression levels of PPARg and C/EBPa in WAT depots of mutant mice?

      Response: To address this point, we have now performed the MEK/ERK activity measurement for the revised version of the manuscript in gonadal WAT tissue (GWAT). As noted in samples from several mice, there was an increase in p42 and p44 MAPK phosphorylation in G WAT isolated from dKO mice compared with the G WAT from WT control mice (Figure 4G).).

      The mRNA expression levels of PPARg and C/EBPa were significantly downregulated in GWAT samples isolated from dKO mice compared with levels from WT control animals (Figure 4H). However, we did not find any significant difference in SOX9 expression in fat pads. Total amounts of Sox9 mRNA in terminally differentiated adipocytes were very low and not within the reliable detection range, and the variation between animals within the same group was too great. Therefore, we provide these data only for the reviewer’s information here and do not present them in the manuscript.

      6)Analysis of Wnt signaling in Fig. 3c should also include a FOPflash control reporter vector, to demonstrate specificity. Also, data from transfection studies should be shown as mean plus/minus STD and not SEM. This also applies to all other cell-based studies (e.g., Fig. 6b,c).

      Response: To address the reviewer’s concerns, we performed FOPflash control reporter measurements in MEFs of all four genotypes. As expected, in every tested cell line the luciferase activity of the FOPflash reporter was substantially lower than that of TOPflash, confirming the specificity of this reporter system.

      We also thank the reviewer for this important reference to our statistical analyses. We have revised the original data and found that the abbreviation SEM was inadvertently used in the legends instead of STD. STD was always used in the original analyses and therefore we have corrected all legends accordingly in the new version of the manuscript.

      7)It is unclear why the authors used the MEF model rather than adipocyte precursors derived from the stromal vascular fraction (SVF) of fat pads from mutant mice. If they did generate data from SVF progenitors, they should include it.

      __Response: __We agree with this comment, although performing the experiments was challenging enough for us. Therefore, we isolated inguinal fat pads and obtained SVF cells from mice of all four genotypes (WT, TIS7, SKMc15 single and double KOs) and have repeated crucial experiments, i.e. adipocyte differentiation, DLK1, PPARg and C/EBPa mRNA and protein analyses in these cells. Novel data gained in this cell system fully confirmed our previous observations in MEFs. Therefore, in the current version of the manuscript we have replaced figures describing the effects of lacking TIS7 and SKMc15 in MEFs by adipose tissues samples (Figures 2D,E, 4G,H,I and 6C) or SVF cells from inguinal WAT (Figures 2A,B,F,G,H, 3C,D,E and F). In addition to the results obtained from SVF cells of inguinal WAT, we also obtained comparable data from SVF cells isolated from fat pads of gonadal WAT. We provide the results from gonadal WAT hereafter for the reviewers' information only.

      amido black

      • 60 kDa

      • 50 kDa

      __G WAT __tissues

      dKO

      WT

      GAPDH

      DLK-1

      • 60 kDa

      • 50 kDa

      • 35 kDa

      WT

      dKO

      DLK-1

      • 60 kDa

      • 50 kDa

      __G WAT __undifferentiated cells

      undifferentiated G WAT cells

      The only experiments where we have still used data obtained in MEFs are those where the ectopic expression or effects of shRNA were necessary (e.g. Figures 2C, 3B,H,I, 5F,G EV2B,C and EV3 A-F).

      8)Given that the authors' proposed mechanism involves both, transcriptional and post-transcriptional regulation of DLK-1 by TIS7 and SKMc15, Fig. 4d should be a Western blot capturing both of these events, and not just quantitation of mRNA levels.

      Response: As requested by the reviewer, we have added in Figure 3B the Western blot analysis of DLK1 expression. Secondly, this experiment was entirely redone and we now show the effects of ectopic expression of SKMc15, TIS7 alone and their combination side by side with the control GFP. We present here the effects of stable expression of ectopic TIS7 and SKMc15 in dKO MEFs following the viral delivery of expression constructs, antibiotic selection and 8 days of adipocyte differentiation.

      9)There is no mention of the impact on brown adipose tissue (BAT) differentiation of KO of TIS7, SKMc15, or the combination. Given the role of BAT in systemic metabolism beyond energy expenditure, the authors need to comment on this issue.

      Response: We thank the reviewer for bringing up this important point that prompted us to better describe the phenotype of TIS7, SKMc15 and double knockout mice. We measured DLK1 protein levels in BAT isolated from WT, TIS7, and SKMc15 mice with single and double knockout and detected a significant increase in DLK1 protein levels in all three knockout genotypes. Five mice per genotype were analyzed, and the statistical analysis in Figure 4I represents the mean ± STD. The p-values are based on the results of the Student's t-test and one-way Anova analysis (p-value = 0.0241).

      **Minor comments:**

      10)The y axis in Fig. 2c is labeled as gain of body weight (g). Is it really the case that WT mice gained 30 g of body weight after just 3 weeks of HFD? This rate of increase seems extraordinary, and somewhat unlikely. Please re-check the accuracy of this panel.

      Response: We thank the reviewer for drawing our attention to the apparent mislabeling of the y-axis. The correct labeling is: "Increase in body weight in %" and Figure 1F has been corrected accordingly.

      11)The Methods indicates all statistical analysis was performed using t tests, but this is at odds with some figure legends that indicate additional tests (e.g., ANCOVA).

      Response: This inaccurate information in the manuscript was corrected.

      12)Please specify in all cases the WAT depot used for the analysis shown (e.g., Fig. 3d is just labeled as WAT, as are Fig. 4a,e, etc.).

      Response: This information was added at all appropriate places of the manuscript.

      13)Fig. 5d is missing error bars, giving the impression that this experiment was performed only once (Fig. 5c). The legend has no details. Please amend.

      __Response: __We thank the reviewer for this important point regarding the statistical analyses. In the new version of the manuscript, we have included a graph (now Figure 4D) depicting results of three independent experiments including the results of the statistical analysis performed. Statistical analysis was performed using One-Way ANOVA (P=0.0016).

      Reviewer #1 (Significance (Required)):

      The role of TIS7 in adipocyte differentiation is well established. The only truly novel finding in this work is the observation that SKMc15 also plays a role in adipogenesis. The molecular mechanisms proposed (modulation of DLK-1 levels) are not novel, but make sense. However, they need to be bolstered by additional data.

      **Referees cross-commenting**

      I think we are all in agreement that the findings in this work are of interest, but that significant additional work is required to discern the mechanisms involved. In my view, a direct and specific link between SKMc15 and translation of DLK-1 needs to be established and its significance for adipogenesis in cells derived from the SVF of fat pads determined. Reviewer 2 has suggested some concrete ways to provide evidence of a direct link.

      __Response: __We agree with the reviewer's comment and have also noted that this point will be crucial in assessing the novelty value of our manuscript, as was also expressed in the referees cross-commenting. Therefore, we have now additionally performed a polysomal RNA analysis, which has of course been included in the current version of the manuscript.

      We analyzed the differences in DLK-1 translation between wild-type control cells and SKMc15 knockout cells in the gradient-purified ribosomal fractions by DLK-1 qPCR. Our analysis identified significantly (pSimilarly, as proposed by the reviewer, we have established stromal vascular fraction cell cultures from inguinal fat pads. In SVF cells of TIS7 and SKMc15 single and double knockout mice, we found increased DLK1 mRNA and protein levels (Figures 2F,G and H) as well as decreased PPARg and C/EBPa levels (Figures 3C,D,E and F). Specifically, we found that the ability of knockout SVF cells to differentiate into adipocytes was significantly downregulated (Figures 2A and B), fully confirming our original findings in TIS7 and SKMc15 knockout MEFs.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      In the current study, Vietor et al. aimed to explore the regulation of Delta-like homolog 1 (DLK-1), an inhibitor of adipogenesis, and demonstrated a role for TIS7 and its orthologue SKMc15 in the regulation of adipogenesis by controlling the level of DLK-1. Using mouse models with whole body deficiency of TIS7 (TIS7 KO) or SKMc15 (SKMc15KO) and double KO (TIS7 and SKMc15 dKO) mice, the authors used a combination of in-vivo experiments and cell culture experiments with mouse embryonic fibroblasts derived from the KO animals, to show that the concurrent depletion of TIS7 and SKMc15 dramatically reduced the amount of adipose tissues and protected against diet-induced obesity in mice, which was associated with defective adipogenesis in vitro.

      **Major Comments:**

      Overall, this study presents convincing evidence that TIOS7 and SKMc15 are necessary for optimal adipogenesis, and proposes a novel mechanism for the control of DLK1 abundance via coordinated regulation of DLK-1 transcription and translation. However, a number of questions remain largely unanswered. In particular, the direct ability of SKMc15 to regulate the translation of DLK-1 is lacking, and this claim remains speculative. SKMc15 being a general inhibitor of translation, SKMc15 may have an effect on adipogenesis independently of its regulation of DLK-1. Thus, addressing the following comments would further improve the quality of the manuscript:

      Response:

      We have been very attentive to these comments to improve the novelty and quality of our manuscript and have tried to address them experimentally. Therefore, this thorough revision of our manuscript took a longer time. First, we identified polysomal enrichment of DLK-1 RNA in SKMc15 KO MEFs, demonstrating that SKMc15 translationally affects DLK-1 levels (Figure 3I). Second, treatment with a recombinant DLK-1 protein as well as its ectopic expression quite clearly blocked adipocyte differentiation of WT MEFs (Figures EV3B,C). In addition, two different shRNA constructs targeting DLK-1 significantly induced adipocyte differentiation of TIS7 SKMc15 dKO MEFs (Figure EV2C, left and right panels). We believe that these results, taken together, sufficiently support our proposed mechanism, namely that TIS7 and SKMc15 control adipocyte differentiation through DLK-1 regulation.

      • The experimental evidence supporting that SKMc15 controls DLK-1 protein levels comes primarily from the observations that DLK-1 abundance is further increased in SKMc15 KO and dKO WAT than in TIS7KO WAT (Fig 3d), and that translation is generally increased in SKMc15 KO and dKO cells (Fig 5a). However, since the rescue experiment is performed in dKO cells, by restoring both TIS7 and SKMc15 together, it is impossible to disentangle the effects on DLK-1 transcription, DLK-1 translation and on adipogenesis. A more detailed description of the TIS7 and SKM15c single KO cells, with or without re-expression of TIS7 and SKMc15 individually, at the level of DLK-1 mRNA expression and DLK-1 protein abundance would be necessary. In addition, polyribosome fractioning followed by qPCR for DLK-1 in each fraction, and by comparison with DLK-1 global expression in control and SKMc15 KO cells, would reveal the efficiency of translation for DLK-1 specifically, and directly prove a translational control of DLK-1 by SKMc15. Alternatively, showing that DLK-1 is among the proteins newly translated in SKMc15 KO cells (Fig. 5a) would be helpful. Response: As suggested by the reviewer we used single TIS7 and SKMc15 knockout cells and demonstrated that both, TIS7 and SKMc15, affect Dlk-1 mRNA levels. We identified a highly significant effect on total DLK-1 mRNA levels in SKMc15 knockout MEFs as presented in Figure 3H. We also show that DLK-1 mRNA is specifically enriched in polysomal fractions obtained from proliferating SKMc15 knockout MEFs when compared to WT MEFs. However, the strong accumulation of DLK-1 mRNA in polysomes cannot be explained by transcriptional upregulation of DLK-1 alone, suggesting that regulation also occurs at the translational level. We took up this suggestion and ectopically expressed TIS7 and SKMc15 separately or together. For this purpose, we used not only MEF cell lines with double knockout but also with single knockout. Our recent data showed that stable ectopic expression of SKMc15 significantly increased adipocyte differentiation in both, single and double TIS7 and SKMc15 knockout MEF cell lines (Figures EV1C,D and EV2A). Ectopic expression of TIS7 significantly induced the adipocyte differentiation in TIS7 single knockout MEFs (Figure EV1C). In addition, both genes down regulated DLK-1 mRNA expression in dKO MEFs (Figure EV2A, bar chart on the right). We fully agree with the opinion of both reviewers and as already explained above we identified by qPCR in the polysomes that SKMc15 directly regulates DLK-1 translation (Figure 3I).

      • While the scope of the study focuses on the molecular control of adipogenesis by TIS7 and SKMc15 via the regulation of DLK-1, basic elements of the metabolic characterization of the KO animals providing the basis for this study would be useful. Since the difference in body weight between WT and dKO animals is already apparent 1 week after birth (Fig 1a), it would be interesting to determine whether the fat mass is decreased at an earlier age than 6 months (Fig 1b). The dKO mice are leaner despite identical food intake, activity and RER (Sup Fig 1). It remains unclear whether defective fat mass expansion is a result or consequence of this phenotype. Is the excess energy stored ectopically? The authors mention defective lipid absorption, however, these data are not presented in the manuscript. It would be interesting to investigate the relative contribution of calorie intake and adipose lipid storage capacity in the resistance to diet-induced obesity. In addition, data reported in Fig 1c seem to indicate a preferential defect in visceral fat development, as compared to subcutaneous fat. It would be relevant if the authors could quantify it and comment on it. Are TIS7 and SKMc15 differentially expressed in various adipose depots? The authors used embryonic fibroblasts as a paradigm to study adipogenesis. It would be important to investigate, especially in light of the former comment, whether pre-adipocytes from subcutaneous and visceral stroma-vascular fractions present similar defects in adipogenesis. Response: We addressed the issue of lipid storage capacity raised by the reviewer using two experimental methods. First, we have analyzed feces of mice fed with high fat diet. The free fatty acids content in dKO mice feces was significantly (PConcerning the question of younger animals, we have repeated microCT fat measurements on a group of 1-2 months old WT and dKO male mice (n=4 per group). The total amount of abdominal fat was in WT mice significantly higher than in dKO mice (P=0.019; Student’s T-test). We provide these data only for the reviewer’s information here and do not present them in the manuscript.

      We have also followed the reviewer’s advice and revisited our microCT measurements of abdominal fat and anylyzed the possible differences between subcutaneous and visceral fat. In all three types of abdominal fat mass measurement (total, subcutaneous and visceral) there was always significantly (ANOVA P=0.034 subcutaneous, P=0.002 total and P=0.002 visceral fat) less fat in the dKO group (n=8) of mice when compared to WT (n=12) mice. However, the difference was more prominent in visceral (P=0.001; Student’s T-test) than in subcutaneous fat (P=0.027; Student’s T-test). We provide these data only for the reviewer’s information here and do not present them in the manuscript.

      In addition, we have analyzed the expression of TIS7 and SKMc15 mRNA expression in both, inguinal and gonadal WAT. Our qPCR result showed that both genes are expressed in different types of WAT. The qPCR analysis was performed on RNA isolated from undifferentiated SVF cells isolated from several animals. The expression of TIS7 and SKMc15 was normalized on GAPDH. Data represent mean and standard deviation of technical replicates from several mice as labeled in the graph. We provide these data only for the reviewer’s information here and do not present them in the manuscript.

      Topics of a) stromal vascular fraction as a source of pre-adipocytes and b) comparison of TIS7 and SKMc15 roles in gonadal vs. inguinal fat pads we answered in response to the Reviewer #1, point 7. The results are presented in Figures 2, 3 and 4 and in this document.

      Both data and methods are explained clearly. The experiments are, for the most part, adequately replicated. However, whenever multiple groups are compared, ANOVA should be employed instead of t-test for statistical analysis.

      Response: Thank you for pointing this out. Wherever it was applicable, we used ANOVA for the statistical analysis of data.

      **Minor comments:**

      • Figure 4 d. The appropriate control would be WT with empty vector Response: this experiment was entirely replaced by the new Figure 3B where stably transfected MEF cells expressing TIS7 or SKMc15 were used.

      • Figure 7c/d. The appropriate control would be WT with empty vector Response: We have now generated new, confirmatory data in MEF cells stably expressing TIS7 or SKMc15 following lentiviral expression.

      • Figure 5C. An additional control would be WT with WT medium __Response: __We agree with your suggestion and therefore we have incorporated this control in all experimental repeats presented in the new Figure 4C.

      • Figure 2: In the legends, the "x" is missing for the dKO regression formula __Response: __Thank you, we have corrected this mistake. In the current version of the manuscript it is Figure 1D.

      • Since the role of SKMc15 in adipogenesis has never been described, the authors could consider describing the single SKMc15 KO in addition to the dKO, or explain the rationale for focusing the study on dKO. __Response: __The original reason for focusing on dKO mice and cells was the obvious and dominant phenotype in this animal model. However, we have sought to address the reviewer's concerns and have now also examined DLK-1 mRNA levels in proliferating SKMc15 knockout MEFs (Figure 3H). In addition to this experiment, we measured DLK-1 mRNA levels also during the process of adipocyte differentiation of single knockout cells. In WT MEFs we observed a transient increase of DLK-1 mRNA only on day 1. In contrast, significantly elevated DLK-1 mRNA levels were found in TIS7 single-knockout MEFs throughout the differentiation process, with the highest level reached at day 8. Interestingly, in SKMc15 single knockout MEFs we found an upregulation of DLK-1 mRNA level in proliferating cells but not a further increase during the differentiation. This supported our idea that SKMc15 acts mainly via translational regulation of DLK-1. We provide these data only for the reviewer’s information here and do not present them in the manuscript.

      To emphasize this point, we revised the entire manuscript accordingly and added data on SKMc15 knockout mice. In particular, experiments presenting data characterizing SKMc15 single knockout mice are presented in: Figures 1C,D,E and F, Figures 2A,B,C and D, Figures 3E,F,H and I, Figures 4A and I and in Figure EV1D.

      Reviewer #2 (Significance (Required)):

      While the effects of DLK-1 on adipogenesis have been widely documented, the factors controlling DLK-1 expression and function remain poorly understood. Here the authors propose a novel mechanism for the regulation of DLK-1, and how it affects adipocyte differentiation. This study should therefore be of interest for researchers interested in the molecular control of adipogenesis and cell differentiation in general. Furthermore, the characterization of the function of SKMc15 in the control of translation may be of interest to a broader readership.

      **Referees cross-commenting**

      I agree with all the comments raised by the other reviewers. Addressing the often overlapping but also complementary questions would help to clarify the molecular mechanisms by which TIS7 and SKMc15 control adipogenesis, and support the conclusions raised by the authors.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In the article, "The negative regulator DLK1 is transcriptionally regulated by TIS7 (IFRD1) and translationally by its orthologue SKMc15 (IFRD2)", the authors performed a double knockout (dKO) of TIS7 and its orthologue SKMc15 in mice and could show that those dKO mice had less adipose tissue compared to wild-type (WT) mice and were resistant to a high fat-diet induced obesity. The study takes advantage of number of different methods and approaches and combines both in vivo and in vitro work. However, some more detailed analysis and clarifications would be needed to fully justify some of the statements. Including the role of TIS7 as a transcriptional regulator of DLK1, SKMc15 as translational regulator of DLK1 and overall contribution of DLK1 in the observed differentiation defects. The observed results could still be explained by many indirect effects caused by the knock-outs and more direct functional connections between the studied molecules would be needed. Moreover, some assays appear to be missing biological replicates and statistical analysis. Please see below for more detailed comments:

      **Major comments:**

      -Are the key conclusions convincing? Yes.

      -Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? No.

      -Would additional experiments be essential to support the claims of the paper? Yes. Please see my comments.

      -Are the suggested experiments realistic in terms of time and resources? Recombinant DLK1 10 μg - Tetu-bio - 112€ ; 8 days of adipocyte differentiation in 3 biological replicate ~ 1 month.

      __Response: __We followed the advice of the individual reviewers as expressed in “Referees cross-commenting” and tested this idea experimentally. Since the manufacturer couldn’t suppy information on biological activities of recombinant DLK-1 proteins, we analyzed in vivo the effects of two different ones, namely RPL437Mu01 and RPL437Mu02. The 8-day adipocyte differentiation protocol showed that the RPL437Mu02 protein was cytotoxic to WT MEF cells and therefore could not be used for analysis. On the other hand, treatment with the Mu01 recombinant DLK-1 protein did not result in a substantial cell death. According to oil red O staining, incubation with 3.3 mg/ml (final concentration) RPL437Mu01 led to 75% inhibition of adipocyte differentiation when compared to not treated WT MEFs (Figure EV3B and C).

      -Are the data and the methods presented in such a way that they can be reproduced? Yes.

      -Are the experiments adequately replicated and statistical analysis adequate?

      Adequately reproduced yes. Please see my comments concerning the statistical analysis.

      1)Fig1a: In the method section it is written that an unpaired 2-tailed Student's t test was used for all statistical comparisons. However, here something like Multivariate analysis of variance (MANOVA) should rather be used to assess statistical significance between the mice. Moreover, the details of this should be clearly stated in the corresponding Figure legend.

      __Response: __Based on this suggestion, we have revised all of our statistical analyses. In several cases, (Figures 1F, 2B and C) we have replaced the statistical analysis using Student’s T test with Anova. However, based on the definition “the difference between ANOVA and MANOVA is merely the number of dependent variables fit. If there is one dependent variable then the procedure ANOVA is used”, in case of Figure 1A we used ANOVA.

      2)Fig2a: please use an appropriate title for Fig2a instead of "Abdominal fat vs. body mass".

      Response: Title of the Figure 1D (formerly Figure 2a) we changed to “Effect of TIS7 and SKMc15 on the abdominal fat mass”.

      3)Fig2c: in the method section it is written that an unpaired 2-tailed Student's t test was used for all statistical comparisons. However, in Fig2c 4 groups are compared (WT, TIS7 KO, SKMc15 KO and dKO) and thus something like Multivariate analysis of variance (MANOVA) should rather be used to assess statistical significance.

      Response: For Figure 1F (formerly Figure 2c), in the revised version of the manuscript, we applied the ordinary one-way ANOVA with Holm-Šidák's multiple comparison test. This analysis gave us statistically even more significant results concerning the difference between WT and dKO mice than previously found by Student's T test. The results in detail were as follows:

      Holm-Šidák's multiple comparisons test Summary Adjusted P Value

      WT vs. TIS7 KO ** 0,0096

      WT vs. SKMc15 KO * 0,0308

      WT vs. dKO **** 4)Fig2 conclusion: Additive or just showing stronger effect?

      Response: We have re-phrased the concluding summary for Figure 1F (formerly Figure 2c). We agree that the precise description of differences found between the weight of single and double knockout animals should be described as “stronger” and not additive effect of knockout of both genes.

      5)Fig3a: the microscope picture for SKMc15 KO shows that cells might have died. Please state the percentage of cell death.

      Response: We would like to comment on these concerns of the reviewer as follows: In the image in Figure 3 of the original manuscript, the density of SKMc15 KO MEF cells after the adipocyte differentiation protocol was lower than in the WT control. Regarding the possible cell death, the cells stained with Oil Red O were adherent and alive. The adipocyte differentiation protocol consists of 3 days proliferation and further 5 days of differentiation including three changes of media during which dead cells are washed away and their vitality cannot be checked. However, in the meantime, we have repeated this protocol and the density of SKMc15 knockout MEFs was now not substantially lower than those of controls. Despite the comparable cell density, we have seen a substantial negative effect of the SKMc15 knockout on the adipogenic differentiation ability of these cells. Several examples are shown here:

      TIS7 +/+ SKMc15 +/+ MEFs

      TIS7 +/+ SKMc15 -/- MEFs

      oil red O staining; 8d differentiated cells

      Importantly, in the current version of our manuscript we replaced MEFs (shown in the former Figure 3a) by SVF cells (Figure 2A in the current manuscript). In these cells we did not see any significant difference in their density after 8 days of the adipocyte differentiation protocol.

      6)Fig3b: It would be informative to additionally observe some of marker genes for adipogenesis and whether all of them are affected.

      Response: In our newly established SVF cell lines, derived from inguinal WAT we have confirmed data previously identified in MEFs. As shown in the new Figure 3, PPARg and C/EBPa mRNA levels were downregulated in all knockout SVF cell lines, both undifferentiated (Figures 3C and D) and adipocyte differentiated (Figures 3E and F). On the other hand, DLK-1 mRNA and protein levels, both in undifferentiated (Figures 2F and G) and adipocyte differentiated (Figure 2H) SVF cells were significantly upregulated in dKO cells when compared to WT cells.

      7)Fig3b: instead of using an unpaired 2-tailed Student's t test with proportion, an one-way ANOVA would be more appropriate.

      __Response: __On the recommendation of the reviewer, we applied a simple ANOVA to our new data from SVF cells using the Holm-Šidák test for multiple comparisons. The Anova summary using GraphPad Prism Ver. 9.2 identified statistically highly significant (P value 8)Fig3c: Same comment as for Fig3b.

      __Response: __Also, in this experiment (now Figure 2C) we used ordinary one-way ANOVA with Holm-Šidák's multiple comparisons test. The ANOVA summary identified statistically highly significant (P value 9)Fig3d: A representative Western blot for 3 independent experiments is shown. Please add the other two as supplementary materials.

      __Response: __Here we provide examples of the requested two additional, independent experiments. These refer now to the Figure 2D in the revised version of the manuscript:

      31 07 2020

      = manuscript

      b____-catenin

      22 07 2020

      WT

      TIS7 KO

      dKO

      SKMc15 KO

      WT

      TIS7 KO

      dKO

      SKMc15 KO

      actin

      b____-catenin

      30 07 2020

      actin

      b____-catenin

      WT

      TIS7 KO

      dKO

      SKMc15 KO

      WT

      TIS7 KO

      dKO

      SKMc15 KO

      actin

      10)Fig3d:Is this distinguishing between the active and inactive catenin?

      __Response: __No, the b-catenin antibody, that we used is not discriminating between active and inactive b-catenin forms.

      11)Fig4a: Please perform qPCR for measuring DLK-1 mRNA levels in TIS7 KO and SKMc15 KO samples to check whether there is a correlation between mRNA and protein level as the statement of the authors is that "DLK1 is transcriptionally regulated by TIS7 (IFRD1) and translationally by its orthologue SKMc15".

      Response: Similar questions were raised by Reviewer 2 on p. 11 “Since the role of SKMc15 in adipogenesis has never been described, the authors could consider describing the single SKMc15 KO in addition to the dKO, or explain the rationale for focusing the study on dKO.” Please see our reply to his comment.

      12)Fig4c: please add the other two western blots as supplementary materials.

      __Response: __Here we provide data from two additional, independent experiments.

      13)Fig4d: The effects in MEFs appear quite modest. What about a rescue with TIS7 or SKMc15 alone?

      __Response: __As mentioned already in response to the question 2 of Reviewer #1, in our newly performed experiments we found significant inhibitory effects of ectopic TIS7 and SKMc15 expression on DLK1 levels, identified both by qPCR and WB analyses (Figure 3B).

      14)Page 12, row 207: I would not call histones transcription factors.

      __Response: __We re-phrased this sentence accordingly.

      15)Fig4e: Would be good to see a schematic overview of the locations of the ChIP primers in relation to the known binding sites and the gene (TSS, gene body). Moreover, the results include an enrichment for only one region while in the text two different regions are discussed. Importantly, to confirm the specificity of the observed enrichment, a primer pair targeting an unspecific control region not bound by the proteins should be included.

      __Response: __The selection of oligonucleotide sequences used for ChIP analyses of the binding of b-catenin, TIS7 and SKMc15 to the Dlk-1 promoter was, based on the following reference, as mentioned in Methods section of our original manuscript on p.21, line 494: Paul C, Sardet C, Fabbrizio E. “The Wnt-target gene Dlk-1 is regulated by the Prmt5-associated factor Copr5 during adipogenic conversion”. Biol Open. 2015 Feb 13;4(3):312-6. doi: 10.1242/bio.201411247.

      We used two regions of the Dlk-1 promoter: a proximal one, encompassing the TCF binding site 2 (TCFbs2) and a more distal one, annotated as “A”:

      Oligonucleotide sequences used for ChIP PCR:

      Dlk-1 TCFbs2 5'f CATTTGACGGTGAACATATTGG

      5'r GCCCAGACCCCAAATCTGTC

      Dlk-1 region A (-2263/-2143) 5'f TTGTCTAACCACCCTACCTCAAA

      5’r CTCTGAGAAAAGATGTTGGGATTT

      We observed specific binding at the proximal site.

      16)Fig5a: Has this experiment been replicated? That is no mention about the reproducibility or quantification of this result. This is the main experiment regarding the role of SKMc15 as a translational regulator of DLK1, also mentioned in the title of the manuscript.

      __Response: __This relates to the Figure 4A in the revised manuscript. Yes, we repeated this experiment several times. Here we provide images and quantifications of three independent experiments.

      17)Fig5b: Showing another unaffected secreted protein would be an appropriate control here.

      Response: As recommended by the reviewer, we have performed an additional WB with a recombinant anti-Collagen I antibody [Abcam, [EPR22209-75] ab255809]. Medium from 8 days adipocyte differentiated WT and dKO MEFs was concentrated using Centriprep 30K and resolved on 10% SDS-PAGE gel. Western blot presented in the new Fig 4 B shows even slightly higher amounts of Collagen-1 protein in medium from WT than in dKO MEFs. Mw of the detected band was approximately 35 kDa, which corresponded to the manufacturer’s information.

      18)Fig5c: I would recommend to perform additional experiments to prove that DLK-1 secreted in the medium can contribute and is responsible for the inhibition of the differentiation. Indeed, a time course of adipocyte differentiation followed by the addition of soluble DLK-1 would confirm that DLK-1 can inhibit adipocyte differentiation in this experimental setup. Moreover, silencing (for example RNAi) of DLK1 in the dKO cells before harvesting the conditioned media would allow to estimate the contribution of DLK1 to the observed inhibition of differentiation by the media. This is important because many other molecules could also be mediating this inhibition.

      __Response: __We agree with this reviewer’s concern, which are shared by other reviewers. Similarly, as in response to Reviewer #2 and as already mentioned above, in response to “major comments” of Reviewer #3, in our novel experiments we found that treatment with recombinant DLK-1 protein as well as ectopic expression of DLK-1 blocked adipocyte differentiation of WT MEFs (Figures EV3B,C,D and E) as well as medium from dKO shDLK-1 391 cells (Figure EV3F).

      19)Fig5c: The details and the timeline of the experiment with conditioned media are not provided in the figure or in the methods. At what time point was conditioned media changed? How long were the cells kept in conditioned media? How does this compare to the regular media change intervals? Could the lower differentiation capacity relate to turnover of the differentiation inducing compounds in the media due to longer period between media change? Moreover, is the result statistically significant after replication?

      __Response: __Based on the reviewer`s comment we have added technical information concerning the experimental protocol of the treatment with conditioned media. In general, the treatment for adipocyte differentiation was identical with the previous experiments. The only difference was that after three days in proliferation medium, we used either fresh differentiation medium or 2-day-old differentiation medium from dKO control or dKO-shDLK-1 391 cell cultures then for wild-type cells, as shown in the figure (Figure EV3F). Cells were incubated additional five days with the differentiation medium with two changes of media, every second day. The adipocyte differentiation of medium “donor” cells and the DLK-1 protein levels in these cells were monitored by oil red O staining and Western blot analysis, respectively.

      Additionally, we show now in Figure 4C representative images from three independent biological repeats and in Figure 4D the statistical analysis confirming a significant decrease in adipocyte differentiation ability of WT MEFs following their incubation with a conditioned differentiation medium from dKO MEFs.

      20)Fig5d: please add a statistical analysis of the oil-red-o quantification.

      __Response: __As requested, we included statistical analysis of at least three independent experiments. In Figure 4D we present the statistical analysis confirming a significant decrease in adipocyte differentiation of WT MEFs following their incubation with the differentiation medium from dKO cells. Additionally, Figure 4C shows representative images of oil red O staining from several independent experiments.

      21)Fig7c-d: Does overexpression also rescue the PPARg and CEBPa induction during differentiation. The importance of their induction in undifferentiated MEFs is a little difficult to judge.

      __Response: __We have focused our attention primarily on the ability of TIS7 and SKMc15 to “rescue” the adipocyte differentiation phenotype of dKO MEFs. dKO MEFs stably expressing SKMc15, TIS7 or both genes were differentiated into adipocytes for 8 days and afterwards stained with oil red O. There was a statistically significant increase in oil red O staining following the individual ectopic expression of SKMc15 (p=5.7E-03), a negative effect of TIS7 ectopic expression and a significant (p=9.3E-03), positive effect of co-expression of both genes (Figure EV2A). We found a significant decrease in Dlk-1 mRNA expression following the ectopic expression of TIS7 and/or SKMc15 (Figure EV2A, very right panel). However, C/EBPa mRNA levels were only partially rescued in 8 days differentiated MEFs by TIS7 and/or SKMc15 ectopic expression, and PPARg mRNA levels were not significantly altered.

      22)Fig8: it is not surprising that PPARg targets are not induced in the absence of PPARg. What is the upstream event explaining this defect? Is DLK1 alone enough to explain the results? Could there be additional mediators of the differences? How big are transcriptome-wide differences between WT MEFs and dKO MEFs?

      __Response: __We agree with the reviewer that the lean phenotype of dKO mice most likely cannot be explained by simple transcriptional regulation of PPARg. Although we showed that in undifferentiated MEFs, the levels of PPARg and C/EBPa are controlled (or upregulated) by both TIS7 and SKMc15, we also expected differences in the expression of genes regulating fat uptake. To determine changes in expression of lipid processing and transporting molecules, we performed transcriptome analyses of total RNA samples isolated from the small intestines of HFD-fed WT type and dKO animals. Cluster analyses of lipid transport-related gene transcripts revealed differences between WT type and dKO animals in the expression of adipogenesis regulators. Those included among other genes the following, mentioned as examples:

      • peroxisome proliferator-activated receptors γ (PPARγ) and d [2], fatty acid binding proteins 1 and 2 (FABP1, 2) [3],
      • cytoplasmic fatty acid chaperones expressed in adipocytes,
      • acyl-coenzyme A synthetases 1 and 4 (ACSL1,4) found to be associated with histone acetylation in adipocytes, lipid loading and insulin sensitivity [4],
      • SLC27a1, a2 fatty acid transport proteins, critical mediators of fatty acid metabolism [5],
      • angiotensin-converting enzyme (ACE) playing a regulatory role in adipogenesis and insulin resistance [6],
      • CROT, a carnitine acyltransferase important for the oxidation of fatty acids, a critical step in their metabolism [7],
      • phospholipase PLA2G5 robustly induced in adipocytes of obese mice [8]; [9]. Parts of the following text are embedded in the manuscript.

      We decided to study in more detail the regulation of CD36 that encodes a very long chain fatty acids (VLCFA) transporter because CD36 is an important fatty acid transporter that facilitates fatty acids (FA) uptake by heart, skeletal muscle, and also adipose tissues [10]. PPARγ induces CD36 expression in adipose tissue, where it functions as a fatty acid transporter, and therefore, its regulation by PPARγ contributes to the control of blood lipids. Diacylglycerol acyltransferase 1 (DGAT1), a protein associated with the enterocytic triglyceride absorption and intracellular lipid processing is besides CD36 another target gene of adipogenesis master regulator PPARγ [11]. DGAT1 mRNA levels are strongly up regulated during adipocyte differentiation [12], its promoter region contains a PPARγ binding site and DGAT1 is also negatively regulated by the MEK/ERK pathway. DGAT1 expression was shown to be increased in TIS7 transgenic mice [13] and its expression was decreased in the gut of high fat diet-fed TIS7 KO mice [14]. Importantly, DGAT1 expression in adipocytes and WAT is up regulated by PPARγ activation [11].

      Heatmap of hierarchical cluster analysis of intestinal gene expression involved in lipid transport altered in TIS7 SKMc15 dKO mice fed a high-fat diet for 3 weeks.

      What is the upstream event explaining this defect?

      Wnt pathway causes epigenetic repression of the master adipogenic gene PPARγ. There are three epigenetic signatures implicated in repression of PPARγ: increased recruitment of MeCP2 (methyl CpG binding protein 2) and HP-1α co-repressor to PPARγ promoter and enhanced H3K27 dimethylation at the exon 5 locus in a manner dependent on suppressed canonical Wnt. These epigenetic effects are reproduced by antagonism of canonical Wnt signaling with Dikkopf-1.

      Zhu et al. showed that Dlk1 knockdown causes suppression of Wnt and thereby epigenetic de-repression of PPARγ [15]. Dlk1 levels positively correlate with Wnt signaling activity and negatively with epigenetic repression of PPARγ [16]. Activation of the Wnt pathway caused by DLK1 reprograms lipid metabolism via MeCP2-mediated epigenetic repression of PPARγ [17]. Blocking the Wnt signaling pathway abrogates epigenetic repressions and restores the PPARγ gene expression and differentiation [18].

      **Minor comments:**

      1)Please use the same font in the main text for the references.

      Response: We thank the reviewer for the remark. This issue was corrected.

      Reviewer #3 (Significance (Required)):

      The study provides interesting insights into the role of these factors in adipocyte differentiation that would be relevant especially to researchers working on adipogenesis and cellular differentiation in general. The authors find the studied factors to have additive contribution to the differentiation efficiency. However, the exact nature of the roles and whether they are strictly speaking additive or synergistic is not clear. More detailed analysis of their contribution and molecular interplay would add to the broader interest of the study on molecular networks controlling cellular differentiation.

      **Referees cross-commenting**

      I very much agree on the different points raised by the other reviewers, some of which are also matching my own already raised concerns. And therefore it makes sense to request these modifications from the authors.

      References

      1. Rozman, J., M. Klingenspor, and M. Hrabe de Angelis, A review of standardized metabolic phenotyping of animal models. Mamm Genome, 2014. 25(9-10): p. 497-507.
      2. Lefterova, M.I., et al., PPARgamma and the global map of adipogenesis and beyond. Trends Endocrinol Metab, 2014. 25(6): p. 293-302.
      3. Garin-Shkolnik, T., et al., FABP4 attenuates PPARgamma and adipogenesis and is inversely correlated with PPARgamma in adipose tissues. Diabetes, 2014. 63(3): p. 900-11.
      4. Joseph, R., et al., ACSL1 Is Associated With Fetal Programming of Insulin Sensitivity and Cellular Lipid Content. Mol Endocrinol, 2015. 29(6): p. 909-20.
      5. Anderson, C.M. and A. Stahl, SLC27 fatty acid transport proteins. Mol Aspects Med, 2013. 34(2-3): p. 516-28.
      6. Riedel, J., et al., Characterization of key genes of the renin-angiotensin system in mature feline adipocytes and during in vitro adipogenesis. J Anim Physiol Anim Nutr (Berl), 2016. 100(6): p. 1139-1148.
      7. Zhou, S., et al., Increased missense mutation burden of Fatty Acid metabolism related genes in nunavik inuit population. PLoS One, 2015. 10(5): p. e0128255.
      8. Wootton, P.T., et al., Tagging SNP haplotype analysis of the secretory PLA2-V gene, PLA2G5, shows strong association with LDL and oxLDL levels, suggesting functional distinction from sPLA2-IIA: results from the UDACS study. Hum Mol Genet, 2007. 16(12): p. 1437-44.
      9. Sergouniotis, P.I., et al., Biallelic mutations in PLA2G5, encoding group V phospholipase A2, cause benign fleck retina. Am J Hum Genet, 2011. 89(6): p. 782-91.
      10. Coburn, C.T., et al., Defective uptake and utilization of long chain fatty acids in muscle and adipose tissues of CD36 knockout mice. J Biol Chem, 2000. 275(42): p. 32523-9.
      11. Koliwad, S.K., et al., DGAT1-dependent triacylglycerol storage by macrophages protects mice from diet-induced insulin resistance and inflammation. J Clin Invest, 2010. 120(3): p. 756-67.
      12. Cases, S., et al., Identification of a gene encoding an acyl CoA:diacylglycerol acyltransferase, a key enzyme in triacylglycerol synthesis. Proc Natl Acad Sci U S A, 1998. 95(22): p. 13018-23.
      13. Wang, Y., et al., Targeted intestinal overexpression of the immediate early gene tis7 in transgenic mice increases triglyceride absorption and adiposity. J Biol Chem, 2005. 280(41): p. 34764-75.
      14. Yu, C., et al., Deletion of Tis7 protects mice from high-fat diet-induced weight gain and blunts the intestinal adaptive response postresection. J Nutr, 2010. 140(11): p. 1907-14.
      15. Zhu, N.L., et al., Hepatic stellate cell-derived delta-like homolog 1 (DLK1) protein in liver regeneration. J Biol Chem, 2012. 287(13): p. 10355-10367.
      16. Zhu, N.L., J. Wang, and H. Tsukamoto, The Necdin-Wnt pathway causes epigenetic peroxisome proliferator-activated receptor gamma repression in hepatic stellate cells. J Biol Chem, 2010. 285(40): p. 30463-71.
      17. Tsukamoto, H., Metabolic reprogramming and cell fate regulation in alcoholic liver disease. Pancreatology, 2015. 15(4 Suppl): p. S61-5.
      18. Miao, C.G., et al., Wnt signaling in liver fibrosis: progress, challenges and potential directions. Biochimie, 2013. 95(12): p. 2326-35.
    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: The manuscript analyzes how the constriction of a tissue by an enveloping basement membrane alters the migration of cells migrating through that tissue. The tissue analyzed is the Drosophila egg chamber, an important model for basement membrane studies in vivo, and the cells migrating through it are the border cells. The border cells migrate through the center of the egg chamber, moving as a cluster between the nurse cells, which are in turn surrounded by follicle cells, which secrete the basement membrane on the outside of the egg chamber. The authors decrease and increase the basement membrane stiffness with various genetic perturbations, and they find that the border cells move more rapidly when the stiffness is reduced. They then investigate how basement membrane stiffness is communicated to the border cells several cell layers inside, by measuring cortical tension with laser-recoil. They found that external basement membrane stiffness alters the cortical tension of the nurse cells and the follicle cells, such that reduced matrix stiffness causes reduced cortical tension; further, reducing cortical tension directly within the cells also results in increased border cell migration rates. They conclude that basement membrane stiffness can alter cell migration in a new way, by altering constriction and cortical tension, with an inverse relationship between stiffness and migration rate. This is a strong manuscript and I would request very few changes.

      The authors are commended on the rigor and completeness of their study. Several independent methods are used to alter basement membrane stiffness (loss of laminin, knock-down of laminin, knock-down of collagen IV, over-production of collagen IV - all of which end up changing collagen IV levels) and all show the same result. Further, they are extremely rigorous about testing and excluding an attractive alternative hypothesis, that the basement membrane of the border cell cluster itself controls its migration rate. The use of mirror-Gal4 is very elegant and convincing, as it expressed only in the central part of the egg chamber, and they found border cells responded differently only in that region. Moreover, the authors were exceptionally thorough in reproducing the basement membrane mechanical data in their own hands using the bursting assay. Overall, the experimental data support the claims of the paper. There is only one more control I would like to see, for the knockdown of laminin in the border cell cluster with a triple-Gal4 combination. Presumably using all three Gal4 lines was necessary to get complete knockdown, and it would be nice to see anti-laminin for the border cell cluster under these knockdown conditions.

      Despite the rigor, because all of the manipulations to the basement membrane alter the levels of collagen IV, the authors cannot formally exclude the possibility that collagen IV in the basement membrane has another function besides stiffness, perhaps sequestering a signaling ligand, and that this other function somehow alters the cortical tension of the egg chamber. In the paper by Crest et al, externally applied collagenase served as a control for this possibility, but collagenase will not work for the authors because this study is in vivo. I suggest the authors bring up this caveat in the discussion. If they wanted to extend the study (optional), they could knock down the crosslinking enzyme peroxidasin in the egg chamber, which ought to reduce basement membrane stiffness without changing the collagen content. The problem here is that it hasn't already been shown to work that way in the egg chamber, and so both stiffness and collagen levels would need to be measured. Testing the stiffness directly would be difficult, since the bursting assay is not actually a measurement of stiffness (more on that below). Rather than go this route, I suggest just acknowledging the formal possibility, which seems to me unlikely anyway.

      In terms of clarity, the manuscript absolutely needs a schematic at the beginning to introduce the egg chamber and border cell migration, labeling the cell types, showing the route and direction of border cell migration, and labeling the A/P axis. Without this the non-expert reader cannot readily understand the study.

      Finally, in terms of clarity, the authors repeatedly use statements such as "stiffness influences migration rate". Influences how? These results are not intuitive to me, and it would help enormously if the authors would make statements like, decreasing stiffness increases migration (as I tried to in my summary). Here are two examples of statements to refine: • Line 189 - "We found that reducing laminin levels affected the migration speed of both phases (Fig.1F, G)." Please say increased, not affected. • Line 245 -"Altogether, these results demonstrate that the stiffness of the follicle BM influences dynamics and mode of BC migration." Again, be specific about how. There are many such statements, from the abstract to the results to the discussion, where it would help the clarity to be more precise about what kind of influence.

      Minor comments: • The movies are beautiful! • All the quantitative data are shown in bar charts with means and errors. It is much better to show the individual data points, superimposing the means and distributions on top of the individual points. • The bursting assay does not actually measure basement membrane stiffness; rather, it measures failure after elastic expansion. These are related, as was found by Crest et al and the authors say that at one point, but stiffness and failure are not the same thing. Please change the language discussing this assay to "mechanical properties" rather than stiffness. • The laser-recoil assays are done well and are convincing. Throughout the results section, the authors describe these as measuring "cortical tension", which is correct. However, in the figure legends the language changes to "membrane tension" which is only one component of cortical tension. Change them all to cortical tension. • In the Discussion, it would be nice to include something on the two different modes of migration (tumbling and not tumbling). • I suggest changing the title to remove the word "forces", because forces are never directly measured from basement membrane. • Although Dai et al (Science 2020) is discussed near the end, I suggest bringing this reference up to the introduction, so the reader can have the background on the mechanical aspects of border cell migration at the start of this study. • Two typos (there may be more): At the bottom of Fig. 2, text turns strangely white that should probably be black; and in line 260, you mean Fig. S5 not S4 (laser ablation).

      Significance

      Mechanobiology, and mechanobiology of the basement membrane, is a vibrant area of study now, arising from the intersection of biophysics/engineering and genetics. There is general interest in how the basement membrane alters forces within the tissue, and this study is the first to my knowledge to relate basement membrane mechanics to migration via constriction and cortical tension. The authors do a great job of discussing the broader significance of their work in the Discussion. To greatly broaden the scope of this work in the future, the authors could collaborate with a mouse team to look for similar responses in a mammalian tissue, as they discuss. It is worth noting that there is a lot of work on matrix stiffness and migration showing that stiffness promotes migration speed; in these cases, matrix is a substrate, not a compression mechanism. But the opposite nature of the result in interesting and makes this work non-intutive and perhaps hard for some readers to grasp.<br /> As the paper is written now, I think the audience for this work would mostly be oogenesis, border cell migration, and/or basement membrane researchers in the Drosophila community, of which there are many (I am in this camp). With some rewriting to make it more accessible to other audiences, I think it would be interesting to a larger developmental biology audience. The content is not like any other paper I know, but it may be similar in scope and subject matter to the papers detailing how follicle cells and basement membrane interact during follicle rotation.

    1. The first occasion of our love to hear, Like one I speak that cannot tears restrain. As we for pastime one day reading were How Lancelot by love was fettered fast— All by ourselves and without any fear— Moved by the tale our eyes we often cast On one another, and our colour fled; But one word was it, vanquished us at last. When how the smile, long wearied for, we read Was kissed by him who loved like none before, This one, who henceforth never leaves me, laid A kiss on my mouth, trembling the while all o’er.

      Francesca di Rimini was forced to marry Giovanni (Gianciotto) Malatesta an older crippled man but falls in love with his younger brother Paolo. Francesca and Paolo are in second circle of hell and forever trapped in a whirlwind because they gave into their desires for each other and committed adultery. In this passage Francesca is recalling how they were reading the story of Lancelot and Guinevere. Francesca romanticizes the kiss and gives Paolo chivalric virtues. This is significant because “If Francesca sees in her lover the peer of such a worthy as Lancelot, she may also see in herself the equal of his Queen: and she may even think that she had a right to betray Gianciotto…” (Poggioli 337). This serves to highlight Dante’s theme: the perfection of Gods Justice.

      Poggioli, Renato. “Tragedy or Romance? A Reading of the Paolo and Francesca Episode in Dante’s Inferno.” PMLA, vol. 72, no. 3, 1957, pp. 313–58. JSTOR, https://doi.org/10.2307/460460. Accessed 10 Mar. 2023.

    1. There's some interesting comparison to the ideas here and the long term state-of-the-art in information management, particularly in business and library settings which Bush wholly ignores.

      Most fascinatingly Bush "coins" memex here, but prior art for the Memindex as a similar product in the office/business productivity space easily goes back to 1906 and was popular to and through at least the early 1950s.

      For details on this, see:

      https://boffosocko.com/2023/03/09/the-memindex-method-an-early-precursor-of-the-memex-hipster-pda-43-folders-gtd-basb-and-bullet-journal-systems/

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2022-01776

      Corresponding author(s): David Bryant

      1. General Statements [optional]

      We describe an ARF6 GTPase module that controls integrin recycling to drive invasion in PTEN-null Ovarian Cancer (OC). We used high-throughput, time-lapse imaging and machine learning to characterise spheroid behaviours from a series of cell lines modelling common genetic lesions in OC patients. We identified that PTEN loss was associated with increased invasion, the formation of invasive protrusions enriched for the PTEN substrate PI(3,4,5)P3, and enhanced recycling of integrins in an ARF6-dependent matter. We utilised Mass Spectrometry proteomics and unbiased labelling to investigate the interactome of ARF6, identifying a single ARF GAP (AGAP1) and a single ARF GEF (CYTH2). Importantly, this ARF6-AGAP1-CYTH2 modality was associated with poor clinical outcome in patients.

      We thank all Reviewers for their highly complementary assessment of our manuscript, describing our paper as a "very impressive study, very well done and controlled with rigorous statistical analyses that uses sophisticated methods", a study that is "stunning in its thoroughness and depth and breadth of its molecular analysis", with "experiments are properly designed, and the data are well presented. The conclusions are appropriate and supported by the data". Finally, we would like to thank the reviewers for appreciating that our results are "of significance for both scientific discovery and clinical application, which will interest the broad audience in both basic and clinical research".

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Reviewer comments in bold. Our response in non-bold.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This paper by Konstantinou et al aims at deciphering the mechanisms by which PTEN loss could be driving poorer prognosis in patients. The authors use their great high-throughput 3D screening method coupled to an unbiased proteomic method and a CRISPR screen to uncover a new pro-invasive axis driving collective invasion of high-grade serous ovarian carcinoma (HGSOC) cells. Overall, this is a very impressive study, very well done and controlled with rigorous statistical analyses that uses sophisticated methods to convincingly show that the CYTH2-ARF6-AGAP1-ITGA6/ITGB1 module is required for the pro-invasive effect of PTEN depletion and discriminates patients with poorest prognosis.

      __

      MAJOR COMMENTS __

      Below are listed all the claims that, in my opinion, are not adequately supported by the data.

      1) Choice of the cell line: More justification on the use of the ID8 cell line and on the p53 deletion is needed. The authors need to clearly state that most p53 mutations in ovarian cancer are missense mutations that lead to a strong accumulation of a p53 protein devoid of transcriptional activity. Nevertheless, it seems that p53 mutations are not associated to differences in patient survival. Hence the choice of studying PTEN loss in the complete absence of p53, a situation that does not mirror the clinical situation, needs to be explained. Moreover, the in vivo experiments already performed in the literature mentioned in the discussion should be mentioned in the introduction to provide more context and physiological relevance to this study (especially regarding the special focus on the p53 null/ dKO cells throughout the study).

      We will update the manuscript with a detailed explanation of the cell line of choice. Briefly, while indeed Tp53 is found mutated in HGSOC, approximately 30-35 % of these are classified as null mutations (PMID: 21552211), making models with null Trp53 representative of the clinical situation. Further, there is no difference in patient outcome in HGSOC by Tp53 mutation type (PMID: 20229506), while gene expression data from TCGA suggest that HGSC is marked by loss of wild-type P53 signalling regardless of Tp53 mutation type (PMID 25109877). Thus, we believe our choice of model can faithfully mirror the clinical situation.

      2) "Therefore, PTEN loss in ovarian cancer, particularly at the protein level, occurs in the tumour epithelium and is associated with upregulated AKT signalling and poor overall survival". This claim is an over-interpretation and over-generalisation of the data presented. I appreciate the honesty of the authors in showing all the ovarian datasets that are available and highlight the discrepancies in expression of the proteins they study in stroma and epithelium. I think the way to present these data in the text without over-interpreting and generalizing would be to show that there is a clear epithelial-specific downregulation of PTEN at the mRNA level. Most likely due to the contribution to other cell types in the stroma, only 3 out of 5 bulk tumour mRNA datasets show a tumour specific downregulation of PTEN and no association with survival based on a median split of PTEN mRNA expression. Nevertheless, although there is no direct correlation between PTEN mRNA and protein levels, patients with low PTEN protein levels have poorer survival that is associated to an upregulation of Akt signalling. This allows to have a clearer conclusion, based solely on the protein data presented and no over-generalisation using the mRNA data. This, to me, makes a stronger case for studying PTEN loss in ovarian cancer and is fully supported by the data presented.

      We will incorporate this reviewer suggestion into the modified manuscript.

      3) PTEN loss induces modest effects in 2D culture. The authors make claims regarding the fact that some of the phenotypes they look at happen after PTEN depletion alone or in combination with p53 loss and are more prominent in 3D vs 2D. Many of these are insufficiently backed up by data. A few key experiments are also only performed in 2D and should be done in 3D. Finally, some clarifications if the role of PTEN is most prominent on either collective, ECM-induced or 3D-dependent invasion.

      some clarifications if the role of PTEN is most prominent on either collective, ECM-induced or 3D-dependent invasion

      We believe that the reviewer may be confused. Both of our models, either spheroids or invading monolayers, are events occurring inside gels of ECM. Therefore, these are all are 3D, ECM-induced, collective invasion. We have not performed 2D migration assays. We apologise that the this was not clearer in the first submission. We will correct this in the updated manuscript.

      First, the authors claim that PTEN loss alone (i.e. without p53 deletion) leads to changes in Akt signalling. Supp fig 1H clearly shows that there is no significant increase in Akt activation, although there seems to be one in the Western Blot (WB) presented in supp fig 1G. There is a clear, significant increase in the Akt activation in all the PTEN KO clones when in association with p53 loss though. This claim is hence not backed up by data and the conclusion seems to be that the effect on Akt signalling requires both deletion of p53 and PTEN.

      The reviewer is correct: that the increase to pAKT levels upon PTEN KO is more robust with co-KO of TP53, thereby indicating synergy with p53. We will update the manuscript to note this, accordingly.

      It will be interesting to see a quantification of the pS473-Akt staining (supp fig S1J), as it seems from these images that pAkt is preferentially found on rounded cells. It should also be performed in 3D conditions to see if there is an enrichment at invasive tips and back-up the invasion data.

      This observation made us realise that the images we had included were giving the wrong impression (that pAkt levels would be highest in round cells). Based on the quantitation in Fig. S1M, PTEN KO cells (which have elevated pAkt levels), show a marked depletion of rounded cells. Therefore, pAkt elevated is not associated with being enriched in rounded cells. We will replace this image with cells mirroring the phenotypes quantified in Fig S1M.

      We used 2D for quantitation of pAKT staining, as we perform a like for like comparison. We cannot compare pAkt in 3D protrusions accurately between genotypes because of the frequency of protrusions: in p53 KO protrusion are rare. In 3D, therefore, it is not a situation where protrusions are present in both genotypes and we compare enrichment or depletion in a stable structure. Rather, what we can provide is whether when protrusions form, there is clear pAkt labelling in a protrusion. We will include for the revision a representative image of each phenotype in 3D, including a 3D Trp53-/-;Pten-/- spheroid stained for pAKT S473.

      Arf6 is recruited to the invasive tips of cells invading a 2D wound (fig4D). How do the authors reconcile the fact that all the machinery required for 3D invasion is present but that PTEN loss has a modest effect on cells in 2D? If the wound assay was done on glass, it should be done again on ECM coated glass to see if it recapitulates the effects seen in 3D. This experiment will help deconvolute if the effect of PTEN loss is more linked to collective behaviour than 3D organization or presence of ECM.

      We again apologise for not being clearer in our description. Both the wound assays and the IF of invading monolayer were performed with cell monolayers invading into Matrigel. Monolayers are grown on top of Matrigel, wounded, and then overlayed with Matrigel. Therefore, this is orthogonal to our spheroid assay, and completely 3D. We will address this comment by changing the text in the results section to highlight the 3D nature of the method.

      The recycling assays are all done in 2D, condition under which the authors claim that the PTEN phenotype is weakest. Although I understand that it is not possible to do this assay in 3D, its contribution to elucidating the mechanism by which integrins participate in the PTEN loss invasive phenotype is not clear. The requirement of integrins relies on the data showing that ITGB1 KO results in no collagen4-positive basement membrane of the cysts and greatly impaired invasion. Experiments looking at the integrin localisation would be helpful: can an enrichment at the invasive tips can be seen? Are ITGA6 and/or ITGB1 repartitions homogeneous between the cysts membranes and the invasive tips? In my opinion the Src/FAK data is not enough to draw the conclusions of fig7I schematic.

      We will endeavour to include images of 3D spheroids of Trp53-/-;Pten-/- cells and stained for β1 integrin (total and active) and α5 integrin to interrogate localisation at the tips.

      4) Expression of AGAP1 isoforms do not alter ARF6 levels. Data in fig 6C, D show a significant downregulation of Arf6 and Akt signalling after expression of AGAP1S. Can the authors clarify what they mean?

      We thank the reviewer for picking up that discrepancy between the results and the text. We will change the relevant text to highlight that expression of AGAP1S is associated with a statistically significant reduction of roughly 30% in ARF6 levels and 10% in p:t AKT. We do not know why AGAP1s may enact such an effect.

      5) Arf6 is not modulated in the different cell lines: data in fig4B (far right graph) and supp fig 4B, J seem to indicate otherwise. Can the authors clarify what they mean?

      It is not clear exactly what the reviewer is referring to here. If the reviewer is referring to Supplementary Figure 4B, this is an experiment examining the levels of ARF5 or ARF6 upon knockdown, so levels would be expected to vary. Fig S4B does not correspond to the experiment performed in S4J. Our interpretation is that loss of p53 alone or in combination with Pten does not seem to be consistently be accompanied with an increase in either the levels of total or bulk GTP-bound ARF6 that could explain the dependency of Trp53-/-;Pten-/- on the GTPase for the invasive phenotype. We will make our interpretation clearer in the text

      6) Immunofluorescence panels without quantifications: Quantifications for the different stainings shown in fig3A; 4D, E; 5H; 7B and supp fig S1L, J; S3 need to be included to fully back the conclusions of the authors. Indeed, these images are used to draw conclusions and not only as illustrations.

      It is not possible to do a direct comparison between protrusion vs no protrusion (see our response above). We will include a line scan to show clear enrichment at the end of the tip for image shown. Quantitation for Figure S1L is already included (S1K and M), quantitation for Figure S1J is presented in Fig S1I and for Fig 5H quantitation of the phenotype is present in Fig 5I.

      7) Quantifications of invasion show that WT cysts become hyper-protrusive at around the half experiment mark (around 30-40hrs). Nevertheless, all movies or galleries show spherical cysts, which does not seem representative. Can the authors change this or explain why these images/movies were chosen?

      We present the fold change at each time point because that is intuitively easier to understand rather than the raw number. The quantitation does not show that the cysts necessarily become hyper-protrusive at the specific timepoint, but rather that the proportion of hyper-protrusive cysts observed in this genotype peaks at the specific timepoint. This phenotype may still be in the minority of behaviours. As an example, something that occurs 5% of the time in the control, with a two-fold increase in behaviours, might still only be 10% of the population. Therefore, adding in a picture that may be representative of a small proportion of the population may not be a realistic depiction of what is happening across the entire population. We will provide the reviewer with the exact percentage of spheroids that are classified as hyper-protrusive at the specific cell line across timepoints, to make this clearer.

      8) Since it seems that the main effect of PTEN is to drive the localisation and intensity of recycling of Arf6 cargoes, it will be helpful to confirm that all the proteins involved in the Arf6 module be shown to be accumulated/present at the pro-invasive tips. Immunofluorescence stainings showing the presence of AGAP1 (could be done with the AGAP1S isoform that is mNeon-tagged), pS473-Akt, ITGB1 (active integrin if possible, otherwise total integrin), ITGA5, PI3K should be included if possible. A quantification comparing signal in the cysts and in the invasive tips should also be included to see if there is an accumulation to PIP3-enriched areas.

      We will endeavour to include the requested images.

      9) Data in fig5I convincingly show that PTEN loss induces a fragmented collagen4-positive basement membrane. The authors use this data to claim that this is one of the ways that PTEN could be driving invasion but no correlation between these structures and the hyper-protrusive phenotype is made. This experiment needs to be done to support this claim.

      This comment made us realise that in an attempt to make images simpler (displayed nuclei and COL4 only), we omitted a staining for where protrusions were moving through gaps in the ECM. We will update these times to demonstrate such events.

      __

      MINOR COMMENTS __

      1) Data visualization: I think that the heatmap representation is overkill when only 2 or 3 conditions are presented. A graph showing the evolution of area or spherical/Hyper-protrusive phenotype proportions across time would be easier to read and more impactful: each genotype could be presented with a colour and the spherical/hyper-protrusive phenotypes as either plain or dashed lanes across time. I understand that this representation allows for the stats to be done at each time points but they are generally pretty clear (especially for the PTEN KO or dKO phenotypes) and do not need to be done for each time point in my opinion. These heatmaps could be put in supplementary figures if the authors feel strongly about putting stats for each time points.

      We thank their reviewer for their suggestion. We believe that our approach, while complex, is the best visualisation to reflect both the changes across time but also between conditions while allowing appreciation of the statistical significance. This visualisation has been optimised by our lab over years of working with this type of data and we would prefer that they remain consistent with the accepted standard of our other publications. We are, however, happy to expand the explanation in the text on how to interpret the bubble heatmaps.

      Fig supp S1M, fig 5I should be presented as a stacked histogram to improve readability and merged with fig supp S1K.

      We will merge Figures S1M and S1K. We believe that Figure 5I is easier to read as is.

      Displaying fold change as antilog rather than log values would be easier for the reader to realise the magnitude of the differences.

      We disagree with the reviewer.

      A bar graph would be easier to read than the matrix representation for fig 6B.

      We disagree with the reviewer as we feel it makes it easier to directly compare each lipid between the two cell lines.

      The way Area data is presented throughout to me makes it very difficult to understand what is going on. Could the authors at least give some explanations in figure legends. A curve graph displaying the evolution of the area across time would be easier to read and see the differences between conditions.

      Please see our response to Minor point 1

      2) It is confusing that, in fig supp S1M, there is a significant decrease of the rounded phenotype after PTEN loss that is not associated to a significant change in another of the categories. Could the authors explain how?

      This can be simply explained from our data: while the rounded phenotype was reduced in a consistent way across replicate experiments (therefore resulting in significance), the effect on the other two phenotypes was not consistent (not set in magnitude and directionality). This therefore does not lead to a significant (i.e. consistent) effect on the latter two phenotypes. PTEN loss therefore seems to allow cells to undergo – at the expense of being round - a range of shape changes, rather than a set phenotype.

      3) One of the big differences of the PTEN KO cells seems their ability to invade through the matrigel bed and migration on the glass below (supp movie S2). From what I gather, these cysts would be considered out of focus and excluded from the analysis. Would it be possible that this would minimize some of the results? Would it be possible to include a quantification of this particular phenotype to confirm it is specific to PTEN KO cells?

      In the same spirit, could the authors provide the percentage of non-classified cysts, to make sure that the same proportion of cysts is quantified across all different genotypes.

      Indeed, we cannot exclude that we under-estimate the magnitude of the effect on the PTEN null. We will include this point in the discussion. We can include a reviewer-only figure showing the proportion of cysts and levels of the ‘OutOfFocus’ objects across cell lines.

      __

      4) Can the authors clarify how a 0 fold change (in log value) in fig 2D can be highly significant? __

      We believe that the reviewer is equating statistical significance with something being biologically meaningful. Statistical analysis does not indicate a priori whether something is biologically meaningful. Rather, it assesses the likelihood that an observed result is occurring by chance (or not). For instance, if a small change (e.g 0.04 in a log2 fold change) occurred repeatedly across experimental replicates this is unlikely to be a result of chance, and therefore could be statistically significant. Yet, such a small magnitude of effect is probably biologically minor. This is why our heatmaps provide both statistical significance, fold change, and consistency in magnitude of effect.

      5) Delta isoform of PI3K seems to have an effect on area in the middle of the experiment, but has no effect at all on invasion. Could the authors comment? Are these smaller cysts still as invasive? There might be an interesting uncoupling between proliferation and invasion there.

      The cysts are actually slightly larger with PI3Kδ inhibition and there is no change in invasion. We will expand our comments in text as well to account for this observation.

      6) ITGB1 depletion seems to induce a downregulation of Akt protein. Is that right? Does it change Akt localisation? Is there a dose effect whereby there is not enough Akt protein to mediate invasion?

      The p:t AKT ratio does not change consistently across all gRNAs (Figure 5C) but we can look at Akt (total) protein levels and include this information if needed.

      __

      7) Stats should be added directly on the graphs for the recycling assays, doing a pairwise comparison of the different genotypes for each time points. Can the authors clarify what the t-32min quantification graphs adds (fig7E, supp fig S8G-I)? I would advise to remove them, as this data is already presented in the recycling assay graphs. __

      We don't include these because although they are technical replicates, they are demonstrative of a single experiment. What we include instead is the quantitation across independent biological experiments (which each have their own internal multiple technical replicates), where it is appropriate to include statistical analysis.

      8) There is a substantial amount of typos and erroneous references to figures. I listed below the ones that I spotted and I encourage the authors to carefully check.

      1. there are some mistakes in referencing the number of cysts in supp table 1. There is for example no cysts experiments in Figure 1 but yet there are some references to figure 1 in supp table 1. Please correct it. I think it will be easier for the reader if the number of cysts quantified for each conditions was also indicated in the figure legends. Supp table 1 can still be included for readers that want additional details.
      2. comma missing page 3
      3. page 3 and 4: PI(3,4)2 means PI(3,4)P2? Can be shorten to PIP2 for ease of read and specify if it is another PIP2 specie otherwise
      4. define CYTH abbreviation: I suppose this is for cytohesin?
      5. fig1F-I: don't understand why TCGA.OV is specified on some but not all the graphs. It seems to me that all the data are from TCGA.OV? Makes it seems it is nit the case
      6. legend of fig1H, I: y axis is -Log10 values in 1I, not Log10 values
      7. page 6: dKO abbreviation is already specified above and should be used to avoid repetition and for ease of read
      8. supp fig S1D: missing legend for the second bar (after Wild Type)
      9. supp fig S1N: legend of the X-axis should be below the axis
      10. supp fig S1O: the numerotation of the X-axis needs to be below the line of the axis for ease of read, not above it
      11. legend of S2A: clones 1.12 and 1.15 are p53-/-;PTEN-/- and not PTEN-/-
      12. supp figS2C can the authors specify the different stages of matrigel (liquid or gel) that are used for the invasion assay, to make it easier for the non-specialist to understand what is going on. Please confirm that the 50% GFR matrigel makes a gel on top of the cells and fill in the wound to produce the 3D invasion assay setup.
      13. page 7: no parental cells are used in S3A, B only p53 null and p53 null and dKO. Please also specify what cells are being compared in the text
      14. description of arrow heads and colours need to be moved to figure legends and not in main text (page 7)
      15. fig 2D: the signification of the dot in the circles needs to be in the legends (since it is its first apparition in the manuscript). It only appears later on, in supp2A legend. Additional description of the matrices is necessary, as they contain a lot of information to digest to understand fully what is going on
      16. legend of fig3: error in figure reference: area data is D and not E, protrusive phenotypes are E and not F
      17. arrow missing in fig3B
      18. fig 3D,E, G, H: please indicate the cell line studied
      19. fig 3I: the different genotypes need to be stated on the galleries for clarity
      20. page 8: define Arf6-mNG in the text
      21. __ page 9: "We thank the reviewer for their careful examination of the manuscript. We will go through all above points and make the corresponding careful adjustments to the manuscript.

      OPTIONAL SUGGESTIONS

      1) Choice of cell line: There is a high number of patients (around 9% according to (Cole et al. 2016)) that present the R248Q gain-of-function mutation. A recent study has shown that this mutant p53 protein is associated to an activation of Akt signalling and an increase of the intercellular trafficking of EGFR (Lai et al. 2021). Given that EGFR was also a hit in this screen, that is seems to have a central role in Arf6 cargoes (fig 4G), I think it would be a great addition to this study. It could hence cooperate with PTEN loss to drive strong, robust invasion.

      This is an excellent observation and one we will likely follow-up in an independent study.

      2) Are MAPK involved in the PTEN KO pro-invasive phenotype? In particular Erk1/2, since EGFR is one of the PTEN loss induced Arf6 cargoes.

      This is an excellent observation and one we will likely follow-up in an independent study.

      __

      REFERENCE Cole, Alexander J., Trisha Dwight, Anthony J. Gill, Kristie-Ann Dickson, Ying Zhu, Adele Clarkson, Gregory B. Gard, et al. 2016. « Assessing Mutant P53 in Primary High-Grade Serous Ovarian Cancer Using Immunohistochemistry and Massively Parallel Sequencing ». Scientific Reports 6 (1): 26191. _https://doi.org/10.1038/srep26191_.

      Lai, Zih-Yin, Kai-Yun Tsai, Shing-Jyh Chang, et Yung-Jen Chuang. 2021. « Gain-of-Function Mutant TP53 R248Q Overexpressed in Epithelial Ovarian Carcinoma Alters AKT-Dependent Regulation of Intercellular Trafficking in Responses to EGFR/MDM2 Inhibitor ». International Journal of Molecular Sciences 22 (16): 8784. _https://doi.org/10.3390/ijms22168784_. __

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors have conducted a study of the molecular requirements for cancer invasion that is stunning in its thoroughness and depth and breadth of its molecular analysis. The writing is exceptionally precise though also very dense (see below). The molecular model proposed is that PTEN loss (in a p53 null background) leads to reliance upon ARF6 for invasion, with regulation through interactions with AGAP1 and beta1-integrin and it is convincingly demonstrated. They focus on interpreting the consequences of genetic and pharmacologic manipulations in a cell line, using a series of 2D and 3D assays. The phenotypes are more prominent in 3D assays.

      Concerns and Suggestions:

      • There is a disconnect between the essentially complete loss of protrusions and invasion in 3D (e.g. 4A) and the reduction in magnitude of protrusive invasion but the continued presence of elongated cells with protrusions in 2D (e.g. S4C). This discrepancy is present in a couple of comparisons and is glossed over in quick callouts to many figure panels.

        We thank the reviewer for mentioning this as this comment was very helpful in determining that we needed to clarify our description of the role of ARF6 to protrusion formation vs maturation. In the Trp53-/- genotype, protrusions can form, but they rapidly retract, failing to mature into structures that drive invasion through ECM (e.g. Figure S2E). This protrusion maturation occurs upon PTEN KO. When ARF6 depleted, PTEN-null cells can form protrusions, but now again lack the ability to mature into invasion-inducing structures.

      This concept of needing ARF6 for protrusion maturation and maintenance is underpinned by our model of ARF6 regulating recycling of active integrin back to the protrusion front. Indeed, we have observed ARF6 being required not for protrusion initiation, but rather ensuring protrusions are not retracted in other contexts (i.e. upon loss of the ARF6 GEF protein IQSEC1 in invading 3D culture of PC3 cells; PMID: 33712589).

      We also note that, as responded to Reviewer 1, the assay is a 3D invasion rather than 2D migration assay, with cells sandwiched between Matrigel.

      We will update the relevant sections of the results and discussion with the point above.

      Once a journal has been identified, it would be wise for the editor to allow some flexibility in word limit to enable some very dense sections to be expanded slightly to guide the reader through the experiments and results more clearly. For example, in the section "ARF6 regulates active integrin pools...", there are callouts like (Fig. 7C,E; S8A-C; G-I) and then (Fig. 7D,E; Fig. S8E-F, H-I). It takes a lot of time to unpack these different experimental claims based on a single sentence.

      We greatly appreciate the refreshing comments of this reviewer to advocate for actions to improve clarity in our reporting. We would take glad advantage of such a possibility.

      The patient data on CYTH2 and its relationship to survival is modestly convincing.

      In Ovarian Cancer, effects on survival are often minor. This is not a disease where one often sees large shifts in survival, which is why we are so excited about the large shifts that we do see with the ARF GTPase module we identified. However, we concede that the effects on CYTH2, although significant, are not vast changes. We will point this out and tone down our language.

      Very minor- search on %- there are a few inconsistencies in terms of spaces and commas vs. periods. The Methods also have some inconsistencies in terms of spaces between numbers and units or numbers and degrees Celsius. References are also in a different font. Overall it was extremely carefully written though (just dense).

      We thank the reviewer for their careful inspection of our manuscript. We will carefully go over the sections flagged before resubmission

      Reviewer #2 (Significance (Required)):

      One limitation of the experimental design is that the depth of molecular analysis in vitro comes at the expense of any in vivo validation, which the authors acknowledge in the Discussion. They attempt to make similar points using analysis of patient survival data from public databases but these analyses generally yielded small magnitude differences. The main audience for this study is likely to be cell biologists interested in cell migration, cell-ECM adhesion, cancer invasion, and GTPases. I don't see any need for new experiments- what can be done has been done and then some. I do think that it would benefit readers if the text could be made less dense.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: Using a murine HGSOC 3D cell model, in combination with analysis of human ovarian cancer datasets, the authors uncover a CYTH2-ARF6-AGAP1 signaling module regulated by PTEN and identify a biomarker for tumor invasion and targeted therapy.

      Major comments:

      __The findings of this study are significant as they reveal a critical signaling module that controls tumor invasion by mediating tumor cell interaction with the extracellular matrix. The experiments are properly designed, and the data are well presented. The conclusions are appropriate and supported by the data. The limitation of the study has also been discussed properly.

      One suggestion regarding the survival analysis in Fig. 6 and 7. __

      The authors noted that the CYTH2-ARF6-AGAP1 module is not specifically or only induced in Pten-null contexts, but rather that Pten-null cells become more dependent on the module for enacting the invasive phenotype. Based on this, it would be interesting to evaluate how the PTEN status impacts the survival difference by integrating the PTEN genomic status (WT versus mutation) or its expression level (protein or mRNA) into the survival analysis of patient cohorts in Fig. 6 and Fig. 7.

      We thank the reviewer for this excellent point. We will include such analysis, where possible. One consideration will be that extensive division of patients based on these molecular characteristics may results in patient numbers too low to draw conclusions of significance.

      **Referees cross-commenting**

      Gene deletion and mutation may elicit different functional outcomes. I therefore agree with Reviewer #1 that "the choice of studying PTEN loss in the complete absence of p53, a situation that does not mirror the clinical situation, needs to be explained".

      We will make our reasons for this choice clear in the text before submission. Please refer to response to Reviewer 1, Major comment 1.

      Reviewer #3 (Significance (Required)):

      The model used and data presented in this study are of significance for both scientific discovery and clinical application, which will interest the broad audience in both basic and clinical research.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      4. Description of analyses that authors prefer not to carry out

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We are grateful to the reviewers for highlighting the value and power of our 3D chimeric dataset to explore cancer/stellate interactions in pancreatic cancer invasion. We also appreciate their support of our findings identifying divergent roles for the two related enzymes ADAMTS2 and ADAMTS14. We thank the reviewers for their detailed comments, which have allowed us to prepare a significantly stronger and clearer manuscript.

      Following the reviewers comments we have made three major changes to the manuscript, which we will outline here in addition to the point-by-point rebuttal.

      1. i) Revised manuscript structure. We have modified the structure of the manuscript, which we hope improves the clarity and accessibility of the work.

      Figure 1 remains the description of our 3D invasion model and our approach to identify stellate cell and cancer cell transcriptomic information from this context.

      Figure 2 describes our focus on proteases and now includes concordance of our data with clinical data sets. This is also now where we describe the strikingly opposing roles for ADAMTS2 and ADAMTS14 in regulating invasion.

      Figure 3 is now the figure demonstrating that ADAMTS2 and ADAMTS14 have an equal contribution to collagen processing from stellate cells. This is an important experiment given that the main physiological roles for these enzymes are in the processing of collagen, and the importance of collagen for cancer progression. It was therefore reasonable to hypothesise that the effect of these enzymes on invasion could be due to differences in their collagen processing in this context. The finding that both have an equal effect on collagen processing points towards a wider, and more diverse, role for these enzymes in regulating biology.

      Figure 4 describes the divergent roles of these two enzymes on myofibroblast differentiation, and by extension TGFβ bioavailability. In this figure we now include experiments with TGFβ reporter constructs, which demonstrate an increase in active TGFβ following loss of ADAMTS14 and a reduction in TGFβ activity following loss of ADAMTS2.

      Figure 5 is our matrisomic experiment to identify enriched enzyme-specific substrates following knockdown of either ADAMTS2 or ADAMTS14.

      Figure 6 details our investigation into the substrate responsible for the reduction in invasion following loss of ADAMTS2. As the previous matrisomic experiment identified only two enriched ADAMTS2 substrates, we investigated both in our 3D assays, identifying SERPINE2 as the responsible substrate. Further analysis identified a reduction in plasmin activity in ADAMTS2 deficient cells. This was rescued with co-knockdown of SERPINE2, implicating this pathway as being crucial for mediating the effect of ADAMTS2. Additionally, we now include experiments demonstrating that concomitant knockdown of SERPINE2 alongside ADAMTS2 rescues the reduction in TGFβ activity observed with ADAMTS2 loss alone.

      Figure 7 describes our analysis of ADAMTS14 substrates. As the matrisomics identified a large change in proteins following ADAMTS14 knockdown, we performed an siRNA screen of candidates to identify those responsible for ADAMTS14 phenotype. This, followed by further validation in our 3D invasive assay, revealed Fibulin2 as the responsible substrate. Fibulin2 has a well-established role in regulating TGFβ release from the matrix. In accordance with this we present new data using TGFβ reporter constructs, which demonstrate that the increase in active TGFβ following ADAMTS14 knockdown can be reversed with co-knockdown of Fibulin2.

      1. ii) Improvement of the clinical significance of our chimeric data set and ADAMTS proteins. Ideally, we would like to present IHC images of ADAMTS2 and ADAMTS14 expression in PDAC tissue samples to corroborate our in vitro findings. However as these enzymes are secreted, this precludes antibody based imaging, as it would not provide cell type specific information. RNA scope presents an alternative, however we have experienced technical issues with this technique due to RNA degradation in PDAC tissue and unavailability of ADAMTS2/14 specific probes. In place of this we have used a range of publically available resources.

      We have compared our chimeric data set with human clinical data using the resource published by Maurer and colleagues (PMID: 30658994). This paper presents transcriptomic data from PDAC tumour and stromal compartments using laser microdissection of clinical tissue. In accordance with our data set, the majority of metzincins, including ADAMTS2 and ADAMTS14, are expressed in the stromal compartment. These data are presented in updated figure 2.

      We have also examined ADAMTS2 and ADAMTS14 expression in PDAC and CAF subtypes using publically available data sets. Using the TCGA dataset, we identified that ADAMTS2 and ADAMTS14 are highly expressed in PDAC tumours compared to normal counterparts. As the majority of PDAC is comprised of stroma, the bulk transcriptomic data from TCGA, combined with the results from the Maurer publication, lead us to conclude that this expression reflects the stromal origin of these proteases. In addition, using publically available single cell RNA sequencing data published by Luo and colleagues (PMID: 36333338), we identified ADAMTS2 and ADAMTS14 expression in the prominent PDAC CAF subtypes, inflammatory and myofibroblastic CAFs. Together these data demonstrate that these enzymes are enriched in clinical disease, which when combined with our mechanistic 3D studies implies a greater role for these enzymes in disease progression than previously appreciated.

      iii) Improved mechanistic link between ADAMTS2 and ADAMTS14 with TGFβ bioavailability

      To strengthen the association between ADAMTS2 and ADAMTS14 function, their substrates SERPINE2 and Fibulin2, and TGFβ bioavailability, we have performed the following experiments using TGFβ reporter constructs:

      We have taken conditioned media from stellate cells lacking either ADAMTS2 or ADAMTS14, along with co-knockdown of their substrate, and stimulated a recipient cell line expressing a SMAD Luciferase reporter. These cells express luciferase in response to TGFβ stimulation. In accordance with a role for ADAMTS14 and Fibulin2 in regulating TGFβ, we demonstrate that following ADAMTS14 knockdown there is a strong increase in active TGFβ in the media (Figure 4I), which is abrogated with co-knockdown of Fibulin2 (Figure 7F).

      We have also obtained a fluorescent reporter, CAGA-eGFP, which expresses GFP in response to TGFβ stimulation in order to examine TGFβ activity in 3D cultures. Stellate cells expressing this construct were embedded in collagen: Matrigel hydrogels following knockdown of either ADAMTS2 or ADAMTS14 and CAGA fluorescence recorded after 72 hours of culture. In accordance with our data, stellate cells deficient in ADAMTS14 showed increased fluorescence in 3D, indicative of increased TGFβ activity, which was abrogated with co-knockdown of Fibulin2 (Figure 4J, K and 7G, H). Equally, loss of ADAMTS2 reduced TGFβ activity in 3D culture, which was rescued with co-knockdown of SERPINE2 (Figure 4J, K and 6 D, E).

      These experiments confirm a link between the ADAMTS enzyme, its relevant substrate, and TGFβ bioavailability. Together with extensive published work linking SERPINE2 and Fibulin2 with TGFβ release we are confident in our proposed mechanism for the dichotomic relationship of ADAMTS2 and ADAMTS14 in regulating TGFβ and thus myofibroblast action.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      • *

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      • *

      This study aims to explain the opposing contributions of stromal stellate cells/CAFs to PDAC. By first identifying stroma-specific proteases, followed by a process of candidate selection and elimination, the authors find that two specific metalloproteases that share enzymatic activity against collagen in fact have differential activity on TGFb availability. This could be interpreted as a way of shaping the CAF population and tumor-promotin or -restricting properties of the stroma.

      There are several flaws that the authors could address to improve the manuscript:

      1. In the flow of experiments and analyses, there is a strange mix of fully unbiased discovery phases followed by functional experiments that do not consider all possible candidates to test, and vice versa. For instance, from the mixed-species transcript analysis, ADAMTS2 and -14 are chosen based on their shared collagenase activity based on literature. However, the authors then perform again a proteomics analysis to identify things from the entire matrisome that are cleaved by these enzymes? Then, for ADAMTS2 a co-silencing approach is done on one selected candidate (Serpine2), but for ADAMTS14 an siRNA screen is performed? The problem of this approach is that the rationale for some studied enzymes is very strong, where as for others it is not.

      We thank the reviewer for their comment and trust the revised manuscript provides more clarity for the rationale of our approach. We performed the chimera sequencing as a discovery experiment to reveal the communication between cancer and stellate cells in a 3D, invasive context. We present the chimera experiment and data here as a resource for the community, with our analysis of ADAMTS2 and ADAMTS14 function serving as a first example of the biological insight this data set can reveal. Other insights revealed from this dataset are active avenues of research in our group.

      Our finding that ADAMTS2 and ADAMTS14 have dramatically opposing roles in regulating invasion was especially striking given their equal contribution to collagen processing in this context. This led us to conclude that the divergent nature of these enzymes must be due to enzyme-specific substrates. A substrate repertoire for these enzymes has been previously published (PMID: 26740262) and we reasoned that the responsible substrate would be enriched following knockdown of the relevant enzyme. Thus we preformed matrisomics on cells lacking either of these enzymes, which did indeed reveal enrichment of known, enzyme-specific substrates that we could use for further analysis.

      The matrisome following ADAMTS2 knockdown was minimally changed and only presented enrichment of two ADAMTS2 substrates. As there was only a minimal cellular phenotype in 2D following loss of ADAMTS2, we decided to concentrate our studies on the two identified substrates in our 3D assay. Conversely as the matrisome following ADAMTS14 knockdown was dramatically different from control cells, and ADAMTS14 knockdown presented a clear phenotype in αSMA expression, we decided to perform a screen of all matrisome hits. This highlighted the role of IL-1β in mediating myofibroblast differentiation, which has been reported elsewhere and validated our approach. Further, this refined the number of enriched ADAMTS14 substrates to two, MMP1 and Fibulin2, with Fibulin2 being identified as the responsible candidate in our 3D assays.

      The ECM is more than just collagen. Choosing these two metalloproteases based on their shared collagen substrate is an approach that perhaps oversimplifies the ECM a bit, and again, does not provide the strongest rationale that these metalloproteases are most likely to explain counteracting stromal activities on tumor growth and progression.

      We fully agree with the reviewers comment and feel our work acutely demonstrates this point. Loss of either ADAMTS2 or ADAMTS14 had similar effects on collagen processing; implicating their divergent roles on invasion was independent of their effects on collagen regulation. This work therefore showcases the incredible complexity of ECM regulation in tumour progression. As discussed in the manuscript, collagen along with other elements of the ECM can regulate tumour progression and we believe our work adds an additional facet to this.

      Related to the above: How were the stellate cells used for the matrisome analysis grown? In the suspension setup or adherent? This will have a large impact on the outcome. Is there for instance hyaluronic acid in this matrix?

      The matrisome analysis was conducted on cells cultured in 2D. Vitamin C was added to the media to promote matrix production. We agree that this is not truly reflective of the in vivo situation but as a discovery tool this led us to identify the ADAMTS2 and ADAMTS14 substrates responsible for the function observed in 3D.

      1. Performing the species-specific transcript analysis both ways is a neat approach, but why did the authors ignore the opportunity to formally overlay/compare the two stromal gene sets to define likely candidates based on statistics?

      We primarily used this approach as a discovery tool to identify key differences between cancer and stellate cell compartments. Comparing the two species data sets is problematic as the murine cancer cells express many elements found in the stellate cells, while the human data set presents a cleaner comparison. This is evident from comparing metzincin expression in the two data sets. The human data set (Figure 2A) shows clear separation between cancer and stellate compartments, which is less evident in the murine data set (Supp figure 2A). As noted in supplementary figure 1A, unlike the human cancer cells used in this study, the murine cancer cells are capable of invading without stellate support (although when cultured with stellate cells invasive projections are always stellate led). Nevertheless the murine data set matches the human, although with less clarity.

      Minor comments: The bioinformatics Methods need more details on how reads were mapped to the different genomes. How many mismatches were allowed and was the mapping done separately or using for instance Xenofilter?

      We have improved the methodology section to include more detail for this separation. Using STAR aligner, reads were mapped to host species using a combined human and mouse genome. Ambiguous reads were subsequently discarded from the analysis. While there are bioinformatic packages that seek to match ambiguous reads to parent species we did not use these for our analysis.

      The authors use the knowledge on the activities of both ADAMTS2 and -14 on collagen as a rationale to choose these two. Is there really a need for the paragraph (and associated figures) from line 102 on?

      Given the prominent role collagen has been shown to have in regulating PDAC progression and the primary role for ADAMTS2 and ADAMTS14 being collagen processing, we initially hypothesised that the divergent role for these enzymes on invasion could be due to differences in collagen processing in this context. The fact that both equally contribute to collagen processing is surprising and adds to the novelty of our findings that these enzymes have a more complex role in regulating stromal biology.

      We have altered the structure of the manuscript to emphasise this point. The divergent roles of ADAMTS2 and ADAMTS14 on invasion are now presented in Figure 2, with their equal role in collagen processing now presented after in Figure 3. Figure 4 onwards now details the opposing roles of these enzymes in myofibroblast differentiation and our investigation into the enzyme-specific substrates responsible for this.

      Abstract, line 21; some words are missing?

      We thank the reviewer for bringing this to our attention and have now amended the abstract.

      Were the siRNA screen hits validated?


      Yes, hits relevant for our further investigations, MMP1 and Fibulin2, are presented in the manuscript.

      What is the genotype of the mouse cancer cells? KPC-derived?

      DT6066 are KPC derived while R254 are derived from KPF mice. This has been added to the methods with relevant reference.

      Reviewer #1 (Significance (Required)):

      The trick of dissecting tumor from stromal signals in spheroid cocultures by RNA-Seq is a cool trick, but not new and the authors should probably cite some prior work.

      We have included reference to other work where researchers have used species deconvolution to explore heterocellular interactions (Lines 68-72). However, we believe our work is one of the first to use this approach to explore cellular interactions in an in vitro, 3D, invasive context.

      What this all means for patients (or in vivo tumors even) remains unclear. There is some debate on whether highly activated CAFs (ACTA2/aSMA+ cells, some call them myCAFs) are indeed tumor-restrictive or whether they promote invasion. The authors appear to argue the latter (which I can agree with) but without any translational work to show what the net outcome of this mechanism is, the study remains descriptive and perhaps of limited interest.

      We contend that our 3D invasion model is a powerful tool to understand the role of stellate cells in leading invasion. We have shown the utility of this model in several studies to dissect the biology of this cell type, revealing the importance of the nuclear translocation of FGFR1 in stellate invasion (PMID: 36357571), the role of the kinase PKN2 in regulating stellate heterogeneity (PMID: 35081338) and the influence of cancer cell-derived exosomes on stellate invasion (PMID: 33592190).

      CAFs within PDAC stroma are highly plastic and can adopt multiple functions depending on distinct environmental cues. Thus, identifying how they are regulated is of paramount importance if they are to be therapeutically targeted. We contend that our mechanistic studies using heterocellular 3D models can aid in the dissection of the biology of these cells with more granularity than offered by clinical or in vivo studies, particularly in the context of secreted proteases. To add clinical relevance for our findings we have compared our chimera data set with previously published laser microdissected tumour and stroma PDAC tissue (Figure 2B), and identified ADAMTS2 and ADAMTS14 expression in prominent CAF subtypes (inflammatory and myofibroblastic) from published single cell RNA seq data taken from tumours (Supp figure 2C). As these enzymes are produced in multiple CAF subtypes, genetically targeting them in vivo appears prohibitive. The generation of ADAMTS2 and ADAMTS14 specific inhibitors would be required to assess their roles in vivo.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Carter and colleagues examines that role of cancer-associated fibroblasts (CAFs) in regulation of invasion in a 3D co-culture assay with epithelial cells. The authors propose that invasive chains of cancer cells are led by fibroblasts. The authors utilise a system of co-culture to create chimeric human-mouse fibroblast-cancer cell spheroids (both directions utilised, to eliminate species bias) to allow for in situ sequencing of the co-operating transcriptional programmes of each cell type during 3D invasion. From this powerful approach, this allowed the authors to identify two key two collagen-processing enzymes, ADAMTS2 and ADAMTS14, as contributing to CAF function in their system. The authors identify that these two enzymes have opposing roles in invasion, and map some of their key substrates in invasion, which extend beyond collagen-processing. The authors propose that one key function is to control the processing of TGFbeta, the latter of which is a regulator of the myofibroblast subpopulation of CAF. Overall, the findings of the manuscript are interesting, but need some further proof-of-principal demonstrations to extend their findings to support the claims within.

      • The authors demonstrate a clear role of ADAMTS2 and ADAMTS14 in stellate function during differentiation and invasion. Is there any evidence of such changes in patient materials? Could the authors query publicly available databases of micro-dissected stroma vs epithelium to validate the translational relevance of their findings?

      We thank the reviewer for their suggestion; we have now explored clinical relevance of ADAMTS2 and ADAMTS14 expression in two ways. We have used previously published work by Maurer and colleagues (PMID: 30658994), which descibes transcriptomic analysis of laser microdissected tumour and stroma from pancreatic cancer tissue. In accordance with our chimeric data set the majority of metzincins, including ADAMTS2 and ADAMTS14, are expressed in the sromal compartment (Figure 2B). We have also used publically available scRNA seq data to examine ADAMTS2 and ADAMTS14 expression in distinct CAF subtypes (Supp Figure 2C). Both ADAMTS2 and ADAMTS14 are expressed in inflammatory and myofibroblastic CAFs, with ADAMTS14 expression lower than that of ADAMTS2. Given the complexity of CAF heterogeneity it is possible that ADAMTS2/14 secretion by one population regulates the resulting phenotype of surrounding CAFs, however this hypothesis if beyond the scope of our current work.

      Major comments: - Page no. 4, Line 71, The authors conclude that the invasion in the chimeric spheroids is "led by" stellate cells. This is a key concept in the manuscript. How do the authors define the "led by" phenomena? What is the frequency that this occurs?

      In our experience all invasive projections are stellate led, defined as a stellate-labelled nucleus present at the tip of invasive projections. Indeed the human cancer cells used in this study are incapable of invading in the absence of stellate cells (Supp figure 1 A). We have previously reported this model where we demonstrated FGFR1 activity in the stellate cells is crucial for invasion (PMID: 36357571). Others have demonstrated the general importance for fibroblasts in leading invasion (PMID: 18037882, 28218910). Interestingly in our study, mouse cancer cells were capable of invading in the absence of stellate cells. However, when cultured with stellate cells, projections were predominantly stellate led.

      • For Figure 2A and S2A, the text suggests that the heatmap represents the stellate vs cancer cell expression (as shown in Figure 1B and S1B) in the respective species but the labelling below the heatmap suggests they are all cancer cells (Mia, Pan, R2 and DT). Is this a typo? Could the authors clarify this?

      We use Mia, Pan, R2 and DT to define the sphere combination from which the data originated. We have improved the clarity of the heatmaps by colour coding the different cell types within each sphere, and matching it with the cell type data presented in the heat map. We hope this improved labelling makes the heatmaps more accessible.

      • The text and the figures are lacking information about the cell line names used in the experiment, e.g, Figure 2C, 2D, 2SB, 2SC and 2SD does not indicate what cell line was used in the study. This is the same with other figures as well. Please indicate in all instances exactly which samples are queried.

      We have now included reference to the cell type and stellate cell species used in each experiment in relevant figure legends. Key 3D invasive experiments were conducted with both human and mouse stellate cells.

      • It's mentioned in the text that the authors have used the cancer and stellate cells in a 1:2 ratio but the numbers of stellate cells look different between different spheroids confocal images. e.g. The numbers look very different between the Miapaca2:PS1 vs Miapaca2:mPSC spheroids. Is this simply the representative images, or are their bona fide differences. This, in turn, would impact on claims of cells being 'led' by stellate cells. Can the authors clarify?

      This is a consequence of the method by which the stellate cells were immortalised. Human PS1 stellate cells were immortalised with hTERT, while mouse stellate cells were immortalised with SV40. A consequence of this is that the mouse stellate cells proliferate faster in 3D than the human stellate cells, with both proliferating slower than the cancer cell compartment. So while spheroids start at 1000 cells (666 stellate, 333 cancer) with stellate cells as the prominent component they are quickly overtaken by the cancer cells. Despite this difference in proliferation we find no difference in the invasive capacity of the stellate cells, with invasive projections always stellate led irrespective of whether they are human or mouse.

      • While for most of the experiments the authors generated the chimeric spheroids first and then performed the respective experiments, it appears that for the invasion assay simply co-culture of Cancer cells and stellate cells was done. Is this correct? Have the authors tried performing the assay with the chimeric spheroids to see if the stellate cells still invade?

      The Boyden chamber migration assay was conducted by seeding a co-culture of stellate and cancer cells in the apical compartment then imaging their migration to the basolateral side. This provided a second method to predominantly showcase the enhanced migration of cells lacking ADAMTS14 in a manner that could be quantified over time. We have not tried placing spheroids in the apical compartment and imaging invasion through the pores.

      • The authors claim that ADAMTS2 and ADAMTS14 regulate the bioavailability of TGFB, and this is a key reason that these regulate CAF differentiation. However, there is no direct demonstration of this concept, which is conspicuous by absence. Could the authors either directly demonstrate this, or remove such notions from the results, and explicitly state that this is an untested speculation in discussion? Examples of this are:

      o Line 173, authors state "ADAMTS2 facilitates TGFβ release through degradation of the plasmin inhibitor, SERPINE2 (Figure 5D)"

      o Line 196 authors conclude "Together these data implicate 197 ADAMTS14 as a key regulator of TGFβ bioavailability (Figure 6F)."

      o Line 240 states "This reduces the activation of Plasmin, preventing the release of TGFβ (Figure 5C)." Since this is just a model without detailed experiments, It will be better to propose rather than conclude.

      We appreciate the reviewer’s concern and have now added additional experiments to strengthen the association of ADAMTS enzymes and TGFβ bioavailability.

      Using a TGFβ-responsive luciferase reporter we demonstrate that the media from stellate cells lacking ADAMTS14 has greatly increased amounts of active TGFβ (Figure 4), which is abrogated when Fibulin2 is knocked down alongside (Figure 7). This links ADAMTS14 and Fibulin2 to TGFβ activity. Given the extensive literature detailing a role for Fibulin2 in regulating matrix TGFβ release through interactions with fibrillin (e.g, PMID: 19349279, 12598898, 12429738) we believe this is how ADAMTS14 is regulating myofibroblast differentiation. As we do not directly examine the association of Fibulin2 with fibrillin in this manuscript we have amended the associated statements to reflect this.

      We have also used a TGFβ-responsive fluorescent reporter to examine TGFβ activity of stellate cells in 3D. Consistent with our results, loss of ADAMTS2 reduces, while loss of ADAMTS14 enhances, TGFβ activity (Figure 4), which can be reversed with concomitant knockdown of their respective substrates SERPINE2 (Figure 6) and Fibulin2 (Figure 7).

      • Figure S5C shows a less invasive phenotype in the NTCsi + ADAMTS14si spheroids compared to the NTCsi + NTCsi control. However, there appears no appreciable difference between NTCsi + ADAMTS14si and NTCsi + NTCsi spheroids' brightfield images in Figure 5SD.

      Could the authors comment on this?

      We thank the reviewer for bringing this to our attention and apologise for our mistake. The images were positioned erroneously. This has now been corrected and the images reflect the quantification that demonstrates a clear increase in invasion following loss of ADAMTS14, which is abrogated with co-knockdown of Fibulin2.

      Minor Comments: - Page no. 2, Line 20 has an incomplete sentence "Crosstalk between cancer and stellate cells is pivotal in pancreatic cancer, resulting in differentiation 21 of stellate cells into myofibroblasts that drive."

      Apologies for the error. This has been rectified.

      • Figure 2C; Figure S2C and Figure S5E lack quantification for the western blots.

      We have now included densitometry for all western blots, presenting values relative to the respective loading control and normalised to the experimental control. Values are averages taken from all biological repeats with significance indicated where relevant.

      • Why did the authors choose to investigate the Metzincin family? Could the authors provide their reasoning to investigate these proteins, to the exclusion of other candidates?

      We focused on the metzincin family, as they are best known for their involvement in cancer invasion. A goal for this manuscript is to present our chimera data set as a discovery tool for the community. While this initial manuscript focuses on protease activity, we have further projects on-going that have used this data set to identify important elements of cancer/stellate communication.

      • Info about the number of fields imaged per sample for the microscopy data is missing in the figure legends (e.g. Figure 2F and 2I, Figure 5SF).

      We have now included a statement in each relevant figure legend to indicate that quantification was performed on at least five fields of view per biological repeat.

      • Any particular reason why the ADAMTS2 expression was not checked through Western blotting like ADAMTS14 in Figure S2B.

      We attempted to examine ADAMTS2 by western blotting but were unable to find an antibody that produced consistent results with our samples, and corroborated consistent knockdown by PCR.

      • The legends for Figure 3SC and 5SF mention that "Images are representative of at least two biological replicates". How many technical replicates were used? It would be useful if the relative intensity of the images is measured and plotted in a graph.

      We have now moved these images to the main figure alongside quantification of αSMA intensity. Images are collected from two biological repeats with quantification obtained from at least five fields of view per image. Together these data strongly demonstrate that loss of ADAMTS14 increases αSMA fibre intensity, which is blocked by either an inhibitor of TGFβ signalling (Figure 4), or co-knockdown of Fibulin2 (Figure 7).

      Reviewer #2 (Significance (Required)):

      This work provides an examination of the cross talk between fibroblasts and cancer cells in a 3-Dimensional culture model of pancreatic tumour cell invasion. By using chimeric human-mouse spheroids, the authors are able to identify cell-type specific transcripts by bulk RNA sequencing in situ. This advance is not to be underestimated as a number of existing approaches for cell type-specific profiling (eg. single-cell sequencing) relies upon dissociation of cell communities prior to sequencing. It is very likely that transcriptional programmes change during this isolation process. This approach allows the authors to identify transcriptional co-operating programmes in situ. This data provides a resource to understand this key co-operation of these two cell types during tumourigenesis, and will be of interest to the pancreatic cancer field. In addition, the mapping of the key substrate of these enzymes provides further insights that may be useful in understanding the expanded target repertoire of these enzymes beyond collagen processing.

      We thank the reviewer for their strong support of our chimeric spheroid approach and resulting investigation into the dichotomic roles of ADAMTS2 and ADAMTS14.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      AMADTS2 and ADAMTS14 belong to the disintegrin and metalloproteinase with thrombospondin motif protein family, mainly produced by pancreatic stellate cells (PSC) and are related to cancer cell invasion. This study reveals that ADAMTS2 and ADAMTS14 have opposite roles in myofibroblast differentiation based on experiments testing HSC driven cancer cell invasion, variant expression of HSC activation makers and the related downstream targets analysed upon RNA sequencing analyses. The authors (TA) established PSC/cancer cell chimeric spheroids for investigating the crosstalk between these cell types in 3D in vitro. Based on their findings, they claim that ADAMTS2 and ADAMTS14 have different functions regarding PSC and TGF-β activation. However, their conclusions mainly rely on quantitatve data of invasion and mechanistic details are completely lacking.

      Comments: Typos, even in the abstract, e.g. first sentence incomplete

      We apologise for the error in the abstract and have rectified this in the revised manuscript.

      Introduction is rather sparce with one third of the text repeating the results of the study

      Our manuscript details a discovery experiment using chimeric spheroids to identify cancer cell and stellate cell transcriptomes in a 3D invasive context. We then showcase the power of this data set by using it to identify and then describe divergent roles for ADAMTS2 and ADAMTS14 in shaping stellate cell biology. Given this two-tiered approach we incorporated text that would normally be placed in the introduction into the results section (e.g. our description of the importance of collagen processing in PDAC, presented as a prelude to the results from figure 3). We feel this improves the flow of the manuscript, rather than having information that isn’t necessarily relevant to the reader at the outset.

      Some citations do not at all fit with the position where they are placed; needs approval

      We have examined this in detail and are confident in our use of appropriate references throughout.

      In this study, it is said that TS2 (AMADTS2) and TS14 (ADAMTS14) have opposite functions on myofibroblast differentiation, with individual depletion leading to distinct matrisomal phenotypes in PSC. However, both similarly contribute to collagen processing. As we know, collagen is increased in response to TGFβ signaling, since TS2 depletion (knock down, kd) inhibits and TS14 kd is suggested to promote TGFβ activation, it is expected that this has impact on the available collagen levels. How do the authors explain that nevertheless the kd effect on collagen is very similar?

      The primary effects of these enzymes are on the processing of pro-collagen to its mature form, rather than on the production of collagen. This is evidenced in figure 3B where collagen expression in the whole cell lysate is the same following ADAMTS2 knockdown, and slightly reduced with loss of ADAMTS14, but the mature form is lost in the cell culture supernatant.

      While myofibroblast differentiation is associated with increased collagen production, it is possible that this is perturbed in a situation where the cell is surrounded by collagen that is incompletely processed (e.g. through biomechanical feedback). Given that our results clearly indicated that the effect of ADAMTS2 and ADAMTS14 on invasion is independent of their roles in collagen processing, this avenue is beyond the scope of the current manuscript.

      The authors claim that TS2 facilitates TGFβ release and TS14 is a key regulator of TGFβ bioavailability. However, throughout the whole data, there is no experimental evidence for this conclusion. TGFβ activation, LAP concentration and downstream effects should be provided.

      Most of the conclusions in the manuscript are based on effects to invasion and the estimated quantification histograms. "Black boxes in between the treatment, e.g. knockdown and readout, that relate to the signals and mechanisms remain black boxes throughout. For example, the impact of the treatments on stellate cell activation markers, the cancer cells invasion signaling, the SERPINE2- and Fibulin2-dependent myofibroblast differentiation pathways should be mechanistically investigated.

      We disagree with this comment. Our invasive model shows a clear role for ADAMTS2 and ADAMTS14 in regulating invasion, which is mitigated by disrupting their substrates SERPINE2 and Fibulin2.

      ADAMTS2 loss is associated with a reduction in plasmin activity, which again is mitigated with concurrent loss of SERPINE2. Equally, inhibition of plasmin activity with Aprotinin matches the loss of invasion observed with loss of ADAMTS2. Plasmin has a well-established role in mediating TGFβ release from the matrix. We have now included additional experiments using a TGFβ fluorescent reporter in 3D culture. This demonstrates that loss of ADAMTS2 reduces TGFβ activity, which can be rescued with co-knockdown of SERPINE2 (Figure 6). Our data therefore support a mechanism where ADAMTS2 blocks TGFβ release from the matrix, and therefore myofibroblast differentiation, through its regulation of SERPINE2 activity.

      We have strengthened our proposed mechanism for ADAMTS14 regulation of TGFβ through Fibulin2 with the use of both luciferase and fluorescent TGFβ reporter constructs. Using these reporters, we demonstrate that stellate cells lacking ADAMTS14 exhibit increased TGFβ activity (Figure 4), which is mitigated with co-knockdown of Fibulin2 (Figure 7). Combined with the effects on αSMA expression and 3D invasion, our data fit with a model where ADAMTS14 regulates TGFβ bioavailability through Fibulin2.

      The authors investigate one cell line each for their conclusions; we know that different cell lines behave differently; can they confirm that the finding they present is of general validity or a finding that is specific for the tested cancer/PSC cell lines. Can the principle findings also be proven in primary cells. More importantly, the authors should proof their findings in PaCa tissue of patients as follows: Expression of the proteases in the tissue, related variation of matrisome signatures, e.g. by snRNASeq, to confirm relevance of the finding.

      All our key 3D invasive experiments are repeated with both human and mouse stellate cells, adding strength to our proposed association with ADAMTS2 and SERPINE2, and ADAMTS14 and Fibulin2, on the invasive capacity of stellate cells. As detailed above we have explored the clinical relevance of our findings by examining laser dissected tumour and stromal data from PDAC tissue, and scRNA fibroblast data. These data confirm that ADAMTS2 and ADAMTS14 are predominantly expressed in the stromal compartment of the tumour and are associated with key CAF subtypes present in the PDAC environment, inflammatory and myofibroblastic CAFs.

      Details related to the figures: Figure 1: Are the numbers of PSC and PaCa cells integrated in the spheres related to the numbers found in patients?

      The 2:1 ratio of stellate to cancer cells used to produce spheres is a technical requirement and reflects the numbers in patients (PMID: 23359139). Cancer cells will proliferate substantially faster than the stellate cells so at the end of the experiment (day 3) the spheres are predominantly cancer cells. Nevertheless the stellate cells are able to drive invasion of the cancer cells, which can be quantitatively assessed in this model.

      B, it seems that the PSC in the spheroid are not equally distributed but instead are all located in close vicinity to eachother in a cloud; is that the representative situation for the spheres and is this similar in the PaCa cancer tissue? Does this have influence on the results?

      We have replaced this image with a more representative image that shows mouse stellate cells dispersed throughout the sphere.

      Figure 2: It is interesting to hear that BMP1, which is actually a ligand for BMP signaling is a protease for Collagen. How does this work?

      While the BMP family generally belong to the TGFβ superfamily, BMP1 is the exception in that it is a C-terminal collagenase. Please refer to reference 21 in the manuscript (PMID: 33879793), which details the role of BMP1 on collagen processing and the resulting effect on PDAC progression.

      C, Quantification of all blots should be presented.

      We have now included densitometry for all western blots, presenting values relative to the loading control and normalised to the experimental control. Values are averages taken from all biological repeats with significance indicated by stars.

      Figure S2: TS2 kd and TS14 kd should be confirmed and provided by both qrt PCR and WB data.

      We were unable to assess ADAMTS2 knockdown by western blot due to the quality of available antibodies. We are confident that either western or PCR confirmation of knockdown is sufficient, especially given the strong phenotype observed with the resulting knockdown.

      Figure 3: F; this result is arguing against the conclusion that TGFb bioavailability is a function of the ADAMs, since the kd impacts on the treatment result with exogenous TGFb. This suggests an effect downstream of ligand activation by proteasomal cleavage, e.g. receptor activation or signal transduction; this needs clarification. H, I: TGFβR inhibitor reduces TS14kd enhanced αSMA expression. How is unclear and needs clarification, since from F we know that already activated TGFβ needs TS2 to fully induce αSMA expression.

      SupplFig.3: B, C, as above!

      αSMA expression in stellate cells requires continuous exposure to TGFβ over 48 hours. Active TGFβ has an incredibly short half-life (minutes) and so requires positive feedback to maintain signalling. We propose that following ADAMTS2 knockdown the cells are incapable of releasing further TGFβ to maintain the phenotype. Equally following ADAMTS14 knockdown the cells are able to release more TGFβ, which is incapable of initiating signalling when the receptor is blocked.

      Figure 4: TIMP1 is a canonical TGFb signaling target gene in fibrosis. How the authors explain that TIMP1 is upregulated in both knockdowns, when they claim that TS2 and 14 have opposing functions on TGFb activation. This result as well puts their conclusions as regards TGFb and also the myofibroblast phenotype into question. Especially, since TIMP1 signifies stellate cell activation not only in the pancreas, but also in the liver and kidney. C, D, E should be explained in more detail and all details of the results should be presented.

      TIMP1 is a substrate for both ADAMTS2 and ADAMTS14, so its enrichment following knockdown of either is unsurprising, reflective of reduced cleavage of TIMP1. Both our 3D invasive assessment in Figure 6 and αSMA imaging in supplementary figure 5 demonstrate that TIMP1 is not responsible for the effect observed as a consequence from loss of either ADAMTS2 or ADAMTS14.

      This holds also for the different myofibroblast phenotypes. All data should be included. From recent scRNASeq investigations, several myofibroblast populations were described and compared, e.g my-stellate cells vs i-stellate cells. To which of these phenotypes the identified populations belong?

      As mentioned above, we have interrogated publically available data sets and identified ADAMTS2 and ADAMTS14 expression in multiple CAF subtypes. As these proteases are secreted it is probable that one CAF subtype can control the phenotype of surrounding CAFs through ADAMTS2 and ADAMTS14 production. While intriguing, this hypotheses is beyond the scope of the current work.

      Figure 5: C, Only brightfield images are provided, confocal images are suggested for comparison of +/- Aprotinin treatment.

      We do not think the addition of confocal images will add to the comparison. Aprotinin clearly reduces invasion, which coupled with the action of stellate-derived SERPINE2 on invasion, and reduced plasmin activity following ADAMTS2 knockdown, suggests that plasmin is important for regulating the effects of ADAMTS2 on invasion.

      The efficiency of TS2 and Serpine2 kd should be provided by qrt PCR and WB.

      TS2 kd promoted SERPINE2 expression should also be presented by qrt PCR and WB.

      We are confident that either western or PCR confirmation of knockdown is sufficient. Of note is that following ADAMTS2 knockdown, SERPINE2 expression is unchanged (sup figure 4C). This would indicate that the enrichment of SERPINE2 observed in the matrisome following loss of ADAMTS2 is reflective of reduced cleavage, rather than a change in expression.

      Figure 6: A, why ta use aSMA and not invasive activity as a readout here?

      Increased αSMA expression following ADAMTS14 knockdown provides a strong, clear, 2D phenotype to act as a readout for an siRNA screen with high-content imaging. Performing such a screen with our 3D invasive model is currently impractical.

      There are many parameters leading to decreased aSMA expression upon kd; (1) why only MMP1 and Fibulin were selected as candidates?

      From our αSMA screen, MMP1 and Fibulin2 knockdown were the only candidates that were able to both prevent an increase in αSMA seen with ADAMTS14 loss alone, and are known ADAMTS14 substrates. Further validation in our 3D invasive model demonstrated that Fibulin2 and not MMP1 was responsible for the effect of ADAMTS14 loss on invasion.

      (2) the single kd control of the screen candidates is missing!

      We feel this control is not needed, as the goal of the experiment was to establish which candidate was responsible for mediating the effects brought about by ADAMTS14 knockdown. Increased αSMA expression with IL-1β loss validates our approach, as this is a known negative regulator of TGFβ signalling.

      (3) Can it be expected that all these matrisomal proteins are involved in aSMA expression regulation? I have doubts.

      We agree with the reviewers comment, from the siRNA screen (sup figure 5B) it is clear that the majority of the identified matrisome proteins have a minimal effect on αSMA expression following loss of ADAMTS14.

      C, D, E, why MMP1 was not also tested in these assays?

      Our spheroid assay clearly demonstrated that invasion was enhanced following ADAMTS14 knockdown even with co-knockdown of MMP1. Given the strong rescue observed with co-knockdown of Fibulin2 we proceeded to further analyse this candidate over MMP1.

      F, Fibrillin is shown in the figure but not described in the text. It would be quite interesting to see whether Fibrillin kd has the same effect as TS14 kd on LTGF-β activation (which of course need to be shown experimentally).

      The association of fibrillin with TGFβ release is well established as it underpins the biology behind Marfan syndrome. Loss of fibrillin, or mutations to its TGFβ binding sites results in a phenotype consistent with super active TGFβ signalling.

      E, what is the meaning of αSMA intensity quantification? By IF staining of αSMA? PSC αSMA expression should be quantified by qrt PCR and WB.

      We have now incorporated the confocal images analysing αSMA expression into the main figure and labelled the quantification accordingly. We feel this improves the clarity of the figures. Every western blot is now presented with quantification.

      Also here, kd efficiency of TS14 and Fibulin2 should be provided by qrt PCR and WB.

      Figure S5E should be part of figure 6, qrt PCR of Fibulin2 should be added.

      We have moved this western blot to the main figure (Fig 7C). We feel additional PCR validation of Fibulin 2 knockdown is not necessary.

      Figure 5/6 and throughout: It is claimed that ADAMTS2 and ADAMTS14 regulate TGFβ bioavailability through SREPINE2-Plasmin and Fibulin2. As mentioned above, TGFβ activation is only mentioned in the schemes, but no experimental evidence is given. In addition, according to previous studies, ADAMTSs can activate latent TGFβ directly by interaction with the LAP of latent TGFβ. .

      We have now included extra experimental evidence to support an association of ADAMTS proteins with TGFβ bioavailability. Using a TGFβ luciferase reporter construct, we demonstrate that active TGFβ is increased following loss of ADAMTS14, which is abrogated with concomitant loss of Fibulin2. This provides further evidence that ADAMTS14 is mediating its effects on myofibroblast differentiation / invasion through TGFβ release.

      Figure 3B, C, and 6D: We are confused from the migration/invasion assays. Invasion should be based on migration of tumor cells, whereas in the migration assays only stellate cells seem to be active? Can you explain this to us? According to Figure 3B, stellate and cancer cells are cocultured in the chamber. Is this the same condition as for the experiment presented as figure 6D?

      In our migration assay, stellate and cancer cells are co-cultured in the apical chamber and cell migration imaged over time. We pooled data of both cancer and stellate cell migration following stellate specific knockdown of either ADAMTS2 or ADAMTS14, which showed an increase in cell migration following loss of ADAMTS14. In figure 7, we again use this assay to demonstrate that Fibulin2 expression accounts for the phenotype observed from loss of ADAMTS14.

      In summary, this study for the first time found that ADAMTS2 and ADAMTS14 have opposite roles on myofibroblast differentiation, which is shown by using chimeric spheroids of stellate and pancreatic cancer cells. The authors claim a therapeutic potential for pancreatic cancer by regulating ADAMTS2/14-mediated stellate cell activation, which should avoid cancer cell invasion. The approach is interesting and there is preliminary evidence, however the study has many gaps and requires substantive workload.

      We thank the reviewer for their support of our findings. We hope the additional data, combined with the known role for these substrates in the regulation of TGFβ, strengthens the clarity of our manuscript.