5,256 Matching Annotations
  1. Aug 2020
    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, Müller, Matthews, Vali, and Baulcombe have used data-driven machine learning approaches to annotated and classified sRNA loci of Chlamydomonas reinhardtii. I have found the manuscript very interesting and a handy handbook for the appropriate way to annotate sRNA loci in different organisms. I believe this is not only a great resource paper on its own, but it also contains essential information to start understanding how Chalmydomonas silence TEs without a RdDM pathway. I have a few comments that may help to improve the manuscript.

      -L50-52: Can you predict where the unmapped read came from? Could viral infections be the source as in land plants? -L67-68, which is the explanation?

      • Fig 1D the reference to the A,C,G,U 5' should be re-positioned within Figure 1D panel space. -Figure 3: it could be a supplementary figure based on the relevance given in the manuscript to this point. -P5, line 107: while commenting on strand bias there seems to be a mistake in strong bias definition, it should be x < 0.2 and x > 0.8, not "strong bias (0.2 < x < 0.8)", as in the text. -P5, line 110: marked changes regarding locus size are not as striking in my opinion, in particular log size 6 and following, which is not marked in the graph (the cut off between 6 and 8). Maybe this curve should be split into two distribution graphs based on some important features (as repetitiveness?) that might allow a better definition of cut-offs.
      • Fig 5: the legend has the C subfigure twice, the second should be D.
      • Table 1: I believe the data would be better presented in a plot, potentially something similar to the plot in Figure 1 A and B. The numbers are already presented in the supplementary spreadsheet.
      • Fig 6A: The boxplots regarding Stability of the clusters should be better described. What exactly does the y-axis in each "small plot" represent?
      • P6, line 142: analyses of stability and variance shows 7 as the optimal k, while gap statistics and NMI suggested 6 as the optimal. It is not clear why 6 was preferred. The MCA section in Methods is unclear regarding this point too.
      • Fig S2-S5: please check legends, they are identical, although they should cover examples of loci in LC2 through LC5. These figures are not cited in the text, only S1 and S2. -Fig 9: I suggest using different colors in density plots to ease interpretation. LC tracks could share a color and Gene, TEs, DNA meth, and All loci should have a different color each. -Supplementary Files S1: The full-annotated locus map should be provided as a spreadsheet file or as a text (.csv) file, not as a pdf file. -I may be misunderstanding Fig. 6E, but it looks strange that the observed sum-of-squares is smooth, but the expected is not. Is it possible that the in-figure reference is inverted?

      Significance

      This is a very interesting aticle. It may looks a little bit technical but is provide useful information for people studying Chlamydomonas. In addition, the way the authors approached the annotation of sRNA is very meticulous and elegant. I would suggest people exploring small RNAs in non-model organisms to use this article as a handbook of how to annotate sRNAs. In this particular way the artivle will be of interest beyong the Chlamydomonas, and event plant, research field.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer1

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript is clearly written and the figures appropriate and informative. Some descriptions of data analyses are a little dense but reflect what would appear long hard efforts on the part of the authors to identify and control for possible sources of misinterpretation due to sensitivities of parameters in their fitness model. The authors efforts to retest interactions under non-competition conditions allay fears of most concerns that I would have. One problem though that I could not see explicitly addressed was that of potential effects of interactions between methotrexate and the other conditions and how this is controlled for. Specifically, I could be argued that the fact that a particular PPI is observed under a specific condition could have more to do with a synthetic effect of treatment of cells with a drug plus methotrexate. Is this controlled for and how? I raise this because in a chemical genetic screen for fitness it was shown that methotrexate is particularly promiscuous for drug-drug interactions (Hillenmeyer ME ,et al. Science 2008). I tried to think of how this works but couldn't come up with anything immediately. I'd appreciate if the authors would take a crack at resolving this issue. Otherwise I have no further concerns about the manuscript.

      We thank the reviewer for the kind comments. We agree with the reviewer’s point that methotrexate could be interacting with drugs or other perturbagens, similar to how the chosen nitrogen source, carbon source, or other growth conditions may interact with a drug. However, the methotrexate concentration is held constant across all conditions, as is the rest of the media components such as the nitrogen and carbon source (with the exception of the raffinose perturbation). Any interactions with methotrexate, or other media components, is undetectable without systematically varying all components for all stressors. Therefore, we use the typical experimental design of measuring molecular variation from a reference, holding invariant media components (such as methotrexate, glucose, or vitamins) fixed between conditions. This is a general practice, and we describe that every condition contains methotrexate on page 3, line 10.

      The library was grown under mild methotrexate selection in 9 environments for 12-18 generations in serial batch culture, diluting 1:8 every ~3 generations, with a bottleneck population size greater than 2 x 109 cells (Table S1).

      We also list the full details of each environment in Table S1.

      Reviewer #1 (Significance (Required)):

      Lui et al expand on previous work from the Levy group to explore a massive in vivo protein interactome in the yeast S. cerevisiae. They achieve this by performing screens cross 9 growth conditions, which, with replication, results in a total of 44 million measurements. Interpreting their results based on a fitness model for pooled growth under methotrexate selection, they make the key observation that there is a vastly expanded pool of protein-protein interactions (PPI) that are found under only one or two condition compared to a more limited set of PPI that are found under a broad set of conditions (mutable versus immutable interactors). The authors show that this dichotomy suggests some important features of proteins and their PPIs that raise important questions about functionality and evolution of PPIs. Among these are that mutable PPIs are enriched for cross-compartmental, high disorder and higher rates of evolution and subcellular localization of proteins to chromatin, suggesting roles in gene regulation that are associated with cellular responses to new conditions. At the same time these interactions are not enriched for changes in abundance. These results are in contrast to those of immutable PPIs, which seem to form a core background noise, more determined by changes in abundance than what the authors interpret must be post-translational processes that may drive, for instance, changes in subcellular localization resulting in appearance of PPIs under specific conditions. The authors are also able to address a couple of key issues about protein interactomes, including the controversial Party-date Hub hypothesis of Vidal, in which they could now affirm support for this hypothesis based on their results and notably negative correlation of PPIs to protein abundance for mutable PPIs. Finally, they also addressed the problem of predicting the upper limit of PPIs in yeast, showing the remarkable results that it may be no more than about 2 times the number of proteins expressed by yeast. Such an upper limit is profoundly important to modelling cellular network complexity and, if it holds up, could define a general upper limit on organismal complexity.

      This manuscript is a very important contribution to understanding dynamics of molecular networks in living cells and should be published with high priority.

      Reviewer 2

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Report on Liu et al. "A large accessory protein interactome is rewired across environments"

      Liu et al. use a mDHFR-based, pooled barcode sequencing / competitive growth / mild methotrexate selection method to investigate changes of PPI abundance of 1.6 million protein pairs across different 9 growth conditions. Because most PPI screens aim to identify novel PPIs in standard growth conditions, the currently known yeast PPI network may be incomplete. The key concept is to define immutable" PPIs that are found in all conditions and "mutable" PPIs that are present in only some conditions.

      The assay identified 13764 PPIs across the 9 conditions, using optimized fitness cut offs. Steady PPI i.e. across all environments, were identified in membrane compartments and cell division. Processes associated with the chromosome, transcription, protein translation, RNA processing and ribosome regulation were found to change between conditions. Mutable PPIs are form modules as topological analyses reveals.

      Interestingly, a correlation on intrinsic disorder and PPI mutability was found and postulated as more flexible in the conformational context, while at the same time they are formed by less abundant proteins.

      I appreciate the trick to use homodimerization as an abundance proxy to predict interaction between heterodimers (of proteins that homodimerize). This "mass-action kinetics model" explains the strength of 230 out of 1212 tested heterodimers.

      A validation experiment of the glucose transporter network was performed and 90 "randomly chosen" PPIs that were present in the SD environment were tested in NaCl (osmotic stress) and Raffinose (low glucose) conditions through recording optical density growth trajectories. Hxt5 PPIs stayed similar in the tested conditions, supported by the current knowledge that Hxt5 is highly expressed in stationary phase and under salt stress. In Raffinose, Hxt7, previously reported to increase the mRNA expression, lost most PPIs indicating that other factors might influence Hxt7 PPIs.

      **Points for consideration:**

      *) A clear definition of mutable and immutable is missing, or could not be found e.g. at page 4 second paragraph.

      We thank the reviewer for pointing this out. We have now added better definition of mutable and immutable on line 19 page 4:

      We partitioned PPIs by the number of environments in which they were identified and defined PPIs at opposite ends of this spectrum as “mutable” PPIs (identified in only 1-3 environments) and “immutable” (identified in 8-9 environments).

      *) Approximately half of the PPIs have been identified in one environment. Many of those mutable PPIs were detected in the 16{degree sign}C condition. Is there an explanation for the predominance of this specific environment? What are these PPIs about?

      The reviewer is correct that ~40% of the PPIs identified in only one environment were found in the 16 ℃ environment. One reason for this could be technical: the positive predictive value (PPV) is the lowest amongst the conditions (16 ℃: 31.6%, mean: 57%, Table SM6). It must be noted, however, that PPVs are calculated using reference data that has generally been collected in standard growth conditions. So, it might be expected that the most divergent environment from standard growth conditions (resulting in the most differences in PPIs) would result in a lower PPV in our study even if the true frequency of false positives was equivalent across environments. We have attempted to be transparent about the quality of the data in each environment by reporting PPVs and other metrics in Table SM6. However, we suspect that the large number of PPIs unique to 16 ℃ is due in part to the fact that it causes the largest changes in the protein interactome, and believe that it should be included, even at the risk of lowering the overall quality of the data. The main reason for this is that this data is likely to contain valuable information about how the cell copes with this stress. For example, we find, but do not highlight in the manuscript, that 16 ℃-specific PPIs contain two major hubs (DID4: 285 PPIs involved in endocytosis and vacuolar trafficking, and DED1: 102 PPIs involved in translation), both of which are reported to be associated with cold adaptation in yeast (Hilliker et al., 2011; Isasa et al., 2015).

      To assess whether the potentially higher false-positive rate in 16 ℃ could be impacting our conclusions related to PPI network organization and features of immutable and mutable PPIs, we repeated these analyses leaving out the 16 ℃ data and found that our main conclusions did not change. This new analysis is now presented in Figure S8 and described on page 5, line 10.

      Finally, we used a pair of more conservative PPI calling procedures that either identified PPIs with a low rate of false positives across all environments (FPR

      We have also added references to other panels in Figure S8 throughout the manuscript, where appropriate.

      *) 50 % overall retest validation rate is fair and reflects a value comparable to other large-scale approaches. However what is the actual variation, e.g. between mutable PPIs and immutable or between condition. e.g. at 16{degree sign}C.

      We validated 502 PPIs present in the SD environment and an additional 36 PPIs in the NaCl environment. As the reviewer suggests, we do indeed observe differences in the validation rate across mutability bins. This data is reported in Figures 3B and S6B, and we use this information to provide a confidence score for each PPI on page 5, line 4.

      To better estimate how the number of PPIs changes with PPI mutability, we used these optical density assays to model the validation rate as a function of the mean PPiSeq fitness and the number of environments in which a PPI is detected. This accurate model (Spearman's r =0.98 between predicted and observed, see Methods) provided confidence scores (predicted validation rates) for each PPI (Table S5) and allowed us to adjust the true positive PPI estimate in each mutability bin. Using this more conservative estimate, we still found a preponderance of mutable PPIs (Figure S6E).

      The validation rate in NaCl is similar to SD (39%, 14/36), suggesting that validation rates do not vary excessively across environments. Because validation experiments are time consuming (we performed 6 growth experiments per PPI), performing a similar scale of validations in all environments as in SD would be resource intensive. Insead, we report a number of metrics (true positive rate, false positive rate, positive predictive value) in Table SM6 using large positive and random reference sets. We believe these metrics are sufficient for readers to compare the quality of data across environments.

      *) What is the R correlation cutoff for PPIs explained in the mass equilibrium model vs. not explained?

      We do not use an R correlation cutoff to assess if a PPI is explained by the mass-action equilibrium model. We instead rely on ordinary least-squares regression as detailed in the methods on page 68, line 13.

      ...we used ordinary least-squares linear regression in R to fit a model of the geometric mean of the homodimer signals multiplied by a free constant and plus a free intercept. Significantly explained heterodimer PPIs were judged by a significant coefficient (FDR 0.05, single-test). This criteria was used to identify PPIs for which protein expression does or does not appear to play as significant of a role as other post-translational mechanisms.

      The first criterion identifies a quantitative fit to the model of variation being related. The second criterion is used to filter out PPIs for which the relationship appears to be explained by more than just the homodimer signals. This approach is more stringent, but we believe this is the most appropriate statistical test to assess fit to this linear model.

      *) 90 "randomly chosen" PPIs for validation. It needs to be demonstrated that these interaction are a random subset otherwise is could also mean cherry picked interactions.

      We selected 90 of the 284 glucose transport-related PPIs for validation using the “sample” function in R (replace = FALSE). We have now included text that describes this on page 63, line 3 in the supplementary methods:

      Diploids (PPIs) on each plate were randomly picked using the “sample” function in R (replace = FALSE) from PPIs that meet specific requirements.

      *) Figure 4 provides interesting correlations with the goal to reveal properties of mutable and less mutable PPIs. PPIs detected in the PPIseq screen can partially be correlated to co-expression (4A) as well as co-localization. Does it make sense to correlate the co-expression across number of conditions? Are the expression correlation condition specific. In this graph it could be that expression correlation stems from condition 1 and 2 and the interaction takes place in 4 and 5 still leading to the same conclusion ... Is the picture of the co-expression correlation similar when you simply look at individual environments like in S4A?

      We use co-expression mutual rank scores from the COXPRESdb v7.3 database (Obayashi et al., 2019). These mutual rank scores are derived from a broad set of 3593 environmental perturbations that are not limited to the environments we tested here. By using this data, we are asking if co-expression in general is correlated with mutability and report that it is in Figure 4A. We thank the reviewer for pointing out that this was not clear and have now added text to clarify that the co-expression analysis is derived from external data on page 6, line 7.

      We first asked whether co-expression is indeed a predictor of PPI mutability and found that it is: co-expression mutual rank (which is inversely proportional to co-expression across thousands of microarray experiments) declined with PPI mutability (Figures 4A and S11) (Obayashi and Kinoshita, 2009; Obayashi et al., 2019).

      The new figure S11 examines how the co-expression mutual rank changes with PPI mutability for PPIs identified in each environment, as the reviewer suggested. For each environment, we find the same general pattern as in Figure 4A (which considers PPIs from all environments).

      *) Figure 4C: Interesting, how dependent are the various categories?

      It is well known that many of these categories are correlated (e.g. mRNA expression level and protein abundance, and deletion fitness effect and genetic interaction degree). However, we believe it is most valuable to report the correlation of each category with PPI mutability independently in Figures 4C and S12, since similar correlations with related categories provide more confidence in our conclusions.

      *) Figure 4 F: When binned in the number of environments in which the PPI was found, the distribution peaks at 6 environments and decreases with higher and lower number of environments. The description /explanation in the text clearly says something else.

      We reported on page 7, line 15:

      We next used logistic regression to determine what features may underlie a good or poor fit to the model (Figure S14C) and found that PPI mutability was the best predictor, with more mutable PPIs being less frequently explained (Figure 4F). Unexpectedly, mean protein abundance was the second best predictor, with high abundance predicting a poor fit to the model, particularly for less mutable PPIs (Figure S14D and S14E).

      As the reviewer notes, Figure 4F shows that the percent of heterodimers explained by the model does appear to decrease for PPIs observed in the most environments. We suspect that the reviewer is correct that something more complicated is going on. One possibility is that extraordinarily stable PPIs (stable in all conditions) would have less quantitative variation in protein or PPI abundance across environments. If this is true, it would be statistically difficult to fit the mass action kinetics model for these PPIs (lower signal relative to noise), thereby resulting in the observed dip.

      A second possibility is that multiple correlated factors are associated with contributing positively or negatively to a good fit, and the simplicity of Figure 4F or a Pearson correlation does not capture this interplay. This second possibility is why we used multivariate logistic regression (Figure S14C) to dissect the major contributing factors. In the text quote above, we report that high abundance is anti-correlated with a good fit to the model (S14D, S14E). Figure 4C shows that immutable PPIs tend to be formed from highly abundant proteins. One possible explanation is that highly abundant proteins saturate the binding sites of their binding partners, breaking from the assumptions of mass action kinetics model. We have now changed the word “limit” to “saturate” on page 7, line 22 to make this concept more explicit.

      Taken together, these data suggest that mutable PPIs are subject to more post-translational regulation across environments and that high basal protein abundance may saturate the binding sites of their partners, limiting the ability of gene expression changes to regulate PPIs.

      A third possibility is that the dip is simply due to noise. Given the complexity of the possible explanations and our uncertainty about which is more likely, we chose to leave this description out of the main text and focus on the major finding: that PPIs detected in more environments are generally associated with a better fit to the mass action kinetics model.

      *) Figure 6: I apologize, but for my taste this is not a final figure 6 for this study. Investigation of different environments increases the PPI network in yeast, yes, yet it is very well known that a saturation is reached after testing of several conditions, different methods and even screening repetition (sampling). It does not represent an important outcome. Move to suppl or remove.

      We included Figure 6 to summarize and illustrate the path forward from this study. This is an explicit reference to impactful computational analyses done using earlier generations of data to assess the completeness of single-condition interaction networks (Hart et al., 2006; Sambourg and Thierry-Mieg, 2010). Here, we are extending PPI measurement of millions-scale networks across multiple environments, and are using this figure to extend these concepts to multi-condition screens. We agree that the property of saturation in sampling is well known, but it is surprising that we can quantitatively estimate convergence of this expanded condition-specific PPI set using only 9 conditions. Thus, we agree with Reviewer 1 that these are “remarkable results” and that the “upper limit is profoundly important to modelling cellular network complexity and, if it holds up, could define a general upper limit on organismal complexity.” We think this is an important advance of the paper, and this figure is useful to stimulate discussion and guide future work.

      Reviewer #2 (Significance (Required)):

      Liu et al. increase the current PPI network in yeast and offer a substantial dataset of novel PPIs seen in specific environments only. This resource can be used to further investigate the biological meaning of the PPI changes. The data set is compared to previous DHFR providing some sort of quality benchmarking. Mutable interactions are characterized well. Clearly a next step could be to start some "orthogonal" validation, i.e. beyond yeast growth under methotrexate treatment.

      The reviewer makes a great point that we also discuss on page 9, line 33:

      While we used reconstruction of C-terminal-attached mDHFR fragments as a reporter for PPI abundance, similar massively parallel assays could be constructed with different PCA reporters or tagging configurations to validate our observations and overcome false negatives that are specific to our reporter. Indeed, the recent development of “swap tag” libraries, where new markers can be inserted C- or N-terminal to most genes (Weill et al., 2018; Yofe et al., 2016), in combination with our iSeq double barcoder collection (Liu et al., 2019), makes extension of our approach eminently feasible.

      Reviewer 3

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Summary**

      The manuscript "A large accessory protein interactome is rewired across environments" by Liu et al. scales up a previously-described method (PPiSeq) to test a matrix of ~1.6 million protein pairs of direct protein-protein interactions in each of 9 different growth environments.

      While the study found a small fraction of immutable PPIs that are relatively stable across environments, the vast majority were 'mutable' across environments. Surprisingly, PPIs detected only in one environment made up more than 60% of the map. In addition to a false positive fraction that can yield apparently-mutable interactions, retest experiments demonstrate (not surprisingly) that environment-specificity can sometimes be attributed to false-negatives. The study authors predict that the whole subnetwork within the space tested will contain 11K true interactions.

      Much of environment-specific rewiring seemed to take place in an 'accessory module', which surrounds the core module made of mostly immutable PPIs. A number of interesting network clustering and functional enrichment analyses are performed to characterize the network overall and 'mutable' interactions in particular. The study report other global properties such as expression level, protein abundance and genetic interaction degree that differ between mutable and immutable PPIs. One of the interesting findings was evidence that many environmentally mutable PPI changes are regulated post-translationally. Finally, authors provide a case study about network rewiring related to glucose transport.

      **Major issues**

      -The results section should more prominently describe the dimensions of the matrix screen, both in terms of the set of protein pairs attempted and the set actually screened (I think this was 1741 x 1113 after filtering?). More importantly, the study should acknowledge in the introduction that this was NOT a random sample of protein pairs, but rather focused on pairs for which interaction had been previously observed in the baseline condition. This major bias has a potentially substantial impact on many of the downstream analyses. For example, any gene which was not expressed under the conditions of the original Tarrasov et al. study on which the screening space was based will not have been tested here. Thus, the study has systematically excluded interactions involving proteins with environment-dependent expression, except where they happened to be expressed in the single Tarrasov et al. environment. Heightened connectivity within the 'core module' may result from this bias, and if Tarrasov et al had screened in hydrogen peroxide (H2O2) instead of SD media, perhaps the network would have exhibited a code module in H2O2 decorated by less-densely connected accessory modules observed in other environments. The paper should clearly indicate which downstream analyses have special caveats in light of this design bias.

      We have now added text the matrix dimensions of our study on page 3, line 3:

      To generate a large PPiSeq library, all strains from the protein interactome (mDHFR-PCA) collection that were found to contain a protein likely to participate in at least one PPI (1742 X 1130 protein pairs), (Tarassov et al., 2008) were barcoded in duplicate using the double barcoder iSeq collection (Liu et al., 2019), and mated together in a single pool (Figure 1A). Double barcode sequencing revealed that the PPiSeq library contained 1.79 million protein pairs and 6.05 million double barcodes (92.3% and 78.1% of theoretical, respectively, 1741 X 1113 protein pairs), with each protein pair represented by an average of 3.4 unique double barcodes (Figure S1).

      We agree with the reviewer that our selection of proteins from a previously identified set can introduce bias in our conclusions. Our research question was focused on how PPIs change across environments, and thus we chose to maximize our power to detect PPI changes by selecting a set of protein pairs that are enriched for PPIs. We have now added a discussion of the potential caveats of this choice to the discussion on page 9, line 4:

      Results presented here and elsewhere (Huttlin et al., 2020) suggest that PPIs discovered under a single condition or cell type are a small subset of the full protein interactome emergent from a genome. We sampled nine diverse environments and found approximately 3-fold more interactions than in a single environment. However, the discovery of new PPIs began to saturate, indicating that most condition-specific PPIs can be captured in a limited number of conditions. Testing in many more conditions and with PPI assays orthogonal to PPiSeq will undoubtedly identify new PPIs, however a more important outcome could be the identification of coordinated network changes across conditions. Using a test set of ~1.6 million (of ~18 million) protein pairs across nine environments, we find that specific parts of the protein interactome are relatively stable (core modules) while others frequently change across environments (accessory modules). However, two important caveats of our study must be recognized before extrapolating these results to the entire protein interactome across all environment space. First, we tested for interactions between a biased set of proteins that have previously been found to participate in at least one PPI as measured by mDHFR-PCA under standard growth conditions (Tarassov et al., 2008). Thus, proteins that are not expressed under standard growth conditions are excluded from our study, as are PPIs that are not detectable by mDHFR-PCA or PPiSeq. It is possible that a comprehensive screen using multiple orthogonal PPI assays would alter our observations related to the relative dynamics of different regions of the protein interactome and the features of mutable and immutable PPIs. Second, we tested a limited number of environmental perturbations under similar growth conditions (batch liquid growth). It is possible that more extreme environmental shifts (e.g. growth as a colony, anaerobic growth, pseudohyphal growth) would introduce new accessory modules or alter the mutability of the PPIs we detect. Nevertheless, results presented here provide a new mechanistic view of how the cell changes in response to environmental challenges, building on the previous work that describes coordinated responses in the transcriptome (Brauer et al., 2007; Gasch et al., 2000) and proteome (Breker et al., 2013; Chong et al., 2015).

      -Related to the previous issue, a quick look at the proteins tested (if I understood them correctly) showed that they were enriched for genes encoding the elongator holoenzyme complex, DNA-directed RNA polymerase I complex, membrane docking and actin binding proteins, among other functional enrichments. Genes related to DNA damage (endonuclease activity and transposition), were depleted. It was unclear whether the functional enrichment analyses described in the paper reported enrichments relative to what would be expected given the bias inherent to the tested space?

      We did two functional enrichment analyses in this study: network density within Gene Ontology terms (related to Figure 2) and gene ontology enrichment of network communities (related to Figure 3). For both analyses, we performed comparisons to proteins included in PPiSeq library. This is described in the Supplementary Materials on page 63, line 35:

      To estimate GO term enrichment in our PPI network, we constructed 1000 random networks by replacing each bait or prey protein that was involved in a PPI with a randomly chosen protein from all proteins in our screen. This randomization preserves the degree distribution of the network.

      And on page 66, line 38:

      The set of proteins used for enrichment comparison are proteins that are involved in at least one PPI as determined by PPiSeq.

      -Re: data quality. To the study's great credit, they incorporated positive and random reference sets (PRS and RRS) into the screen. However, the results from this were concerning: Table SM6 shows that assay stringency was set such that between 1 and 3 out of 67 RRS pairs were detected. This specificity would be fine for an assay intended for retest or validate previous hits, where the prior probability of a true interaction is high, but in large-scale screening the prior probability of true interactions that are detectable by PCA is much lower, and a higher specificity is needed to avoid being overwhelmed by false positives. Consider this back of the envelope calculation: Let's say that the prior probability of true interaction is 1% as the authors' suggest (pg 49, section 6.5), and if PCA can optimistically detect 30% of these pairs, then the number of true interactions we might expect to see in an RRS of size 67 is 1% * 30% * 67 = 0.2 . This back of the envelope calculation suggests that a stringency allowing 1 hit in RRS will yield 80% [ (1 - 0.2) / 1 ] false positives, and a stringency allowing 3 hits in RRS will yield 93% [ (3 - 0.2) / 3] false positives. How do the authors reconcile these back of the envelope calculations from their PRS and RRS results with their estimates of precision?

      We thank the reviewer for bringing up with this issue. We included positive and random reference sets (PRS:70 protein pairs, RRS:67 protein pairs) to benchmark our PPI calling (Yu et al., 2008). The PRS reference lists PPIs that have been validated by multiple independent studies and is therefore likely to represent true PPIs that are present in some subset of the environments we tested. For the PRS set, we found a rate of detection that is comparable to other studies (PPiSeq in SD: 28%, Y2H and yellow fluorescent protein-PCA: ~20%) (Yu et al., 2008). The RRS reference, developed ten years ago, is randomly chosen protein pairs for which there was no evidence of a PPI in the literature at the time (mostly in standard growth conditions). Given the relatively high rate of false negatives in PPI assays, this set may in fact contain some true PPIs that have yet to be discovered. We could detect PPIs for four RRS protein pairs in our study, when looking across all 9 environments. Three of these (Grs1_Pet10, Rck2_Csh1, and YDR492W_Rpd3) could be detected in multiple environments (9, 7, and 3, respectively), suggesting that their detection was not a statistical or experimental artifact of our bar-seq assay (see table below derived from Table S4). The remaining PPI detected in the RRS, was only detected in SD (standard growth conditions) but with a relatively high fitness (0.35), again suggesting its detection was not a statistical or experimental artifact. While we do acknowledge it is possible that these are indeed false positives due to erroneous interactions of chimeric DHFR-tagged versions of these proteins, the small size of the RRS combined with the fact that some of the protein pairs could be true PPIs, did not give us confidence that this rate (4 of 70) is representative of our true false positive rate. To determine a false positive rate that is less subject to biases stemming from sampling of small numbers, we instead generated 50 new, larger random reference sets, by sampling for each set ~ 60,000 protein pairs without a reported PPI in BioGRID. Using these new reference sets, we found that the putative false positive rate of our assay is generally lower than 0.3% across conditions for each of the 50 reference sets. We therefore used this more statistically robust measure of the false positive rate to estimate positive predictive values (PPV = 62%, TPR = 41% in SD). We detail these statistical methods in Section 6 of the supplementary methods and report all statistical metrics in Table SM6.

      PPI

      Environment_number

      SD

      H2O2

      Hydroxyurea

      Doxorubicin

      Forskolin

      Raffinose

      NaCl

      16℃

      FK506

      Rck2_Csh1

      7

      0.35

      0.35

      0

      0.20

      0.54

      0.74

      0

      0.17

      0.59

      Grs1_Pet10

      9

      0.44

      0.39

      0.34

      0.25

      0.65

      1.19

      0.2

      0.16

      0.95

      YDR492W_Rpd3

      3

      0

      0.18

      0

      0

      0

      0

      0

      0.17

      0.61

      Mrps35_Bub3

      1

      0.35

      0

      0

      0

      0

      0

      0

      0

      0

      Positive_control

      9

      1

      0.8

      0.73

      0.62

      1.4

      2.44

      0.4

      0.28

      1.8

      Table. Mean fitness in each environment

      -Methods for estimating precision and recall were not sufficiently well described to assess. Precision vs recall plots would be helpful to better understand this tradeoff as score thresholds were evaluated.

      We describe in detail our approach to calling PPIs in section 6.6 of the supplementary methods, including Table SM6, and Figures SM3, SM4, SM6, and now Figure SM5. We identified positive PPIs using a dynamic threshold that considers the mean fitness and p-value in each environment. For each dynamic threshold, we estimated the precision and recall based on the reference sets (described supplementary methods in section 6.5). We then chose the threshold with the maximal Matthews correlation coefficient (MCC) to obtain the best balance between precision and recall. We have now added an additional plot (Figure SM5) that shows the precision and recall for the chosen dynamic threshold in each environment.

      -Within the tested space, the Tarassov et al map and the current map could each be compared against a common 'bronze standard' (e.g. literature curated interactions), at least for the SD map, to have an idea about how the quality of the current map compares to that of the previous PCA map. Each could also be compared with the most recent large-scale Y2H study (Yu et al).

      We thank the reviewer for this suggestion. We have now added a figure panel (Figure S4) that compares PPiSeq in SD (2 replicates) to mDHFR PCA (Tarassov et al., 2008), Y2H (Yu et al., 2008), and our newly constructed ‘bronze standard’ high-confidence positive reference set (PRS, supplementary method section 6.4).

      • Experimental validation of the network was done by conventional PCA. However, it should be noted that this is a form of technical replication of the DHFR-based PCA assay, and not a truly independent validation. Other large-scale yeast interaction studies (e.g., Yu et al, Science 2008) have assessed a random subset of observed PPIs using an orthogonal approach, calibrated using PRS and RRS sets examined via the same orthogonal method, from which overall performance of the dataset could be determined.

      We appreciate the reviewer’s perspective, since orthogonal validation experiments have been a critical tool to establish assay performance following early Y2H work. We know from careful work done previously that modern orthogonal assays have a low cross validation rate ((Yu et al., 2008) and that they tend to be enriched for PPIs in different cellular compartments (Jensen and Bork, 2008), indicating that high false negative rates are the likely explanation. High false negative rates have been confirmed here and elsewhere using positive reference sets (e.g. Y2H 80%, PCA 80%, PPiSeq 74% using the PRS in (Yu et al., 2008)). Therefore, the expectation is that PPiSeq, as with other assays, will have a low rate of validation using an orthogonal assay -- although we would not know if this rate is 10%, 30% or somewhere in between without performing the work. However, the exact number -- whether it be 10% or 30% -- has no practical impact on the main conclusions of this study (focused on network dynamics rather than network enumeration). Neither does that number speak to the confidence in our PPI calls, since a lower number may simply be due to less overlap in the sets of PPIs that are callable by PPiSeq and another assay. Our method uses bar-seq to extend an established mDHFR-PCA assay (Tarassov et al., 2008). The validations we performed were aimed at confirming that our sequencing, barcode counting, fitness estimation, and PPI calling protocols were not introducing excessive noise relative to mDHFR-PCA that resulted in a high number of PPI miscalls. Confirming this, we do indeed find a high rate of validation by lower throughput PCA (50-90%, Figure 3B). Finally, we do include independent tests of the quality of our data by comparing it to positive and random reference sets from literature curated data. We find that our assay performs extremely well (PPV > 61%, TPR > 41%) relative to other high-throughput assays.

      -The Venn diagram in Figure 1G was not very informative in terms of assessing the quality of data. It looks like there is a relatively little overlap between PPIs identified in standard conditions (SD media) in the current study and those of the previous study using a very similar method. Is there any way to know how much of this disagreement can be attributed to each screen being sub-saturation (e.g. by comparing replica screens) and what fraction to systematic assay or environment differences?

      We have now added a figure panel (Figure S4) that compares PPiSeq in SD (2 replicates) to mDHFR-PCA (Tarassov et al., 2008), Y2H (Yu et al., 2008), and our newly constructed ‘bronze standard’ high-confidence positive reference sets (PRS, supplementary methods section 6.4). We find that SD replicates have an overlap coefficient of 79% with each other, ~45% with mDHFR-PCA, ~45% the ‘bronze standard’ PRS, and ~13% with Y2H. Overlap coefficients between the SD replicates and mDHFR-PCA are much higher than those found between orthologous methods ((Yu et al., 2008), indicating that these two assays are identifying a similar set of PPIs. We do note that PPiSeq and mDHFR-PCA do screen for PPIs under different growth conditions (batch liquid growth vs. colonies on agar), so some fraction of the disagreement is due to environmental differences. PPIs that overlap between the two PPiSeq SD replicates are more likely to be found in mDHFR-PCA, PRS, and Y2H, indicating that PPIs identified in a single SD replicate are more likely to be false positives. However, we do find (a lower rate of) overlaps between PPIs identified in only one SD replicate and other methods, suggesting that a single PPiSeq replicate is not finding all discoverable PPIs.

      -In Figure S5C, the environment-specificity rate of PPIs might be inflated due to the fact that authors only test for the absence of SD hits in other conditions, and the SD condition is the only condition that has been sampled twice during the screening. What would be the environment-specific verification rate if sample hits from each environment were tested in all environments? This seems important, as robustly detecting environment-specific PPIs is one of the key points of the study.

      We use PPIs found in the SD environment to determine the environment-specificity because this provides the most conservative (highest) estimate of the number of PPIs found in other environments that were not detectable by our bar-seq assay. To identify PPIs in the SD environment, we pooled fitness estimates across the two replicates (~ 4 fitness estimates per replicate, ~ 8 total). The higher number of replicates results in a reduced rate of false positives (an erroneous fitness estimate has less impact on a PPI call), meaning that we are more confident that PPIs identified in SD are true positives. Because false positives in one environment (but not other environments) are likely to erroneously contribute to the environment-specificity rate, choosing the environment with the lowest rate of false positives (SD) should result in the lowest environment-specificity rate (highest estimate of PPIs found in other environments that were not detectable by our bar-seq assay).

      **Minor issues**

      -Re: "An interaction between the proteins reconstitutes mDHFR, providing resistance to the drug methotrexate and a growth advantage that is proportional to the PPI abundance" (pg 2). It may be more accurate to say "monotonically related" than "proportional" here. Fig 2 from the cited Freschi et al ref does suggests linearity with colony size over a wide range of inferred complex abundances, but non-linear at low complex abundance. Also note that Freschi measured colony area which is not linear with exponential growth rate nor with cell count.

      We agree with the reviewer and have changed “proportional” to “monotonically related” on page 2, line 41.

      -Re: "Using putatively positive and negative reference sets, we empirically determined a statistical threshold for each environment with the best balance of precision and recall (positive predictive value (PPV) > 61% in SD media, Methods, section 6)." (pg 3). Should state the recall at this PPV.

      We agree with the reviewer and have added the recall (41%) in the main text (line 26, page3).

      Using putatively positive and negative reference sets, we empirically determined a statistical threshold for each environment with the best balance of precision and recall (positive predictive value (PPV) > 61% and true positive rate > 41% in SD media, Methods, section 6).

      -Authors could discuss the extent to which related methods (e.g. PMID: 28650476, PMID: 27107012, PMID: 29165646, PMID: 30217970) would be potentially suitable for screening in different environments.

      We have now added a reference to a barcode-based Y2H study that examined interactions between yeast proteins to the introduction on page 2, line 2:

      Yet, little is known about how PPI networks reorganize on a global scale or what drives these changes. One challenge is that commonly-used high-throughput PPI screening technologies are geared toward PPI identification (Gavin et al., 2002; Ito et al., 2001; Tarassov et al., 2008; Uetz et al., 2000; Yu et al., 2008, Yachie et al., 2016), not a quantitative analysis of relative PPI abundance that is necessary to determine if changes in the PPI network are occurring. The murine dihydrofolate reductase (mDHFR)‐based protein-fragment complementation assay (PCA) provides a viable path to characterize PPI abundance changes because it is a sensitive test for PPIs in the native cellular context and at native protein expression levels (Freschi et al., 2013; Remy and Michnick, 1999; Tarassov et al., 2008).

      We have excluded the references to other barcode-based Y2H studies that reviewer mentions because they test heterologous proteins within yeast, and the effect of perturbations to yeast on these proteins would be difficult to interpret in the context of our questions. The yeast protein Y2H study, although a wonderful approach and paper, would also not be an appropriate method to examine how PPI networks change across environments because protein fusions are not expressed under their endogenous promoters and must be transported to, in many cases, a non-native compartment (cell nucleus) to be detected. Rather than explicitly discuss the caveats of this particular approach, we have instead chosen to discuss why we use PCA.

      • the term "mutable" is certainly appropriate according to the dictionary definition of changeable. The authors may wish to consider though, that in a molecular biology context the term evokes changeability by mutation (a very interesting but distinct topic). Maybe another term (environment-dependent interactions or ePPIs?) would be clearer. Of course this is the authors' call.

      We thank the reviewer for this suggestion, and have admittedly struggled with the terminology. For clarity of presentation, we strived to have a single word that describes the property of a PPI that is at the core of this manuscript -- how frequently a PPI is found across environments. However, the most descriptive words come with preloaded meanings in PPI research (e.g. transient, stable, dynamic), as does “mutable” with another research field. We are, quite frankly, open to suggestions from the reviewers or editors for a more appropriate word that does not raise similar objections.

      -Some discussion is warranted about the phenomenon that a PPI that is unchanged in abundance could appear to change because of statistical significance thresholds that differ between screens. This would be a difficult question for any such study, and I don't think the authors need to solve it, but just to discuss.

      We agree with the reviewer that significance thresholds could be impacting our interpretations and discuss this idea at length on page 4, line 23 of the Results. This section has been modified to include an additional analysis (excluding 16 ℃ data) in response to another reviewer’s comment:

      Immutable PPIs were likely to have been previously reported by colony-based mDHFR-PCA or other methods, while the PPIs found in the fewest environments were not. One possible explanation for this observation is that previous PPI assays, which largely tested in standard laboratory growth conditions, and variations thereof, are biased toward identification of the least mutable PPIs. That is, since immutable PPIs are found in nearly all environments, they are more readily observed in just one. However, another possible explanation is that, in our assay, mutable PPIs are more likely to be false positives in environment(s) in which they are identified or false negatives in environments in which they are not identified. To investigate this second possibility, we first asked whether PPIs present in very few environments have lower fitnesses, as this might indicate that they are closer to our limit of detection. We found no such pattern: mean fitnesses were roughly consistent across PPIs found in 1 to 6 conditions, although they were elevated in PPIs found in 7-9 conditions (Figure S6A). To directly test the false-positive rate stemming from pooled growth and barcode sequencing, we validated randomly selected PPIs within each mutability bin by comparing their optical density growth trajectories against controls (Figures 3B). We found that mutable PPIs did indeed have lower validation rates in the environment in which they were identified, yet putative false positives were limited to ~50%, and, within a bin, do not differ between PPIs that have been previously identified and those that have been newly discovered by our assay (Figure S65B). We also note mutable PPIs might be more sensitive to environmental differences between our large pooled PPiSeq assays and clonal 96-well validation assays, indicating that differences in validation rates might be overstated. To test the false-negative rate, we assayed PPIs identified in only SD by PPiSeq across all other environments by optical density growth and found that PPIs can be assigned to additional environments (Figure S6C). However, the number of additional environments in which a PPI was detected was generally low (2.5 on average), and the interaction signal in other environments was generally weaker than in SD (Figure S6D). To better estimate how the number of PPIs changes with PPI mutability, we used these optical density assays to model the validation rate as a function of the mean PPiSeq fitness and the number of environments in which a PPI is detected. This accurate model (Spearman's r =0.98 between predicted and observed, see Methods) provided confidence scores (predicted validation rates) for each PPI (Table S5) and allowed us to adjust the true positive PPI estimate in each mutability bin. Using this more conservative estimate, we still found a preponderance of mutable PPIs (Figure S6E). Finally, we used a pair of more conservative PPI calling procedures that either identified PPIs with a low rate of false positives across all environments (FPR

      We later examine major conclusions of our study using more conservative calling procedures, and find that they are consistent. On page 6, line 14:

      Both the co-expression and co-localization patterns were also apparent in our higher confidence PPI sets (Figures S7B, and S7C, S8B, S8C ), indicating that they are not caused by different false positive rates between the mutability bins.

      And on page 6, line 19:

      We binned proteins by their PPI degree, and, within each bin, determined the correlation between the mutability score and another gene feature (Figure 4C and S12A, Table S8) (Costanzo et al., 2016; Finn et al., 2014; Gavin et al., 2006; Holstege et al., 1998; Krogan et al., 2006; Levy and Siegal, 2008; Myers et al., 2006; Newman et al., 2006; Östlund et al., 2010; Rice et al., 2000; Stark et al., 2011; Wapinski et al., 2007; Ward et al., 2004; Yang, 2007; Yu et al., 2008). These correlations were also calculated using our higher confidence PPI sets, confirming results from the full data set (Figures S7D and, S7E, S8D, S8E). We found that mutable hubs (> 15 PPIs) have more genetic interactions, in agreement with predictions from co-expression data (Bertin et al., 2007; Han et al., 2004), and that their deletion tends to cause larger fitness defects.

      -More discussion would be helpful about the idea that immutability may to some extent favor interactions that PCA is better able to detect (possibly including membrane proteins?)

      We agree with the reviewer and now added a discussion of this potential caveats to the discussion on page 9, line 4:

      Results presented here and elsewhere (Huttlin et al., 2020) suggest that PPIs discovered under a single condition or cell type are a small subset of the full protein interactome emergent from a genome. We sampled nine diverse environments and found approximately 3-fold more interactions than in a single environment. However, the discovery of new PPIs began to saturate, indicating that most condition-specific PPIs can be captured in a limited number of conditions. Testing in many more conditions and with PPI assays orthogonal to PPiSeq will undoubtedly identify new PPIs, however a more important outcome could be the identification of coordinated network changes across conditions. Using a test set of ~1.6 million (of ~18 million) protein pairs across nine environments, we find that specific parts of the protein interactome are relatively stable (core modules) while others frequently change across environments (accessory modules). However, two important caveats of our study must be recognized before extrapolating these results to the entire protein interactome across all environment space. First, we tested for interactions between a biased set of proteins that have previously been found to participate in at least one PPI as measured by mDHFR-PCA under standard growth conditions (Tarassov et al., 2008). Thus, proteins that are not expressed under standard growth conditions are excluded from our study, as are PPIs that are not detectable by mDHFR-PCA or PPiSeq. It is possible that a comprehensive screen using multiple orthogonal PPI assays would alter our observations related to the relative dynamics of different regions of the protein interactome and the features of mutable and immutable PPIs. Second, we tested a limited number of environmental perturbations under similar growth conditions (batch liquid growth). It is possible that more extreme environmental shifts (e.g. growth as a colony, anaerobic growth, pseudohyphal growth) would introduce new accessory modules or alter the mutability of the PPIs we detect. Nevertheless, results presented here provide a new mechanistic view of how the cell changes in response to environmental challenges, building on the previous work that describes coordinated responses in the transcriptome (Brauer et al., 2007; Gasch et al., 2000) and proteome (Breker et al., 2013; Chong et al., 2015).

      -Re: "As might be expected, we also found that mutable hubs, but not non-hubs, are more likely to participate in multiple protein complexes than less mutable proteins." (pg 6) This is a cool result. To what extent was this result driven by members of one or two complexes? If so, it would worth noting them.

      We thank the reviewer for this question. We have now included Figue S13, which shows the number and size of protein complexes that underlie the finding that mutable hubs are more likely to participate in multiple protein complexes. We find that proteins in our screen that participate in multiple complexes are distributed over a wide range of complexes, indicating that this observation is not driven by one or two complexes. On page 6, line 34:

      As might be expected, we also found that mutable hubs, but not non-hubs, are more likely to participate in multiple protein complexes than less mutable proteins (Figures S13A-C) (Costanzo et al., 2016).

      -Re: "Borrowing a species richness estimator from ecology (Jari Oksanen et al., 2019), we estimate that there are ~10,840 true interactions within our search space across all environments, ~3-fold more than are detected in SD (note difference to Figure 3, which counts observed PPIs)." (pg 8) Should note that this only allows estimation of the number of interactions that are detectable by PCA methods. Previous work (Braun et al, 2019) showed that every known protein interaction assay (including PCA approaches) can only detect a fraction of bona fide interactions.

      We agree with the reviewer and have modified the discussion to make this point explicit on page 9, line 4:

      Results presented here and elsewhere (Huttlin et al., 2020) suggest that PPIs discovered under a single condition or cell type are a small subset of the full protein interactome emergent from a genome. We sampled nine diverse environments and found approximately 3-fold more interactions than in a single environment. However, the discovery of new PPIs began to saturate, indicating that most condition-specific PPIs can be captured in a limited number of conditions. Testing in many more conditions and with PPI assays orthogonal to PPiSeq will undoubtedly identify new PPIs, however a more important outcome could be the identification of coordinated network changes across conditions.

      We continue in this paragraph to discuss the implications:

      Using a test set of ~1.6 million (of ~18 million) protein pairs across nine environments, we find that specific parts of the protein interactome are relatively stable (core modules) while others frequently change across environments (accessory modules). However, two important caveats of our study must be recognized before extrapolating these results to the entire protein interactome across all environment space. First, we tested for interactions between a biased set of proteins that have previously been found to participate in at least one PPI as measured by mDHFR-PCA under standard growth conditions (Tarassov et al., 2008). Thus, proteins that are not expressed under standard growth conditions are excluded from our study, as are PPIs that are not detectable by mDHFR-PCA or PPiSeq. It is possible that a comprehensive screen using multiple orthogonal PPI assays would alter our observations related to the relative dynamics of different regions of the protein interactome and the features of mutable and immutable PPIs.

      -Re: "This analysis shows that the number of PPIs present across all environments is much larger than the number observed in a single condition, but that it is feasible to discover most of these new PPIs by sampling a limited number of conditions." (pg 8). The main point is surely correct, but it is worth noting that extrapolation to the number of true interactions depends on the nine chosen environments being representative of all environments. The situation could change under more extreme, e.g., anaerobic, conditions.

      We agree with the reviewer and make this point explicit, continuing from the paragraph quoted above on page 9, line 22:

      Second, we tested a limited number of environmental perturbations under similar growth conditions (batch liquid growth). It is possible that more extreme environmental shifts (e.g. growth as a colony, anaerobic growth, pseudohyphal growth) would introduce new accessory modules or alter the mutability of the PPIs we detect. Nevertheless, results presented here provide a new mechanistic view of how the cell changes in response to environmental challenges, building on the previous work that describes coordinated responses in the transcriptome (Brauer et al., 2007; Gasch et al., 2000) and proteome (Breker et al., 2013; Chong et al., 2015).

      -It stands to reason that proteins expressed in all conditions will yield less mutable interactions, if 'mutability' is primarily due to expression change at the transcriptional level. They should at least discuss that measuring mRNA levels could resolve questions about this. Could use Waern et al G3 2013 data (H202, SD, HU, NaCl) to predict the dynamic interactome purely by node removal, and see how conclusions would change

      We agree with the reviewer that mRNA abundance could potentially be used as a proxy for protein abundance and have added this point on page 10, line 28:

      Here we use homodimer abundance as a proxy for protein abundance. However, genome-wide mRNA abundance measures could be used as a proxy for protein abundance or protein abundance could be measured directly in the same pool (Levy et al., 2014) by, for example, attaching a full length mDHFR to each gene using “swap tag” libraries mentioned above (Weill et al., 2018; Yofe et al., 2016).

      However, using mRNA abundance as a proxy for protein abundance in this study has several important caveats that would make interpretation difficult. First, mRNA and protein abundance correlate, but not perfectly (R2 = 0.45) (Lahtvee et al., 2017), and our findings suggest that post-translational regulation may be important to driving PPI changes. Second, mRNA abundance measures are for a single time point, while our PPI measures coarse grain over a growth cycle (lag, exponential growth, diauxic shift, saturation). Although we may be able to take multiple mRNA measures across the cycle, time delays between changes in mRNA and protein levels, combined with the fact that we do not know when a PPI is occurring or most prominent over the cycle, would pose a significant challenge to making any claims that PPI changes are driven by changes in protein abundance. We instead chose to focus on a subset of proteins (homodimers) where abundance measures can be coarse grained in the same way as PPI measures. In the above quote, we point to a potential method by which this can be done for all proteins. We also point to how a continuous culturing design could be used to better determine how protein (or mRNA proxy) abundance impacts PPI abundance on page 10, line 6:

      Finally, our assays were performed across cycles of batch growth meaning that changes in PPI abundance across a growth cycle (e.g. lag, exponential growth, saturation) are coarse grained into one measurement. While this method potentially increases our chance of discovering a diverse set of PPIs, it might have an unpredictable impact on the relationship between fitness and PPI abundance (Li et al., 2018). To overcome these issues, strains containing natural or synthetic PPIs with known abundances and intracellular localizations could be spiked into cell pools to calibrate the relationship between fitness and PPI abundance in each environment. In addition, continuous culturing systems may be useful for refining precision of growth-based assays such as ours.

      -The analysis showing that many interactions are likely due to post-translational modifications is very interesting, but caveats should be discussed. Where heterodimers do not fit the expression-level dependence model, some cases of non-fitting may simply be due to measurement error or non-linearity in the relationship between abundance and fitness.

      We show the measurement error in Figures 1, S2, S3. While we agree with the reviewer that measurement error is a general caveat for all results reported, we do not feel that it is necessary to point to that fact in this particular case, which uses a logistic regression to report that PPI mutability was the best predictor of fit to the expression-level dependence model. We discuss the non-linearity caveat on page 9, line 41:

      Our assay detected subtle fitness differences across environments (Fig S5B and S5C), which we used as a rough estimate for changes in relative PPI abundance. While it would be tempting to use fitness as a direct readout of absolute PPI abundance within a cell, non-linearities between fitness and PPI abundance may be common and PPI dependent. For example, the relative contribution of a reconstructed mDHFR molecule to fitness might diminish at high PPI abundances (saturation effects) and fitness differences between PPIs may be caused, in part, by differences in how accessible a reconstructed mDHFR molecule is to substrate. In addition, environmental shifts might impact cell growth rate, initiate a stress response, or result in other unpredictable cell effects that impact the selective pressure of methotrexate and thereby fitness (Figure S2 and S3).

      -Line numbers would have been helpful to note more specific minor comments

      We are sorry for this inconvenience. We have added line numbers in our revised manuscript.

      -Sequence data should be shared via the Short-Read Archive.

      The raw sequencing data have been uploaded to the Short-Read Archive. We mentioned it in the Data and Software Availability section on page 68, line 41.

      Raw barcode sequencing data are available from the NIH Sequence Read Archive as accession PRJNA630095 (https://trace.ncbi.nlm.nih.gov/Traces/study/?acc=SRP259652).

      Reviewer #3 (Significance (Required)):

      Knowledge of protein-protein interactions (PPIs) provides a key window on biological mechanism, and unbiased screens have informed global principles underlying cellular organization. Several genome-scale screens for direct (binary) interactions between yeast proteins have been carried out, and while each has provided a wealth of new hypotheses, each has been sub-saturation. Therefore, even given multiple genome-scale screens our knowledge of yeast interactions remains incomplete. Different assays are better suited to find different interactions, and it is now clear that every assay evaluated thus far is only capable (even in a saturated screen) of detecting a minority of true interactions. More relevant to the current study, no binary interaction screen has been carried out at the scale of millions of protein pairs outside of a single 'baseline' condition.

      The study by Liu et al is notable from a technology perspective in that it is one of several recombinant-barcode approaches have been developed to multiplex pairwise combinations of two barcoded libraries. Although other methods have been demonstrated at the scale of 1M protein pairs, this is the first study using such a technology at the scale of >1M pairs across multiple environments.

      A limitation is that this study is not genome-scale, and the search space is biased towards proteins for which interactions were previously observed in a particular environment. This is perhaps understandable, as it made the study more tractable, but this does add caveats to many of the conclusions drawn. These would be acceptable if clearly described and discussed. There were also questions about data quality and assessment that would need to be addressed.

      Assuming issues can be addressed, this is a timely study on an important topic, and will be of broad interest given the importance of protein interactions and the status of S. cerevisiae as a key testbed for systems biology.

      *Reviewers' expertise:* Interaction assays, next-generation sequencing, computational genomics. Less able to assess evolutionary biology aspects.

      References

      Brauer, M.J., Huttenhower, C., Airoldi, E.M., Rosenstein, R., Matese, J.C., Gresham, D., Boer, V.M., Troyanskaya, O.G., and Botstein, D. (2007). Coordination of Growth Rate, Cell Cycle, Stress Response, and Metabolic Activity in Yeast. Mol. Biol. Cell 19, 352–367.

      Breker, M., Gymrek, M., and Schuldiner, M. (2013). A novel single-cell screening platform reveals proteome plasticity during yeast stress responses. J. Cell Biol. 200, 839–850.

      Chong, Y.T., Koh, J.L.Y., Friesen, H., Kaluarachchi Duffy, S., Cox, M.J., Moses, A., Moffat, J., Boone, C., and Andrews, B.J. (2015). Yeast Proteome Dynamics from Single Cell Imaging and Automated Analysis. Cell 161, 1413–1424.

      Gasch, A.P., Spellman, P.T., Kao, C.M., Carmel-Harel, O., Eisen, M.B., Storz, G., Botstein, D., and Brown, P.O. (2000). Genomic Expression Programs in the Response of Yeast Cells to Environmental Changes. Mol. Biol. Cell 11, 4241–4257.

      Hart, G.T., Ramani, A.K., and Marcotte, E.M. (2006). How complete are current yeast and human protein-interaction networks? Genome Biol. 7, 120.

      Hilliker, A., Gao, Z., Jankowsky, E., and Parker, R. (2011). The DEAD-box protein Ded1 modulates translation by the formation and resolution of an eIF4F-mRNA complex. Mol. Cell 43, 962–972.

      Isasa, M., Suñer, C., Díaz, M., Puig-Sàrries, P., Zuin, A., Bichmann, A., Gygi, S.P., Rebollo, E., and Crosas, B. (2015). Cold Temperature Induces the Reprogramming of Proteolytic Pathways in Yeast. J. Biol. Chem. jbc.M115.698662.

      Jensen, L.J., and Bork, P. (2008). Not Comparable, But Complementary. Science 322, 56–57.

      Lahtvee, P.-J., Sánchez, B.J., Smialowska, A., Kasvandik, S., Elsemman, I.E., Gatto, F., and Nielsen, J. (2017). Absolute Quantification of Protein and mRNA Abundances Demonstrate Variability in Gene-Specific Translation Efficiency in Yeast. Cell Syst. 4, 495-504.e5.

      Obayashi, T., Kagaya, Y., Aoki, Y., Tadaka, S., and Kinoshita, K. (2019). COXPRESdb v7: a gene coexpression database for 11 animal species supported by 23 coexpression platforms for technical evaluation and evolutionary inference. Nucleic Acids Res. 47, D55–D62.

      Sambourg, L., and Thierry-Mieg, N. (2010). New insights into protein-protein interaction data lead to increased estimates of the S. cerevisiae interactome size. BMC Bioinformatics 11, 605.

      Tarassov, K., Messier, V., Landry, C.R., Radinovic, S., Molina, M.M.S., Shames, I., Malitskaya, Y., Vogel, J., Bussey, H., and Michnick, S.W. (2008). An in Vivo Map of the Yeast Protein Interactome. Science 320, 1465–1470.

      Yu, H., Braun, P., Yıldırım, M.A., Lemmens, I., Venkatesan, K., Sahalie, J., Hirozane-Kishikawa, T., Gebreab, F., Li, N., Simonis, N., et al. (2008). High-Quality Binary Protein Interaction Map of the Yeast Interactome Network. Science 322, 104–110.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The manuscript "A large accessory protein interactome is rewired across environments" by Liu et al. scales up a previously-described method (PPiSeq) to test a matrix of ~1.6 million protein pairs of direct protein-protein interactions in each of 9 different growth environments.

      While the study found a small fraction of immutable PPIs that are relatively stable across environments, the vast majority were 'mutable' across environments. Surprisingly, PPIs detected only in one environment made up more than 60% of the map. In addition to a false positive fraction that can yield apparently-mutable interactions, retest experiments demonstrate (not surprisingly) that environment-specificity can sometimes be attributed to false-negatives. The study authors predict that the whole subnetwork within the space tested will contain 11K true interactions.

      Much of environment-specific rewiring seemed to take place in an 'accessory module', which surrounds the core module made of mostly immutable PPIs. A number of interesting network clustering and functional enrichment analyses are performed to characterize the network overall and 'mutable' interactions in particular. The study report other global properties such as expression level, protein abundance and genetic interaction degree that differ between mutable and immutable PPIs. One of the interesting findings was evidence that many environmentally mutable PPI changes are regulated post-translationally. Finally, authors provide a case study about network rewiring related to glucose transport.

      Major issues

      -The results section should more prominently describe the dimensions of the matrix screen, both in terms of the set of protein pairs attempted and the set actually screened (I think this was 1741 x 1113 after filtering?). More importantly, the study should acknowledge in the introduction that this was NOT a random sample of protein pairs, but rather focused on pairs for which interaction had been previously observed in the baseline condition. This major bias has a potentially substantial impact on many of the downstream analyses. For example, any gene which was not expressed under the conditions of the original Tarrasov et al. study on which the screening space was based will not have been tested here. Thus, the study has systematically excluded interactions involving proteins with environment-dependent expression, except where they happened to be expressed in the single Tarrasov et al. environment. Heightened connectivity within the 'core module' may result from this bias, and if Tarrasov et al had screened in hydrogen peroxide (H2O2) instead of SD media, perhaps the network would have exhibited a code module in H2O2 decorated by less-densely connected accessory modules observed in other environments. The paper should clearly indicate which downstream analyses have special caveats in light of this design bias.

      -Related to the previous issue, a quick look at the proteins tested (if I understood them correctly) showed that they were enriched for genes encoding the elongator holoenzyme complex, DNA-directed RNA polymerase I complex, membrane docking and actin binding proteins, among other functional enrichments. Genes related to DNA damage (endonuclease activity and transposition), were depleted. It was unclear whether the functional enrichment analyses described in the paper reported enrichments relative to what would be expected given the bias inherent to the tested space?

      -Re: data quality. To the study's great credit, they incorporated positive and random reference sets (PRS and RRS) into the screen. However, the results from this were concerning: Table SM6 shows that assay stringency was set such that between 1 and 3 out of 67 RRS pairs were detected. This specificity would be fine for an assay intended for retest or validate previous hits, where the prior probability of a true interaction is high, but in large-scale screening the prior probability of true interactions that are detectable by PCA is much lower, and a higher specificity is needed to avoid being overwhelmed by false positives. Consider this back of the envelope calculation: Let's say that the prior probability of true interaction is 1% as the authors' suggest (pg 49, section 6.5), and if PCA can optimistically detect 30% of these pairs, then the number of true interactions we might expect to see in an RRS of size 67 is 1% 30% 67 = 0.2 . This back of the envelope calculation suggests that a stringency allowing 1 hit in RRS will yield 80% [ (1 - 0.2) / 1 ] false positives, and a stringency allowing 3 hits in RRS will yield 93% [ (3 - 0.2) / 3] false positives. How do the authors reconcile these back of the envelope calculations from their PRS and RRS results with their estimates of precision?

      -Methods for estimating precision and recall were not sufficiently well described to assess. Precision vs recall plots would be helpful to better understand this tradeoff as score thresholds were evaluated.

      -Within the tested space, the Tarassov et al map and the current map could each be compared against a common 'bronze standard' (e.g. literature curated interactions), at least for the SD map, to have an idea about how the quality of the current map compares to that of the previous PCA map. Each could also be compared with the most recent large-scale Y2H study (Yu et al).

      • Experimental validation of the network was done by conventional PCA. However, it should be noted that this is a form of technical replication of the DHFR-based PCA assay, and not a truly independent validation. Other large-scale yeast interaction studies (e.g., Yu et al, Science 2008) have assessed a random subset of observed PPIs using an orthogonal approach, calibrated using PRS and RRS sets examined via the same orthogonal method, from which overall performance of the dataset could be determined.

      -The Venn diagram in Figure 1G was not very informative in terms of assessing the quality of data. It looks like there is a relatively little overlap between PPIs identified in standard conditions (SD media) in the current study and those of the previous study using a very similar method. Is there any way to know how much of this disagreement can be attributed to each screen being sub-saturation (e.g. by comparing replica screens) and what fraction to systematic assay or environment differences?

      -In Figure S5C, the environment-specificity rate of PPIs might be inflated due to the fact that authors only test for the absence of SD hits in other conditions, and the SD condition is the only condition that has been sampled twice during the screening. What would be the environment-specific verification rate if sample hits from each environment were tested in all environments? This seems important, as robustly detecting environment-specific PPIs is one of the key points of the study.

      Minor issues

      -Re: "An interaction between the proteins reconstitutes mDHFR, providing resistance to the drug methotrexate and a growth advantage that is proportional to the PPI abundance" (pg 2). It may be more accurate to say "monotonically related" than "proportional" here. Fig 2 from the cited Freschi et al ref does suggests linearity with colony size over a wide range of inferred complex abundances, but non-linear at low complex abundance. Also note that Freschi measured colony area which is not linear with exponential growth rate nor with cell count. -Re: "Using putatively positive and negative reference sets, we empirically determined astatistical threshold for each environment with the best balance of precision and recall (positive predictive value (PPV) > 61% in SD media, Methods, section 6)." (pg 3). Should state the recall at this PPV.

      -Authors could discuss the extent to which related methods (e.g. PMID: 28650476, PMID: 27107012, PMID: 29165646, PMID: 30217970) would be potentially suitable for screening in different environments.

      • the term "mutable" is certainly appropriate according to the dictionary definition of changeable. The authors may wish to consider though, that in a molecular biology context the term evokes changeability by mutation (a very interesting but distinct topic). Maybe another term (environment-dependent interactions or ePPIs?) would be clearer. Of course this is the authors' call.

      -Some discussion is warranted about the phenomenon that a PPI that is unchanged in abundance could appear to change because of statistical significance thresholds that differ between screens. This would be a difficult question for any such study, and I don't think the authors need to solve it, but just to discuss.

      -More discussion would be helpful about the idea that immutability may to some extent favor interactions that PCA is better able to detect (possibly including membrane proteins?)

      -Re: "As might be expected, we also found that mutable hubs, but not non-hubs, are more likely to participate in multiple protein complexes than less mutable proteins." (pg 6) This is a cool result. To what extent was this result driven by members of one or two complexes? If so, it would worth noting them.

      -Re: "Borrowing a species richness estimator from ecology (Jari Oksanen et al., 2019), we estimate that there are ~10,840 true interactions within our search space across all environments, ~3-fold more than are detected in SD (note difference to Figure 3, which counts observed PPIs)." (pg 8) Should note that this only allows estimation of the number of interactions that are detectable by PCA methods. Previous work (Braun et al, 2019) showed that every known protein interaction assay (including PCA approaches) can only detect a fraction of bona fide interactions.

      -Re: "This analysis shows that the number of PPIs present across all environments is much larger than the number observed in a single condition, but that it is feasible to discover most of these new PPIs by sampling a limited number of conditions." (pg 8). The main point is surely correct, but it is worth noting that extrapolation to the number of true interactions depends on the nine chosen environments being representative of all environments. The situation could change under more extreme, e.g., anaerobic, conditions.

      -It stands to reason that proteins expressed in all conditions will yield less mutable interactions, if 'mutability' is primarily due to expression change at the transcriptional level. They should at least discuss that measuring mRNA levels could resolve questions about this. Could use Waern et al G3 2013 data (H202, SD, HU, NaCl) to predict the dynamic interactome purely by node removal, and see how conclusions would change

      -The analysis showing that many interactions are likely due to post-translational modifications is very interesting, but caveats should be discussed. Where heterodimers do not fit the expression-level dependence model, some cases of non-fitting may simply be due to measurement error or non-linearity in the relationship between abundance and fitness.

      -Line numbers would have been helpful to note more specific minor comments

      -Sequence data should be shared via the Short-Read Archive.

      Significance

      Knowledge of protein-protein interactions (PPIs) provides a key window on biological mechanism, and unbiased screens have informed global principles underlying cellular organization. Several genome-scale screens for direct (binary) interactions between yeast proteins have been carried out, and while each has provided a wealth of new hypotheses, each has been sub-saturation. Therefore, even given multiple genome-scale screens our knowledge of yeast interactions remains incomplete. Different assays are better suited to find different interactions, and it is now clear that every assay evaluated thus far is only capable (even in a saturated screen) of detecting a minority of true interactions. More relevant to the current study, no binary interaction screen has been carried out at the scale of millions of protein pairs outside of a single 'baseline' condition.

      The study by Liu et al is notable from a technology perspective in that it is one of several recombinant-barcode approaches have been developed to multiplex pairwise combinations of two barcoded libraries. Although other methods have been demonstrated at the scale of 1M protein pairs, this is the first study using such a technology at the scale of >1M pairs across multiple environments.

      A limitation is that this study is not genome-scale, and the search space is biased towards proteins for which interactions were previously observed in a particular environment. This is perhaps understandable, as it made the study more tractable, but this does add caveats to many of the conclusions drawn. These would be acceptable if clearly described and discussed. There were also questions about data quality and assessment that would need to be addressed.

      Assuming issues can be addressed, this is a timely study on an important topic, and will be of broad interest given the importance of protein interactions and the status of S. cerevisiae as a key testbed for systems biology.

      Reviewers' expertise: Interaction assays, next-generation sequencing, computational genomics. Less able to assess evolutionary biology aspects.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Report on Liu et al. "A large accessory protein interactome is rewired across environments" Liu et al. use a mDHFR-based, pooled barcode sequencing / competitive growth / mild methotrexate selection method to investigate changes of PPI abundance of 1.6 million protein pairs across different 9 growth conditions. Because most PPI screens aim to identify novel PPIs in standard growth conditions, the currently known yeast PPI network may be incomplete. The key concept is to define immutable" PPIs that are found in all conditions and "mutable" PPIs that are present in only some conditions. The assay identified 13764 PPIs across the 9 conditions, using optimized fitness cut offs. Steady PPI i.e. across all environments, were identified in membrane compartments and cell division. Processes associated with the chromosome, transcription, protein translation, RNA processing and ribosome regulation were found to change between conditions. Mutable PPIs are form modules as topological analyses reveals.

      Interestingly, a correlation on intrinsic disorder and PPI mutability was found and postulated as more flexible in the conformational context, while at the same time they are formed by less abundant proteins.

      I appreciate the trick to use homodimerization as an abundance proxy to predict interaction between heterodimers (of proteins that homodimerize). This "mass-action kinetics model" explains the strength of 230 out of 1212 tested heterodimers.

      A validation experiment of the glucose transporter network was performed and 90 "randomly chosen" PPIs that were present in the SD environment were tested in NaCl (osmotic stress) and Raffinose (low glucose) conditions through recording optical density growth trajectories. Hxt5 PPIs stayed similar in the tested conditions, supported by the current knowledge that Hxt5 is highly expressed in stationary phase and under salt stress. In Raffinose, Hxt7, previously reported to increase the mRNA expression, lost most PPIs indicating that other factors might influence Hxt7 PPIs.

      Points for consideration:

      *) A clear definition of mutable and immutable is missing, or could not be found e.g. at page 4 second paragraph.

      *) Approximately half of the PPIs have been identified in one environment. Many of those mutable PPIs were detected in the 16{degree sign}C condition. Is there an explanation for the predominance of this specific environment? What are these PPIs about?

      *) 50 % overall retest validation rate is fair and reflects a value comparable to other large-scale approaches. However what is the actual variation, e.g. between mutable PPIs and immutable or between condition. e.g. at 16{degree sign}C.

      *) What is the R correlation cutoff for PPIs explained in the mass equilibrium model vs. not explained?

      *) 90 "randomly chosen" PPIs for validation. It needs to be demonstrated that these interaction are a random subset otherwise is could also mean cherry picked interactions ...

      *) Figure 4 provides interesting correlations with the goal to reveal properties of mutable and less mutable PPIs. PPIs detected in the PPIseq screen can partially be correlated to co-expression (4A) as well as co-localization. Does it make sense to correlate the co-expression across number of conditions? Are the expression correlation condition specific. In this graph it could be that expression correlation stems from condition 1 and 2 and the interaction takes place in 4 and 5 still leading to the same conclusion ... Is the picture of the co-expression correlation similar when you simply look at individual environments like in S4A?

      *) Figure 4C: Interesting, how dependent are the various categories?

      *) Figure 4 F: When binned in the number of environments in which the PPI was found, the distribution peaks at 6 environments and decreases with higher and lower number of environments. The description /explanation in the text clearly says something else.

      *) Figure 6: I apologize, but for my taste this is not a final figure 6 for this study. Investigation of different environments increases the PPI network in yeast, yes, yet it is very well known that a saturation is reached after testing of several conditions, different methods and even screening repetition (sampling). It does not represent an important outcome. Move to suppl or remove.

      Significance

      Liu et al. increase the current PPI network in yeast and offer a substantial dataset of novel PPIs seen in specific environments only. This resource can be used to further investigate the biological meaning of the PPI changes. The data set is compared to previous DHFR providing some sort of quality benchmarking. Mutable interactions are characterized well. Clearly a next step could be to start some "orthogonal" validation, i.e. beyond yeast growth under methotrexate treatment.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript is clearly written and the figures appropriate and informative. Some descriptions of data analyses are a little dense but reflect what would appear long hard efforts on the part of the authors to identify and control for possible sources of misinterpretation due to sensitivities of parameters in their fitness model. The authors efforts to retest interactions under non-competition conditions allay fears of most concerns that I would have. One problem though that I could not see explicitly addressed was that of potential effects of interactions between methotrexate and the other conditions and how this is controlled for. Specifically, I could be argued that the fact that a particular PPI is observed under a specific condition could have more to do with a synthetic effect of treatment of cells with a drug plus methotrexate. Is this controlled for and how? I raise this because in a chemical genetic screen for fitness it was shown that methotrexate is particularly promiscuous for drug-drug interactions (Hillenmeyer ME ,et al. Science 2008). I tried to think of how this works but couldn't come up with anything immediately. I'd appreciate if the authors would take a crack at resolving this issue. Otherwise I have no further concerns about the manuscript.

      Significance

      Lui et al expand on previous work from the Levy group to explore a massive in vivo protein interactome in the yeast S. cerevisiae. They achieve this by performing screens cross 9 growth conditions, which, with replication, results in a total of 44 million measurements. Interpreting their results based on a fitness model for pooled growth under methotrexate selection, they make the key observation that there is a vastly expanded pool of protein-protein interactions (PPI) that are found under only one or two condition compared to a more limited set of PPI that are found under a broad set of conditions (mutable versus immutable interactors). The authors show that this dichotomy suggests some important features of proteins and their PPIs that raise important questions about functionality and evolution of PPIs. Among these are that mutable PPIs are enriched for cross-compartmental, high disorder and higher rates of evolution and subcellular localization of proteins to chromatin, suggesting roles in gene regulation that are associated with cellular responses to new conditions. At the same time these interactions are not enriched for changes in abundance. These results are in contrast to those of immutable PPIs, which seem to form a core background noise, more determined by changes in abundance than what the authors interpret must be post-translational processes that may drive, for instance, changes in subcellular localization resulting in appearance of PPIs under specific conditions. The authors are also able to address a couple of key issues about protein interactomes, including the controversial Party-date Hub hypothesis of Vidal, in which they could now affirm support for this hypothesis based on their results and notably negative correlation of PPIs to protein abundance for mutable PPIs. Finally, they also addressed the problem of predicting the upper limit of PPIs in yeast, showing the remarkable results that it may be no more than about 2 times the number of proteins expressed by yeast. Such an upper limit is profoundly important to modelling cellular network complexity and, if it holds up, could define a general upper limit on organismal complexity.

      This manuscript is a very important contribution to understanding dynamics of molecular networks in living cells and should be published with high priority.
      
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their close reading and constructive comments on our manuscript. We believe that their insight has substantially strengthened our manuscript. Please find our response/revision plan for each comment below (in blue). Note, because of the substantial changes to the figures and the additional experiments that are we are undertaking, we have not initially revised the text. The proposed textual revisions will be included in the full revision.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The Katz lab has contributed greatly to the field of epigenetic reprogramming over the years, and this is

      another excellent paper on the subject. I enjoyed reviewing this manuscript and don't have any major

      comments/suggestions for improving it. The findings presented are novel and important, the results are clear

      cut, and the writing is clear.

      It's important to stress the novelty of the findings, which build upon previous studies from the same lab (upon

      a shallow look one might think that some of the conclusions were described before, but this is not the case).

      Despite the fact that this system has been studied in depth before, it remained unclear why and how

      germline genes are bookmarked by H3K36 in the embryo, and it wasn't known why germline genes are not

      expressed in the soma.

      To study these questions Carpenter et al. examine multiple phenotypes (developmental aberrations,

      sterility), that they combine with analysis of multiple genetic backgrounds, RNA-seq, CHIP-seq, single

      molecule FISH, and fluorescent transgenes.

      Previous observations from the Katz lab suggested that progeny derived from spr-5;met-2 double mutants

      can develop abnormally. They show here that the progeny of these double mutants (unlike spr-5 and met-2

      single mutants) develop severe and highly penetrate developmental delays, a Pvl phenotype, and sterility.

      They show also that spr-5; met-2 maternal reprogramming prevents developmental delay by restricting

      ectopic MES-4 bookmarking, and that developmental delay of spr-5;met-2 progeny is the result of ectopic

      expression of MES-4 germline genes. The bottom line is that they shed light on how SPR-5, MET-2 and

      MES-4 balance inter-generational inheritance of H3K4, H3K9, and H3K36 methylation, to allow correct

      specification of germline and somatic cells. This is all very important and relevant also to other organisms.

      **(very) Minor comments:**

      -Since the word "heritable" is used in different contexts, it could be helpful to elaborate, perhaps in the

      introduction, on the distinction between cellular memory and transgenerational inheritance.

      We are happy to elaborate on this in the revised manuscript.

      -It might be interesting in the Discussion to expand further about the links between heritable chromatin

      marks and heritable small RNAs. The do hint that the result regarding the silencing of the somatic transgene

      are especially intriguing.

      We are happy to expand this in the revised manuscript.

      Reviewer #1 (Significance (Required)):

      This is an exciting paper which build upon years of important work in the Katz lab. The novelty of the paper

      is in pinpointing the mechanisms that bookmark germline genes by H3K36 in the embryo, and explaining

      why and how germline genes are prevented from being expressed in the soma.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Katz and colleagues examine the interaction between the methyltransferase MES-4 and spr-5; met-2 double

      mutants. Their prior analysis (PNAS, 2014) showed the dramatic enhancement in sterility and development

      for spr-5; met-2; this paper extends that finding by showing these effects depend on MES-4. The results are

      interesting and the genetic interactions dramatic. The examination by RNAseq and ChIP helps move the

      phenotypes into a more molecular analysis. The authors hypothesize that SPR-5 and MET-2 modify

      chromatin of germline genes (MES-4 targets) in somatic cells, and this is required to silence germline genes

      in the soma. A few issues need to be resolved to test these ideas and rule out others.

      **Main comments:**

      The authors' hypothesis is that SPR-5 and MET-2 act directly, to modify chromatin of germline genes (MES-

      4 targets), but alternate hypothesis is that the key regulated genes are i) MES-4 itself and/or ii) known

      regulators of germline gene expression e.g. the piwi pathway. Mis regulation of these factors in the soma

      could be responsible for the phenotypes. Therefore, the authors should analyze expression (smFISH and

      where possible protein stains) for MES-4 and PIWI components in the embryo and larvae of wildtype, double

      and triple mutant strains. These experiments are essential and not difficult to perform.

      In our RNA-seq analysis we see a small elevation of MES-4 itself (average 1.18 log2 fold change across 5 replicates). This does not seem likely to be solely driving such a dramatic phenotype. Nevertheless, it is possible that the small increase in expression of MES-4 itself could be contributing. To determine if MES-4 is being ectopically expressed in spr-5; met-2 double mutants, we have obtained a tag version of MES-4 from Dr. Susan Strome and will use this to examine the localization of MES-4 protein in spr-5; met-2 double mutants. We are definitely interested in the potential interaction between PIWI components and the histone modifying enzymes that we have explored in this study. However, since RNAi of MES-4 is sufficient to rescue the developmental delay of spr-5; met-2 mutants, we have chosen to focus on that interaction in this paper. In the future, we hope to examine the role of PIWI components in this system.

      A second aspect of the hypothesis is that spr-5 and met-2 act before mes-4 and that while these genes are

      maternally expressed, they act in the embryo. There really aren't data to support these ideas - the timing and

      location of the factors' activities have not been pinned down. One way to begin to address this question

      would be to perform smFISH on the target genes and on mes-4 in embryos and determine when and where

      changes first appear. smFISH in embryos is critical - relying on L1 data is too late. If timing data cannot be

      obtained, then I suggest that the authors back off of the timing ideas or at least explain the caveats.

      Certainly, figure 8 should be simplified and timing removed. (note: Typical maternal effect tests probably

      won't work because if the genes' RNAs are germline deposited, then a maternal effect test will reflect when

      the RNA is expressed but not when the protein is active. A TS allele would be needed, and that may not be

      available.)

      To determine the timing of the ectopic expression of MES-4 targets, we have performed smFISH on two MES-4 targets in embryos. Thus far, these experiments show that MES-4 targets are ectopically expressed in the embryo, but only after the maternal to zygotic transition. This is consistent with our proposed model. A figure containing this data will be added to the revised manuscript. In addition, our model is predicated on the known embryonic protein localization of SPR-5 and MES-4. Maternal SPR-5 protein is present in the early embryo up to around the 8-cell stage, but absent in later embryos (Katz et al., 2009). In addition, in mice, the SPR-5 ortholog LSD1 is required maternally prior to the 2-cell stage (Wasson et al., 2016 and Ancelin et al., 2016). In contrast, MES-4 continues to be expressed in the embryo until later embryonic stages where it is concentrated into the germline precursors Z2 and Z3 (Fong et al., 2002). This is consistent with SPR-5 establishing a chromatin state that continues to be antagonized by MES-4. There is evidence that MET-2 is expressed both in early embryos and later embryos. However, since the phenotype of MET-2 so closely resembles the phenotype of SPR-5 (Kerr et al., 2014), we have included it in our model as working with SPR-5. Further experimentation will be required to substantiate the model, but we believe the model is consistent with all of the current data.

      Writing/clarity:

      -It would be helpful to include a table that lists the specific genes studied in the paper and how they behaved

      in the different assays e.g. RNAseq 1, RNAseq 2, MES-4 target, ChIP. That way, readers will understand

      each of the genes better.

      We are happy to include a table in the revised manuscript.

      -At the end of each experiment, it would be helpful to explain the conclusion and not wait until the

      Discussion. For readers not in the field, the logic of the Results section is hard to follow.

      This seems like a stylistic choice. Traditionally, papers did not include any conclusions in the results section, and it is our preference to keep our paper organized this way. However, if the reviewer would still like us to change this, we are happy to do so.

      -The model is explained over three pages in the Discussion. It would be great to begin with a single

      paragraph that summarizes the model/point of the paper simply and clearly.

      The discussion in the revised manuscript will altered to include this.

      **Specific comments:**

      -Figure 1 has been published previously and should be moved to the supplement.

      In our original paper (Kerr et al.) we reported in the text that spr-5; met-2 mutants have a developmental delay. However, we did not characterize this developmental delay. Nor did we include any images of the double mutants, except for one image of the adult germline phenotype. As a result, we believe that the inclusion of the developmental delay in the main body of this manuscript is warranted.

      -Cite their prior paper for the vulval defects e.g. page 6 or show in supplement.

      We are happy to include a citation of our previous paper for the vulval defects in the revised manuscript.

      -The second RNAseq data should be shown in the Results since it is much stronger. The first RNAseq,

      which is less robust, should be moved to supplement.

      The revised manuscript will include this alteration.

      -Figure 3 is very nice. Please explain why the RNAs were picked (+ the table, see comment above), and

      please add here or in a new figure mes-4 and piwi pathway expression data in wildtype vs double/triple

      mutants.

      We performed RT-PCR on 9 MES-4 targets. These 9 targets were picked because they had the highest ectopic expression in spr-5; met-2 mutants and largest change in H3K36me3 in spr-5; met-2 mutants versus Wild Type. Amongst these 9 genes, we performed smFISH on htp-1 and cpb-1 because they are relatively well characterized as germline genes.

      The revised manuscript will include added panels to supplemental figure 2 showing the expression of PIWI pathway components.

      -Figure 3 here or later, please show if mes-4 RNAi removes somatic expression of target genes.

      We are currently carrying out this experiment. Once it is completed, the data will hopefully be added to the paper.

      -Is embryogenesis delayed?

      Embryogenesis seems to be sped up in spr-5; met-2 mutants. A supplemental figure will be added to the revised manuscript showing this. It is unclear why embryogenesis is sped up. However, this confirms that the developmental delay is unique to the L1/L2 stages.

      -Figure 4 since htp-1 smFISH is so dramatic, it would be helpful to include htp-1 in the lower panels.

      htp-1 will be added to the lower panels in the revised manuscript.

      -Figure 4, please add an extra 2 upper panels showing all the genes in N2 vs spr-5;met-2, for comparison to

      the mes-4 cohort.

      As a control, we will add panels showing a comparison to all germline genes, excluding MES-4 targets. This new data shows that germline genes that are not MES-4 targets do not have ectopic H3K36me3. This data, which further suggests that the phenomenon is confined to MES-4 targets, is consistent with our results showing that MES-4 RNAi is sufficient to suppress the developmental delay.

      -Figure 6. Please show a control that met-1 RNAi is working.

      We performed RT-PCR to try and confirm that met-1 RNAi was working. Despite controls repeating the MES-4 suppression and verifying that RNAi was working, we were unable to demonstrate that met-1 was knocked down. As a result, we will remove this result from the paper. Importantly, this does not affect the conclusion of the paper.

      -To quantify histone marks more clearly, it would be wonderful to have a graph of the mean log across the

      gene. showing the mean numbers would help clarify the degree of the effect. we had an image as an

      example but it does not paste into the reviewer box. Instead, see figure 2 or figure 4

      here: https://www.nature.com/articles/ng.322

      We will attempt to include this analysis in the revised manuscript.

      Reviewer #2 (Significance (Required)):

      Katz and colleagues examine the interaction between the methyltransferase MES-4 and spr-5; met-2 double

      mutants. Their prior analysis (PNAS, 2014) showed the dramatic enhancement in sterility and development

      for spr-5; met-2; this paper extends that finding by showing these effects depend on MES-4. The results are

      interesting and the genetic interactions dramatic. The examination by RNAseq and ChIP helps move the

      phenotypes into a more molecular analysis.

      This work will be of interest to people following transgenerational inheritance, generally in the C. elegans

      field. People using other organisms may read it also, although some of the worm genetics may be

      complicated. Some of the writing suggestions could make a difference.

      I study C. elegans embryogenesis, chromatin and inheritance.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In the paper entitled "C. elegans establishes germline versus soma by balancing inherited histone

      methylation" Carpenter BS et al examined a double mutant worm strain they had previously produced of the

      H3K4me1/2 demethylase spr-5 and the predicted H3K9me1/me2 methylase met-2. These mutant worms

      have a developmental delay that arises by the L2 larval stage. They performed an analysis of what genes

      get misexpressed in these double mutants by performing RNAseq and compare this to datasets generated

      from other labs on an H3K36me2/me3 methylase MES-4 where they see a high degree of overlap. They

      validate the misexpression of some germline specific genes in the soma by in situ and validate that there is a

      dysregulation of H3K36me3 in their double mutant worms. They further find that knocking down mes-4

      reverts the developmental delay.

      I think that the authors need to make more of an effort to be a bit more scholarly in terms of placing their

      work in the context of the field as a whole and also need to add a few additional experiments as well as

      reorganize a bit before this is ready for publication. Remember that the average reader is not necessarily an

      expert in C. elegans or this particular field and you really want to try and make the manuscript as accessible

      to everyone as possible.

      **Major Points**

      1)It would be good to see western blots or quantitative mass spec examining H3K36me3 in the WT and spr-

      5;met-2 double mutant worms. I believe this was also previously reported by Greer EL et al Cell Rep 2014 in

      the single spr-5 mutant worm so that work should be cited here in addition to the identification of JMJD-2 as

      an enzyme involved in the inheritance of H3K4me2 phenotype.

      The ectopic H3K36me3 is confined to a small set of MES-4 targets. We don’t even see ectopic H3K36me3 at non-MES-4 germline genes (see above). Therefore, we don’t expect to see any global differences in bulk H3K36me3. Greer et al reported that there are elevated H3K36me3 levels in spr-5 mutants. This discrepancy may be due to different stages (embryos, germline) present in their bulk preparation. Alternatively, the met-2 mutant may counteract the effect of the spr-5 mutation on H3K36me3. Regardless, we believe that the genome-wide ChIP-seq is more informative than bulk H3K36me3 levels.

      We will add a citation for the Greer paper in the revised manuscript.

      2)Missing from Fig.5 is mes-4 KD by itself. This is needed to determine whether these effects are specific to

      the spr-5;met-2 double mutants or more general effects that KD of mes-4 would decrease the expression of

      all these genes to a similar extent. Then statistics should be done to see if the decrease in the WT context is

      the same or greater than the decrease in the double mutants.

      The MES-4 targets are generally expressed only in the germline and defined by having mes-4 dependent H3K36me3. Knocking down mes-4 would be expected to prevent the expression of these genes in the germline, but this is difficult to test because mes-4 mutants basically don’t make a germline. Regardless, knocking down mes-4 by itself would only assess the role of MES-4 in germline transcription, not the ectopic expression that is being assayed in spr-5; met-2 mutants in Fig 5. Importantly, it remains possible that spr-5; met-2 mutants might also result in an increase in the expression of MES-4 targets in the germline. However, the experiments performed in this manuscript were conducted on L1 larvae, which do not have any germline expression, to eliminate this potential confounding contribution.

      **Minor Points**

      1)A greater attempt needs to be made to be more scholarly for citing previously published literature. This

      includes work on the inheritance of H3K27 and H3K36 methylation in C. elegans and other species as well.

      A few papers which seem germane to this story which should be cited in the intro are (Nottke AC et al PNAS

      2011, Gaydos LJ et al Science 2014, Ost A et al Cell 2014, Greer EL et al Cell Rep 2014, Siklenka K et al

      Science 2015, Tabuchi TM et al Nat Comm 2018, Kaneshiro KR et al Nat Comm 2019). This problem is not

      restricted to the intro.

      Although many of these excellent papers are broadly relevant to this current work, they are not necessarily directly relevant to this paper. For this reason, they were not originally cited. Nevertheless, we will attempt to cite these papers in the revised version when possible.

      2)I think that the authors need to be a little less definitive with your language. Theories should be introduced

      as possibilities rather than conclusions. Should remove "comprehensive" from intro as there are many other

      methods which could be done to test this.

      Throughout the manuscript, we have tried to be clear what the data suggests versus what is model based on the data. Nevertheless, to further clarify this, we are happy to remove “comprehensive” from the intro.

      3)The authors should describe what PIE-1 is. Is this a transcription factor?

      PIE-1 is a transcriptional inhibitor that is thought to block RNA polII elongation by mimicking the CTD of RNA polII and competing for phosphorylation. We are happy to add a reference to this function in the revised manuscript.

      4)The language needs clarification about MES-4 germline genes and bookmark genes. Are these bound by

      MES-4 or marked with K36me2/3?

      The revised manuscript will be modified to make this definition more clear.

      5)I think Fig S1 E+F should be in the main figure 1 so readers can see the extent of the phenotype.

      The original single image of the spr-5; met-2 adult germline phenotype (including the protruding vulva) was included in our previous publication. In this manuscript, we have now quantified this phenotype, which is why it is included in the supplement here. However, because the original picture was included in our original publication, we prefer to leave it as supplemental.

      6)For Fig S2 it would be good to do the same statistics that is done in Fig 2 and mention them in the text so

      the readers can see that the overlap is statistically significant.

      We are happy to include these statistics in the revised manuscript.

      7)Fig S2.2 should be yellow blue rather than red green for the colorblind out there.

      Thanks for pointing this out. We are happy to change the colors in the revised manuscript.

      8)When saying "Many of these genes involved in these processes..." the authors need to include numbers

      and statistics.

      We will amend the revised text to make the definition of the MES-4 genes more clear.

      9)Should use WT instead of N2 and specify what wildtype is in methods.

      We will use WT instead of N2 in the revised manuscript.

      10)Fig. 2A + B could be displayed in a single figure. And Fig 2D seems superfluous and could be combined

      with 2C or alternatively it could be put in supplementary.

      Figure 2A and 2B were purposely separated to make it clear how many of the overlapped changes are up versus down. In the revised manuscript, Figure

      2D will be moved to the supplement.

      11)Non-C. elegans experts won't understand what balancers are. An effort should be made to make this

      accessible to all. Explaining when genes are heterozygous or homozygous mutants seems relevant

      here.

      The text of the revised manuscript will be amended to make it more accessible for non-C. elegans readers.

      12)The GO categories (Fig. S2) should be in the main figure and need to be made to look more scientific

      rather than copied and pasted from a program.

      The GO categories were included to be comprehensive and do not contribute substantially to the main conclusion of the paper. This is why they are supplemental. In the revised manuscript, we will edit the GO results so that they look more scientific.

      13)Fig. 7 seems a bit out of place. If the authors were to KD mes-4 and similarly show that the phenotype

      reverts that would help justify its inclusion in this paper. Without it seems like a bit of an add on that belongs

      elsewhere.

      We believe that the somatic expression of a transgene in spr-5; met-2 mutants adds to our potential understanding of how this double mutant may lead to developmental delay. This is true, regardless of whether of whether the somatic transgene expression is mes-4 dependent or not.

      Reviewer #3 (Significance (Required)):

      I think this is an interesting and timely piece of work. A little more effort needs to be put in to make sure it is

      accessible to the average reader and has sufficient inclusion of more of the large body of work on

      inheritance of histone modifications. I think C. elegans researchers as well as people interested in

      inheritance and the setup of the germline will be interested in this work.

      REFEREES CROSS COMMENTING

      I agree with Reviewer #2's comments on experiments to include or exclude alternative models. I also agree

      about their statement about rewriting to make it more accessible to others who aren't experts in this

      specialized portion of C. elegans research. All in all it seems like the experiments which are required by

      reviewer #2 and myself as well as the rewriting should be quite feasible.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In the paper entitled "C. elegans establishes germline versus soma by balancing inherited histone methylation" Carpenter BS et al examined a double mutant worm strain they had previously produced of the H3K4me1/2 demethylase spr-5 and the predicted H3K9me1/me2 methylase met-2. These mutant worms have a developmental delay that arises by the L2 larval stage. They performed an analysis of what genes get misexpressed in these double mutants by performing RNAseq and compare this to datasets generated from other labs on an H3K36me2/me3 methylase MES-4 where they see a high degree of overlap. They validate the misexpression of some germline specific genes in the soma by in situ and validate that there is a dysregulation of H3K36me3 in their double mutant worms. They further find that knocking down mes-4 reverts the developmental delay.

      I think that the authors need to make more of an effort to be a bit more scholarly in terms of placing their work in the context of the field as a whole and also need to add a few additional experiments as well as reorganize a bit before this is ready for publication. Remember that the average reader is not necessarily an expert in C. elegans or this particular field and you really want to try and make the manuscript as accessible to everyone as possible.

      Major Points

      1)It would be good to see western blots or quantitative mass spec examining H3K36me3 in the WT and spr-5;met-2 double mutant worms. I believe this was also previously reported by Greer EL et al Cell Rep 2014 in the single spr-5 mutant worm so that work should be cited here in addition to the identification of JMJD-2 as an enzyme involved in the inheritance of H3K4me2 phenotype.

      2)Missing from Fig.5 is mes-4 KD by itself. This is needed to determine whether these effects are specific to the spr-5;met-2 double mutants or more general effects that KD of mes-4 would decrease the expression of all these genes to a similar extent. Then statistics should be done to see if the decrease in the WT context is the same or greater than the decrease in the double mutants.

      Minor Points

      1)A greater attempt needs to be made to be more scholarly for citing previously published literature. This includes work on the inheritance of H3K27 and H3K36 methylation in C. elegans and other species as well. A few papers which seem germane to this story which should be cited in the intro are (Nottke AC et al PNAS 2011, Gaydos LJ et al Science 2014, Ost A et al Cell 2014, Greer EL et al Cell Rep 2014, Siklenka K et al Science 2015, Tabuchi TM et al Nat Comm 2018, Kaneshiro KR et al Nat Comm 2019). This problem is not restricted to the intro.

      2)I think that the authors need to be a little less definitive with your language. Theories should be introduced as possibilities rather than conclusions. Should remove "comprehensive" from intro as there are many other methods which could be done to test this.

      3)The authors should describe what PIE-1 is. Is this a transcription factor?

      4)The language needs clarification about MES-4 germline genes and bookmark genes. Are these bound by MES-4 or marked with K36me2/3?

      5)I think Fig S1 E+F should be in the main figure 1 so readers can see the extent of the phenotype.

      6)For Fig S2 it would be good to do the same statistics that is done in Fig 2 and mention them in the text so the readers can see that the overlap is statistically significant.

      7)Fig S2.2 should be yellow blue rather than red green for the colorblind out there.

      8)When saying "Many of these genes involved in these processes..." the authors need to include numbers and statistics.

      9)Should use WT instead of N2 and specify what wildtype is in methods.

      10)Fig. 2A + B could be displayed in a single figure. And Fig 2D seems superfluous and could be combined with 2C or alternatively it could be put in supplementary.

      11)Non-C. elegans experts won't understand what balancers are. An effort should be made to make this accessible to all. Explaining when genes are heterozygous or homozygous mutants seems relevant here.

      12)The GO categories (Fig. S2) should be in the main figure and need to be made to look more scientific rather than copied and pasted from a program.

      13)Fig. 7 seems a bit out of place. If the authors were to KD mes-4 and similarly show that the phenotype reverts that would help justify its inclusion in this paper. Without it seems like a bit of an add on that belongs elsewhere.

      Significance

      I think this is an interesting and timely piece of work. A little more effort needs to be put in to make sure it is accessible to the average reader and has sufficient inclusion of more of the large body of work on inheritance of histone modifications. I think C. elegans researchers as well as people interested in inheritance and the setup of the germline will be interested in this work.

      REFEREES CROSS COMMENTING

      I agree with Reviewer #2's comments on experiments to include or exclude alternative models. I also agree about their statement about rewriting to make it more accessible to others who aren't experts in this specialized portion of C. elegans research. All in all it seems like the experiments which are required by reviewer #2 and myself as well as the rewriting should be quite feasible.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Katz and colleagues examine the interaction between the methyltransferase MES-4 and spr-5; met-2 double mutants. Their prior analysis (PNAS, 2014) showed the dramatic enhancement in sterility and development for spr-5; met-2; this paper extends that finding by showing these effects depend on MES-4. The results are interesting and the genetic interactions dramatic. The examination by RNAseq and ChIP helps move the phenotypes into a more molecular analysis. The authors hypothesize that SPR-5 and MET-2 modify chromatin of germline genes (MES-4 targets) in somatic cells, and this is required to silence germline genes in the soma. A few issues need to be resolved to test these ideas and rule out others.

      Main comments:

      The authors' hypothesis is that SPR-5 and MET-2 act directly, to modify chromatin of germline genes (MES-4 targets), but alternate hypothesis is that the key regulated genes are i) MES-4 itself and/or ii) known regulators of germline gene expression e.g. the piwi pathway. Mis regulation of these factors in the soma could be responsible for the phenotypes. Therefore, the authors should analyze expression (smFISH and where possible protein stains) for MES-4 and PIWI components in the embryo and larvae of wildtype, double and triple mutant strains. These experiments are essential and not difficult to perform.

      A second aspect of the hypothesis is that spr-5 and met-2 act before mes-4 and that while these genes are maternally expressed, they act in the embryo. There really aren't data to support these ideas - the timing and location of the factors' activities have not been pinned down. One way to begin to address this question would be to perform smFISH on the target genes and on mes-4 in embryos and determine when and where changes first appear. smFISH in embryos is critical - relying on L1 data is too late. If timing data cannot be obtained, then I suggest that the authors back off of the timing ideas or at least explain the caveats. Certainly, figure 8 should be simplified and timing removed. (note: Typical maternal effect tests probably won't work because if the genes' RNAs are germline deposited, then a maternal effect test will reflect when the RNA is expressed but not when the protein is active. A TS allele would be needed, and that may not be available.)

      Writing/clarity:

      -It would be helpful to include a table that lists the specific genes studied in the paper and how they behaved in the different assays e.g. RNAseq 1, RNAseq 2, MES-4 target, ChIP. That way, readers will understand each of the genes better.

      -At the end of each experiment, it would be helpful to explain the conclusion and not wait until the Discussion. For readers not in the field, the logic of the Results section is hard to follow.

      -The model is explained over three pages in the Discussion. It would be great to begin with a single paragraph that summarizes the model/point of the paper simply and clearly.

      Specific comments:

      -Figure 1 has been published previously and should be moved to the supplement.

      -Cite their prior paper for the vulval defects e.g. page 6 or show in supplement.

      -The second RNAseq data should be shown in the Results since it is much stronger. The first RNAseq, which is less robust, should be moved to supplement.

      -Figure 3 is very nice. Please explain why the RNAs were picked (+ the table, see comment above), and please add here or in a new figure mes-4 and piwi pathway expression data in wildtype vs double/triple mutants.

      -Figure 3 here or later, please show if mes-4 RNAi removes somatic expression of target genes.

      -Is embryogenesis delayed?

      -Figure 4 since htp-1 smFISH is so dramatic, it would be helpful to include htp-1 in the lower panels.

      -Figure 4, please add an extra 2 upper panels showing all the genes in N2 vs spr-5;met-2, for comparison to the mes-4 cohort.

      -Figure 6. Please show a control that met-1 RNAi is working.

      -To quantify histone marks more clearly, it would be wonderful to have a graph of the mean log across the gene. showing the mean numbers would help clarify the degree of the effect. we had an image as an example but it does not paste into the reviewer box. Instead, see figure 2 or figure 4 here: https://www.nature.com/articles/ng.322

      Significance

      Katz and colleagues examine the interaction between the methyltransferase MES-4 and spr-5; met-2 double mutants. Their prior analysis (PNAS, 2014) showed the dramatic enhancement in sterility and development for spr-5; met-2; this paper extends that finding by showing these effects depend on MES-4. The results are interesting and the genetic interactions dramatic. The examination by RNAseq and ChIP helps move the phenotypes into a more molecular analysis.

      This work will be of interest to people following transgenerational inheritance, generally in the C. elegans field. People using other organisms may read it also, although some of the worm genetics may be complicated. Some of the writing suggestions could make a difference.

      I study C. elegans embryogenesis, chromatin and inheritance.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The Katz lab has contributed greatly to the field of epigenetic reprogramming over the years, and this is another excellent paper on the subject. I enjoyed reviewing this manuscript and don't have any major comments/suggestions for improving it. The findings presented are novel and important, the results are clear cut, and the writing is clear.

      It's important to stress the novelty of the findings, which build upon previous studies from the same lab (upon a shallow look one might think that some of the conclusions were described before, but this is not the case). Despite the fact that this system has been studied in depth before, it remained unclear why and how germline genes are bookmarked by H3K36 in the embryo, and it wasn't known why germline genes are not expressed in the soma.

      To study these questions Carpenter et al. examine multiple phenotypes (developmental aberrations, sterility), that they combine with analysis of multiple genetic backgrounds, RNA-seq, CHIP-seq, single molecule FISH, and fluorescent transgenes.

      Previous observations from the Katz lab suggested that progeny derived from spr-5;met-2 double mutants can develop abnormally. They show here that the progeny of these double mutants (unlike spr-5 and met-2 single mutants) develop severe and highly penetrate developmental delays, a Pvl phenotype, and sterility. They show also that spr-5; met-2 maternal reprogramming prevents developmental delay by restricting ectopic MES-4 bookmarking, and that developmental delay of spr-5;met-2 progeny is the result of ectopic expression of MES-4 germline genes. The bottom line is that they shed light on how SPR-5, MET-2 and MES-4 balance inter-generational inheritance of H3K4, H3K9, and H3K36 methylation, to allow correct specification of germline and somatic cells. This is all very important and relevant also to other organisms.

      (very) Minor comments:

      -Since the word "heritable" is used in different contexts, it could be helpful to elaborate, perhaps in the introduction, on the distinction between cellular memory and transgenerational inheritance.

      -It might be interesting in the Discussion to expand further about the links between heritable chromatin marks and heritable small RNAs. The do hint that the result regarding the silencing of the somatic transgene are especially intriguing.

      Significance

      This is an exciting paper which build upon years of important work in the Katz lab. The novelty of the paper is in pinpointing the mechanisms that bookmark germline genes by H3K36 in the embryo, and explaining why and how germline genes are prevented from being expressed in the soma.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the three reviewers for providing insightful critiques on our manuscript.

      Changes to document and comments made are marked e.g. “Reply 1.1” (referring the Reviewer #1 item #1, etc.) as described below.

      Reviewer #1

      I found this study to be very convincing. Prior studies are referenced appropriately, the text is well written and clear, the figures are clear also. In my opinion the paper does not need further experiment.

      [1.1] The conclusions are well supported by the data. However, the concatenation model seems very speculative at this point. Also, it does not take into account the dynamics of these molecules.

      Reply 1.1: The concatenation model combines the structural data from our manuscript with prior biochemical insights into tetraspanin homodimerization and with scanning-EM data on immunogold-labeled CD81 and CD9 on cells. It is not completely clear to us what reviewer #1 refers to with “the dynamics of these molecules”. The cryo-EM data revealed that CD9 - EWI-F is a dynamic complex with straight and bent conformations, which could account for both circular and linear arrangements of tetraspanin-microdomains in cell membranes through the higher-order oligomerization of stable CD9 - EWI-F tetramers. Moreover, transient CD9 - CD9 interactions likely yield a variable number of complexes present in these concatenated and flexible strings of complexes. Such a concatenation model indeed requires further validation. However, it is consistent with experimental data and, importantly, provides a long-awaited molecular basis for TEM assembly. Although it was not within the scope of the current study, it will be of great interest to further investigate the concatenation model through detailed cell-biology based approaches.

      **Minor comment:**

      [1.2] There seems to be a mix up between the two structures in the following sentence p4: "In CD9EC2 - 4C8, the D loop adopts a partially helical conformation and central residue F176 is sandwiched by 4E8 residues W59 of CDR2 and W102 and R105 of CDR3 (Fig. 1D). In the 4C8-bound CD9EC2 structure the tip of the D loop points more outward and the Cα atom of F176"

      Reply 1.2: The first sentence indeed mixed up the two structures and wrongfully mentioned CD9EC2 - 4C8 instead of CD9EC2 - 4E8. This has now been updated: “In CD9EC2 - 4E8, the D loop adopts …”

      Reviewer #2

      The paper is well written and the conclusions made are supported by the data presented.

      [2.1] The ternary structure is in agreement with that of CD9 in complex with the related EWI-2 published earlier this year by Umeda et al (ref #25). The present work thus adds little structural insights but may be useful in showing that the interaction pattern seen extends to another EWI protein family member.

      Reply 2.1: We agree with reviewer #2 that that the CD9 - EWI-F structure presented in our work is similar to the CD9 - EWI-2 structure published recently by Umeda et al. (ref #25). However, as also pointed out by reviewer #1, we believe that the CD9 - EWI-F structure adds new important information to understand the molecular mechanism underlying the assembly of tetraspanin-enriched microdomains. Notably, the different conformations of the CD9 - EWI-F complex observed in the cryo-EM data provide structural biology evidence for the dynamic nature of the interaction between a tetraspanin and a partner protein, which is consistent with a wealth of prior biochemical data. Guided by the distinct shape of the CD9EC2 - 4C8 densities, we were able to distinguish a range of straight to bent conformations of the complex. CD9 regions that represent known tetraspanin homo-dimerization sites, orient away from EWI-F and are available for interactions. Thus, combining our structural data with previous biochemical interaction data allowed for the generation of a long-awaited model for the assembly of tetraspanin-microdomains at the molecular level. We believe that these implications for TEM assembly will stimulate new, innovative research into the molecular principles that govern the function of tetraspanins.

      [2.2] As such it may be acceptable for publication. In this case, the authors should improve the quality of Figs. 3D and 4D.

      Reply 2.2: Figures 3D and 4D depict raw cryo-electron microscopy images (micrographs). The protein complexes imaged in this study only contain light atoms (H, N, C, O, S). Therefore, the collected micrographs only reveal low-contrast images of protein particles, and, for a typical cryo-EM experiment, it is required to average particles from thousands of micrographs to obtain a 3-dimensional reconstruction. We would like to keep the raw micrographs in figures 3 and 4, as it will aid cryo-EM scientists in judging the quality of the data.

      Reviewer #3

      The work is technically well performed and clearly presented including methodological details. I just have a few minor comments:

      [3.1] Page 4 and Figure S1: it is hard to see how a reliable affinity for 4E8 can be obtained from the cell binding data in S1A, as there is no indication of saturation. It would be good to at acknowledge that this is at best a rough estimate. Fortunately the data for this nanobody in purified situation seems solid.

      Reply 3.1: The obtained affinities are indeed an ±estimation based on a non-linear regression curve fitting on the measured data, performed in triplicate. The text has been updated and now reads as “4C8 and 4E8 bind to purified, full-length CD9 as well as to endogenous CD9 expressed on HeLa cells with apparent binding affinities in the nanomolar range (Fig. S1A, B, C)”. Next to that, a table stating the calculated KDs has been included as Fig. S1C.

      [3.2] Page 6: Does the absence of micellar density for the EWI-F complex indicate flexibility of the extracellular domain relative to the TM? Does this happen because the classification focuses on the highly elongated Ig region?

      Reply 3.2: These are indeed plausible assumptions. We observed highly heterogeneous, elongated particles in the micrograph shown in Fig. 3D, indicating inter-domain flexibility. If the alignment software focusses on certain Ig-like domains, other regions of the protein complex will be averaged out. An additional complexity with these elongated particles was to select an appropriate box size for particle picking and particle extraction, because the particles differ greatly in size based on their orientation (fully elongated side-views vs. much smaller top-views). When taken together, the complex of CD9 with full-length EWI-F was unsuitable for high-resolution structure determination; the subsequent strategy using EWI-FΔIg1-5 resulted in globular particles with less flexibility (Fig. 4D), which allowed for a more detailed structural characterization of the complex.

      [3.3] Page 8: "Recently, a cryo-EM density map has been reported..." - please reference here.

      Reply 3.3: We added the appropriate reference to the sentence: “Recently, a cryo-EM density map has been reported of CD9 in complex with an EWI-F homolog, EWI-2 (25).”

      [3.4] Relatively little is known about how tetraspanins help to organize partner receptors into defined membrane domains, evidence for which has emerged from super-resolution light microscopy. Based on their structural analysis of the CD9-EWI-F complex, including the heterogeneity apparent in the cryo-EM structure, they propose a feasible concatenation model for higher order oligomerization of these complexes in the membrane. Obviously the model will need to be tested rigorously by mutational analysis, particularly the EWI Ig6 interface, but as it stands the paper is a significant contribution to the field of tetraspanins.

      Reply 3.4: From the 8.6 Å cryo-EM data, the amino-acid residues that form the EWI-F Ig6 dimer interface can indeed not be distinguished. However, our data on CD9 in complex with full-length EWI-F (Fig. 3E) and previous cross-linking data (André et al. In situ chemical cross-linking on living cells reveals CD9P-1 cis-oligomer at cell surface - PMID: 19703604) support that EWI-F forms dimeric assemblies. Regarding the concatenation model, we therefore think that it will be of great interest to establish the putative CD9 - CD9 interactions (identified through biochemical approaches), that would link CD9 - EWI-F tetramers into higher assemblies, in the context of native membranes. However, investigating these transient interactions would require various non-trivial experiments and was therefore not within the scope of the current study.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This paper describes the structure of the tetraspanin CD9 and its interaction with the single pass protein EWI-F. The variability in the D loop of EC2 and the domain swapping is a useful addition to the limited structural database of these proteins and correlates with the relatively poor sequence conservation of this region. The key message is that dimerization of the single pass protein extracellular region, and interaction of its transmembrane helix with the tetraspanin, produces a heterodimeric structure that may further oligomerize. The authors propose a feasible concatenation model for higher order oligomerization of these complexes in the membrane.

      The work is technically well performed and clearly presented including methodological details. I just have a few minor comments:

      Page 4 and Figure S1: it is hard to see how a reliable affinity for 4E8 can be obtained from the cell binding data in S1A, as there is no indication of saturation. It would be good to at acknowledge that this is at best a rough estimate. Fortunately the data for this nanobody in purified situation seems solid.

      Page 6: Does the absence of micellar density for the EWI-F complex indicate flexibility of the extracellular domain relative to the TM? Does this happen because the classification focuses on the highly elongated Ig region?

      Page 8: "Recently, a cryo-EM density map has been reported..." - please reference here.

      Significance

      Relatively little is known about how tetraspanins help to organize partner receptors into defined membrane domains, evidence for which has emerged from super-resolution light microscopy. Based on their structural analysis of the CD9-EWI-F complex, including the heterogeneity apparent in the cryo-EM structure, they propose a feasible concatenation model for higher order oligomerization of these complexes in the membrane. Obviously the model will need to be tested rigorously by mutational analysis, particularly the EWI Ig6 interface, but as it stands the paper is a significant contribution to the field of tetraspanins.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this paper, Dr. Oosterheert and colleagues report the crystal structures of CD9EC2 bound to nanobodies 4C8 and 4E8. The CD9EC2/4C8 structure was useful in determining a low resolution cryo-EM structure of EWI-F in complex with CD9/4C8. The observed sample heterogeneity of this ternary complex was reduced by deleting the n-terminal five Ig domains of EWI-F, yielding a modest maximum global resolution of ~ 8.6 Å. The structural approaches used are standard. The crystallographic and structure refinement statistics are sound as are the cryo-EM image processing. The overall cryo-EM structure of the ternary complex shows a central EWI-F protein dimer flanked by one CD9 molecule on each side. The paper is well written and the conclusions made are supported by the data presented.

      Significance

      The ternary structure is in agreement with that of CD9 in complex with the related EWI-2 published earlier this year by Umeda et al (ref #25). The present work thus adds little structural insights but may be useful in showing that the interaction pattern seen extends to another EWI protein family member. As such it may be acceptable for publication. In this case, the authors should improve the quality of Figs. 3D and 4D.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this article, the authors provide new insights into the structure of the tetraspanin CD9. On the one hand, they provide crystal structures of the large extracellular domain of CD9, alone or bound to two nanobodies. The 3 structures are similar and similar to that of CD81, a related tetraspanin, except for a portion of the molecule, the so-called D-domain, showing flexibility of this domain. On the other hand, they obtained the cryo-EM structure of CD9 in association with a known-partner (EWI-F) with a resolution of 8.6A. More precisely, the complex of CD9 and the full-length EWI-F showed heterogeneity which they interpret as a consequence of the flexibility between the six Ig-like domains of EWI-F, precluding high-resolution structure determination. However, they showed that CD9 still interacted with a molecule lacking the 5 most membrane-distal Ig domains of EWI-F, and obtained the structure using this construct and an anti-CD9 nanobody. This structure reveals a hetero-tetrameric arrangement of CD9-EWIF, with a central EWI-F dimer flanked by a CD9 molecule on each side. CD9 and EWI-F interact through their transmembrane domains and the two truncated EWI-F molecules through the remaining Ig domains. Importantly, CD9 and EWI-F do not make contacts in the extracellular region, and CD9 shows a semi-open conformation. The structure also shows different configurations of the complex.

      I found this study to be very convincing. Prior studies are referenced appropriately, the text is well written and clear, the figures are clear also.

      In my opinion the paper does not need further experiment.

      The conclusions are well supported by the data. However, the concatenation model seems very speculative at this point. Also, it does not take into account the dynamics of these molecules.

      Minor comment:

      There seems to be a mix up between the two structures in the following sentence p4: "In CD9EC2 - 4C8, the D loop adopts a partially helical conformation and central residue F176 is sandwiched by 4E8 residues W59 of CDR2 and W102 and R105 of CDR3 (Fig. 1D). In the 4C8-bound CD9EC2 structure the tip of the D loop points more outward and the Cα atom of F176"

      Significance

      Tetraspanins have been shown over the years to play an essential role in various biological functions. Among them, CD9 which is strongly expressed on the oocyte plasma membrane is essential for sperm-egg fusion. However, the mechanisms by which CD9 regulates this fusion process as well as other cell-cell fusion events remain unknown. The elucidation of its structure and of how it interacts with well characterized partner proteins is clearly a major advance in our understanding of the function of this molecule.

      The absence of a structure for tetraspanins has been for a long time a knowledge gap. Following a breakthrough in 2001 with the publication of the crystal structure of the large extracellular domain of CD81 (Kitadokoro et al., EMBO J 2001), it was only recently that the structure of a full length tetraspanin, again that of CD81, was published (Zimmermann et al., Cell 2016). Earlier this year was published the crystal structure of a truncated version of CD9 as well as the cryo-EM structure of CD9 in association with another molecular partner EWI-2 (Umeda et al.,Nature com 2020).

      The present structure adds new important information such as the existence of different conformation in the large extracellular domain of CD9 or the structure of CD9 with another molecular partner. It also highlights the different configurations of the complex. It will be of interest to researchers interested in tetraspanins, in membrane organization as well as researchers interested in the biological processes regulated by CD9, notably sperm-egg fusion.

      My field of expertise concerns tetraspanins. I cannot comment on the technical aspects of the structures.

  2. Jul 2020
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      1. The first hypothesis of the manuscript is that, rather than a change in a single immune pathway being responsible for the lack of response to the virus, the response will be systemic involving multiple inter-related pathways. The data show that this was the case after presenting convincing transcriptome analysis.

      We thank the reviewer agreeing that we have convincingly shown that the response to the virus is systemic involving the induction of interrelated pathways

      The second hypothesis is that the differences in responses between bats and humans are due to evolutionarily divergent genes. The authors provide evidence for this in the transcriptome differences in the C-reactive protein, aspects of the complement system, iron regulation and M1/M2 macrophage polarization. The second hypothesis is broad, but there are clearly differences in the genes involved in humans and bats. Without mechanistic information on the function of the proteins/cells investigated, it is hard to determine that the changes the authors are observing are the cause of the different responses, rather than an effect of some upstream response, and so difficult to pin-point specific divergent genes.

      We agree that mechanistic studies will be required to test causal links between the genes we identified and specific anti-viral responses, an effort that is likely to require multiple laboratories and some time. The aim of this study was to enable this effort by identifying a list of candidate genes affected by EBOV and MARV infection in bats, not merely in cultured bat cells.

      The authors wish to compare the response to the virus in bats to the better characterized human tissue responses, but because this relies on previously published work in humans, it is sometimes unclear whether "more bat-like" responses are definitely associated with positive outcomes in humans. As the benefit of certain responses in human infections can depend on the timing of the response, it might be helpful to include summarized human data in manuscript to aid comparison with the bat responses.

      We agree and have added the following data and discussion (inserted into Discussion, page 9, and added two new tables, Tables 2 and 3).

      Comparing our observations to human responses to filoviruses is limited by the scarcity of studies in humans. Nevertheless, this comparison suggests potential directions to explore. In one study, individuals who succumbed to the disease showed stronger upregulation of interferon signaling and acute phase responses compared to survivors during the acute phase of infection[1], consistent with the anti-inflammatory response gene expression signature identified in this study in bats. However, most of the genes used in the study by Liu et al. to classify survivors are either barely expressed in bats or do not respond to filoviral infection (Table 2), the differences that provide potential clues to find why bats can tolerate the infection.

      A study of patients infected with Sudan Ebola virus (SUDV) analyzed protein levels for a panel of genes using a Luminex multiplex assay (using antibodies)[2]. The panel was based on results from other studies and pathways involved in the response to infections. The patients were classified into 3 possible dichotomies (fatal/non-fatal, hemorrhaging/non-hemorrhaging, or high/low viremia) correlated with genes that characterized these states. Most of these genes either are barely expressed, if at all, or are unaffected by infection in bats, except for ferritin (FTL, FTH1) whose expression is lowered by MARV infection, consistent with the observation that ferritin is higher is fatal human cases (Table 3).

      For instance, the T-cell response section concludes "Bats mount a T cell response against the infection" but there is no discussion of the impaired but complex lymphocyte response in humans, so comparison is not possible.

      We have expanded the discussion on T cells (Results, page 7) as follows.

      Previous studies on the adaptive immune response to Ebola and Marburg viruses in humans, non-human primates, and non-primate mammals, shows that long-term immunity is conferred by both T cell and antibody responses. Mostly CD8+ T cells were elicited and helpful against Ebola in mice[3],[4], while SUDV infection in humans[5]) and MARV infection in cynomolgus monkeys[6] and humans[7] ) elicited mostly CD4+ T cells . In most human EBOV infections, CD8+ T cells against the EBOV NP protein dominated the responses, while a minority of individuals harbored memory CD8+ T cells against the EBOV-GP [8].

      Consistent with this, in MARV-infected bats, CD4 expression (specific to CD4+ T cells) was higher, while in EBOV-infected bats, CD8 expression (specific to CD8+ T cells) was higher, the overall levels are low, because the tissue samples are heterogenous and expression of these markers is not high in the T cells to begin with. T cell markers (such as CCL3, ANAX1, TIMD4 and MAGT1) are also upregulated in liver, suggesting a T cell response is mounted.

      Mock infected IHC should be included in Figure 1F to demonstrate the antibodies are not background.

      We have added IHC data of two mock-infected animals (Fig. S1 panels A and B).

      See comment in hypotheses- a summarized table of findings from previous studies of early responses to the virus would be helpful for comparisons to the bat response and for determining the second hypothesis.

      We have expanded our comparisons to previous studies by adding the following text to Introduction (page 3)

      A potential source of the difficulty to understand how bats tolerate or eliminate the viruses that are deadly to humans is the lack of studies that analyze the response to infection in bats rather than in cultured bat cells. The results obtained using cell lines have been contradictory. Some studies claim both EBOV and MARV replicate to similar levels in ERB and human derived cell lines[9], with a robust innate immune response mounted by ERB and to a lesser degree, human cells, while others claim MARV inhibited the antiviral program in ERB cells, like in primate cells, and did not induce almost any IFN gene [10], or little anti-viral gene induction[11]. An experiment with the pig (PK15A) and bat (EhKiT) cells suggested they responded to EBOV through the upregulation of immune, inflammatory, and coagulation pathway, in contrast to a limited response in the human (HEK293T) cells[12]. To comprehensively understand the pathways involved in the bat filoviral response, we infected bats, rather than their isolated cells, and analyzed tissue-specific RNA expression through mRNA-seq in the organs of the infected animals.

      Reviewer #2

      1. The authors provide this contribution to the extremely interesting topic of the immunobiology that facilitates filovirus infections of bats without overt pathology. They focused entirely on gene transcription signatures from different tissue sites following experimental infection, and sometimes compare those signatures with those generated in humans following natural exposures to filoviruses. The strengths of the paper is the shear breadth of data generated that is available openly to the scientific community and the development of novel mRNA datasets from bats, in the absence and presence of infection. One of the major limitations of this systems-based approach is that there is no mechanistic data that links gene function to the immune response to filovirus infection. Rather, associations are made and functional links are inferred. This limitation makes the title of the manuscript "...is controlled by a systemic response" an overstatement.

      We thank the reviewer and agree that mechanistic studies were out of scope of this study and have reflected this fact in the title by replacing “is controlled” with “induces”:

      Ebola and Marburg filovirus infection in bats induces a systemic response

      The authors indicate that one of their main objectives is to understand differences in the responses to infection between bats and humans. But this submission says little about the transcriptome-level responses to filovirus infection in humans. It does, on at least one occasion, state that some of the bat genes with altered expression levels were also altered in a study of human filovirus infections (reference #67). I think it would be helpful if the authors devoted a figure or table to the direct comparison between their analysis of MARV- and EBOV-infected bats and the findings of filovirus-infected humans, highlighting genes that are differentially up- or downregulated between the two species.

      This discussion, which was also requested by Reviewer 1, is now included in the manuscript (Discussion page 9 and Tables 2 and 3).

      Figure 2 is not described nor presented usefully. Instead of providing a figure title ""Upset plot..." the authors should clearly describe the type of transcriptomic data being presented. Moreover, it way the data is plotted does not reveal any direct information about the genes that are up- or downregulated in each condition, thus reducing its utility to the reader. I suggest that this Figure be placed in the Supplemental information. In fact, Figures 3 could also be moved to the Supplemental information

      Figure 2 makes that point that the response is a broad one while Figure 3 presents evidence from expression data that there is tissue-specific responses to the viruses. Both together provide convincing evidence of a systemic, wide-ranging response to both MARV and EBOV infections. We have edited the caption to Figure 2 by changing it to the following:

      Figure 2: Broad response of bat liver genes to filoviral infection. Many genes in the liver respond to filoviral infections, with MARV having a bigger impact compared to EBOV (840 genes that are responsive to MARV alone, compared to the 43 specific to EBOV alone). The EBOV-specific (EBOV/MARV) and MARV-specific (MARV/EBOV)genes are likely host responses specific to the viral VP40, VP35 and VP24 genes. In the plot, mock refers to mock-infected bats, EBOV to EBOV-infected bats, and MARV to MARV-infected bat livers. Each row in the lower panel represents a set, there are six sets of genes based on various comparisons, e.g., EBOV/mock is the set of genes at least 2-fold up regulated in EBOV infection, compared to the mock samples. The gray bars at the lower left representing membership in the sets. The vertical blue lines with bulbs represent set intersections, e.g., the last bar is the set of genes common to EBOV/MARV, EBOV/mock and MARV/mock, so the genes in this set are up 2-fold in EBOV compared to the mock and MARV samples, and at least 2-fold up in MARV compared to mock. The main bar plot (top) is number of genes unique to that intersection, so the total belonging to a set, say mock/EBOV, is a sum of the numbers in all sets that have mock/EBOV as a member (41+203+6+31=281).

      The authors do not specify in the main text, figure captions, or methods sections how they objectively assigned bat homologs as being "similar to " or "divergent from" their human counterparts. What is the cut-off in terms of sequence similarity?

      We apologize for this omission. In addition to a description in Methods, we have added the following statement to the Results section (Page 4).

      To identify divergent genes, we relied on BLASTn[13]. Genes detected as homologues (16004, 87% out of 18443 genes in our databse) using BLASTn default settings were labelled “similar”. The remaining 2439 genes (13%) were considered “divergent”. Of these genes, 1,548 transcripts (8% of the total), could be identified as homologous by reducing the word-size in BLASTn from 11, the default, to 9. This approach is equivalent to matching at the protein level, but we find that using nucleotide level matches provides a cleaner separation of the two classes than using translated proteins (Fig. 4, Methods).

      In the Discussion, it is surprising that the authors state that "the majority of interferon response genes are not divergent from human homologs" since genes involved in innate immunity are some of the most rapidly evolving genes known to exist. Again, clarification over what dictates "divergence" over "similarity" is warranted. Many previous studies have shown how a single residue change in an innate immune effector can drastically alter its specificity and/or potency.

      We have clarified this point by adding the following statement in the Discussion (pages 8,9)

      There are hundreds of genes involved in the interferon response, some key components can mutate to change specificity of their interactions, but most, especially those in the core ISG category[14], evolve slowly and have conserved function and sequence[15]. Our analysis of gene divergence shows that the majority of interferon response genes are not divergent from their human homologs, consistent with prior observations that the innate responses are quite similar between human and bat cell lines[9]. This implies that other systems are involved in generating the difference in response between bats and humans.

      The authors state in the introduction, and point to citation #21, that ERBs are "refractory to infection." In Figure 1, the authors indicate that experimental of ERBs with EBOV led to detectable infection in some animals, particularly in the liver. At this point in the manuscript, the authors should state if and how this result differs from what is published in #21, and they should comment on whether this is scientifically significant, or not. This is eventually discussed briefly in the Discussion but adding a sentence to Results section would be helpful for readers.

      To emphasize that our results contradict prior reports of ERB being refractory to EBOV infection, we have modified the statement in the Results (page 3) as follows.

      Two of the three EBOV-inoculated animals presented with histopathological lesions in the liver, consisting of pigmented and unpigmented infiltrates of aggregated mononuclear cells compressing adjacent tissue structures, and eosinophilic nuclear and cytoplasmic inclusions, changes consistent with previous reports[16], [17]. In EBOV-infected animals, focal immunostaining with both pan-filovirus and EBOV-VP40 antibodies was observed in the liver of one animal, but very few foci were found, suggesting limited viral replication.

      The research question at hand, concerning how bats serve as reservoirs for multiple viruses which are pathogenic to humans without succumbing to disease, is one of the hottest topics in immunology and virology. However, the authors do not provide a clear enough explanation of how their approach to study the transcriptome response following filovirus infection goes beyond what has been published in previous studies. This manuscript would greatly benefit from a discussion of its novelty in the Introduction and Discussion sections.

      We have reviewed prior human and bat studies (Introduction -page 3 and Discussion- page 9 shown above) to highlight the novelty of our findings. We have also added the following sentence at the end of the Introduction highlighting the novelty of the study.

      This is the first in vivo study that focuses on the coordinated transcriptional response to filoviruses at the level of individual organs in bats.

      References

      [1] X. Liu et al., “Transcriptomic signatures differentiate survival from fatal outcomes in humans infected with Ebola virus,” Genome Biology, vol. 18, no. 1, p. 4, Jan. 2017, doi: 10.1186/s13059-016-1137-3.

      [2] A. K. McElroy et al., “Ebola hemorrhagic Fever: novel biomarker correlates of clinical outcome,” J. Infect. Dis., vol. 210, no. 4, pp. 558–566, Aug. 2014, doi: 10.1093/infdis/jiu088.

      [3] S. B. Bradfute, K. L. Warfield, and S. Bavari, “Functional CD8+ T cell responses in lethal Ebola virus infection,” J. Immunol., vol. 180, no. 6, pp. 4058–4066, Mar. 2008, doi: 10.4049/jimmunol.180.6.4058.

      [4] M. N. Rahim et al., “Complete protection of the BALB/c and C57BL/6J mice against Ebola and Marburg virus lethal challenges by pan-filovirus T-cell epigraph vaccine,” PLOS Pathogens, vol. 15, no. 2, p. e1007564, Feb. 2019, doi: 10.1371/journal.ppat.1007564.

      [5] A. Sobarzo et al., “Multiple viral proteins and immune response pathways act to generate robust long-term immunity in Sudan virus survivors,” EBioMedicine, vol. 46, pp. 215–226, Aug. 2019, doi: 10.1016/j.ebiom.2019.07.021.

      [6] L. Fernando et al., “Immune Response to Marburg Virus Angola Infection in Nonhuman Primates,” J Infect Dis, vol. 212, no. suppl_2, pp. S234–S241, Oct. 2015, doi: 10.1093/infdis/jiv095.

      [7] S. W. Stonier et al., “Marburg virus survivor immune responses are Th1 skewed with limited neutralizing antibody responses,” J. Exp. Med., vol. 214, no. 9, pp. 2563–2572, Sep. 2017, doi: 10.1084/jem.20170161.

      [8] S. Sakabe et al., “Analysis of CD8+ T cell response during the 2013–2016 Ebola epidemic in West Africa,” PNAS, vol. 115, no. 32, pp. E7578–E7586, Aug. 2018, doi: 10.1073/pnas.1806200115.

      [9] I. V. Kuzmin et al., “Innate Immune Responses of Bat and Human Cells to Filoviruses: Commonalities and Distinctions,” J. Virol., vol. 91, no. 8, Apr. 2017, doi: 10.1128/JVI.02471-16.

      [10] C. E. Arnold et al., “Transcriptomics Reveal Antiviral Gene Induction in the Egyptian Rousette Bat Is Antagonized In Vitro by Marburg Virus Infection,” Viruses, vol. 10, no. 11, 02 2018, doi: 10.3390/v10110607.

      [11] M. Hölzer et al., “Differential transcriptional responses to Ebola and Marburg virus infection in bat and human cells,” Scientific Reports, vol. 6, p. 34589, Oct. 2016, doi: 10.1038/srep34589.

      [12] J. W. Wynne et al., “Comparative Transcriptomics Highlights the Role of the Activator Protein 1 Transcription Factor in the Host Response to Ebolavirus,” Journal of Virology, vol. 91, no. 23, Dec. 2017, doi: 10.1128/JVI.01174-17.

      [13] S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman, “Basic local alignment search tool,” J. Mol. Biol., vol. 215, no. 3, pp. 403–410, Oct. 1990, doi: 10.1016/S0022-2836(05)80360-2.

      [14] A. E. Shaw et al., “Fundamental properties of the mammalian innate immune system revealed by multispecies comparison of type I interferon responses,” PLOS Biology, vol. 15, no. 12, p. e2004086, Dec. 2017, doi: 10.1371/journal.pbio.2004086.

      [15] T. B. Sackton, B. P. Lazzaro, T. A. Schlenke, J. D. Evans, D. Hultmark, and A. G. Clark, “Dynamic evolution of the innate immune system in Drosophila,” Nat. Genet., vol. 39, no. 12, pp. 1461–1468, Dec. 2007, doi: 10.1038/ng.2007.60.

      [16] M. E. B. Jones et al., “Experimental Inoculation of Egyptian Rousette Bats (Rousettus aegyptiacus) with Viruses of the Ebolavirus and Marburgvirus Genera,” Viruses, vol. 7, no. 7, pp. 3420–3442, Jun. 2015, doi: 10.3390/v7072779.

      [17] J. T. Paweska, N. Storm, A. A. Grobbelaar, W. Markotter, A. Kemp, and P. Jansen van Vuren, “Experimental Inoculation of Egyptian Fruit Bats (Rousettus aegyptiacus) with Ebola Virus,” Viruses, vol. 8, no. 2, Jan. 2016, doi: 10.3390/v8020029.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors provide this contribution to the extremely interesting topic of the immunobiology that facilitates filovirus infections of bats without overt pathology. They focused entirely on gene transcription signatures from different tissue sites following experimental infection, and sometimes compare those signatures with those generated in humans following natural exposures to filoviruses. The strengths of the paper is the shear breadth of data generated that is available openly to the scientific community and the development of novel mRNA datasets from bats, in the absence and presence of infection. One of the major limitations of this systems-based approach is that there is no mechanistic data that links gene function to the immune response to filovirus infection. Rather, associations are made and functional links are inferred. This limitation makes the title of the manuscript "...is controlled by a systemic response" an overstatement.

      Major points:

      The authors indicate that one of their main objectives is to understand differences in the responses to infection between bats and humans. But this submission says little about the transcriptome-level responses to filovirus infection in humans. It does, on at least one occasion, state that some of the bat genes with altered expression levels were also altered in a study of human filovirus infections (reference #67). I think it would be helpful if the authors devoted a figure or table to the direct comparison between their analysis of MARV- and EBOV-infected bats and the findings of filovirus-infected humans, highlighting genes that are differentially up- or downregulated between the two species.

      Figure 2 is not described nor presented usefully. Instead of providing a figure title ""Upset plot..." the authors should clearly describe the type of transcriptomic data being presented. Moreover, it way the data is plotted does not reveal any direct information about the genes that are up- or downregulated in each condition, thus reducing its utility to the reader. I suggest that this Figure be placed in the Supplemental information. In fact, Figures 3 could also be moved to the Supplemental information.

      The authors do not specify in the main text, figure captions, or methods sections how they objectively assigned bat homologs as being "similar to " or "divergent from" their human counterparts. What is the cut-off in terms of sequence similarity?

      In the Discussion, it is surprising that the authors state that "the majority of interferon response genes are not divergent from human homologs" since genes involved in innate immunity are some of the most rapidly evolving genes known to exist. Again, clarification over what dictates "divergence" over "similarity" is warranted. Many previous studies have shown how a single residue change in an innate immune effector can drastically alter its specificity and/or potency.

      Minor points:

      The authors state in the introduction, and point to citation #21, that ERBs are "refractory to infection." In Figure 1, the authors indicate that experimental of ERBs with EBOV led to detectable infection in some animals, particularly in the liver. At this point in the manuscript, the authors should state if and how this result differs from what is published in #21, and they should comment on whether this is scientifically significant, or not. This is eventually discussed briefly in the Discussion but adding a sentence to Results section would be helpful for readers.

      Significance

      The research question at hand, concerning how bats serve as reservoirs for multiple viruses which are pathogenic to humans without succumbing to disease, is one of the hottest topics in immunology and virology. However, the authors do not provide a clear enough explanation of how their approach to study the transcriptome response following filovirus infection goes beyond what has been published in previous studies. This manuscript would greatly benefit from a discussion of its novelty in the Introduction and Discussion sections.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Jayaprakash et al investigates the response to the filoviruses Marburg and Ebola virus in Rousettus aegyptiacus bats, the natural reservoir of Marburg virus. The response to infection is investigated by comparing transcriptomes of different bat tissues in infected and uninfected bats. The manuscript groups the observed transcriptome changes into pathways that are impacted, and discusses how those pathways may cause subclinical infection in bats, compared to severe disease in humans. The data included also sheds light on bat immunology and reservoir characteristics more generally, which is particularly timely during the SARS-CoV-2 pandemic.

      Major comments:

      Are the key conclusions convincing?

      The first hypothesis of the manuscript is that, rather than a change in a single immune pathway being responsible for the lack of response to the virus, the response will be systemic involving multiple inter-related pathways. The data show that this was the case after presenting convincing transcriptome analysis. The second hypothesis is that the differences in responses between bats and humans are due to evolutionarily divergent genes. The authors provide evidence for this in the transcriptome differences in the C-reactive protein, aspects of the complement system, iron regulation and M1/M2 macrophage polarization. The second hypothesis is broad, but there are clearly differences in the genes involved in humans and bats. Without mechanistic information on the function of the proteins/cells investigated, it is hard to determine that the changes the authors are observing are the cause of the different responses, rather than an effect of some upstream response, and so difficult to pin-point specific divergent genes. The authors wish to compare the response to the virus in bats to the better characterized human tissue responses, but because this relies on previously published work in humans, it is sometimes unclear whether "more bat-like" responses are definitely associated with positive outcomes in humans. As the benefit of certain responses in human infections can depend on the timing of the response, it might be helpful to include summarized human data in manuscript to aid comparison with the bat responses. For instance, the T-cell response section concludes "Bats mount a T cell response against the infection" but there is no discussion of the impaired but complex lymphocyte response in humans, so comparison is not possible.

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      No, speculative discussion of potential drugs is already qualified as speculative, and adds to the understanding of the significance of the data.

      Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      No

      Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      N/A

      Are the data and the methods presented in such a way that they can be reproduced?

      Yes

      Are the experiments adequately replicated and statistical analysis adequate?

      Yes

      Minor comments:

      Specific experimental issues that are easily addressable.

      Mock infected IHC should be included in Figure 1F to demonstrate the antibodies are not background.

      Are prior studies referenced appropriately?

      Mostly yes. The discussion of the T-cell responses in infection could be expanded to include more information on human responses

      Are the text and figures clear and accurate?

      Yes

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      See comment in hypotheses- a summarized table of findings from previous studies of early responses to the virus would be helpful for comparisons to the bat response and for determining the second hypothesis.

      Significance

      Nature and Significance of the advance.

      Bat immune responses to filoviruses are poorly characterized, and this paper contains much information that can aid future investigation of reservoir responses. This data also has broad application to other bat-borne pathogens.

      Compare to existing published knowledge.

      There is little about in vivo bat immune response to filoviral infections. Significantly, this report has a non-refractory response to Ebola virus infection in Rousettus aegyptiacus.

      Audience

      This paper would be of interest to filovirologists and those interested in zoonotics and bat immunology.

      Your expertise.

      I am a viral immunologist with >15 years' experience with filoviruses. Ms. Clarke is a senior graduate student whose thesis focuses on immune responses to filovirus glycoproteins.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      INITIAL RESPONSE TO REVIEWERS / REVISION PLAN

      We are grateful to the three reviewers for reviewing our manuscript and providing their comments which helped to improve further the quality of the current study. We attach an initial revised version of the manuscript with changes corresponding to reviewers’ comments being highlighted. We now provide:

      • 18 new main figure panels (Fig.1E, Figs.2D-F, Figs.3E-F, Figs.4B,C,E, Figs.6B-F, Figs.7B,D,E,F),
      • 9 new supplementary figures, and
      • 13 new supplementary tables, that correspond to the points raised by the reviewers. In this initial response to reviewers and revision plan we have already performed the bioinformatics analysis and the majority of new wet lab experiments requested by the reviewers, while we are still awaiting only for the results of three sets of wet lab experiments (RIP-seq, additional protein/RT-qPCR confirmations and B2 incubations with other proteins), which, due to their nature, take longer. We have also revised the main text accordingly with only a number of updates (regarding some methods of experiments currently in progress and the respective discussion) still missing.

      In detail:

      REVIEWER 1

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      B2 RNAs, encoded from SINE B2 elements has been directly implicated in stress response by its inherent ability to bind RNA Pol II and suppress stress response genes (SRG) in homeostatic conditions. However, upon stimuli, B2 RNAs are cleaved and degraded, resulting in the release of RNA pol II and upregulation of SRGs. Previous work from the senior author identified PRC2 component EZH2 to be the B2 RNA processing factor, cleaving B2, and releasing POL2. SRGs are upregulated upon stress, for example in age-associated neuropathologies like Alzheimer's disease (AD). Considering that the hippocampus is a primary target of amyloid pathologies as well as since SRGs are suggested to be key for the function of a healthy hippocampus, the authors set to understand the role of B2 RNAs that are linked to SRG regulation in the mouse hippocampus with amyloid pathology. They use disease-relevant in vivo and in vitro models combined with unbiased RNA seq data analysis for this endeavor, which indicates the potential relevance of B2 RNAs in APP mediated neuronal pathologies in mice as well as identifies Hsf1 as the factor cleaving B2 RNAs in the hippocampus.

      This reviewer generally remarks that “The work is interesting and identification of Hsf1 as the processing factor for B2 RNAs in the hippocampus is significant. I would like to credit the authors for their elegant in vivo experimental design in Figure 2.”

      We appreciate the encouraging comments made by this reviewer.

      General comment: The reviewer finds “some of the conclusions to be overstated” and has brought a number of concerns to our attention. Indeed, we agree that provision of additional data and details is needed to avoid any confusion about the gene pathways to which our findings apply. In the initial manuscript, (Figures 2 D, F and 6 D, F), we presented the gene expression levels of all B2 RNA regulated SRGs identified in our previous study (Zovoilis et al, Cell 2016), referred as B2 RNA regulated SRGs or B2-SRGs throughout the manuscript. To this end, we performed the respective statistical tests between the different conditions considering these genes, in order to show the transcription dynamics of these genes in either amyloid beta pathology (APP mice /Figs. 2D, F) or amyloid beta toxicity (HT22 cells / Figs. 6D, F). Since we were not looking for new candidate genes upregulated in APP mice or in our HT22 cell culture system, we did not narrow our analysis only to genes delivered by a general-purpose differential gene expression approach such as DESeq but tested all B2-SRGs. However, based on the reviewer’s comments below, we realize that the paper would benefit by presenting in the main figures only those B2 RNA regulated SRGs that overlap with differentially expressed genes identified by DEseq in each experimental system. This will help to avoid confusion and any misunderstanding that all B2 RNA regulated genes are equally affected in our system, which is not the case and would be an overstatement. We are now presenting in new Figure 2 (2E, 2F) only those B2-SRGs that overlap with upregulated genes identified by DESeq in 6m old APP mice (listed in new Suppl. Table 5) and in new Figure 7 (7D, F) we are now presenting only those B2-SRGs that overlap with upregulated genes identified by DESeq in HT22 cells treated with amyloid beta (listed in new Suppl. Table 11). The conclusions drawn by the new figures remain the same as with the old ones and we believe that this new way of presentation of this data will prevent confusion and potential over-statements. We thank the reviewer for bringing this to our attention. Based also on this reviewer’s minor point 3, we recommend that the old figures that included all B2-SRGs (and not only the differentially expressed ones identified by DESeq) are moved to the Supplement as new Supplementary Figures 1 and 7, respectively, so that readers can still get a view of all the data and the transcription dynamics of all B2-SRGs, while we provide both in text and the supplement an explanation about the value as well as limitations of these figures.

      **Major comments:**

      Major point 1. The reviewer asks: “In figure 1, the authors indicate a strong connection between B2 RNA regulated SRGs and learning and memory. In figure 2, they identify the SRGs in the hippocampus, please provide a direct comparison of learning and memory associated SRGs and the SRGs they identify in figure 2 that are significantly upregulated in APP mice in 6 months.”

      In the revised version of the manuscript we now provide: i) As a new figure panel (lower panel in new Fig.1E), the number of B2 RNA regulated SRGs that are associated with learning based on our Peleg et al, Science 2010 paper and as a new Supplementary Table 3, the exact list of these genes. ii) As a new Supplementary Table 4, the list of all genes that are significantly upregulated in APP mice (6 months). iii) As a new Supplementary Table 5, the list of those genes upregulated in amyloid pathology (APP 6 months) that are B2-SRGs (expression levels of these genes are presented in new Figure 2E,F). Per reviewer’s question, we now provide as a new Supplementary Table 6, the list of B2 RNA regulated SRGs that are both learning associated genes and upregulated in 6 month old APP mice. In the text (first two sections of the results), we provide direct comparisons of the number of genes in each category and their overlap.

      Major point 2. The reviewer asks: “To better understand the data in the context of hippocampal function, please include functional annotation of SRGs they identified in Figure 2F as they do it in Figure 1 (desirably for each time point, at least for 6M). How many of the SRGs they identify in Figure 1 are part of Figure 2F? Please include functional annotation of significantly upregulated B2 regulated SRGs in Fig2 and compare them with that of Figure 1.”

      The number of B2 RNA regulated SRGs in Figure 1 that are part of Figure 2 (in particular Figs.2E,F) is now presented in the new Supplementary Table 5 and also in the text. We now provide as a new Supplementary Table 7 the functional annotation of these genes (see also general comment for this reviewer) and discuss the findings in the text.

      We recommend to include only the 6M old mice as this is the time point in which B2 RNA processing was found to differ between WT and APP mice. However, if the reviewer thinks that this is necessary we will add also differential expression lists of other ages as additional supplementary tables.

      Major point 3. The reviewer asks: “In figure 3, the authors report that the B2 processing rates are high at the 6M time point at in hippocampi of the APP mice. Please include the levels of unprocessed and processed B2 RNAs in these samples along with this figure, without which it is difficult to gauge the significance of its correlation with SRGs in Figure 2.”

      We now provide as new figure panels 3E and 3F the levels of processed B2 RNA fragments and unprocessed (full length) B2 RNAs in these samples, respectively, along with the processing ratio which is now labeled as subfigure 3G.

      Major point 4. The reviewer asks: “What is the % of B2 regulated SRGs that are hsf1 bound in Figure 4C? What is there dynamics in the wild type and APP hippocampi?”.

      Old Figure 4C is now Figure 4A. The exact number of B2 RNA regulated SRGs that are close to Hsf1 binding sites is now presented as a new figure (Figure 4C) and discussed in the text. A list of these genes is provided as new Supplementary Table 8. For genes that are upregulated in APP mice compared to wild type, the difference in Hsf1 binding dynamics between B2 RNA regulated and not regulated genes is now presented as Suppl. Figure 4D.

      Major point 5. The reviewer asks: “What is the distribution of Hsf1 binding sites on (a) non-B2 regulated SRGs and (b) non-SRG genes in hippocampi?”.

      This point is related with point 4. We now present a new panel (Fig. 4B) for non B2 RNA regulated genes (listed in Suppl. Table 13) along with the distribution we have in the initial manuscript for all B2 RNA regulated SRGs (now presented as Fig. 4A). The direct comparison of these genes is presented in the new Suppl Figure 4C together with a similar comparison only for genes upregulated in APP mice (Suppl. Fig.4D)

      Major point 6. The reviewer notes: “In Figure 4D, the 3months old Wt HSF1 levels are high, yet B2 processing (Figure 3E) is low. Please comment.”

      The reviewer’s comment made us realize that we should include a plot that describes the correlation between Hsf1 levels and B2 RNA processing ration across all sequenced samples. This should reveal whether differences such as those observed by the reviewer affect our conclusion regarding the relationship between these two parameters. We now provide this in the new Supplementary Figure 6D, where we found a strong positive correlation between Hsf1 levels and B2 RNA processing ratio. We thank the reviewer for this comment which helped us to substantiate further this relationship.

      Major point 7. The reviewer notes: While the authors show in vitro cleavage of B2 RNA by Hsf1, the experiment lacks controls to be conclusive. At least, please include a similar size protein as HSF1 with no-known RNA binding activity and a similar size protein with RNA binding activity as controls in 5A. Please justify the use of PNK as the control protein. Please include the use domain-based deletions of Hsf1 to map the region of HSF1 that is binding and potentially cleaving the B2 RNA. Please include an RNA of similar size and Antisense-B2 RNA to show the specificity of the Hsf1 based cleavage of B2 RNA. Without these controls, the conclusions in Figure 5 cannot be substantiated.

      The endogenous ribozyme activity of B2 RNA compared to other control RNAs has already been shown in two previous works but we will also include the relative controls here by providing control incubations with other RNAs. We will also include the incubations with additional control proteins as suggested by the reviewer. We are currently performing these experiments and will include them in the revised version. PNK is used as a control protein because it is an RNA binding protein that is used in the construction of our short RNA libraries and we wanted show that short RNA seq data are free of such confounding factors that could potentially generate artificial fragments. We now include this information in the text.

      We feel that the application of domain based deletions for Hsf1, while it would add additional information on the exact biochemistry underlying B2 RNA processing though Hsf1, is beyond the scope of this manuscript. In the current manuscript we are just focusing on the fact that Hsf1 can accelerate B2 RNA processing in vitro and not on the mechanism how this happens. This should be addressed in our opinion on a separate manuscript.

      Major point 8. The reviewer asks: “The authors should show that the incubated APP peptides are taken up by the cells (experiments in Figure 5F and Figure 6).” These figures are now labelled as Fig.6C and Figure 7, respectively. That’s a very interesting point and we thank the reviewer for this comment. Multiple studies have shown that toxicity after incubation by amyloid beta is mediated mainly by cell surface receptors, which through cell signalling leads to the response to cellular toxicity that induces stress genes such as Hsf1. Nevertheless, APP peptides may enter the cell, and the reviewer’s questions raised the possibility that oligomers entering the cell could have a direct impact on the stability of the B2 RNA. In that case, providing evidence that the amyloid enters the cell would be important if we had indications that amyloid beta interacts directly with B2 RNA. We did test this and we found no direct effect of amyloid beta on B2 RNA, so the processing in our case is not induced by oligomers that may have entered the cell. We were planning to present this information in a different manuscript, but if the reviewer or editor thinks that it would be beneficial for the paper, we could present this as supplement figure that shows that amyloid beta incubations with B2 RNA do not induce further processing beyond what Hsf1 causes. For the moment we just present this below:

      Major point 9. The reviewer asks: “Please provide the list, functional annotation, and % of the SRGs upregulated upon incubation with APP in HT22 cells in comparison to 6month old APP mice. Comment on learning-related Genes.”

      In the revised version, we now provide and mention in the text the following data: i) a list of genes upregulated in HT22 cells during amyloid toxicity upon incubation with amyloid beta (new Suppl. Table 9), ii) a list of genes according to point (i) that are common with genes upregulated in APP mice (new Suppl. Table 10), iii) the list and number of B2-SRGs that are upregulated in HT22 cells during amyloid toxicity (the reviewer’s question) (new Suppl. Table 10). We mention in the text the gene numbers and also the genes that are common in all three lists. iv) Functional annotation of genes of point (iii) (new Suppl. Table 12),

      We also mention in the text the limitations of our comparisons between the in vivo model of amyloid pathology (APP mice) and the in vitro cell culture model of amyloid toxicity (HT 22 cells) and we clarify that the cell culture model is used just as a simulation of the effect of amyloid beta in gene pathways associated with response to cellular stress and the role of Hsf1 on B2 RNA processing.

      Major point 10. The reviewer asks: “The authors should show the efficient downregulation of Hsf1 (protein) upon anti-Hsf1 LNA transfection.”

      In the revised version, in addition to the RNA-seq data we provide a second confirmation at the mRNA level with an independent method (RT-qPCR) in new figures 4E and 7B (lower panel). We are currently performing the protein extractions and will provide a WB or an Elisa in the revised version.

      Major point 11. The reviewer asks: “Please present the total B2 RNA levels for conditions in Figure 6C.”

      We now provide as new supplementary figure (Suppl. Fig. 6B and C) the levels of processed B2 RNA fragments and the total levels of unprocessed full length B2 RNAs of these samples that relate to old Figure 6C (now labeled as Fig.7C)

      Major point 12. The reviewer notes: “Hsf1 levels are not significantly downregulated in Control cells which were inoculated with the reverse APP peptide. Please comment.”

      We assume that the reviewer here refers to the lack of reduction in Hsf1 levels in the cells inoculated with the reverse peptide and the anti-Hsf1 LNA. Indeed, this lack of reduction is confirmed also by the new qPCR we performed (new Figure 7B, lower panel, R-ctrl vs R-anti-Hsf1). This should likely be attributed to compensation during non-stress conditions. In contrast, under stress conditions, Hsf1 is heavily used in stress response, which could explain the differences we see as cellular needs surpass the available Hsf1 transcripts due to degradation by the LNA. This is also supported by the new RT-qPCR experiments we have performed for B2-SRGs (new Figure 7E). In agreement with what is known for stress response genes such as immediately early genes (for example FosB), levels of these genes are minimal in both R-ctrl and R-anti-Hsf1 conditions and only become activated during stress response. We now discuss this in the text of the revised manuscript.

      Major point 13. The reviewer asks: “Please compare and contrast the % of genes, the overlap, and the functional distinctions in 6F to that of 5G and Figure1. What are the genes that are common between Figure1, and that are specifically upregulated upon Anti-Hsf1 LNA transfection along with 1-42 APP. What is % of the occurrence of B2 binding sites in those genes? What are their functional annotations and what is their connection to learning, memory, and cell survival?”

      Old Figure 6F is now Figure 7F, while old Figure 5G is now Figure 6C. This point is discussed in the response to points 1 and 9 of this reviewer. In summary, genes upregulated in our amyloid toxicity model included 25 B2-SRGs (new Suppl. Table 11). When testing for enriched terms in these 25 genes, biological processes related with apoptosis, such as regulation of apoptotic process and programmed cell death were at the top of the list (new Suppl. Table 12) and included, among others, genes such as FosB and Mitf that have been connected with Alzheimer’s disease. Out of the 25 genes that are up-regulated in both mice and our cell culture system, six are B2-SRGs (4932438A13Rik, Fosb, Pag1, Ptprs, Sema5a, and Sgms1) and include a well-known immediate early gene (Fosb), genes associated with sensitivity to amyloid toxicity (Pag1, Sema5a, Sgms1, Fosb), as well as genes associated with p53 (Ptprs, Fosb). All these genes get upregulated in amyloid toxicity (42-Ctrl vs R-Ctrl) but are not upregulated when Hsf1 LNA is applied (42-anti-Hsf1 vs R-anti-Hsf1, no significant difference). This information is now included in the text.

      **Minor.**

      1 . Please include TPM/ FPKM values for hippocampal markers as control in Figure 2 to do justice to the hippocampus specific RNA seq conducted by the Authors.

      To our understanding, the reviewer here suggests the testing of well-known hippocampal markers in our mouse data as controls to confirm that they are indeed hippocampus specific. We have selected as reference markers, the genes employed by the Allen Brain Atlas RNA-sequencing project and we provide a comparison of their data in hippocampal cells with our data from mouse hippocampus. This is now presented as new Supplementary Figure 2.

      2 . In figure 2D the authors show that B2 RNA regulated SRGs in the 3 months' wild type mice are significantly high. P53 has been reported to be high in young wild types hippocampus, but not SRGs in my opinion. The authors should comment on this.

      Old Figure 2D is now Figure 2E. We now mention the reviewer’s comment particularly in the discussion and cite a landmark review article in Neuron journal by Michael Greenberg regarding the role of stress response genes, such as FosB, early during development. As to prevent any confusion, we have also replaced SRGs with B2-SRGs since we tested only B2-SRGS in our study.

      3 . In figure 2F, under the 6m APP condition, the replicate 3 looks substantially different from the other replicate. This can significantly impact the analysis and conclusions made. Either remove that replicate and present the analysis without it or please provide a valid explanation. To make the data more valid, please provide hierarchical clustering of the entire data, the non-B2 regulated genes and the B2 regulated SRGs.

      We now provide in the new Supplementary Figure 9C a PCA plot, which includes 6m APP mice vs. their WT counterparts and HT22 cells, and shows that this variability is within the biological replicate variability we can expect in these models. To substantiate this further, we have constructed the correlation matrix of the RNA-seq data of both WT and APP 6 month old mice in the new Supplementary Figure 9D. As shown in this matrix, all APP mice clearly correlate with each other and not with their WT counterparts.

      In the initial manuscript the heatmaps of former Figure 2 were indeed provided with hierarchical clustering of the entire data and also included non-B2 RNA regulated genes. This data is included now as Supplementary figure 2.

      In Figure 2C RNA seq data is represented in TPM while its FPKM in Figure 2D.

      Figure 2D is now Figure 2E, while Figure 2C remains labelled with the same number. Given that TPM already includes scaling of the data, it is unsuitable for the averaging of the gene expression levels of multiple genes (B2-SRGs) used in the boxplots of Figure 2. This does not apply in the case of single genes as in Fig 2C (p53) or in the heatmap where each gene is presented in a separate row. This explanation is now included in the methods section.

      Figure 2: the number of replicates in the case of 3-month-old wild types only 2. Please specifically denote it and comment why only 2 replicates are provided.

      During the hippocampal RNA extractions, the RNA of one of the three 3m old mice had very low RIN scores, which could be a confounding factor for the short-RNA-seq. As this happened some months after the hippocampal extractions, we did not have any other 3 month mice of the same cohort used for the behavioral and IHC studies. Thus, we decided to include only two replicates in this condition. Since the results presented in the current study focus mainly on 6 month old mice, we expect the impact to be minimal. We include this note in the methods section.

      4 . Considering that p53 and SRGs are significantly upregulated in 6months in the APP model, it would be great if (allowing that these samples are still available) the authors can include a staining for apoptotic markers, for example, Active Casp3 or similar. This will allow us to better gauge the gene expression changes presented by the authors especially regarding SRGs.

      Unfortunately, we do not have these slides but in the revised version we will provide qPCR data for some of these markers.

      5 . Under subheading: Hsf1 accelerates B2 RNA processing, 3rd paragraph when the authors comment on known hsf1 binding sites on SRG genes, please correct from: Increased Hsf1-binding was found.... "To the increased number of hsf1 binding sites were found", unless the authors would like to show increased Hsf1 binding by performing CHIP-seq for Hsf1 in the hippocampus at least at the 6-month time point between Wt and APP mice.

      We have changed the text accordingly.

      Reviewer #1 (Significance (Required)):

      B2 RNAs, encoded from SINE B2 elements has been directly implicated in stress response by its inherent ability to bind RNA Pol II and suppress stress response genes (SRG) in homeostatic conditions. However, upon stimuli, B2 RNAs are cleaved and degraded, resulting in the release of RNA pol II and upregulation of SRGs. Previous work from the senior author identified PRC2 component EZH2 to be the B2 RNA processing factor, cleaving B2, and releasing POL2. SRGs are upregulated upon stress, for example in age-associated neuropathologies like Alzheimer's disease (AD). Considering that the hippocampus is a primary target of amyloid pathologies as well as since SRGs are suggested to be key for the function of a healthy hippocampus, the authors set to understand the role of B2 RNAs that are linked to SRG regulation in the mouse hippocampus with amyloid pathology. They use disease-relevant in vivo and in vitro models combined with unbiased RNA seq data analysis for this endeavor, which indicates the potential relevance of B2 RNAs in APP mediated neuronal pathologies in mice as well as identifies Hsf1 as the factor cleaving B2 RNAs in the hippocampus.

      The work is interesting and identification of Hsf1 as the processing factor for B2 RNAs in the hippocampus is significant. I would like to credit the authors for their elegant in vivo experimental design in Figure 2.

      REVIEWER 2

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      This manuscript follows from previous work by the corresponding author showing that SINE-encoded B2 RNAs function as regulators of the expression of stress response genes (SRGs). Specifically, stimulus triggers the processing of repressive B2 RNAs that are bound at the SRGs, thereby activating SRG transcription. In this work, the authors investigate whether a similar mechanism might be controlling the expression of genes in models of amyloid beta neuropathology (i.e. mouse hippocampi from an amyloid precursor protein knock-in mouse model, and a cell culture model of amyloid beta toxicity). They performed RNA-seq in these models. Their data show a correlation between the progression of amyloid pathology, expression of genes thought to be regulated by B2 RNA, and the processing of B2 RNA. In addition, they show biochemical data supporting a role for Hsf1 in enhancing the processing of B2 RNA. Knockdown of Hsf1 also reduced B2 RNA processing and the expression of SRGs.

      **Major comments:**

      Major point 1. The reviewer asks: “In the RNA-seq data one cannot distinguish between Pol III transcribed B2 RNA and Pol II transcribed B2 RNA (typically embedded within introns and UTRs of mRNAs). The models they present, and the structures they show, clearly imply regulation by Pol III transcribed B2 RNA. However, there is no way to know that the short B2 RNAs they sequence aren't coming from degraded mRNAs. This needs to addressed. Minimally, in writing as a caveat of their model. Ideally, it would be addressed experimentally.”

      That’s a very interesting point, as it implies that the regulatory role of B2 RNAs may extend from PolIII transcribed B2 RNAs into B2 RNAs embedded into mRNAs (likely nascent ones) that may be also under the same endogenous ribozyme activity of this sequence, suppress PolII and are processed in response to stimuli. The RNA RIN values of our samples were pretty high except one 3m old mouse sample which was for this reason excluded from further analysis. Moreover, during the library construction shorter and longer RNAs have been separated. Thus, any generation of B2 RNA fragment that may have originated from mRNA should be biologically but not technically related and must have happened in the cell before our RNA extraction. To address this point, we now provide a new supplementary figure (Suppl. Figure 8), where we have separated the B2 elements against which we map the RNA fragments into two categories, those that fall within exonic/genic regions and those outside of these regions. Although B2 RNAs are produced by multiple copies in the genome, each copy does harbor multiple SNPs, insertions and deletions, which means that each B2 RNA fragment is mapped to a specific set of B2 elements and not to all of them. In other words, despite multiple mapping a level of spatial specificity is maintained. If the B2 RNAs we map were coming exclusively from either only Pol III B2 elements or mRNA embedded B2 elements, we would expect at least some difference in the distribution of fragments between B2 elements of these two categories, as the second one overlaps with mRNAs. As shown in the new supplementary figure 8, the fact that distribution models are very similar between the two categories indeed supports the hypothesis that both types of B2 elements may contribute to B2 RNA processing. Most importantly, the profile of B2 RNAs in genic regions shows that B2 RNA processing is not random but follows the same processing rules as B2 RNAs from Pol III promoters. Given the limitations posed by the repetitive nature of B2 RNAs, it remains difficult though to provide an exact number regarding the portion of B2 RNA fragments produced by each category and this is clearly noted in our revised discussion part. However, even the indication that B2 RNAs embedded in mRNAs may also play an important role in our model provides a new perspective that should be investigated further in future studies.

      Major point 2. The reviewer asks: “The direct regulation of SRGs by B2 RNA was not shown in their model systems for amyloid beta neuropathology. Rather, the authors' used the genes identified in their prior studies as B2 RNA-regulated, which I believe were in the NIH3T3 cell line. Given that transcription is highly cell-type specific, these genes might not be regulated by B2 RNA in mouse hippocampi or their cell culture model, despite the correlations shown. This needs to be addressed. Ideally, a targeted approach to show that transcription of even a couple genes in their system is indeed regulated by B2 RNA would provide stronger support for their conclusions.”

      We agree with the reviewer and we now provide a new figure (Fig.6D-F) with the targeted approach that this reviewer proposed. In particular, we have tested whether fragmentation of full length B2 RNAs is in connection with activation of target genes also in our biological system (HT22 cells) as it did in NIH/3T3 cells in our Cell paper. We now show in new Figure 6 that this is indeed the case.

      Major point 3. The reviewer proposes a number of additional information that needs to be provided: “The following bioinformatics analyses would strengthen their conclusions. This should be straightforward to do because it involves data they already have, and perhaps analyses they have already have performed.”

      a. Regarding the plot in Figure 3A (lower panel). The same plot should be shown for the 3m old and the 12m old APP mice (i.e. not just the 6m data). This would show the specificity of processing B2 RNA and that it indeed correlates with disease progression.

      We now provide this plot as new supplementary figure (Suppl. Figure 3). It shows that increased B2 RNA processing coincides only with the active neurodegeneration phase at 6 months and not the terminal stage.

      b. Regarding the plots of B2 RNA processing rate. This value could increase either due to more short RNAs or less full length RNA. Which is it for the 3m, 6m, and 12m APP mice? Showing the short and long B2 RNAs as boxplots (as opposed to only the processing rate) would address this and also provide additional insight into the regulation involved. The same applies to the data in Figure 6. (As an aside... do the authors mean processing ratio as opposed to rate? I'm not clear where the time component is coming into play to call this a rate.)

      Old Figure 6 is now Figure 7. We now provide all these figures that show that increase in processing ratio at 6 months is mainly due to increase in the processed fragments and not a decrease in full length B2 RNAs. For APP mice these are new Figures 3E and F, and for HT22 cells , these are new Supp. Figures 6B and C.

      c. The random genes in Figures 2E and 6E are plotted as heat maps, but statistical significance is hard to see. What do boxplots of the random genes look like, and is the significant difference between 6m old APP and 6m old WT then lost?

      Old Figure 2E is now new Suppl. Figure 1C, while old Figure 6E is now new Suppl. Figure 7C. We now provide these boxplots in new supplementary figures 1B and 7B.

      Major point 4. The reviewer comments: “ It is interesting that B2 RNA self-processing is enhanced by both Ezh2 and also Hsf1. It would strengthen the data to perform a control with a protein prepared more similarly to the Hsf1 (rather than PNK) to confirm that the enhanced B2 RNA breakdown is indeed attributable to Hsf1 and not a contaminant in the protein prep. Similarly, the authors should provide information on which RNA was added as the negative control for Hsf1-stimulated breakdown (i.e. the ~80 nt RNA).”

      This point is also discussed in Reviewer 1 point 7. The ribozyme endogenous activity of B2 RNA has been shown already in two previous studies that performed incubations with control RNAs and proteins. We are currently preparing and will provide these additional incubations as anew supplementary figure in the revised manuscript.

      **Minor comments:**

      1 . Regarding the GO analyses in Figure 1 (panels B, C, and D). I wasn't clear whether the authors are showing all statistically enriched terms, or only those relevant to neuronal processes and learning. I recommend showing a supplemental table with all terms that have an adjusted p value below a specified cut-off (e.g. 0.05).

      The statistical threshold used was an EASE score of 0.05 and all presented terms were above this threshold. In the initial manuscript we filtered only the top 5 terms in tissue enrichment and the top 10 terms for GO Biol process and Cell Compartment that had passed the threshold. We now provide all the terms that passed the threshold as a new Supplementary Table 2, including gene counts, exact gene numbers and related statistics.

      2 . The authors show several figures that are not new data (2B, 4A, 4B, Suppl. Fig 1 and 2). I think it would be more clear if these data were summarized and referenced in the results, rather than shown.

      Old Suppl. Fig1 and 2 that were results of previous studies or web resources directly available (such as Human Protein Atlas) have been now removed and they are now just referenced in the text. Old Figures 4A and 4B have been removed from the main figures but may be helpful to the readers if they are still available in the Supplement (currently as Suppl. Figure 4A and B), as not all users are familiar with the RNA-seq browsing tools of Allen Brain Atlas resources. Regarding figure 2B that contains data from our previous study on this exact cohort of mice: If the reviewer and the editor agree we recommend that it remains in the main figure (with the appropriate image credit citations), as it provides in an efficient way the clear connection between amyloid load and our results at the molecular level, and, most importantly, it clearly draws a line in amyloid pathology progression between 3m old and 6m old, that agrees with our findings in the RNA-seq data of these mice.

      3 . In Figure 3A the schematic shows that B2 is 155 nt, the plots in Figures 3A,B,C show B2 RNA is 120 nt, and Figure 5 shows the RNA is 188 nt. Can the authors please clarify these differences?

      The full length of B2 consensus sequence is 188nt and this is the one we use for the in vitro experiments. However, the structure of the B2 RNA has been resolved only for the first 155nt by the Kugel lab, and this is the only publicly available structure that we can reference in our figures. For the mapping of 5’ends of short fragments in Fig.3A we have used the same range tested in our Cell paper to maintain consistency of the results. The reason why this 120nt threshold was selected in the Cell paper was to exclude artifacts from short RNAs mapping partially in our metagene as well as downstream of those B2 elements that are shorter from the consensus sequence. We now explain in methods section these differences.

      4 . In the Methods section, the sequence of the g block template didn't contain the T7 promoter sequence that was used as the forward primer for PCR amplification?

      We have now included this sequence in lower case.

      5 . In Figure 6B, why were Hsf1 levels not decreased in the R treated cells after treatment with the LNA?

      Old Figure 6B is now new Figure 7B. Please see response to Reviewer 1, major point 12.

      Reviewer #2 (Significance (Required)):

      Finally, this reviewer generally remarks that “The models presented for the regulation of stress response genes (SRGs) in amyloid beta neuropathologies are compelling. As are the correlations they found between the progression of amyloid pathology, expression of genes thought to be regulated by B2 RNA, and the processing of B2 RNA. This is a unique direction of research for brain disease and represents an interesting conceptual advance. Most prior studies in this area use common model cell lines, and this lab seems well-positioned to unravel the proposed molecular mechanisms in neuronal systems.”

      We appreciate the encouraging comments made by this reviewer.

      REVIEWER 3

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This manuscript describes a regulatory mechanism involving Hsf1 and B2 RNAs in the control of stress response genes (SRGs) during amyloid induced toxicity. In particular Hsf1, upregulated in 6m old APP mice and in HT22 cells treated with beta amyloid peptides, is shown to stimulate the B2 RNA destabilization leading to SRGs activation. While in healthy cells this upregulation can be reverted once the stimulus is removed, the pathological condition fuels the circuitry leading to p53 upregulation and neuronal cell death. The authors previously described the same mechanism acting during cellular heath shock response but in this case the protein identified as trigger of B2 RNA destabilization and SRGs activation was EZH2 (Zovoilis et al, 2016).

      This reviewer generally remarks that “Indeed, the first part of the manuscript describes additional analyses of the previous data that prompts further investigation on the potential role of B2 RNA in AD condition. Nevertheless, it is not clear how the prior findings obtained in not biologically related cellular models might be used to obtain helpful indication of B2 RNA neuronal activity.”

      We thank the reviewer for this comment. Indeed, the current study’s main aim was to expand the findings of our previous work on the role of B2 RNA in cellular response to thermal stress in NIH/3T3 cells to other types of cellular response to stress, in our case to amyloid toxicity and the resulting amyloid pathology in neural cells. Response to thermal stress (Heat Shock) has been used for years as a basic study model for cellular response to stress. Proteins and gene pathways initially identified in heat shock have been subsequently shown to play identical pro-survival roles in other biological systems and there are studies showing the role of Hsf1, heat shock related proteins and cell stress response pathways in neural cells and the mammalian brain (we will provide these references in the revised version). For example, pathways such as the MAPK pathway and early response genes, that constitute the basis of response to heat shock, have been shown in studies by us and others to be activated and play a critical role in hippocampal function. Thus, examining the role of B2 RNA in the context of neural response to stress constituted a natural continuation of our previous study in NIH/3T3 cells. The fact that the list of B2 RNA regulated SRGs was found to be highly enriched in neuronal tissue terms and cellular compartments related to neuronal functions plainly confirms the close relationship among cellular response pathways in the two biological systems. Due to these facts we were compelled to investigate in more detail our previous findings also in a neural cell model. However, as discussed in point 2 of Reviewer 2, the initial manuscript did not confirm the direct control of B2 RNA on expression of target genes also in our cellular model. This information is now part of the new figure 6 and we thank both reviewers for bringing this to our attention.

      The reviewer also remarks that “The research fields of non coding RNAs and neurodegeneration are attractive and challenging and, in my opinion, the molecular circuitry involving B2 RNAs might add important insights for understanding beta amyloid toxicity and neuronal death; however, the data provided are not in the shape making the manuscript suitable for publication: some controls are missing, the way the experiments are presented is not easy to follow and more importantly the authors does not provide any data (tables or lists) of the NGS experiments and the study lacks validation of them. Therefore, in my opinion the manuscript needs a profound revision before to be considered for publication in Review Commons.”

      Based on this reviewer’s and the other reviewers’ suggestions we now provide additional controls, detailed tables and gene lists, and qPCR validation of these results. We have also substantially revised the text in the first section of the results and beginning of the discussion, to make our rational for testing B2-SRGs more clear and easier to follow.

      **major concerns:**

      Major point 1. The reviewer asks: “The first paragraph of the Results is entirely dedicated to re-analyze the data previously published by the same group (Zovoilis et al., 2016). However, this is not adequately explained. In line with this, the table 1 is not required since the data are already provided by Zovoilis et al., 2016, unless the authors handled the data using additional new criteria that have to be explained.”

      We now explain our rational for using this data in more detail in the text. Please see also response to the general comment of this reviewer and response to the next point.

      In the Zovoilis et al (2016) study, the data presented did not include the list of regulated genes in a direct way but as part of the annotation of the B2 CHART peaks. This may pose difficulty to non-experts to extract the gene list from that data and we thought to include them as separate gene list here so that readers can directly use it for their analysis. Nevertheless, if the reviewer or the editor think that the list is redundant, we can surely omit it.

      In addition, the reviewer comments: “Moreover, Zovoilis and colleagues (2016) focused on SRGs regulated upon heat shock and using NIH/3T3 and HeLa cell lines, therefore, it is difficult to me understand how, searching for "cellular function connected with B2 RNA regulated SRGs", the list resulted enriched of neuronal tissue terms or cellular compartments related to neuronal functions. Please clarify this point since the following analyses are based on these findings.”

      Neural pathologies, such as amyloid pathology in brain, are often connected with cellular stress due to proteotoxicity. The ability of neural cells to respond to proteotoxicity challenges is connected with various molecular mechanisms, including stress related proteins that were firstly described in the context of heat shock. Thus, both contexts (heat shock and amyloid toxicity) refer to cellular response to stress, which explains why genes identified to be regulated during stress response in NIH/3T3 cells constitute part of the basic stress response toolbox that neural cells have also been described to possess. We have now modified the text accordingly to make our rational more clear.

      Major point 2. The reviewer comments: “In Figure 1F there is no arrow indicating that some of the SRGs regulate directly miR-34 as stated in the main text. Moreover, it is more appropriate to replace SRGs with learning‐associated genes both in the figure and in text (2nd paragraph of the results) since Zovoilis and colleagues focused on them. Finally, they did not show in their manuscript the rescue of p53 expression mediated by mir-34; indeed, for miR-34-p53 regulatory axis Zovoilis and colleagues referred to Peleg et al, 2010 and Yamakuchi & Lowenstein, 2009. Please fix all these concerns.”

      We have restructured the figure as suggested by the reviewer and made clear the distinction between learning genes and B2 RNA regulated SRGs (B2-SRGs) from the two different studies. In connection with point 1 of Reviewer 1, we believe that new Figure 1E, that includes the exact number of B2-SRGs that are learning associated, will represent more efficiently and accurately the data. We have also corrected in the text the citation regarding miR-34c and p53 in both the introduction and first section of the results (last paragraph).

      -The Fig.1A and Fig.1F are wrongly indicated at the end of the sentence "....levels of these genes are normally downregulated in 6m and 12m old mice compared to 3m old mice (p=0.02 and p=0.04, respectively)"; please correct this point.

      The error has been corrected.

      Major point 3. The reviewer comments regarding Figure 2:

      a) Since three mice for each condition have been used for the RNA seq analyses, please provide a blot with the Principal Component Analysis (PCA).

      Please see also response to minor point 3 of Reviewer 1. We provide the PCA plots for WT and APP mice in the new Supplementary Figure 9 and we also provide a comparison of the six month old mice with the HT cell samples as well as a correlation matrix for 6 month old mice in the same figure.

      b) Fig 2F comes first of Fig 2E in the text, however, I suggest to move this latter to supplementary material.

      Old figure 2E has now been moved to supplementary material as new Supplementary Figure 2C and we also provide in a boxplot the exact gene expression levels as new Supplementary Figure 2B.

      c) In general, this study lacks validation of the RNA-seq results. Western blot and/or qRTR-PCR to verify the variation of p53 and of some selected SRGs have to be provided.

      In the current revised version we already provide qPCRs for p53 and Hsf1 in APP mice and we will include additional genes in the final version.

      d) It is also not clear how the authors defined SRGs in the hippocampus: do they correspond to learning‐associated genes described by in Zovoilis et al, 2011 or to B2 RNA H/S regulated genes by Zovoilis et al, 2016?

      The way we presented B2 RNA SRGs in the results with regard to learning associated genes was indeed unclear. We now present the distinction between the two gene categories and their relationship as a new Fig.1E panel and we also provide detailed gene lists of common genes and the exact numbers (please see also response to Review 1, major point 1).

      -APP 12 month old mice show the sever phenotype of the terminal AD-like pathology, however this does not correlate with significant SRGs and B2 processing increase. Can the author make a comment on this?

      That’s a very important point and we thank the reviewer for raising this point. We now comment on this in the discussion part explaining how our findings are characteristic of the initial active neurodegeneration phase of amyloid pathology rather than more terminal stages.

      Major point 4: The reviewer comments regarding Figure 5:

      a) a gel with no-protein control for the time course of panel B was cited in the text but missing among the panels. Moreover, the time course shown in the graph in 5C does not correspond to the one in 5B.

      Indeed, the no-protein control time line should refer only to panel C and not to B, we have now corrected the text. Nevertheless, we now present in the new Supplementary Fig. 5 the gels, based on which the graph in panel C was calculated, including also the gel with no protein timeline. The time course shown in the initial 5C had been mislabeled. It has now been corrected. We apologize for this and we thank the reviewer for bringing this to our attention.

      b) 5G indicates that four samples for each condition have been analysed by RNA-seq, since they do not seem to be homogeneous please provide a PCA analysis together with the validation by qRT-PCR of a selected group of deregulated genes.

      Old Figure 5G is new Figure 6C. PCA analysis for these samples is now provided in Supplementary Figure 9 and qPCR validation of a number of these genes is provided in new Fig. 7E.

      Moreover, it is not clear whether all the genes shown in the heatmap or a number of them, as stated in the text, were found upregulated in 6m old APP mice. Please clarify this point and modify the figure and the text accordingly. A Venn diagram showing the overlap between genes upregulated in 42vsR treatment and those upregulated in 6m old APP mice might help the comprehension of the experiment.

      Please see response to Reviewer 1, point 9. We now provide as new supplementary tables the exact overlapping lists and mention these numbers in the text.

      Major point 5: The reviewer comments regarding Figure 6 (now labeled as Fig.7):

      a) The evaluation of the levels of Hsf1 mRNA and protein upon LNA transfection is missing for both R and 42 treated HT22 cells. From TPM in panel B, Hsf1 downregulation seems to have been more effective in 42 than in R condition. This would mess up the interpretation of the data.

      We now provide qPCR data for Hsf1 gene expression levels which confirm the ones from the RNAseq. The reason why Hsf1 downregulation seems not to affect the R condition is discussed in our response to Reviewer 1, major point 12, and the respective explanation is provided in the revised text.

      b) Again, in this case any validation of the RNA seq data is provided (any B2 regulated SRGs).

      Now, we provide qPCR data for these genes in Fig.7B and new Fig.7E

      c) Panels E and F should be swapped or panel E moved to supplementary material.

      Panel E is now moved to supplementary material as new Suppl. Figure 7C.

      Major point 6. The reviewer comments: “In a previous paper the authors discovered B2 RNAs as a class of transcripts bound to EZH2 and this interaction leads to B2 RNA destabilization in heath shock (H/S) condition. The authors also conclude that the genes controlled by B2 RNAs may not overlap with the ones controlled by Hsf1 during H/S. The author should make a comment on this explaining why during H/S B2 RNAs work independently from Hsf1 and on different target SRGs while, during beta amyloid stress ,the two act together on the same SRGs. Moreover, as shown for EZH2, Hsf1-RIP experiment should be performed in order to confirm the direct involvement of Hsf1 in the SRGs-B2 destabilization.”

      In the last two paragraphs of our discussion we indicate that B2 RNA regulation is a new process implicated in the response to stress in amyloid pathology but certainly not the only one. We have revised the text in this part accordingly in the revised version to prevent any confusion. We are currently performing a series of RIP-seq experiments with various antibodies. As, to our knowledge, there is no prior published study performing RIP-seq or CLIP-seq for any tissue using Hsf1 antibodies, the success of this experiment is not guaranteed and depends on the existence of appropriate antibodies.

      Major point 7. The reviewer comments: “There is any table listing the results of the RNA seq experiments performed in this paper: control vs APP 3-6-12 m old mice and in R vs 42 treated HT22 cells in presence or absence of LNA against Hsf1. Please provide these data.”

      We now provide these lists as new supplementary tables. Please see response to major points 1 and 9 of reviewer 1.

      Major point 8. The reviewer comments: “In the discussion the authors claim that healthy cells are able to restore the expression of Hsf1, SRGs and B2 RNA upon removal of the stress. Since there are evidence for the rescue of SRGs and B2 RNA expression post H/S, no data are available for Hsf1, SRGs and B2 RNA upon the removal of 1-42 beta amyloid peptide. This might be a nice information to add to the manuscript.”

      This would indeed substantiate further our results in our HT22 cell model. We have now performed this experiment, in which HT-22 cells were removed from the amyloid 42 (and the respective R peptide control) and left to recover for 12 hours before estimating through RT-qPCR the Hsf1 levels ( see graph below, REC corresponds to recovered HT-22 cells). Hsf1 levels in 42-REC have returned to the same levels as in R, p We currently perform the RT-qPCRs of these samples also for B2-SRGs and will include them in the final version as a supplementary figure.

      **Minor criticisms:**

      -In the introduction the reference Yamakuchi M and Lowenstein CJ, (2009) MiR‐34, SIRT1 and p53: the feedback loop. Cell Cycle, should be added in the sentence: "In contrast, hippocampi of mouse models of amyloid pathology and post- mortem brains of human patients of AD.....and neural death (Zovoilis et al., 2011)."

      We have now changed the text at that point accordingly and also updated the legend of Figure 1F that also refers to this same study.

      -Authors refer to Hernandez et al., 2020 to state that B2 self cleavage is stimulated by some proteins however, Hernandez and colleagues studied only the effect of EZH2 protein. Please rephrase the sentence accordingly.

      Text has been modified accordingly.

      -Indicate a reference for the sentence: "......Ezh2, was reported as being responsible for the B2 RNA accelerated destabilization and processing during response to stress."

      The respective citation was added.

      -The format of many references is not consistent and has to be revised.

      We have switched to the Vancouver style. Some references in the legend and methods sections are referred independently from EndNote in case these text sections have to be moved to supplement in the final version in order to not create inconsistencies with endnote.

      Reviewer #3 (Significance (Required)):

      Finally, this reviewer generally remarks that “The research fields of non coding RNAs and neurodegeneration are attractive and challenging and, in my opinion, the molecular circuitry involving B2 RNAs might add important insights for understanding beta amyloid toxicity and neuronal death.

      However, this manuscript does not really add technical advances since the authors employed experimental approaches and bioinformatic analyses previously published by Zovoilis and colleagues in 2011 and 2016.”

      Our aim in the current manuscript was not to introduce a new method or experimental approach but rather to study the mechanisms behind B2 RNA regulation of gene expression in neural cells and particularly in amyloid pathology. Nevertheless, the current study constitutes the first reported short-RNA seq in this tissue and offers for the first time the ability to study B2 RNA processing in this tissue which is not possible with standard small and long RNA-seq.

      The reported findings might of interest of an audience of experts in non coding RNAs and neurodegeneration. The area of my expertise almost regards the biology of non coding RNAs from biogenesis to function manly focusing on neuronal and muscular systems both in physiological and pathological conditions.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript describes a regulatory mechanism involving Hsf1 and B2 RNAs in the control of stress response genes (SRGs) during amyloid induced toxicity. In particular Hsf1, upregulated in 6m old APP mice and in HT22 cells treated with beta amyloid peptides, is shown to stimulate the B2 RNA destabilization leading to SRGs activation. While in healthy cells this upregulation can be reverted once the stimulus is removed, the pathological condition fuels the circuitry leading to p53 upregulation and neuronal cell death. The authors previously described the same mechanism acting during cellular heath shock response but in this case the protein identified as trigger of B2 RNA destabilization and SRGs activation was EZH2 (Zovoilis et al, 2016). Indeed, the first part of the manuscript describes additional analyses of the previous data that prompts further investigation on the potential role of B2 RNA in AD condition. Nevertheless, it is not clear how the prior findings obtained in not biologically related cellular models might be used to obtain helpful indication of B2 RNA neuronal activity. The research fields of non coding RNAs and neurodegeneration are attractive and challenging and, in my opinion, the molecular circuitry involving B2 RNAs might add important insights for understanding beta amyloid toxicity and neuronal death; however, the data provided are not in the shape making the manuscript suitable for publication: some controls are missing, the way the experiments are presented is not easy to follow and more importantly the authors does not provide any data (tables or lists) of the NGS experiments and the study lacks validation of them. Therefore, in my opinion the manuscript needs a profound revision before to be considered for publication in Review Commons.

      major concerns:

      -The first paragraph of the Results is entirely dedicated to re-analyze the data previously published by the same group (Zovoilis et al., 2016). However, this is not adequately explained. In line with this, the table 1 is not required since the data are already provided by Zovoilis et al., 2016, unless the authors handled the data using additional new criteria that have to be explained. Moreover, Zovoilis and colleagues (2016) focused on SRGs regulated upon heat shock and using NIH/3T3 and HeLa cell lines, therefore, it is difficult to me understand how, searching for "cellular function connected with B2 RNA regulated SRGs", the list resulted enriched of neuronal tissue terms or cellular compartments related to neuronal functions. Please clarify this point since the following analyses are based on these findings.

      -In Figure 1F there is no arrow indicating that some of the SRGs regulate directly miR-34 as stated in the main text. Moreover, it is more appropriate to replace SRGs with learning‐associated genes both in the figure and in text (2nd paragraph of the results) since Zovoilis and colleagues focused on them. Finally, they did not show in their manuscript the rescue of p53 expression mediated by mir-34; indeed, for miR-34-p53 regulatory axis Zovoilis and colleagues referred to Peleg et al, 2010 and Yamakuchi & Lowenstein, 2009. Please fix all these concerns.

      -The Fig.1A and Fig.1F are wrongly indicated at the end of the sentence "....levels of these genes are normally downregulated in 6m and 12m old mice compared to 3m old mice (p=0.02 and p=0.04, respectively)"; please correct this point.

      -Figure 2:

      a) Since three mice for each condition have been used for the RNA seq analyses, please provide a blot with the Principal Component Analysis (PCA).

      b) Fig 2F comes first of Fig 2E in the text, however, I suggest to move this latter to supplementary material.

      c) In general, this study lacks validation of the RNA-seq results. Western blot and/or qRTR-PCR to verify the variation of p53 and of some selected SRGs have to be provided.

      d) It is also not clear how the authors defined SRGs in the hippocampus: do they correspond to learning‐associated genes described by in Zovoilis et al, 2011 or to B2 RNA H/S regulated genes by Zovoilis et al, 2016?

      -APP 12 month old mice show the sever phenotype of the terminal AD-like pathology, however this does not correlate with significant SRGs and B2 processing increase. Can the author make a comment on this?

      -Figure 5:

      a) a gel with no-protein control for the time course of panel B was cited in the text but missing among the panels. Moreover, the time course shown in the graph in 5C does not correspond to the one in 5B.

      b) 5G indicates that four samples for each condition have been analysed by RNA-seq, since they do not seem to be homogeneous please provide a PCA analysis together with the validation by qRT-PCR of a selected group of deregulated genes. Moreover, it is not clear whether all the genes shown in the heatmap or a number of them, as stated in the text, were found upregulated in 6m old APP mice. Please clarify this point and modify the figure and the text accordingly. A Venn diagram showing the overlap between genes upregulated in 42vsR treatment and those upregulated in 6m old APP mice might help the comprehension of the experiment.

      -Figure 6:

      a) The evaluation of the levels of Hsf1 mRNA and protein upon LNA transfection is missing for both R and 42 treated HT22 cells. From TPM in panel B, Hsf1 downregulation seems to have been more effective in 42 than in R condition. This would mess up the interpretation of the data.

      b) Again, in this case any validation of the RNA seq data is provided (any B2 regulated SRGs).

      c) Panels E and F should be swapped or panel E moved to supplementary material.

      -In a previous paper the authors discovered B2 RNAs as a class of transcripts bound to EZH2 and this interaction leads to B2 RNA destabilization in heath shock (H/S) condition. The authors also conclude that the genes controlled by B2 RNAs may not overlap with the ones controlled by Hsf1 during H/S. The author should make a comment on this explaining why during H/S B2 RNAs work independently from Hsf1 and on different target SRGs while, during beta amyloid stress ,the two act together on the same SRGs. Moreover, as shown for EZH2, Hsf1-RIP experiment should be performed in order to confirm the direct involvement of Hsf1 in the SRGs-B2 destabilization.

      -There is any table listing the results of the RNA seq experiments performed in this paper: control vs APP 3-6-12 m old mice and in R vs 42 treated HT22 cells in presence or absence of LNA against Hsf1. Please provide these data.

      -In the discussion the authors claim that healthy cells are able to restore the expression of Hsf1, SRGs and B2 RNA upon removal of the stress. Since there are evidence for the rescue of SRGs and B2 RNA expression post H/S, no data are available for Hsf1, SRGs and B2 RNA upon the removal of 1-42 beta amyloid peptide. This might be a nice information to add to the manuscript.

      Minor criticisms:

      -In the introduction the reference Yamakuchi M and Lowenstein CJ, (2009) MiR‐34, SIRT1 and p53: the feedback loop. Cell Cycle, should be added in the sentence: "In contrast, hippocampi of mouse models of amyloid pathology and post- mortem brains of human patients of AD.....and neural death (Zovoilis et al., 2011)."

      -Authors refer to Hernandez et al., 2020 to state that B2 self cleavage is stimulated by some proteins however, Hernandez and colleagues studied only the effect of EZH2 protein. Please rephrase the sentence accordingly.

      -Indicate a reference for the sentence: "......Ezh2, was reported as being responsible for the B2 RNA accelerated destabilization and processing during response to stress."

      -The format of many references is not consistent and has to be revised.

      Significance

      The research fields of non coding RNAs and neurodegeneration are attractive and challenging and, in my opinion, the molecular circuitry involving B2 RNAs might add important insights for understanding beta amyloid toxicity and neuronal death. However, this manuscript does not really add technical advances since the authors employed experimental approaches and bioinformatic analyses previously published by Zovoilis and colleagues in 2011 and 2016.

      The reported findings might of interest of an audience of experts in non coding RNAs and neurodegeneration.

      The area of my expertise almost regards the biology of non coding RNAs from biogenesis to function manly focusing on neuronal and muscular systems both in physiological and pathological conditions.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This manuscript follows from previous work by the corresponding author showing that SINE-encoded B2 RNAs function as regulators of the expression of stress response genes (SRGs). Specifically, stimulus triggers the processing of repressive B2 RNAs that are bound at the SRGs, thereby activating SRG transcription. In this work, the authors investigate whether a similar mechanism might be controlling the expression of genes in models of amyloid beta neuropathology (i.e. mouse hippocampi from an amyloid precursor protein knock-in mouse model, and a cell culture model of amyloid beta toxicity). They performed RNA-seq in these models. Their data show a correlation between the progression of amyloid pathology, expression of genes thought to be regulated by B2 RNA, and the processing of B2 RNA. In addition, they show biochemical data supporting a role for Hsf1 in enhancing the processing of B2 RNA. Knockdown of Hsf1 also reduced B2 RNA processing and the expression of SRGs.

      Major comments:

      1 . In the RNA-seq data one cannot distinguish between Pol III transcribed B2 RNA and Pol II transcribed B2 RNA (typically embedded within introns and UTRs of mRNAs). The models they present, and the structures they show, clearly imply regulation by Pol III transcribed B2 RNA. However, there is no way to know that the short B2 RNAs they sequence aren't coming from degraded mRNAs. This needs to addressed. Minimally, in writing as a caveat of their model. Ideally, it would be addressed experimentally.

      2 . The direct regulation of SRGs by B2 RNA was not shown in their model systems for amyloid beta neuropathology. Rather, the authors' used the genes identified in their prior studies as B2 RNA-regulated, which I believe were in the NIH3T3 cell line. Given that transcription is highly cell-type specific, these genes might not be regulated by B2 RNA in mouse hippocampi or their cell culture model, despite the correlations shown. This needs to be addressed. Ideally, a targeted approach to show that transcription of even a couple genes in their system is indeed regulated by B2 RNA would provide stronger support for their conclusions.

      3 . The following bioinformatics analyses would strengthen their conclusions. This should be straightforward to do because it involves data they already have, and perhaps analyses they have already have performed.

      a. Regarding the plot in Figure 3A (lower panel). The same plot should be shown for the 3m old and the 12m old APP mice (i.e. not just the 6m data). This would show the specificity of processing B2 RNA and that it indeed correlates with disease progression.

      b. Regarding the plots of B2 RNA processing rate. This value could increase either due to more short RNAs or less full length RNA. Which is it for the 3m, 6m, and 12m APP mice? Showing the short and long B2 RNAs as boxplots (as opposed to only the processing rate) would address this and also provide additional insight into the regulation involved. The same applies to the data in Figure 6. (As an aside... do the authors mean processing ratio as opposed to rate? I'm not clear where the time component is coming into play to call this a rate.)

      c. The random genes in Figures 2E and 6E are plotted as heat maps, but statistical significance is hard to see. What do boxplots of the random genes look like, and is the significant difference between 6m old APP and 6m old WT then lost?

      4 . It is interesting that B2 RNA self-processing is enhanced by both Ezh2 and also Hsf1. It would strengthen the data to perform a control with a protein prepared more similarly to the Hsf1 (rather than PNK) to confirm that the enhanced B2 RNA breakdown is indeed attributable to Hsf1 and not a contaminant in the protein prep. Similarly, the authors should provide information on which RNA was added as the negative control for Hsf1-stimulated breakdown (i.e. the ~80 nt RNA).

      Minor comments:

      1 . Regarding the GO analyses in Figure 1 (panels B, C, and D). I wasn't clear whether the authors are showing all statistically enriched terms, or only those relevant to neuronal processes and learning. I recommend showing a supplemental table with all terms that have an adjusted p value below a specified cut-off (e.g. 0.05).

      2 . The authors show several figures that are not new data (2B, 4A, 4B, Suppl. Fig 1 and 2). I think it would be more clear if these data were summarized and referenced in the results, rather than shown.

      3 . In Figure 3A the schematic shows that B2 is 155 nt, the plots in Figures 3A,B,C show B2 RNA is 120 nt, and Figure 5 shows the RNA is 188 nt. Can the authors please clarify these differences?

      4 . In the Methods section, the sequence of the g block template didn't contain the T7 promoter sequence that was used as the forward primer for PCR amplification?

      5 . In Figure 6B, why were Hsf1 levels not decreased in the R treated cells after treatment with the LNA?

      Significance

      The models presented for the regulation of stress response genes (SRGs) in amyloid beta neuropathologies are compelling. As are the correlations they found between the progression of amyloid pathology, expression of genes thought to be regulated by B2 RNA, and the processing of B2 RNA. This is a unique direction of research for brain disease and represents an interesting conceptual advance. Most prior studies in this area use common model cell lines, and this lab seems well-positioned to unravel the proposed molecular mechanisms in neuronal systems.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      B2 RNAs, encoded from SINE B2 elements has been directly implicated in stress response by its inherent ability to bind RNA Pol II and suppress stress response genes (SRG) in homeostatic conditions. However, upon stimuli, B2 RNAs are cleaved and degraded, resulting in the release of RNA pol II and upregulation of SRGs. Previous work from the senior author identified PRC2 component EZH2 to be the B2 RNA processing factor, cleaving B2, and releasing POL2. SRGs are upregulated upon stress, for example in age-associated neuropathologies like Alzheimer's disease (AD). Considering that the hippocampus is a primary target of amyloid pathologies as well as since SRGs are suggested to be key for the function of a healthy hippocampus, the authors set to understand the role of B2 RNAs that are linked to SRG regulation in the mouse hippocampus with amyloid pathology. They use disease-relevant in vivo and in vitro models combined with unbiased RNA seq data analysis for this endeavor, which indicates the potential relevance of B2 RNAs in APP mediated neuronal pathologies in mice as well as identifies Hsf1 as the factor cleaving B2 RNAs in the hippocampus. The work is interesting and identification of Hsf1 as the processing factor for B2 RNAs in the hippocampus is significant. I would like to credit the authors for their elegant in vivo experimental design in Figure 2. However, I find some of the conclusions to be overstated and I would like to bring the following concerns I have to your attention:

      Major comments:

      1 . In figure 1, the authors indicate a strong connection between B2 RNA regulated SRGs and learning and memory. In figure 2, they identify the SRGs in the hippocampus, please provide a direct comparison of learning and memory associated SRGs and the SRGs they identify in figure 2 that are significantly upregulated in APP mice in 6 months.

      2 . To better understand the data in the context of hippocampal function, please include functional annotation of SRGs they identified in Figure 2F as they do it in Figure 1 (desirably for each time point, at least for 6M). How many of the SRGs they identify in Figure 1 are part of Figure 2F? Please include functional annotation of significantly upregulated B2 regulated SRGs in Fig2 and compare them with that of Figure 1.

      3 . In figure 3, the authors report that the B2 processing rates are high at the 6M time point at in hippocampi of the APP mice. Please include the levels of unprocessed and processed B2 RNAs in these samples along with this figure, without which it is difficult to gauge the significance of its correlation with SRGs in Figure 2.

      4 . What is the % of B2 regulated SRGs that are hsf1 bound in Figure 4C? What is there dynamics in the wild type and APP hippocampi?

      5 . What is the distribution of Hsf1 binding sites on (a) non-B2 regulated SRGs and (b) non-SRG genes in hippocampi?

      6 . In Figure 4D, the 3months old Wt HSF1 levels are high, yet B2 processing (Figure 3E) is low. Please comment.

      7 . While the authors show in vitro cleavage of B2 RNA by Hsf1, the experiment lacks controls to be conclusive. At least, please include a similar size protein as HSF1 with no-known RNA binding activity and a similar size protein with RNA binding activity as controls in 5A. Please justify the use of PNK as the control protein. Please include the use domain-based deletions of Hsf1 to map the region of HSF1 that is binding and potentially cleaving the B2 RNA. Please include an RNA of similar size and Antisense-B2 RNA to show the specificity of the Hsf1 based cleavage of B2 RNA. Without these controls, the conclusions in Figure 5 cannot be substantiated.

      8 . The authors should show that the incubated APP peptides are taken up by the cells (experiments in Figure 5F and Figure 6).

      9 . Please provide the list, functional annotation, and % of the SRGs upregulated upon incubation with APP in HT22 cells in comparison to 6month old APP mice. Comment on learning-related Genes.

      10 . The authors should show the efficient downregulation of Hsf1 (protein) upon anti-Hsf1 LNA transfection.

      11 . Please present the total B2 RNA levels for conditions in Figure 6C.

      12 . Hsf1 levels are not significantly downregulated in Control cells which were inoculated with the reverse APP peptide. Please comment.

      13 . Please compare and contrast the % of genes, the overlap, and the functional distinctions in 6F to that of 5G and Figure1. What are the genes that are common between Figure1, and that are specifically upregulated upon Anti-Hsf1 LNA transfection along with 1-42 APP. What is % of the occurrence of B2 binding sites in those genes? What are their functional annotations and what is their connection to learning, memory, and cell survival?

      Minor.

      1 . Please include TPM/ FPKM values for hippocampal markers as control in Figure 2 to do justice to the hippocampus specific RNA seq conducted by the Authors.

      2 . In figure 2D the authors show that B2 RNA regulated SRGs in the 3 months' wild type mice are significantly high. P53 has been reported to be high in young wild types hippocampus, but not SRGs in my opinion. The authors should comment on this.

      3 . In figure 2F, under the 6m APP condition, the replicate 3 looks substantially different from the other replicate. This can significantly impact the analysis and conclusions made. Either remove that replicate and present the analysis without it or please provide a valid explanation. To make the data more valid, please provide hierarchical clustering of the entire data, the non-B2 regulated genes and the B2 regulated SRGs. In Figure 2C RNA seq data is represented in TPM while its FPKM in Figure 2D. Figure 2: the number of replicates in the case of 3-month-old wild types only 2. Please specifically denote it and comment why only 2 replicates are provided

      4 . Considering that p53 and SRGs are significantly upregulated in 6months in the APP model, it would be great if (allowing that these samples are still available) the authors can include a staining for apoptotic markers, for example, Active Casp3 or similar. This will allow us to better gauge the gene expression changes presented by the authors especially regarding SRGs.

      5 . Under subheading: Hsf1 accelerates B2 RNA processing, 3rd paragraph when the authors comment on known hsf1 binding sites on SRG genes, please correct from: Increased Hsf1-binding was found.... "To the increased number of hsf1 binding sites were found", unless the authors would like to show increased Hsf1 binding by performing CHIP-seq for Hsf1 in the hippocampus at least at the 6-month time point between Wt and APP mice.

      Significance

      B2 RNAs, encoded from SINE B2 elements has been directly implicated in stress response by its inherent ability to bind RNA Pol II and suppress stress response genes (SRG) in homeostatic conditions. However, upon stimuli, B2 RNAs are cleaved and degraded, resulting in the release of RNA pol II and upregulation of SRGs. Previous work from the senior author identified PRC2 component EZH2 to be the B2 RNA processing factor, cleaving B2, and releasing POL2. SRGs are upregulated upon stress, for example in age-associated neuropathologies like Alzheimer's disease (AD). Considering that the hippocampus is a primary target of amyloid pathologies as well as since SRGs are suggested to be key for the function of a healthy hippocampus, the authors set to understand the role of B2 RNAs that are linked to SRG regulation in the mouse hippocampus with amyloid pathology. They use disease-relevant in vivo and in vitro models combined with unbiased RNA seq data analysis for this endeavor, which indicates the potential relevance of B2 RNAs in APP mediated neuronal pathologies in mice as well as identifies Hsf1 as the factor cleaving B2 RNAs in the hippocampus.

      The work is interesting and identification of Hsf1 as the processing factor for B2 RNAs in the hippocampus is significant. I would like to credit the authors for their elegant in vivo experimental design in Figure 2.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their useful suggestions to improve the manuscript and their support for publication. We have addressed all the comments that have been raised and carried out the suggested additional analyses, resulting in a significantly improved revised version of the manuscript. We provide hereafter a detailed point-by-point response to all questions and comments of the three reviewers.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Centriole structure has been an attractive but challenging research topic for years. Pierre Gonczy's group has been working on its structure using cryo-electron tomography (cryo-ET). While the axoneme, which has longitudinal periodicity, was analyzed by several groups by cryo-ET for more than a decade, cryo-ET study on the centriole suffers from poor signal to noise ratio due to its limited length and thus fewer periodicity. They chose the centriole of flagellate Trichonympha, which have exceptionally long centrioles and thus offer opportunity of relatively straightforward sub-tomogram averaging. Their approach has been successful, and they revealed intermediate resolution structure of the cartwheel, key of 9-fold symmetry formation, and it's joint to triplet microtubules (Guichard et al. 2012, 2013, 2018).

      In this work, they employed modern state-of-art cryo-ET technique, such as direct electron detection and 3D image classification to upgrade our knowledge of centriole structure. In their past works, the central hub of the cartwheel, made of SAS-6 protein forming 9-fold complex, was described as an 8nm periodic object. With improved spatial resolution, they provided further detail with clear polarity, which will deepen our thought about the initial stage of ciliogenesis. They also compared two Trichonympha species (spp and agilis) as well as another flagellate, Teranympha mirabilis, and extended their intriguing evolutional and mechanical hypotheses based on structural differences.

      Despite improved spatial resolution, it is still not possible to identify proteins in the cryo-ET map (cellular cryo-ET will not reach such high resolution in the near future). Therefore, this work is rather geometrically descriptive, which will inspire molecular biologists to identify molecules by other methods. Nevertheless, this work demonstrated capability of cellular cryo-ET, especially analysis of structural heterogeneity. Thus, while biological topics handled are rather specialized for cilia from flagellate, this work will attract attention of any biologist interested in molecular structure in vivo. It is worth for publication in a high Journal after addressing the points below. This reviewer believes that the authors can address these points easily with additional analysis.

      We are grateful to the reviewer for the favorable evaluation and the many valuable suggestions, in particular concerning the processing pipeline, which we addressed by additional analyses, as detailed below.

      Major points:

      1. Entire scheme A graphic diagram of the entire cartwheel area, summarizing this work, is necessary for the readers' understanding (similar to Fig.6 of the other manuscript, Klena et al.).

      We thank the reviewer for this interesting suggestion, which we fully adhere to. As a result, we have generated a graphical summary of the work, which is shown in the new Figure panels 6B-F. Moreover, Figure 6A provides an evolutionary perspective regarding the presence of the CID and of what is now referred to as the fCID (filamentous CID, previously: FLS, see response to reviewer 3). This also helps to link our findings with the companion manuscript by Klena et al. This new Figure 6 is referred to extensively in the discussion of the revised manuscript (pages 13-16).

      Then average scheme should be shown in more detail, especially assumption of periodicity, Materials and Methods. The cartwheel hub was averaged with 25nm periodicity (as discussed below). Was the pinhead averaged with 16nm (as detected by FFT in Fig.S2L)? How about the triplet?

      This reviewer is not completely sure if the longitudinal averaging strategy is justifiable. Since periodicity of each domain is not trivial, logically the initial average must be done with the size of least common multiple (or larger). It is likely 96nm, assuming 25nm of the central hub is 3 times of microtubule periodicity and 16nm of the pinhead is twice of MT. 96nm average should be possible with a long cartwheel in this work. Alternative, in case periodicity is independent of MT and thus there is no least common multiple, is random picking and classification mentioned in "4. Periodicity". This should also be possible, since they can pick enough number of particles from long cartwheels.

      We apologize that the initial version of the manuscript was not sufficiently clear regarding the averaging pipeline that was pursued. To rectify this, we now provide a new Figure S1B to graphically explain the approach followed for STA. As depicted in this figure panel, the step size for sub-volume extraction was 25 nm both centrally and peripherally. This step size was selected because it corresponds to ~3x the major periodicity of ~8.5 nm observed in the power spectra of the sub-volumes. The 25 nm step size is larger than that previously used (i.e. 17 nm in Guichard et al. 2013), in order to identify potential features with larger periodicities. The fact that the step size was of 25 nm in all cases is now mentioned explicitly in the Materials and Methods section of the revised manuscript (line 649).

      We agree with the reviewer that 96 nm averaging is possible given the long cartwheel analyzed here, and such a piece of data was in fact included in the original submission, although with a different purpose. Indeed, we carried out STA using ~(100 nm)3 sub-volumes (with binning 3 to reduce computational time), the results of which are reported in Figure S7 (previously Fig. S6). For the purpose of this analysis, we focused on the lateral organization of the cartwheel, but did not use this dataset to explore other periodicities because of the limitations inherent to a binning 3 data set.

      • Classification*

      The authors analyzed structural heterogeneity inside the cartwheel hub, employing reference-free classification by Relion software. The program reveals multiple coexisting structures - two from Trichonympha agilis and three from Teranympha, respectively. Whereas this is an exciting finding and shows future research direction of this field, interpretation of this classification must be done carefully. ** It is puzzling that major (55%) population of T. agilis shows more ambiguous features than the minor population (45%), while spatial resolutions by FSC are not so different - for example, Fig.2H vs Fig.S5C. In case of Teranympha, it is even more drastic - Fig.4D (major class) seems blurred along the centriolar axis, compared to Fig. 4E (minor class). This reviewer is afraid that these "major" classes might contain more than one structure and after subaveraging be blurred in detailed features. The apparent good spatial resolution could be explained, when two structures coexist and subtomograms are aligned within each subclass. Probably lower resolution at the spoke region of the major class (Fig.S2A) than that of the minor class (Fig.S2D) is a sign of heterogeneity within this class. Another risk could be subtomograms with poorer S/N being categorized to one class (due to lack of feature to be properly classified). Fig.S5F (black dots localized in one tomogram) raised this concern.

      The following investigation will help to solve this issue. 1. Extract and re-classify subtomograms belonging to the major population. 2. Direct observation of tomograms. The authors could plot two classes of Teranympha (as they did for T. agilis in Fig.S5) and find features of the cylindrical cartwheel hub in two conformations (as shown Fig.4DE). Since such a feature was directly observed in tomograms from the other manuscript (left panels of Fig.S6AC in Klena et al.), it should be possible in this work as well.

      We agree with the reviewer that the interpretation of the classification must be done with care, and share her/his interest in better understanding the structural variability between cartwheels classes in T. agilis and T. mirabilis. Although poor S/N may in theory result in erroneous joint classifications, we note that all maps in the original submission stemmed from extensive focused 3D classification, which removed defective and spurious sub-volumes, nevertheless defining distinct classes in the cases reported. Obviously, however, we cannot exclude that much larger data sets and future software advances may lead to the identification of additional features that would allow further sub-classes to be identified.

      Regardless, we followed the two suggestions the reviewer offered to us and have (1) extracted and re-classified sub-tomograms belonging to the major populations and (2) undertaken a direct observation of tomograms. These two points are developed in turn below.

      (1) We have performed a further round of classification of the major populations in T. agilis (55 % class) and T. mirabilis (64 % class), to assess whether additional sub-classes might be identified and thus help further improve the quality of the central cartwheel map. However, this additional round did not yield new sub-classes nor notable improvement in the map quality as judged by visual inspections. We show in Rebuttal Figure 1 a comparison in each case of the original STA and the corresponding STA upon such re-classification. Importantly, all conclusions spelled out in the original submission hold upon further re-classification, indicating that the initial classification converged to the best map quality based on the current data set and available computational resources.

      (2) We have followed the suggestion of the reviewer and now show raw tomograms to confirm that the classes correspond to bona fide structures and not to processing artefacts (new Figures S1C-F). The resulting new Figure S1D for instance shows that the striking variations observed between classes in the T. agilis STA are also visible in the raw tomogram. The more subtle variations among T. mirabilis classes are more difficult to observe in the raw tomogram, but inherent variations that reflect the presence of two classes are nevertheless observed.

      Furthermore, following the reviewer’s suggestion, we now mapped the distribution of the two T. mirabilis cartwheel classes onto tomograms, revealing that both classes can occur next to each other within the same centriole (new Figure S8E).

      • Periodicity mismatch*

      In Fig. 2CD, periodicity of CID has discrepancy from that of the stacked SAS-6 ring (8.5nm and 8.0nm). Do the authors think this is a significant difference or within an error? The same question can occur to other subtomogram averages. It would be nice to show errors as shown in their other manuscript (Fig.3C of Klena et al.) and clarify their idea. If it is systematic difference of periodicity between the stacked ring and CID, this shift will be accumulated through the entire cartwheel region - after 100nm, 8.5nm/8.0nm difference can be accumulated to ~6nm, which should change the entire view of the subtomogram - and the main factor to be classified (periodicity mismatch). This artifact (or influence) should be removed (or separately evaluated) by masking CID (out and in) and run classification separately. By clarifying this, the quality of the major subaverages (mentioned in the previous paragraph) could be improved.

      The reviewer wonders whether there might be a periodicity discrepancy within one map, for instance between CID and spokes in the T. spp. cartwheel map (Fig. 2C and Fig. 2D). Here, the periodicity determined from the STA maps is 8.5 ± 0.2 nm (SD, N=4) for the CID and 8.0 ± 1.5 nm (SD, N=2) for the spokes. Based on these standard deviations, there is indeed no significant difference between the two, and thus no periodicity discrepancy. The same applies for measurements in T. agilis and T. mirabilis. The SDs were reported already in the figure legends of the original submission, and we would prefer to leave them there if possible and not mention them in the figures, which are pretty busy as is. We apologize if this was not clear enough in the initial manuscript. Likewise, one may wonder whether there might be periodicity discrepancies between structures from distinct maps, for instance between CID and A-links from T. spp. (Fig. 2C and Fig. 3D). Again, the measurements are within error, since the distance between adjacent CIDs is 8.5 ± 0.2 nm (N=4) and between adjacent A-links 8.4 ± 0.4 nm (N=6); a similar conclusion applies for the corresponding measurement comparisons in T. agilis and T. mirabilis. The figure legends have been altered in the revised manuscript to spell out that there are no significant differences between periodicities (lines 856-858).

      Furthermore, we would like to stress that, by definition, STA value are average distances. For instance, in the case of T. spp., the central cartwheel STA was obtained from 511 sub-volumes, and thus the reported N=2 represents the average distance from 511 sub-volumes. Since this is an average, errors can therefore not accumulate over longer distances. This point has also been clarified in the figure legends (line 856-858).

      • Periodicity*

      They averaged subtomograms extracted with spacing of 252A with initial average as the first template (p.18 Line22). This means they assumed 25nm periodicity from the beginning and excluded different or larger unit size (if they take search range wide, they could detect difference periodicity, but will still be biased by initially assumed 25nm). 25nm average allowed them to see more detail than before (when they assumed 8nm periodicity), but there is still a risk of bias from references. To avoid this risk, this reviewer would propose classification of randomly extracted (but of course along the cylindrical hub or along the triplet microtubules, so one-dimensionally random picking) subtomograms. This experiment will end up with multiple sub-averages, which are 25nm (or multiple times of that) shifted from each other. Then it will prove their assumption.

      We agree with the reviewer that in theory the choice of periodicity could introduce a bias. This is why we have chosen a larger step size than in our initial work, corresponding to ~3x the major periodicity of ~8.5 nm observed in the power spectrum of the sub-volumes, as mentioned above. Regardless, following the reviewer’s suggestion, we have now explored other types of periodicities by re-analyzing the dataset through extraction of non-overlapping sub-volumes along the proximal-distal centriole axis. In doing so, we randomized the starting position of the first box between tomograms, reaching the same goal as with random picking but maximizing the number of sub-volumes. We carried out this analysis for all T. spp., T. agilis and T. mirabilis cartwheel classes, and found no notable differences that would affect the conclusions of the manuscript compared to the initial overlapping sub-volume classification, albeit generally with a noisier STA due to the lower number of sub-volumes. A comparison of the two approaches is provided in Rebuttal Figure 2. Moreover, all the points regarding the choice of periodicity have been further clarified in the expanded Materials and Methods section (pages 19-21).

      Minor points:

      They discussed difference of stacked SAS-6 rings in the cartwheel from various species. How much is the sequence difference of SAS-6 among these species?

      Unfortunately, no genomic or transcriptomic data has been published for the species investigated here, although the sparse molecular data available from small subunit rRNA sequences allows one to establish an overall molecular phylogeny. We previously identified a SAS-6 homologue in T. agilis (Guichard et al. 2013), which shares 20 % identity and 45 % similarity with C. reinhardtii SAS-6. Despite low sequence conservation, the structural conservation of SAS-6 is predicted to be high between the two organisms (Guichard et al. 2013). We apologize if these points were not expressed sufficiently clearly in the initial rendition and have adapted the wording in the revised manuscript (lines 325-332).

      Are the authors sure that CID is nine-fold symmetric? It is not trivial.

      We thank the reviewer for bringing up this interesting point. We have applied 9-fold symmetrization to the entire central cartwheel comprising spokes, hub and CID/ fCID, a choice guided by the apparent 9-fold symmetry of the spokes and peripheral element. We investigated the impact of symmetrization on the CID by relaxing symmetry from C9 to C1 during refinement, but did not observe a difference, and thus continued with C9 symmetry, which improves map resolution by S/N ratio enhancement and additional missing wedge compensation. In addition, we have also analyzed the CID without symmetrization, as reported in Figure S7 (previously: Fig. S6). Note that these maps were generated with larger sub-volumes centered on the spokes to comprise hub, spokes and microtubule triplets, explaining the resulting lower resolution, as the missing wedge is not compensated. Despite these limitations, however, the unsymmetrized CID shown in Figure S7A and S7E resembles the one in the symmetrized maps of Figure 2, indicating that the CID indeed exhibits 9-fold radial symmetry. That this is the case is spelled out explicitly in the revised manuscript (lines 1145-1147).

      Fig.1C: Another cross-section from the distal region will be helpful. A longer scale bar is better for readers' understanding.

      We understand that the reviewer is curious about the distal region, and cross-section views of resin-embedded sections from T. agilis are available and could be provided if necessary. However, given that the focus of the manuscript is strictly on the cartwheel-bearing proximal region, we felt that featuring the distal region in detail would break the narrative. Therefore, we suggest to keep Figure 1 as in the original manuscript. Following the reviewer’s suggestion, we increased the size of the scale bars from 10 nm to 20 nm in Figure 1C as well as in the corresponding Figure S8C.

      Fig.S6F: It would be informative if the subclasses (25% and 20%) are distinguished in this mapping.

      As per the reviewer’s request, we provide in Rebuttal Figure 3 a side-by-side comparison of the T. agilis 25 % and 20 % classes centered on the spokes, which are noisier than the composite 45 % class due to the lower number of sub-volumes in each sub-class. Given that there are no notable differences between the two maps that would affect any of the conclusions of the manuscript, we feel it is best to keep what is now Figure S7F (previously: Fig. S6F) unchanged in the revised manuscript.

      A figure to explain the classification scheme will help readers understand. How many subtomograms did classification started? Were the 45% class classified into two (25% and 20%) groups by two-step classification or at once (the entire subtomograms were classified into three groups directly?

      We thank the reviewer for this useful suggestion. As a result, we have generated a new Supplemental Figure S1G-J that provides a graphical overview of the classification scheme, together with sub-volume numbers for all deposited maps, thus nicely complementing Table S1.

      Reviewer #1 (Significance (Required)):

      Nevertheless, this work demonstrated capability of cellular cryo-ET, especially analysis of structural heterogeneity. Thus, while biological topics handled are rather specialized for cilia from flagellate, this work will attract attention of any biologist interested in molecular structure in vivo. It is worth for publication in a high journal after addressing the points above. This reviewer believes that the authors can address these points easily with additional analysis.

      We reiterate our thanks to this reviewer for her/his favorable evaluation and detailed suggestions, which enabled us to generate a strengthened manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Here, Nazarov and colleagues report sub-tomogram average (STA) maps of centrioles with 16 to 40 Å resolution from Trichonympha spp., Trichonympha agilis, and Teranympha mirabilis. Even though the authors have previously described the centriole architecture of T. spp, these STA maps of higher resolution revealed new features of centrioles, like polarized Cartwheel Inner Density (CID) and the pinhead. They also observed Filament-like structure (FLS) from T. mirabilis which seems to correspond to the CID from other species. Interestingly, they suggest that one and two SASS6 rings are stacked in an alternative fashion to make the central hub in T. mirabilis (Figure 5). The following issue should be addressed:

      Major points

      • Figure 4E. Authors mentioned in the manuscript that "We observed that every other double hub units in the 36% T. mirabilis class appears to exhibit a slight tilt angle relative to the vertical axis". When I see the other side, it does not seem to be tilted. Could the authors explain this?*

      We apologize that this aspect was not explained in sufficient detail. The left and right sides of the hub indeed appeared different in transverse views across the cartwheel center (previous Fig. 4E). This was because the area we selected in the original submission was centered on one emanating spoke. Due to the 9-fold symmetry one spoke density was selected on the right side, while the region between two spokes was displayed on the left side (as was illustrated by the slice across the center in previous Figure 4A; dashed rectangles in 4.0 nm panel). We have now selected a larger area to include spokes from both sides of the hub and thus better visualize this offset as shown in the modified Figure 4D-E.

      Reviewer #2 (Significance (Required)):

      I believe these results are of interest for all centrosome researchers and would like to recommend this manuscript be published in the EMBO journal which is affiliated with the Review Commons.

      We thank the reviewer for the recommendation to submit the revised manuscript to EMBO Journal, which we have followed.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript Nazrov et al., use cryo-electron tomography (CET) to analyse the structure of the centriole cartwheel. The Gonczy lab have previously generated a ground-breaking structure of the cartwheel from Trichonympha spp (T. spp.) (Guichard et al., Science, 2012; Guichard et al., Curr. Biol., 2013). This work is a direct continuation of those studies but using modern technology to get higher resolution images of the T. spp. cartwheel and comparing this to the cartwheel from Trichonympha agilis and from another distantly related flagellate Teranympha mirabilis.

      The data is generally well presented and of high quality. I am not an expert in CET, so it would be advisable to get the opinion from a reviewer who is, but the Gonczy lab are experienced in these techniques so I would not anticipate any problems. I have to admit that the title of the paper did not excite me, and I expected this to be a very worthy, but incremental study. It was a pleasure to find out that the extra detail provided by the increased resolution has revealed several new and unexpected features that have important implications for our understanding of cartwheel assembly and function. Most important are the potential asymmetry of the cartwheel hub, apparent variations in the packing mechanism of the stacked rings (even within the same cartwheel), and the potential offsetting of ring stacking. These findings will be of great interest to the field, and so I am strongly supportive of publication in The EMBO Journal. I have only a few points that I think the authors should consider.

      We thank the reviewer for this positive feedback and the recommendation to submit to EMBO Journal, which we hereby follow.

      Prompted by the comment of the reviewer, we revised the title to make it more informative and appealing to readers: “Novel features of centriole polarity and cartwheel stacking revealed by cryo-tomography”.

      • Nazarov et al., conclude that the cartwheel structure is intrinsically asymmetric. This is most convincingly based on the displacement of the CID within the hub, but they state that the Discussion that the potential offset between the Sas-6 double rings generates an inherently polar structure. I didn't understand why this is the case. Looking at Fig.S9A,B I can see that the offset in B could tilt to the left (as shown here) or to the right (if the structure was flipped by 180o). But I couldn't see how this makes this structure polar in the sense that a molecule coming into dock with the structure could only bind to one side of the offset structure shown in B, but to both sides of the aligned structure shown in A. I think this needs to be explained better, as it is crucial to understand where any potential polarity in the cartwheel structure comes from.*

      We apologize for not having been sufficiently clear about how two SAS-6 rings with an offset could impart organelle polarity. The reviewer is correct that an offset between superimposed rings alone is not sufficient to generate polarity at a larger scale. The important point we would like to stress, however, is that we discovered concerted polarity in multiple locations, from the central hub to the peripheral elements as illustrated in Fig. S7C-D, S7G-H, S7K-L and S7O-P (previously: Fig. S6). Prompted by the reviewer’s comment, we now better emphasize the asymmetric tilt angles of merging spokes, as highlighted also in the improved Figure S7. This asymmetric spoke tilt angle allows one to discriminate the proximal and distal side of a double SAS-6 ring, which is now explained better in the text (lines 259-263 & 502-510).

      • Related to this last point, in a co-submitted paper Klena et al. do not report such an asymmetry in the hub structures they have solved from several different species (neither in the tilting of the hub, or the displacement of the CID). I think it would be worth both sets of authors commenting on this point.*

      We agree that comparing and contrasting the results of the two companion manuscripts is important and we have updated the text as a consequence in several places (lines 444, 467, 507, 536, 985, 1000). We know from our previous work (Guichard et al. 2013) that the asymmetry of the hub and spoke is not visible at lower resolution. In the accompanying manuscript by Klena et al., no offset in the hub or asymmetric CID localization is reported, probably due to lower resolution and differences between species.

      • The authors data strongly suggests that the T. ag. and Te. mir. hubs are composed of a mixture of single and double Sas-6 rings. In contrast, the T. spp. cartwheel only has a single class of rings, but it wasn't absolutely clear if the authors think this comprises a single or double ring. In the text it is presented as though the elongation of the hub densities in the vertical direction is a new feature of the T. ag cartwheel (Fig.2H,I), but to me it looks as though this is also apparent in the T. spp. cartwheel (Fig.2C,D). The authors should address this directly and, if they believe that T. spp. has a double ring, they should comment on whether this more regular structure seems to have offset rings. If not, then the offset rings are unlikely to be the source of asymmetry that leads to the asymmetric displacement of the CID. Finally, if the authors think these are double rings, they should also be clear that they would now slightly re-interpret their original T. spp. cartwheel model (Figure 2, Guichard et al., Curr. Biol.). There is no embarrassment in this-a higher resolution structure has simply revealed more detail.*

      We apologize if the conclusions drawn about T. spp. cartwheel hubs were not sufficiently clearly expressed. Like the reviewer, we think that elongated hub elements are also discernible in T. spp., something that is also illustrated by the intensity plot profile in Figure 2C (double peaks on light blue line). These points are spelled out more explicitly in the revised manuscript (lines 177-179). In addition, to emphasize the conservation of the double hub units in both Trichonympha species, we have likewise adapted the text for T. agilis (lines 198-201).

      As for the offset observed within T. spp. spoke densities in Figure S10H, we interpret this as evidence for an offset of the double ring at the level of the hub, although we have not observed such offset in T. spp. for reasons that are unclear. The fact that this revises our previous interpretation based on a lower resolution map of T. spp. was already mentioned in the initial submission but is now better emphasized (lines 171-172 & 179-181).

      • The authors conclude that T. mirabilis cartwheels lack a CID and instead have a filament-like structure (FLS). I wonder whether it is more likely that the FLS is really a highly derived CID that appears to be structurally distinct when analysed in this way, but that will ultimately have a similar molecular composition. This situation might be analogous to the central tube in C. elegans, which by EM appears to be distinct from the central cartwheel seen in most other species, but is of course still composed of Sas-6. This historical tube/cartwheel nomenclature is now cumbersome to deal with, so perhaps it would be better to be cautious and not give the T. mirabilis structure a completely new name-how about "unusual CID" (uCID).*

      We share the view that the CID and the “FLS” –the term used in the initial submission- may have a related molecular composition and function, as we had also speculated in the discussion of the original submission. Following the reviewer’s suggestion, and in an effort to have a more uniform nomenclature, we propose to dub the T. mirabilis structure “filamentous CID” (fCID). This highlights better the similar location of these two entities and their potential shared function, while stressing the filamentous nature of the fCID. We further emphasize this point by providing the new Figure 6A to compare the presence of the two entities in select species. The discussion has also been adapted accordingly (pages 13-14).

      Rebuttal Figure Legends

      Rebuttal Figure 1: Re-classification of major classes

      (A-D) Transverse (top) and longitudinal (bottom) views of T. agilis (A, B) and T. mirabilis (C, D) central cartwheel 3D maps. The final major classes reported in the manuscript (A: 55 % class, C: 64 % class) were subjected to re-classification, which again yielded one major class in each case, with no notable improvement (B, D).

      Rebuttal Figure 2: Reclassification with non-overlapping sub-volumes

      (A-F) Transverse (top) and longitudinal (bottom) views of T. spp. (A, B) T. agilis (C, D) and T. mirabilis (E, F) central cartwheel 3D maps. The final maps reported in the manuscript (A, C, E) were generated with a 25 nm step size, yielding overlapping sub-volumes, whereas the maps in (B, D, F) were generated from non-overlapping sub-volumes, with no notable differences between the two that would affect the conclusions of the manuscript.

      Rebuttal Figure 3: Polar centriolar cartwheel upon sub-classification

      (A-C) 3D transverse views of non-symmetrized STA centered on the spokes to jointly show the central cartwheel and peripheral elements in the T. agilis 45 % class (A), as well as separately in the 25 % class (B) and 20% class (C). No notable differences are apparent following such re-classification, apart from the output being noisier due to the lower number of sub-volumes in each sub-class.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript Nazrov et al., use cryo-electron tomography (CET) to analyse the structure of the centriole cartwheel. The Gonczy lab have previously generated a ground-breaking structure of the cartwheel from Trichonympha spp (T. spp.) (Guichard et al., Science, 2012; Guichard et al., Curr. Biol., 2013). This work is a direct continuation of those studies but using modern technology to get higher resolution images of the T. spp. cartwheel, and comparing this to the cartwheel from Triconympha agilis and from another distantly related flagellate Tetranympha mirabilis.

      The data is generally well presented and of high quality. I am not an expert in CET, so it would be advisable to get the opinion from a reviewer who is, but the Gonczy lab are experienced in these techniques so I would not anticipate any problems. I have to admit that the title of the paper did not excite me, and I expected this to be a very worthy, but incremental study. It was a pleasure to find out that the extra detail provided by the increased resolution has revealed several new and unexpected features that have important implications for our understanding of cartwheel assembly and function. Most important are the potential asymmetry of the cartwheel hub, apparent variations in the packing mechanism of the stacked rings (even within the same cartwheel), and the potential offsetting of ring stacking. These findings will be of great interest to the field, and so I am strongly supportive of publication in The EMBO Journal. I have only a few points that I think the authors should consider.

      1. Nazarov et al., conclude that the cartwheel structure is intrinsically asymmetric. This is most convincingly based on the displacement of the CID within the hub, but they state that the Discussion that the potential offset between the Sas-6 double rings generates an inherently polar structure. I didn't understand why this is the case. Looking at Fig.S9A,B I can see that the offset in B could tilt to the left (as shown here) or to the right (if the structure was flipped by 180o). But I couldn't see how this makes this structure polar in the sense that a molecule coming into dock with the structure could only bind to one side of the offset structure shown in B, but to both sides of the aligned structure shown in A. I think this needs to be explained better, as it is crucial to understand where any potential polarity in the cartwheel structure comes from.

      2. Related to this last point, in a co-submitted paper Klena et al. do not report such an asymmetry in the hub structures they have solved from several different species (neither in the tilting of the hub, or the displacement of the CID). I think it would be worth both sets of authors commenting on this point.

      3. The authors data strongly suggests that the T. agg. and Te. mir. hubs are composed of a mixture of single and double Sas-6 rings. In contrast, the T. spp. cartwheel only has a single class of rings, but it wasn't absolutely clear if the authors think this comprises a single or double ring. In the text it is presented as though the elongation of the hub densities in the vertical direction is a new feature of the T. agg cartwheel (Fig.2H,I), but to me it looks as though this is also apparent in the T. spp. cartwheel (Fig.2C,D). The authors should address this directly and, if they believe that T. spp. has a double ring, they should comment on whether this more regular structure seems to have offset rings. If not, then the offset rings are unlikely to be the source of asymmetry that leads to the asymmetric displacement of the CID. Finally, if the authors think these are double rings, they should also be clear that they would now slightly re-interpret their original T. spp. cartwheel model (Figure 2, Guichard et al., Curr. Biol.). There is no embarrassment in this-a higher resolution structure has simply revealed more detail.

      4. The authors conclude that T. mirabilis cartwheels lack a CID and instead have a filament-like structure (FLS). I wonder whether it is more likely that the FLS is really a highly derived CID that appears to be structurally distinct when analysed in this way, but that will ultimately have a similar molecular composition. This situation might be analogous to the central tube in C. elegans, which by EM appears to be distinct from the central cartwheel seen in most other species, but is of course still composed of Sas-6. This historical tube/cartwheel nomenclature is now cumbersome to deal with, so perhaps it would be better to be cautious and not give the T. mirabilis structure a completely new name-how about "unusual CID" (uCID).

      Significance

      see above

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Here, Nazarov and colleagues report sub-tomogram average (STA) maps of centrioles with 16 to 40 Å resolution from Trichonympha spp., Trichonympha agilis, and Teranympha mirabilis. Even though the authors have previously described the centriole architecture of T. spp, these STA maps of higher resolution revealed new features of centrioles, like polarized Cartwheel Inner Density (CID) and the pinhead. They also observed Filament-like structure (FLS) from T. mirabilis which seems to correspond to the CID from other species. Interestingly, they suggest that one and two SASS6 rings are stacked in an alternative fashion to make the central hub in T. mmirabilis (Figure 5). The following issue should be addressed:

      Major points

      1. Figure 4E. Authors mentioned in the manuscript that "We observed that every other double hub units in the 36% T. mirabilis class appears to exhibit a slight tilt angle relative to the vertical axis". When I see the other side, it does not seem to be tilted. Could the authors explain this?

      Minor Points

      1. Page 11, I think Fig. 9G indicates Fig. S9G.

      Significance

      I believe these results are of interest for all centrosome researchers, and would like to recommend this manuscript be published in the EMBO journal which is affiliated with the Review Commons.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Centriole structure has been an attractive but challenging research topic for years. Pierre Gonczy's group has been working on its structure using cryo-electron tomography (cryo-ET). While the axoneme, which has longitudinal periodicity, was analyzed by several groups by cryo-ET for more than a decade, cryo-ET study on the centriole suffers from poor signal to noise ratio due to its limited length and thus fewer periodicity. They chose the centriole of flagellate Trichonympha, which have exceptionally long centrioles and thus offer opportunity of relatively straightforward subtomogram averaging. Their approach has been successful and they revealed intermediate resolution structure of the cartwheel, key of 9-fold symmetry formation, and it's joint to triplet microtubules (Guichard et al. 2012, 2013, 2018). In this work, they employed modern state-of-art cryo-ET technique, such as direct electron detection and 3D image classification to upgrade our knowledge of centriole structure. In their past works, the central hub of the cartwheel, made of SAS-6 protein forming 9-fold complex, was described as an 8nm periodic object. With improved spatial resolution, they provided further detail with clear polarity, which will deepen our thought about the initial stage of ciliogenesis. They also compared two Trichonympha species (spp and agilis) as well as another flagellate, Teranympha micabilis, and extended their intriguing evolutional and mechanical hypotheses based on structural differences. Despite improved spatial resolution, it is still not possible to identify proteins in the cryo-ET map (cellular cryo-ET will not reach such high resolution in the near future). Therefore this work is rather geometrically descriptive, which will inspire molecular biologists to identify molecules by other methods. Nevertheless this work demonstrated capability of cellular cryo-ET, especially analysis of structural heterogeneity. Thus, while biological topics handled are rather specialized for cilia from flagellate, this work will attract attention of any biologist interested in molecular structure in vivo. It is worth for publication in a high Journal after addressing the points below. This reviewer believes that the authors can address these points easily with additional analysis.

      Major points:

      1. Entire scheme A graphic diagram of the entire cartwheel area, summarizing this work, is necessary for the readers' understanding (similar to Fig.6 of the other manuscript, Klena et al.). Then average scheme should be shown in more detail, especially assumption of periodicity, Materials and Methods. The cartwheel hub was averaged with 25nm periodicity (as discussed below). Was the pinhead averaged with 16nm (as detected by FFT in Fig.S2L)? How about the triplet? This reviewer is not completely sure if the longitudinal averaging strategy is justifiable. Since periodicity of each domain is not trivial, logically the initial average must be done with the size of least common multiple (or larger). It is likely 96nm, assuming 25nm of the central hub is 3 times of microtubule periodicity and 16nm of the pinhead is twice of MT. 96nm average should be possible with a long cartwheel in this work. Alternative, in case periodicity is independent of MT and thus there is no least common multiple, is random picking and classification mentioned in "4. Periodicity". This should also be possible, since they can pick enough number of particles from long cartwheels.

      2. Classification The authors analyzed structural heterogeneity inside the cartwheel hub, employing reference-free classification by Relion software. The program reveals multiple coexisting structures - two from Trichonympha agilis and three from Teranympha, respectively. Whereas this is an exciting finding and shows future research direction of this field, interpretation of this classification must be done carefully. It is puzzling that major (55%) population of T. agilis shows more ambiguous features than the minor population (45%), while spatial resolutions by FSC are not so different - for example, Fig.2H vs Fig.S5C. In case of Teranympha, it is even more drastic - Fig.4D (major class) seems blurred along the centriolar axis, compared to Fig. 4E (minor class). This reviewer is afraid that these "major" classes might contain more than one structure and after subaveraging be blurred in detailed features. The apparent good spatial resolution could be explained, when two structures coexist and subtomograms are aligned within each subclass. Probably lower resolution at the spoke region of the major class (Fig.S2A) than that of the minor class (Fig.S2D) is a sign of heterogeneity within this class. Another risk could be subtomograms with poorer S/N being categorized to one class (due to lack of feature to be properly classified). Fig.S5F (black dots localized in one tomogram) raised this concern. The following investigation will help to solve this issue. 1. Extract and re-classify subtomograms belonging to the major population. 2. Direct observation of tomograms. The authors could plot two classes of Teranympha (as they did for T. agilis in Fig.S5) and find features of the cylindrical cartwheel hub in two conformations (as shown Fig.4DE). Since such a feature was directly observed in tomograms from the other manuscript (left panels of Fig.S6AC in Klena et al.), it should be possible in this work as well.

      3. Periodicity mismatch In Fig. 2CD, periodicity of CID has discrepancy from that of the stacked SAS-6 ring (8.5nm and 8.0nm). Do the authors think this is a significant difference or within an error? The same question can occur to other subtomogram averages. It would be nice to show errors as shown in their other manuscript (Fig.3C of Klena et al.) and clarify their idea. If it is systematic difference of periodicity between the stacked ring and CID, this shift will be accumulated through the entire cartwheel region - after 100nm, 8.5nm/8.0nm difference can be accumulated to ~6nm, which should change the entire view of the subtomogram - and the main factor to be classified (periodicity mismatch). This artifact (or influence) should be removed (or separately evaluated) by masking CID (out and in) and run classification separately. By clarifying this, the quality of the major subaverages (mentioned in the previous paragraph) could be improved.

      4. Periodicity They averaged subtomograms extracted with spacing of 252A with initial average as the first template (p.18 Line22). This means they assumed 25nm periodicity from the beginning and excluded different or larger unit size (if they take search range wide, they could detect difference periodicity, but will still be biased by initially assumed 25nm). 25nm average allowed them to see more detail than before (when they assumed 8nm periodicity), but there is still a risk of bias from references. To avoid this risk, this reviewer would propose classification of randomly extracted (but of course along the cylindrical hub or along the triplet microtubules, so one-dimensionally random picking) subtomograms. This experiment will end up with multiple subaverages, which are 25nm (or multiple times of that) shifted from each other. Then it will prove their assumption.

      Minor points: They discussed difference of stacked SAS-6 rings in the cartwheel from various species. How much is the sequence difference of SAS-6 among these species? Are the authors sure that CID is nine-fold symmetric? It is not trivial. p.7 Line21 "Fig.S1D-O": D-L p.8 Line1: It would be nice if more detailed description about MIPs, correlating to recent high resolution works from Bui and Brown labs. p.9 Line6 "Focused 3D classification...": This sentence is unclear. p.18 5 lines from bottom "S6C, S6F": How can these panels be power spectra to measure spacing? Typo? Fig.1C: Another cross-section from the distal region will be helpful. A longer scale bar is better for readers' understanding. p.29 Line6: pin -> pink Fig.S6F: It would be informative if the subclasses (25% and 20%) are distinguished in this mapping. A figure to explain the classification scheme will help readers understand. How many subtomograms did classification started? Were the 45% class classified into two (25% and 20%) groups by two-step classification or at once (the entire subtomograms were classified into three groups directly?

      Significance

      Nevertheless this work demonstrated capability of cellular cryo-ET, especially analysis of structural heterogeneity. Thus, while biological topics handled are rather specialized for cilia from flagellate, this work will attract attention of any biologist interested in molecular structure in vivo. It is worth for publication in a high journal after addressing the points above. This reviewer believes that the authors can address these points easily with additional analysis.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note from the authors (AU): This manuscript has been reviewed by subject experts for Review Commons. The authors would like to thank the reviewers for their comments to the manuscript, and the editor for patience with our response. Our reponse was delayed due to the COVID-19 lock-down situation in our institution. Now we are pleased to provide the following point-by-point response, as detailed below.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Suomalainen et al. describes a fluorescence-based approach combined with high-resolution confocal microscopy to study the heterogeneity of adenovirus infection in a population of human cells. The main focus of the authors is the detection of viral transcripts in infected cells, how this correlates with viral genomes, the cell state, and how it varies between different cells in a single population. The paper is generally well written and easy to read, with a few typos, although I found parts of it to be somewhat length and repetitive. Particularly the results section could be pruned somewhat for readability and clarity. The major limitation of the study as it stands is it's overall impact and novelty, which limits journal selection somewhat. A very similar study was recently published, which the authors cite (Krzywkowski et al, 2017). Nevertheless, I think the study design is rigorous and well executed, but I do have some specific comments which may enhance it's overall impact and novelty.

      **Major:**

      Results "Visualization of AdV-C5..." section:

      Why not also look at normal cells that can be synchronized? Cancer cells, such as A549 will by definition be highly heterogenous and at all phases of the cell cycle. Primary non-transformed cells can easily be synchronized by contact inhibition and are much more physiologically relevant.

      AU: In the current manuscript, we concentrated on the early phases of the AdV-C5 infection, on the question how virus gene expression is initiated and whether the cell cycle phase of the host cell impacts the initiation of virus gene expression. Answering these questions requires use of cells that express good amount of virus receptors so that viruses efficiently bind to the cells and infections can be synchronized so that extended time does not elapse between virus addition and accumulation of E1A transcripts; extended time between these two steps would make interpretation of the results more complex since cells could have progressed from one cell cycle stage to another during the experiment. Furthermore, having cells at all phases of the cell cycle is actually a benefit since then the experiment can be carried out under an “unperturbed” condition; all cell cycle synchronization methods have pleiotropic effects on the cells.

      It is true that primary non-transformed cells are physiologically more relevant than cancer cells, but primary cells have issues with donor-to-donor variability and many primary cells express rather low amounts of AdV-C5 receptors, so synchronized infections in these cells are not possible. Furthermore, the extended cell morphology of many normal fibroblast cell lines and the tendency of cell extensions from neighboring cells to overlap makes fluorescent images of these cells incompatible for automated cell segmentation.

      Here, we provide data also from HDF-TERT cells (nontransformed human diploid fibroblasts immortalized by human telomerase expression) to show that two of our key findings from A549 cells are not artefacts of cancer cells. This is, that akin to A549 cells, the infected HDF-TERT cells accumulate high number of E1A transcripts (Fig.1C), and also in these cells nuclear vDNA numbers do not predict the cytoplasmic E1A transcript counts during early phases of infection (S2C Fig). However, since HDF-TERT cells are rather inefficiently infected by AdV-C5, correlation of early E1A transcript accumulation to the cell cycle phase of the host cell could not been done in these cells. We have been unable to identify primary or normal immortalized cells that would be easily available and efficiently infected by AdV-C5 (synchronized infection with short time elapsed between virus addition and accumulation of E1A transcripts).

      "The virus particles bound..." - Can the spatial resolution of a confocal microscope truly differentiate individual particles that are sub-wavelength in size? What about the sensitivity for single particles? Some sort of experiment to show that single particles can be detected should be performed and shown to assure the readers that this is in fact possible. Furthermore, even when based on the particle to pfu ratio, the MOI would still be nearly 2000pfu/cell, so the actual number of observed particles is an order of magnitude lower than what was applied to the cells.

      AU: The fluorescence signal from individual fluorophore-tagged AdV or anti-hexon antibody-decorated particle is bright enough to be picked up by PMT or HyD detectors of the current confocal laser scanning microscopes. In fact, tracking fluorophore-tagged particles of the size of AdV has been a standard microscopy procedure since late 1990’s.

      Because the Reviewers were questioning the apparently high multiplicity of infection used in the experiments, we clarify the difference between “standard” MOI estimations and our infection set-up. First of all, as described in Material and Methods, we estimated the number of physical virus particles in our virus preparations using A260 measurements (J.A. Sweeney et al., Virol. 2002, doi: 10.1006/viro.2002.1406). This method, like all other methods used to estimate virus particle numbers, is likely not 100% reliable.

      Second, we incubated the virus inoculum with cells only for 60 min, after which the unbound viruses were washed away. During this short incubation time only a small fraction of input virus particles bind to cells, and indeed as shown in Fig.1A, a theoretical MOI of 54400 physical virus particles/cell or 13600 physical virus particles/cell yielded Median of 75 and 26 bound virus particles per cell, respectively. Interpretation of the results from the cell cycle assays required that there was a relatively short time between infection and analysis so that cells in a large scale did not change their cell cycle status during the experiment. This required use of a rather high MOI. Furthermore, for collection of a large data set, it is convenient that every cell is infected.

      Third, what exactly does one pfu mean in terms of physical adenovirus particles? There is no clear answer to this, since several parameters affect the pfu. In which cells was the titration carried out? How long was the input virus inoculum incubated with the cells? How many of the virus particles entering the cell actually established an infection? And, as described in A. Yakimovich et al. (J. Virol. 2012, DOI: 10.1128/JVI.01102-12), only a fraction of infected cells produce a plaque. The majority of papers stating that x pfu/cell was used for infection, usually incubate the cells with the virus inoculum for several hours at 37°C, and never make any attempts to estimate exactly how many virus particles entered into the cells.

      Fig. 4 - I am not certain that the observed difference is significant, at least looking at it, beyond the width difference of the peaks, highest expression for both is largely in G1. It would be nice to see this using a western blot of cell cycle sorted cells, which can easily be accomplished using FACS.

      AU: In the highest GFP expression bin, CMV-eGFP expressing cells have 43% cells in G1 and 50% in S/G2/M. In comparison, E1A-GFP expressing cells have 58% cells in G1 and 35% in S/G2/M. The difference in G1 cells in the highest eGFP bin is statistically significant (p Page 15, 2nd paragraph. It would be valuable and informative to determine whether there is heterogeneity in histone association with these different vDNAs and whether these histones exhibit divergent modifications (enabling or restricting transcription). Same as above. I am rather surprised that the DBP signal did not correlate well with vDNA signal, particularly for the larger replication centers. How can this be reconciled? Was there an increase in overall vDNA signal later in infection? It is important to know this as it determines whether the observed vDNA signal is real or could be caused by viral RNA or other background causes (non-infected controls notwithstanding). Can the signal be detected with inactivated viruses (via UV for example?)

      AU: Whether histone modifications impact the transcriptional output of adenovirus genomes early in infection is indeed an intriguing question, but unfortunately this is very challenging, if not impossible, to study at single-cell / single vDNA level with the existing technology. Techniques for single-cell measurements of chromatin states are still in infancy, although some notable advancements in this field were reported in 2019 (e.g. K. Grosselin et al. Nature Genetics, DOI: https://doi.org/10.1038/s41588-019-0424-9 and S. Ai et al. Nature Cell Biology, DOI: https://doi.org/10.1038/s41556-019-0383-5).

      Furthermore, current literature offers a confused picture as to when exactly protein VII on incoming virus genomes is replaced by histones (reviewed in the reference 39, Giberson et al.). Of note, the vast majority of incoming nuclear vDNA molecules scored protein VII-positive with anti-VII staining under the experimental conditions used for the Fig. 2C data. However, we did not include these results into the manuscript because VII-positive signal on vDNAs does not exclude these vDNAs having histones on certain parts of the genome.

      The Reviewer wonders why the DBP signal in Fig.6C does not correlate with vDNA signal. There is no discrepancy here because DBP signal in the figure is a proxy for replicating vDNA whereas the click vDNA signal reports incoming vDNA. The one DBP spot without an associated click vDNA signal could be due to a replication center originated from a replicated viral genome, not from incoming viral genome. The figure shows that incoming vDNAs within the same nucleus initiate replication asynchronously.

      Page 18, 1st paragraph. It would be interesting to determine whether there was association between pol II and those genomes that showed no E1A, similarly to the histone suggestion. What about things like viral chromatin organization? Soriano et al. 2019 showed how E1A and E4orf3 work in tandem to alter viral chromatin organization by varying histone loading on the viral genome.

      AU: This again would be technically very challenging to show. We actually tried to visualize active transcription using an antibody against RNA polymerase II CTD repeat YSPTSPS (phosphor S5), azide-alexa fluor488 and anti-alexa fluor488 antibody to mark EdC-labeled incoming vDNAs and proximity ligation assay for signal amplification. However, this method was not sensitive enough to detect RNA polymerase II association with individual viral genomes. We only detected the proximity ligation signal in replication centers when replicated viral genomes were tagged with EdC.

      Fig. 2. Can you really say that a single dot correlates with a single transcript? Has that been validated in any way?

      AU: Signal amplification with branched DNA technology leads to binding of a large number of fluorescent probes to a mRNA and thus enables detection of single nucleic acid molecules. This has been validated e.g. in A.N. Player et al. 2001. J. Histochem. Cytochem (https://doi.org/10.1177/002215540104900507) and N. Battich et al. 2013. Nature Methods (https://doi.org/10.1038/nmeth.2657).

      **Minor:**

      Page 5, last paragraph. "Transcirpts from the viral late transcription unit,..." This is not correct as recently shown by Crisostomo et al, 2019.

      AU: The data in Crisostomo et al. paper suggest that some late gene expression can occur before vDNA replication, but an abundant accumulation of late transcripts coincides with onset of vDNA replication. However, the Crisostomo et al. study did not test what the levels of late gene transcripts are if the vDNA replication was inhibited. But to acknowledge the possibility that there might be some level of late gene transcription prior to replication of the viral genomes, the sentence is modified as follows: “Transcripts from the viral late transcription unit, amongst them mRNAs for the viral structural proteins, vastly increase in abundance concomitant with the onset of vDNA replication”. Furthermore, we have added the Crisostomo et al. reference here as well.

      Page 10, "... because AdvV-infected cells are less well adherent..." This is not strictly true as loss of attachment only occurs later on in infection. It would be helpful to have statistical significance indicated directly in the figures.

      AU: Although clearly visible cell rounding indeed occurs only late in infection, also during early stages of infection the HAdV-C5-infected cells are less adherent than non-infected cells. In many assays this is not obvious, but the RNA FISH staining procedure includes several incubation and washing steps in rather harsh buffers, and we observed random, sometimes considerable, cell loss with infected cultures but not with non-infected cultures.

      In the revised manuscript we have included the statistical significance P values both into the main text and the figure legends, but not to the figures directly, because the P values were generated with different statistical tests and P values should not be shown/mentioned without stating which statistical test was used. However, we noticed that we had in some cases omitted to mention what was the number of pairs analyzed in some of the Spearman’s correlation tests. This has now been corrected in the revised manuscript.

      The very high MOIs used are concerning, could these have negative effects on the cell viability or overall state?

      AU: We refer to our explanation above about the theoretical MOI and the actual MOI. Furthermore, in the experiment described in Fig.2C (correlation of E1A transcripts per cell vs. viral genomes per cell), 42% of analyzed cells had ≤ 5 viral genomes/cell and 27.5% of analyzed cells had between 6-10 viral genomes per cell; these are not high numbers. We also provide controls that the EdC-labeled genomes are detected with good efficiency. Hence the EdC-labeled genomes per cell are a good estimate of the numbers of virus particles that indeed entered into the cells.

      There are a few typos and such that should be corrected. AU: We have tried to find and correct the typos.

      Reviewer #1 (Significance (Required)):

      As I stated above, the work is interesting and significant, to a degree. The major limitation is that the novelty is low as a paper published in 2017 (cited by the authors) used a very similar approach to investigate a similar problem. In addition, there are multiple other recent papers looking at cell populations in the context of adenovirus infection, and whether a single cell or population based approach is better is unclear. This is something the authors might want to strengthen prior to submission.

      AU: In the current study, we focused on the early phase of HAdV-C5 infection, on how viral gene expression is initiated and how individual nuclear viral genomes proceed to a replicative phase. The Krzywkowski et al. 2017 J. Virol. Paper that the reviewer refers to used padlock probe-based rolling circle amplification technique to simultaneously detect HAdV-C5 genomes and viral mRNAs in individual infected cells.

      The shortcoming of this method is inferior sensitivity compared to the branched DNA technology-based method used by us in the current study. Krzywkowski et al. were able to pick up signals from virus mRNAs and virus genome only relatively late in the infection, i.e. at the time when incoming genomes were expected to have multiplied by replication. Thus the study by Krzywkowski et al. was unable to provide information for the questions addressed in our study, i.e. do the levels of E1A transcripts early in infection correlate with viral vDNA counts in the nucleus and is there variability in the transcription output from individual vDNAs within the same nucleus, or variability in how individual vDNAs within the same nucleus proceed into the replication phase. We hence do provide novel information, and do not consider this as a limitation of our paper.

      We emphasize that population assays are done to attempt to understand molecular basis of a phenomenon by correlations. Instead, deep molecular insights require to-the-point-assays, in the case of transcription, single-molecule live cell assays at the level of single genes. Technically, we (and also the field) are not quite there yet.

      Regardless, our study is a first step towards understanding transcription output of nuclear HAdV-genome at single-cell, single-genome levels. It has revealed insight that was not apparent from population assays. It is clear that the next step will be time-resolved live cell assays with simultaneous detection of transcription output, genome detection and transcription factor clustering on the genomic loci. With current technology the simultaneous detection of all these events is challenging, and requires the development of further technology.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors show heterogeneity of AdV-C5 mRNA transcript quantity and dynamics in different cell types, which is regulated by the cell cycle phase and does not correlate to incoming viral DNA, using single molecule RNA FISH technologies and detection of incoming viral DNA by EdC labeling.

      **Major Comments:**

      The authors change the MOI used in their experiments (7 different MOIs are used throughout the paper) in a manner that appears randomly and without explanation. (54400 for Figure 1A, 1B, 3B, S3B; 37500 for Figure 1C; 23440 for Figure 2A, 2C, S5A; 13600 for Figure 1A, 1D; 36250 for Figure 3C, S3D; 11200 for Figure 4B; 23400 for Figure 6B). The authors should provide explanation, why these changes in MOIs are necessary.

      AU: The MOIs given are theoretical MOIs, and essentially all figures indicate what was the actual MOI, that is, the real number of virus particles entering into the cells. This is beyond what is commonly provided in virology. It is essential, however, since MOI differs between different cell types. Therefore, we prefer to use the actual MOI as shown in Fig.1A, or we indicate the number of vDNAs that were delivered to the cells of interest.

      Variable MOIs had to be used to ensure that different cell lines received comparable numbers of virions, in particular virus particle binding to and entering into the cells. Infection kinetics are different in different types of cells, but can be tuned by MOIs used. Furthermore, different virus preparations were used in the experiments and we performed analyses at different stages of the infection cycle. Due to all these different facettes provided by our experiments, it was impossible to choose one standard (theoretical) MOI for all the experiments.

      The authors use mean fluorescence intensity of E1A probes per cell as estimate for viral transcript abundance for some of their experiments (Figure 1D, E, 3B), and count E1A punctae as measure for E1A transcripts in other experiments (Figure 2C, 3C, 5), without showing data, that these measures correlate. Problematic is hereby, that not all E1A punctae have the same signal intensity, as can be seen in Figure S1, which makes the estimation of the correlation of E1A punctae (= number of transcripts) and fluorescence intensity difficult. The authors should provide both (E1A punctae counts and estimation via fluorescence intensity) for at least one experiment, to prove, that the estimation of E1A transcript levels via fluorescence intensity is feasible.

      AU: The quantification method had to be adjusted to the number of virus transcripts in the cell at the time of analysis. The best quantification method is segmentation and counting the individual fluorescent puncta per cell, but, as stated in the manuscript, this method does not accurately quantify the mRNA puncta from maximum projections of confocal or widefield image stacks when the number of puncta per cell exceeds ~ 200.

      On the other hand, as shown in the quantification below, mean fluorescence intensity measurements per cell do not of course distinguish between cells having one vs. two mRNA puncta. Yet, as shown in the figure below, a relatively good correlation between puncta counting and fluorescence intensity measurements is achieved when cells have ≥ 10 transcripts per cell. Subsets of randomly picked images of the Fig.2C/Fig.5 dataset were included into the analysis (rs is Spearman’s correlation rank coefficient, approximate P p.15: "The nuclear E1A signals in AraC-treated cells were resistant to RNase A, but they were dampened by treatment with S1 nuclease (S6B Fig)." The authors make this statement based on (i) two completely different timepoints (12 h.p.i. for RNaseA treatment, 24.5 h.p.i. for S1 nuclease treatment) and (ii) in different clones of the A549 cells as stated in the methods section on p.21 (Two different clones of human lung epithelial carcinoma A549 cells were used in the study: our laboratory's old A549 clone (experiments shown in Fig. 1, Fig. 3B and S1 Fig., S3B and S3C Fig., S6A and S6B Fig., RNase A treatment) and A549 from American Type Culture Collection (ATCC, experiments shown in Fig. 2 and Fig. 5, Fig. 6, S2B Fig., S4 Fig., S5 Fig., and S6B Fig. S1 nuclease-treatment)). This makes it difficult to interpret, if the data is due to differences in the timepoints or cell types, or if it is due to binding of the E1A probe to single stranded vDNA.

      AU: This is a fair criticism, thank you. We have replaced the RNase A figure S6B in the revised manuscript. A new RNase A experiment was repeated in ATCC A549 cells using the same infections conditions as with the S1 nuclease-treated cells.

      **Minor Comments:**

      p.4: "AdV are non-enveloped, double-stranded DNA viruses that cause mild respiratory infections in immuno-competent hosts, and establish persistent infections, which can develop into life-threatening infections if the host becomes immuno-compromised [reviewed in 6]." Not all AdV cause respiratory diseases, the disease outcome of human AdV depends on the site of primary infection, which differs between the different AdV types.

      AU: We have modified the text as follows: AdV are non-enveloped, double-stranded DNA viruses that cause mild respiratory, gastrointestinal or ocular infections…

      p.7: The authors state, that "At the 17 h time point, about half of the cells had high numbers of protein VI transcripts, and most of them very high numbers of E1A transcripts.", however, the picture shown in Figure 1F shows a different phenotype, with low transcript levels of VI in E1A high cells and high transcript levels of VI in E1A low cells.

      AU: This was perhaps a bit difficult to see in the overlay images since one has to distinguish between green and yellowish green. We have provided the individual channels along the overlay picture in Fig. S1D, and now it is clear that at 17h pi cells with high numbers of VI transcripts have also high numbers of E1A transcripts.

      p.8: "This nuclear E1A signal is due to binding of the E1A probe to single-stranded vDNA in the replication centers (see below)." The authors should state here, that due to the binding of the probes to the single stranded vDNA in the replication centers, the nucleus was excluded from the analysis for Figure 1F in late timepoints.

      AU: We have modified the text according to the Reviewer’s suggestion. The text is now as follows: ‘Due to further studies (see below), we assume that this nuclear E1A signal represents binding of the E1A probe to single-stranded vDNA in the replication centers. Accordingly, the nuclear area was excluded when quantifying the viral transcripts per cell in late timepoints (Fig. 1F).’

      Due to this time point the author cannot state that the E1A staining seen (Fig. 1F; indicated with white arrows) are replication centers; this is just an assumption, since there is no evidence in Fig 1 the author cannot be sure; the author should change the text: "taking the following experiments into account...", "due to further studies (see below)..... we assume that..."

      AU: We have modified the text according to the Reviewer’s suggestion; see also the previous comment above.

      p.8: The authors should mention the figure they refer to, since there is no E1B-55K staining in Fig. 1F

      AU: The text has been modified as follows: Whereas other time points showed relatively few E1A, E1B-55K or VI puncta over the nuclear area (Fig. 1B, 1F, S1A Fig.), clustered nuclear E1A signals were apparent at 23 h.

      p.9: Which test was used to calculate the additional p-values?

      AU: As stated in the Material and Methods section or the figure legends, the p-values were calculated either by a permutation test using custom-programmed R-script (the code has been deposited on Mendeley Data along with other data associated with this manuscript), or by Kolmogorov-Smirnov test using GraphPad Prism. GraphPad Prism was also used to calculate Spearman’s correlation coefficients and the associated approximate p values. In the revised manuscript, we have added the following sentense into the Material and Methods section / Statistical analyses: Spearman’s correlation tests were done using GraphPad Prism.

      p.10: For the experiment for the correlation of viral genomes per cell and E1A transcripts in HDF-TERT cells (Figure S2C), the MOI is missing in the description of the results, as well as in the corresponding figure legends.

      AU: We have indicated the theoretical MOI (~ 4800 virus particles per cell) in the figure legend and in the Material and Methods section. The actual MOI, i.e. the actual number of virus particles entering into the cells, could not be determined due to the long (15 h) incubation time of virus inoculum with the cells, which in turn was required because these cells bind AdV-C5 rather inefficiently. However, between 1 and 32 EdC-labeled virus genomes were detected per cell nucleus at 22 h pi.

      11: calculation of correlation? rs? Why does the author combine S and G2/M phase? Fig. S3A show different values for the phases

      AU: rs is the abbreviation for Spearman’s correlation coefficient, and, as indicated in the Material and Methods, we used GraphPad Prism to calculate the Spearman’s correlation coefficients.

      Different methods to estimate cell cycle stages. DNA content method cannot separate S and G2/M with great confidence, whereas Kusabira Orange-hCdt1 and Azami-Green-hGeminin expressions in HeLa-Fucci cells allow more fine-tuned assessment of the cell cycle phases.

      p.11: "Thus, the total intensity of nuclear DAPI signal can be used to accurately assign G1 vs S/G2/M stage to cells." The authors should also here refer to other papers, which showed that this correlation is feasible, as they did in the methods section (67. Roukos V, Pegoraro G, Voss TC, Misteli T. Cell cycle staging of individual cells by fluorescence microscopy. Nature protocols. 2015;10(2):334-48. Epub 2015/01/31. doi: 10.1038/nprot.2015.016. PubMed PMID: 25633629; PubMed Central PMCID:PMCPMC6318798.), and maybe also refer to a newer paper which deals with this technique: Ferro, A., Mestre, T., Carneiro, P. et al. Blue intensity matters for cell cycle profiling in fluorescence DAPI-stained images. Lab Invest 97, 615-625 (2017). https://doi.org/10.1038/labinvest.2017.13

      AU: The integrated nuclear DAPI signal intensity is indeed a widely used method to assign cell-cycle stage to individual cells. We have added the second reference suggested by the Reviewer to the reference list for this method.

      p.11: "Furthermore, when focusing on the highest E1A expressing cells, i.e. the cells with mean cytoplasmic E1A intensities larger than 1.5 × interquartile range from the 75th percentile, 71.9% of these cells were found to be in the G1 phase of cell cycle, whereas only 55.8% of cells in the total sampled cell population were G1 cells." The authors do not provide any reference to a figure within the manuscript or the supplements, which contains these data. Are these data not shown in the manuscript?

      AU: These values are calculated from the data shown in Fig.3B. The source data supporting findings of this study (maximum projection images, excel files of the CellProfiler and Knime workflows) have now been deposited to Mendeley Data as stated in the Material and Methods / Data availability section of the revised manuscript and listed in Supplementary tables.

      p.12: punctuation mistake; . instead of , To enrich G1 cells. AdV-C-5 (moi ~ 36250) was added. Why does the author switch between signal intensities and counting E1A puncta per cell (limited to 200) in the different experiments to illustrate accumulation of E1A transcripts?

      AU: The same answer as above: the quantification method had to be adjusted to the number of virus transcripts in the cell at the time of analysis. The best quantification method is segmentation and counting the individual fluorescent puncta per cell, but, as stated in the manuscript, this method does not accurately quantify the mRNA puncta from maximum projections of confocal or widefield image stacks when the number of puncta per cell exceeds ~ 200. On the other hand, as shown in the quantification in the new S1C Fig., mean fluorescence intensity measurements per cell do not of course distinquish between cells having one vs. two mRNA puncta, but a relatively good correlation between puncta counting and fluorescence intensity measurements is achieved when cells have ≥ 10 transcripts per cell.

      p.14: "For E1A (or E1B-55K), we did not detect transcriptional bursts with bDNA-FISH probes on nuclear vDNAs, either prior to or after accumulation of viral transcripts in the cell cytoplasm." The authors do not provide any reference to a figure within the manuscript or the supplements, which contains these data. Are these data not shown in the manuscript?

      AU: This statement is based on hundreds of images we have analyzed during the course of the study. It is impossible to show all of these images, so in principle, this is “data not shown”. We have modified the text as follows: With hundreds of images analyzed, we never unambiguously detected transcriptional bursts with E1A (or E1B-55K) bDNA-FISH probes on nuclear vDNAs, either prior to or after accumulation of viral transcripts in the cell cytoplasm.

      p.14: space between number and %

      AU: Thank you for pointing this out. It has been corrected.

      p.15: "This is was also seen in AdV-C5-EdC-infected cells" should be changed to "This was also seen in AdV-C5-EdC-infected cells"

      AU: Thank you for pointing this out. It has been corrected.

      Fig. 1B:

      −figure legend does not indicate how cells were staine −also no description in the continuous text −which E1A transcripts are stained? all? 12S? 13S?

      AU: The first sentence in Results section states that “We used fluorescent in situ hybridization (FISH) with probes targeting E1A, E1B-55K and protein VI transcripts followed by branched DNA (bDNA) signal amplification to visualize the appearance and abundance of viral transcripts in AdV-C5-infected A549 lung carcinoma cells.” Furthermore, the legend to Figure 1 starts with the title “Visualization of AdV-C5 E1A, E1B-55K and protein VI transcripts in infected cells by bDNA-FISH technique”, and the legend to Fig.1B mentions that “cells were stained with probes against E1A and E1B-55K mRNAs or E1A and protein VI mRNAs”. We are of the opinion that this is enough information to understand the figures.

      The main text to Fig.1 also states that “The E1A probes covered the entire E1A primary transcript region and thus all E1A splice variants. The temporal control of E1A primary transcript splicing and E1A mRNA stability give rise predominantly to 13S and 12S E1A mRNAs at 5 h pi (references)”.

      Fig. 1D: −difference in accumulation of viral transcripts is not that visible as in IF staining (Fig. 1B; Fig. 1S);

      Fig. 1 or S1 Fig. do not show IF staining but signals from FISH.

      −graph does not show any difference between E1A and E1B-55K

      AU: The y-axes values in Fig.1D graph are arbitrary units and thus E1A and E1B-55K graphs are not directly comparable to each other. We have included into the revised manuscript S1B Fig., which shows quantification of E1A and E1B-55K fluorescent puncta per cell at the 5 h pi; the difference between E1A and E1B-55K was statistically significant.

      Fig. 1F: −figure legend does not fit with labelling of IF images and continuous text −description says 22 h, while IF labeling and text (p. 7, last lane) mentions 23 h pi

      AU: The figure annotations state the time of analyses as total time after virus addition to cells, whereas text stated the time of analyses as x h post virus removal since we wanted to stress that the input virus was incubated only for 1 h with the cells. However, Reviewers found this confusing, so we have changed the text in the revised manuscript so that time of analysis is stated as total time after virus addition to cells (as in the figure annotations). Only in the Material and Methods section we maintain the original 1 h + x h statement for the time of analysis.

      Fig. 2A: −figure legend: lane 5 Punctuation wrong: azide-Alexa Fluor488. Alexa Fluor647

      AU: Thank you for pointing this out. It has been corrected.

      Fig. 4A: −difficulties to understand −author stated that promoter-driven EGFP expression is clearly dominated by G1 cells for E1A and by S/G2/M cells for CMV, however this is not clearly visible in the graph −no severe differences visible between CMV-eGFP and E1A-eGFP −author should include numbers for quantification and statistical calculations to illustrate the differences

      AU: In the highest GFP expression bin, CMV-eGFP expressing cells have 43% cells in G1 and 50% in S/G2/M (n=2149). In comparison, E1A-GFP expressing cells have 58% cells in G1 and 35% in S/G2/M (n=2258). The difference in G1 cells in the highest eGFP bin is statistically significant (p

      Fig. 4B: −amount of E1A protein levels calculated via IF (signal intensities) −immunofluorescence is not a suitable tool for protein quantification

      AU: It is true that not all antibodies are suitable for IF (or for Western blot), and we cannot be certain that the monoclonal anti-E1A antibody used by us detects all E1A forms with different post-translational modifications with equal efficiency. However, IF is a widely accepted method to estimate protein levels in the cell, especially if the proteins like E1A accumulate in the nucleus (makes segmentation of the signal easy) and give a rather uniform nuclear staining pattern.

      Fig. 5: −in A. it is stated, that E1A bDNA -FISH is not suitable, since it is too short to be detectable. However, in B E1A bDNA-FISH is used. is there a difference? −according to the method part just one E1A mRNA was used for the assays, why is it then not possible to use that one in Fig. 5A? −explanation of the procedure and the experiment is very confusing

      AU: The Reviewer probably refers to Fig.6 here, not to Fig.5. The E1A introns are short (about 100 bases) and cannot be picked up with bDNA FISH probes. In Fig. 6B we were using the E1A bDNA-FISH probes, which were made against the AdV-C5 genome map positions 551-1630 to detect vDNA single strands of the E1A region and these single strands were long enough to be picked out by our E1A probes.

      Fig. S6B: −authors want to show that it is RNase-insensitive, but S1 nuclease-sensitive

      −two different A549 cell clones and two different time points are used for the treatments → not compareable to each other

      AU: This is a fair criticism. We have replaced the RNase A figure in S6B Fig. in the revised manuscript. The new RNase A experiment was carried out in ATCC A549 cells using the same infections conditions as with the S1 nuclease-treated cells.

      Material and Methods: −headings do not indicate which methods are explained −no clear structure AU: We have made minor changes to the headings of Material and Methods section. We have first explained in detail the bDNA-FISH method, but otherwise the order is according to the order of the figures.

      Reviewer #2 (Significance (Required)):

      highly significant manuscript very important for the virology field

      my research topics are human adenoviruses and their replication cycle

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Summary:** Soumalainen et al have studied adenovirus viral gene expression and replication at a single-cell level. They explore the extent of correlation between incoming genome copy number and early gene expression and progression into the late phase, revealing substantial variation between cells in the numbers of E1A transcripts (the first gene expressed upon infection) that is not explained by differences in the numbers of viral genome templates in the cells. They also explore the relevance of cell cycle stage to this variability and show a positive correlation between G1 cell cycle stage and higher levels of gene activity, which explains at least part of the variation. To form these conclusions they have applied new methods to visualise and quantify single molecules of nucleic acid in single cells. The experiments are all carefully and fully described with full detail of materials. Overall the manuscript is well written and easy to follow.

      **Major comments:**

      All of the experiments appear to be done with rigour and their results reported with due regard to statistical significance etc. My major concern though is that they have been done, perhaps out of necessity to get detectable signals, at very high multiplicities of infection. A well-accepted standard to achieve infection of all cells in a culture is an MOI of 10 infectious units per cell. Even this is acknowledged not to represent the biology of natural infection and it is striking that, where technically feasible, lower MOI studies are more revealing of how a virus actually works. Here, the authors have used counts of particles rather than infectious units to determine MOI and for Ad5, the particle/pfu ratio is typically 20-100. Their MOIs though are 13,000 - 50,000 per cell, implying an infectious MOI of at least 130 for their A549 experiments, which are known to be readily infected by Ad5 from other work.

      AU: Unlike common experiments done by others, we used a synchronized infection and removed the input virus after 1h incubation at 37°C. This type of infection initiation requires high input virus amounts, as opposed to studies in which the virus inoculum is incubated with cells for several hours/days, as is typically done in studies determining the infectious or plaque forming units in virus inoculum. Hence, the MOI used by others involved incubation of inoculum with cells over extended periods of time, and they cannot be compared to our pulsed infection conditions.

      Although the calculated theoretical MOIs (physical particles/cell) were high in our experiments, only 0.1% – 0.2% of input virus particles bound to cells during the 1h incubation period (Fig. 1 A; this estimation is based on the ratios between Median values for the number of cell-associated viruses vs input virus numbers).

      Furthermore, in the experiment described in Fig.2C (correlation of E1A transcripts per cell vs. viral genomes per cell), 42% of analyzed cells had ≤ 5 viral genomes/cell and 27.5% of analyzed cells had between 6-10 viral genomes per cell. Please note, that these are not high numbers.

      The input virus amounts used were selected this way, because we aimed at getting a broader view of how virus transcription at early phases of infection responds to a varying number of virus genomes delivered to the nucleus. Therefore, we did not limit the analyses to a situation with 1 or less than 1 virus particles/genomes per cell.

      In addition, the analyses of how cell cycle phase impacts the initiation of virus gene expression requires a relatively short time between virus inoculation and time point of analysis (i.e. a rather high MOI). Otherwise, as also pointed out by the Reviewer, the cells could have experienced more than one cell cycle phase during the duration of the experiment. Furthermore, although the initial natural infection probably starts with a very low MOI, the second round of infection is a high MOI infection due to a large number of progeny virus particles released from an infected cell.

      Surprisingly, the authors do not see intracellular vDNA copy numbers that are fully reflective of this high MOI, with median intracellular vDNA of 75 /cell at the highest MOI. The authors should consider how the population distribution of vDNA /cell does or does not fit the predicted Poisson distribution. Nonetheless, at these high copy numbers / cell, there must surely be a risk that the variation in gene expression activity arises stochastically, out of competition between genomes for essential transcription factors. Given that multiple cellular factors are each required for E1A transcription, high genome copy numbers could actually inhibit E1A expression relative to cells with more modest copy numbers because limited supplies of individual factors are recruited to different viral genome copies.

      AU: The “discrepancy” between theoretical MOI and the actual observed number of cell-associated virus particles or cell-associated virus genomes is explained above. Furthermore, we would like to point out that we have directly estimated the number of virus particles bound to cells with the input virus amounts used, something that is usually not done in other studies.

      It is indeed theoretically possible that high nuclear genome numbers could lead to inhibition of transcription due to competition for limiting essential host factors. However, if we included only cells with ≤4 vDNA molecules per nucleus into the analysis (total number of cells analyzed was 258), then Spearman’s correlation coefficient for vDNA per nucleus vs E1A mRNAs per cell was 0.186 (p=0.0027). Thus, this would not support the notion that cells with moderate nuclear vDNA copy numbers would have a better correlation between the nuclear vDNA copies vs E1A mRNA counts per cell.

      The vDNA/cell in Fig.2C does not fit predicted Poisson distribution, var/mean=9.129.

      It is important for the analysis of correlation of gene expression with cell cycle that the virus has not, at the time point analysed, already perturbed the cell cycle (a well-known effect of infection) which the authors document in Suppl Fig3B. To my eye, the G1 peak in infected cells is somewhat narrower than in the control while the S/G2 bump is a little greater. The % of cells in each of the two gates needs to be shown to support the conclusion.

      AU: In non-infected sample G1= 54.63% and S/G2/M = 45.37%, in infected cells G1= 51.4% and S/G2/M= 48.6%. We have added this information into the S3B Fig.

      Turning to the experiments documenting a correlation between E1A expression and cell cycle stage, the authors interpret their findings in terms of the stage the cells are at when the analysis was done (G1 stage cells have more E1A transcripts). The key experiment (Fig 3B) is analysed at only 4 h pi, so substantial progression from G2/M back to G1 after virus addition can probably be discounted, but the point should be discussed. The authors also use release from G1 in another cell line to support their argument that G1 supports higher levels of E1A expression (Fig 3C). Here, they elect to exclude all cells with fewer than 50 E1A transcripts from their analysis. The reason for this is completely obscure and isn't obviously justified; conceivably it could bias the outcome of the experiment. At minimum, this decision needs to be carefully explained; ideally, the full data set should be used.

      AU: Fig.3B: As suggested by the Reviewer, we have added to the main text the following explanation: “We used a high MOI infection (median 75 cell-associated virus particles, Fig. 1A) in order to achieve a rapid onset of E1A expression so that the time between virus addition and analysis was short. Thus, it is not expected that a substantial number of cells would have changed their cell cycle status during the experiment.”

      Fig.3C: We show the results also from the full data set of infected cells, i.e., cells with ≥ 1 E1A puncta in S3D Fig. We excluded the cells without zero E1A puncta because with these cells it is impossible to know whether they received no virus or whether E1A transcription had not yet started. Permutation test indicated that the difference between the starved+starved and starved+FCS is statistically significant even in this case. Because both samples are dominated by cells with low E1A counts, we log-transformed the E1A values for the box plot figure.

      The authors note the highest level of E1A activity (as opposed to RNA) was in G1/S cells and suggest that high E1A cells advance preferentially into S. Whilst in line with the literature that E1A promotes progression into S, an alternative explanation is simply that there is a time lag between RNA accumulation and protein accumulation, during which progression through the cycle would be expected.

      AU: This is a valid point, and we have modified the text as follows: “… which could reflect the advancement of high E1A expressing cells into S-phase. However, considering the time between virus addition and analysis (10.5 h), we cannot exclude the possibility that the observed G1/S preference is at least partly due to time-dependent progression of G1 cells to G1/S.”

      **Minor comments:** Fig 1 and elsewhere. Given that the 1 h incubations with virus were done at 37 C, the convention would be to include this period in the time post-infection at which harvest / fix time points are quoted. There is inconsistency between text and legend with 12 h pi being sometimes represented as 11 h after virus removal; this is an unnecessary confusion.

      AU: We have modified the text so that hours pi always include the 1h incubation with the input virus. Only in the Material and Methods section we kept the original 1h virus binding – fixing at xh post virus removal.

      Results description prior to the ref to Fig 1B: unclear what this is supposed to mean.

      AU: We have now slightly modified the first paragraph of the Results section. We mention the benefits of the bDNA signal amplification method and explain the experimental set up, i.e. that the input virus was incubated with the cells only for 1h. We also justify why we used a short incubation for the virus inoculum.

      Fig 4A: provide % of cells in each gate in each histogram.

      AU: In the highest GFP expression bin, CMV-eGFP expressing cells have 43% of cells in G1 and 50% in S/G2/M. In comparison, E1A-GFP expressing cells have 58% of cells in G1 and 35% in S/G2/M. This has been added to the figure, and it is also mentioned in the main text. Furthermore, we added to the text the results from Two Proportion Z-test to show that the proportion difference of G1 cells in the highest bin was statistically significant (p

      Fig 5: bottom right panel x axis label is wrong

      AU: Thank you for pointing out this. This has been corrected.

      In the presentation of Fig 6, it would be much clearer for the reader if the detected replication foci (ss DNA detected as E1A puncta) were referred to as something other than E1A puncta. There is too much scope for confusion with the earlier experiments in which E1A RNA was detected.

      AU: We agree. In the revised manuscript, we refer to these puncta in the text as E1A ssDNA-foci.

      Reviewer #3 (Significance (Required)):

      The study represents the application of state of the art single-molecule visualization techniques to an as yet not understood aspect of virus infection. That said, there is prior experimentation in this area, which the authors fully acknowledge and build upon. The new work is largely descriptive, in that it reveals very clearly the discrepancy between genome copy number and amounts of mRNA without seeking to explain these, beyond the cell cycle analysis. Whilst there is a better correlation between vDNA number and transcript once the data are stratified by cell cycle stage, it is still not strong (Fig 5), indicating that other substantial contributing factors remain to be described.

      The work will be of interest certainly to adenovirologists, but also to others who study virus infections - particularly nuclear-replicating DNA viruses such as herpesviruses - where similar considerations are likely to apply.

      Expertise: adenovirus; gene expression; virus-host interactions; molecular biology

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: Soumalainen et al have studied adenovirus viral gene expression and replication at a single-cell level. They explore the extent of correlation between incoming genome copy number and early gene expression and progression into the late phase, revealing substantial variation between cells in the numbers of E1A transcripts (the first gene expressed upon infection) that is not explained by differences in the numbers of viral genome templates in the cells. They also explore the relevance of cell cycle stage to this variability and show a positive correlation between G1 cell cycle stage and higher levels of gene activity, which explains at least part of the variation. To form these conclusions they have applied new methods to visualise and quantify single molecules of nucleic acid in single cells. The experiments are all carefully and fully described with full detail of materials. Overall the manuscript is well written and easy to follow.

      Major comments:

      All of the experiments appear to be done with rigour and their results reported with due regard to statistical significance etc. My major concern though is that they have been done, perhaps out of necessity to get detectable signals, at very high multiplicities of infection. A well-accepted standard to achieve infection of all cells in a culture is an MOI of 10 infectious units per cell. Even this is acknowledged not to represent the biology of natural infection and it is striking that, where technically feasible, lower MOI studies are more revealing of how a virus actually works. Here, the authors have used counts of particles rather than infectious units to determine MOI and for Ad5, the particle/pfu ratio is typically 20-100. Their MOIs though are 13,000 - 50,000 per cell, implying an infectious MOI of at least 130 for their A549 experiments, which are known to be readily infected by Ad5 from other work.

      Surprisingly, the authors do not see intracellular vDNA copy numbers that are fully reflective of this high MOI, with median intracellular vDNA of 75 /cell at the highest MOI. The authors should consider how the population distribution of vDNA /cell does or does not fit the predicted Poisson distribution. Nonetheless, at these high copy numbers / cell, there must surely be a risk that the variation in gene expression activity arises stochastically, out of competition between genomes for essential transcription factors. Given that multiple cellular factors are each required for E1A transcription, high genome copy numbers could actually inhibit E1A expression relative to cells with more modest copy numbers because limited supplies of individual factors are recruited to different viral genome copies. It is important for the analysis of correlation of gene expression with cell cycle that the virus has not, at the time point analysed, already perturbed the cell cycle (a well-known effect of infection) which the authors document in Suppl Fig3B. To my eye, the G1 peak in infected cells is somewhat narrower than in the control while the S/G2 bump is a little greater. The % of cells in each of the two gates needs to be shown to support the conclusion.

      Turning to the experiments documenting a correlation between E1A expression and cell cycle stage, the authors interpret their findings in terms of the stage the cells are at when the analysis was done (G1 stage cells have more E1A transcripts). The key experiment (Fig 3B) is analysed at only 4 h pi, so substantial progression from G2/M back to G1 after virus addition can probably be discounted, but the point should be discussed. The authors also use release from G1 in another cell line to support their argument that G1 supports higher levels of E1A expression (Fig 3C). Here, they elect to exclude all cells with fewer than 50 E1A transcripts from their analysis. The reason for this is completely obscure and isn't obviously justified; conceivably it could bias the outcome of the experiment. At minimum, this decision needs to be carefully explained; ideally, the full data set should be used.

      The authors note the highest level of E1A activity (as opposed to RNA) was in G1/S cells and suggest that high E1A cells advance preferentially into S. Whilst in line with the literature that E1A promotes progression into S, an alternative explanation is simply that there is a time lag between RNA accumulation and protein accumulation, during which progression through the cycle would be expected.

      Minor comments:

      Fig 1 and elsewhere. Given that the 1 h incubations with virus were done at 37 C, the convention would be to include this period in the time post-infection at which harvest / fix time points are quoted. There is inconsistency between text and legend with 12 h pi being sometimes represented as 11 h after virus removal; this is an unnecessary confusion.

      Results description prior to the ref to Fig 1B: unclear what this is supposed to mean.

      Fig 4A: provide % of cells in each gate in each histogram.

      Fig 5: bottom right panel x axis label is wrong

      In the presentation of Fig 6, it would be much clearer for the reader if the detected replication foci (ss DNA detected as E1A puncta) were referred to as something other than E1A puncta. There is too much scope for confusion with the earlier experiments in which E1A RNA was detected.

      Significance

      The study represents the application of state of the art single-molecule visualization techniques to an as yet not understood aspect of virus infection. That said, there is prior experimentation in this area, which the authors fully acknowledge and build upon. The new work is largely descriptive, in that it reveals very clearly the discrepancy between genome copy number and amounts of mRNA without seeking to explain these, beyond the cell cycle analysis. Whilst there is a better correlation between vDNA number and transcript once the data are stratified by cell cycle stage, it is still not strong (Fig 5), indicating that other substantial contributing factors remain to be described.

      The work will be of interest certainly to adenovirologists, but also to others who study virus infections - particularly nuclear-replicating DNA viruses such as herpesviruses - where similar considerations are likely to apply.

      Expertise: adenovirus; gene expression; virus-host interactions; molecular biology

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors show heterogeneity of AdV-C5 mRNA transcript quantity and dynamics in different cell types, which is regulated by the cell cycle phase and does not correlate to incoming viral DNA, using single molecule RNA FISH technologies and detection of incoming viral DNA by EdC labeling.

      Major Comments:

      The authors change the MOI used in their experiments (7 different MOIs are used throughout the paper) in a manner that appears randomly and without explanation. (54400 for Figure 1A, 1B, 3B, S3B; 37500 for Figure 1C; 23440 for Figure 2A, 2C, S5A; 13600 for Figure 1A, 1D; 36250 for Figure 3C, S3D; 11200 for Figure 4B; 23400 for Figure 6B). The authors should provide explanation, why these changes in MOIs are necessary. The authors use mean fluorescence intensity of E1A probes per cell as estimate for viral transcript abundance for some of their experiments (Figure 1D, E, 3B), and count E1A punctae as measure for E1A transcripts in other experiments (Figure 2C, 3C, 5), without showing data, that these measures correlate. Problematic is hereby, that not all E1A punctae have the same signal intensity, as can be seen in Figure S1, which makes the estimation of the correlation of E1A punctae (= number of transcripts) and fluorescence intensity difficult. The authors should provide both (E1A punctae counts and estimation via fluorescence intensity) for at least one experiment, to prove, that the estimation of E1A transcript levels via fluorescence intensity is feasible. p.15: "The nuclear E1A signals in AraC-treated cells were resistant to RNase A, but they were dampened by treatment with S1 nuclease (S6B Fig)." The authors make this statement based on (i) two completely different timepoints (12 h.p.i. for RNaseA treatment, 24.5 h.p.i. for S1 nuclease treatment) and (ii) in different clones of the A549 cells as stated in the methods section on p.21 (Two different clones of human lung epithelial carcinoma A549 cells were used in the study: our laboratory's old A549 clone (experiments shown in Fig. 1, Fig. 3B and S1 Fig., S3B and S3C Fig., S6A and S6B Fig., RNase A treatment) and A549 from American Type Culture Collection (ATCC, experiments shown in Fig. 2 and Fig. 5, Fig. 6, S2B Fig., S4 Fig., S5 Fig., and S6B Fig. S1 nuclease-treatment)). This makes it difficult to interpret, if the data is due to differences in the timepoints or cell types, or if it is due to binding of the E1A probe to single stranded vDNA.

      Minor Comments:

      p.4: "AdV are non-enveloped, double-stranded DNA viruses that cause mild respiratory infections in immuno-competent hosts, and establish persistent infections, which can develop into life-threatening infections if the host becomes immuno-compromised [reviewed in 6]." Not all AdV cause respiratory diseases, the disease outcome of human AdV depends on the site of primary infection, which differs between the different AdV types.

      p.7: The authors state, that "At the 17 h time point, about half of the cells had high numbers of protein VI transcripts, and most of them very high numbers of E1A transcripts.", however, the picture shown in Figure 1F shows a different phenotype, with low transcript levels of VI in E1A high cells and high transcript levels of VI in E1A low cells.

      p.8: "This nuclear E1A signal is due to binding of the E1A probe to single-stranded vDNA in the replication centers (see below)." The authors should state here, that due to the binding of the probes to the single stranded vDNA in the replication centers, the nucleus was excluded from the analysis for Figure 1F in late timepoints. Due to this time point the author cannot state that the E1A staining seen (Fig. 1F; indicated with white arrows) are replication centers; this is just an assumption, since there is no evidence in Fig 1 the author cannot be sure; the author should change the text: "taking the following experiments into account...", "due to further studies (see below)..... we assume that..." p.8: The authors should mention the figure they refer to, since there is no E1B-55K staining in Fig. 1F

      p.9: Which test was used to calculate the additional p-values?

      p.10: For the experiment for the correlation of viral genomes per cell and E1A transcripts in HDF-TERT cells (Figure S2C), the MOI is missing in the description of the results, as well as in the corresponding figure legends.

      p. 11: calculation of correlation? rs? Why does the author combine S and G2/M phase? Fig. S3A show different values for the phases

      p.11: "Thus, the total intensity of nuclear DAPI signal can be used to accurately assign G1 vs S/G2/M stage to cells." The authors should also here refer to other papers, which showed that this correlation is feasible, as they did in the methods section (67. Roukos V, Pegoraro G, Voss TC, Misteli T. Cell cycle staging of individual cells by fluorescence microscopy. Nature protocols. 2015;10(2):334-48. Epub 2015/01/31. doi: 10.1038/nprot.2015.016. PubMed PMID: 25633629; PubMed Central PMCID:PMCPMC6318798.), and maybe also refer to a newer paper which deals with this technique: Ferro, A., Mestre, T., Carneiro, P. et al. Blue intensity matters for cell cycle profiling in fluorescence DAPI-stained images. Lab Invest 97, 615-625 (2017). https://doi.org/10.1038/labinvest.2017.13

      p.11: "Furthermore, when focusing on the highest E1A expressing cells, i.e. the cells with mean cytoplasmic E1A intensities larger than 1.5 × interquartile range from the 75th percentile, 71.9% of these cells were found to be in the G1 phase of cell cycle, whereas only 55.8% of cells in the total sampled cell population were G1 cells." The authors do not provide any reference to a figure within the manuscript or the supplements, which contains these data. Are these data not shown in the manuscript?

      p.12: punctuation mistake; . instead of , To enrich G1 cells. AdV-C-5 (moi ~ 36250) was added. Why does the author switch between signal intensities and counting E1A puncta per cell (limited to 200) in the different experiments to illustrate accumulation of E1A transcripts?

      p.14: "For E1A (or E1B-55K), we did not detect transcriptional bursts with bDNA-FISH probes on nuclear vDNAs, either prior to or after accumulation of viral transcripts in the cell cytoplasm." The authors do not provide any reference to a figure within the manuscript or the supplements, which contains these data. Are these data not shown in the manuscript?

      p.14: space between number and %

      p.15: "This is was also seen in AdV-C5-EdC-infected cells" should be changed to "This was also seen in AdV-C5-EdC-infected cells"

      Fig. 1B:

      −figure legend does not indicate how cells were staine

      −also no description in the continuous text

      −which E1A transcripts are stained? all? 12S? 13S?

      Fig. 1D:

      −difference in accumulation of viral transcripts is not that visible as in IF staining (Fig. 1B; Fig. 1S);

      −graph does not show any difference between E1A and E1B-55K

      Fig. 1F:

      −figure legend does not fit with labelling of IF images and continuous text

      −description says 22 h, while IF labeling and text (p. 7, last lane) mentions 23 h pi

      Fig. 2A:

      −figure legend: lane 5 Punctuation wrong: azide-Alexa Fluor488. Alexa Fluor647

      Fig. 4A:

      −difficulties to understand

      −author stated that promoter-driven EGFP expression is clearly dominated by G1 cells for E1A and by S/G2/M cells for CMV, however this is not clearly visible in the graph

      −no severe differences visible between CMV-eGFP and E1A-eGFP

      −author should include numbers for quantification and statistical calculations to illustrate the differences

      Fig. 4B:

      −amount of E1A protein levels calculated via IF (signal intensities)

      −immunofluorescence is not a suitable tool for protein quantification

      Fig. 5:

      −in A. it is stated, that E1A bDNA -FISH is not suitable, since it is too short to be detectable. However, in B E1A bDNA-FISH is used. is there a difference?

      −according to the method part just one E1A mRNA was used for the assays, why is it then not possible to use that one in Fig. 5A?

      −explanation of the procedure and the experiment is very confusing

      Fig. S6B:

      −authors want to show that it is RNase-insensitive, but S1 nuclease-sensitive

      −two different A549 cell clones and two different time points are used for the treatments → not compareable to each other

      Material and Methods:

      −headings do not indicate which methods are explained

      −no clear structure

      Significance

      highly significant manuscript very important for the virology field

      my research topics are human adenoviruses and their replication cycle

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Suomalainen et al. describes a fluorescence-based approach combined with high-resolution confocal microscopy to study the heterogeneity of adenovirus infection in a population of human cells. The main focus of the authors is the detection of viral transcripts in infected cells, how this correlates with viral genomes, the cell state, and how it varies between different cells in a single population. The paper is generally well written and easy to read, with a few typos, although I found parts of it to be somewhat length and repetitive. Particularly the results section could be pruned somewhat for readability and clarity. The major limitation of the study as it stands is it's overall impact and novelty, which limits journal selection somewhat. A very similar study was recently published, which the authors cite (Krzywkowski et al, 2017). Nevertheless, I think the study design is rigorous and well executed, but I do have some specific comments which may enhance it's overall impact and novelty.

      Major: Results "Visualization of AdV-C5..." section:

      Why not also look at normal cells that can be synchronized? Cancer cells, such as A549 will by definition be highly heterogenous and at all phases of the cell cycle. Primary non-transformed cells can easily be synchronized by contact inhibition and are much more physiologically relevant. "The virus particles bound..." - Can the spatial resolution of a confocal microscope truly differentiate individual particles that are sub-wavelength in size? What about the sensitivity for single particles? Some sort of experiment to show that single particles can be detected should be performed and shown to assure the readers that this is in fact possible. Furthermore, even when based on the particle to pfu ratio, the MOI would still be nearly 2000pfu/cell, so the actual number of observed particles is an order of magnitude lower than what was applied to the cells.

      Fig. 4 - I am not certain that the observed difference is significant, at least looking at it, beyond the width difference of the peaks, highest expression for both is largely in G1. It would be nice to see this using a western blot of cell cycle sorted cells, which can easily be accomplished using FACS. Page 15, 2nd paragraph. It would be valuable and informative to determine whether there is heterogeneity in histone association with these different vDNAs and whether these histones exhibit divergent modifications (enabling or restricting transcription). Same as above. I am rather surprised that the DBP signal did not correlate well with vDNA signal, particularly for the larger replication centers. How can this be reconciled? Was there an increase in overall vDNA signal later in infection? It is important to know this as it determines whether the observed vDNA signal is real or could be caused by viral RNA or other background causes (non-infected controls notwithstanding). Can the signal be detected with inactivated viruses (via UV for example?)

      Page 18, 1st paragraph. It would be interesting to determine whether there was association between pol II and those genomes that showed no E1A, similarly to the histone suggestion. What about things like viral chromatin organization? Soriano et al. 2019 showed how E1A and E4orf3 work in tandem to alter viral chromatin organization by varying histone loading on the viral genome. Fig. 2. Can you really say that a single dot correlates with a single transcript? Has that been validated in any way?

      Minor:

      Page 5, last paragraph. "Transcirpts from the viral late transcription unit,..." This is not correct as recently shown by Crisostomo et al, 2019.

      Page 10, "... because AdvV-infected cells are less well adherent..." This is not strictly true as loss of attachment only occurs later on in infection. It would be helpful to have statistical significance indicated directly in the figures.

      The very high MOIs used are concerning, could these have negative effects on the cell viability or overall state?

      There are a few typos and such that should be corrected.

      Significance

      As I stated above, the work is interesting and significant, to a degree. The major limitation is that the novelty is low as a paper published in 2017 (cited by the authors) used a very similar approach to investigate a similar problem. In addition, there are multiple other recent papers looking at cell populations in the context of adenovirus infection, and whether a single cell or population based approach is better is unclear. This is something the authors might want to strengthen prior to submission.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      First of all, we thank all reviewers for their constructive suggestions and comments.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This group has been at the forefront recently of using imaging technologies to understand how chromosome segregation is coordinated in mammalian oocytes, and why errors occur. In the current paper they examine the dynamics of microtubule organising centres (which effectively replace centrioles/centrosomes in oocytes) in MI. The imaging of oocytes in this paper is beautiful. The major findings are (1) that MTOCs that are supposed to be at the spindle pole sometimes end up at the spindle equator, and this is documented very beautifully and (2) the correct positioning of MTOCs at the spindle pole appears to require kinetochore microtubules, as indicated by experiments manipulating the kinetochore component NDC80.

      We appreciate the reviewer’s comment and clear description of our study.

      **Major Comments**

      As such the major claims of the paper are basically well supported. However, the analyses are is almost entirely restricted to prometaphase/metaphase, and the conclusions are relatively limited. The salient omission is any analysis of MTOC/chromosome relationship during anaphase. Were the paper to be extended to determine whether the lingering of MTOCs at the spindle equator is related to chromosome segregation error, that would increase the reach and importance of the work substantially. Specifically:

      Can tracking experiments be performed to determine whether the chromosome that shows movement similarities to the errant MTOC is more/less likely to missegregate? Complete tracking as these authors are expert at should achieve this, or photo-labelling the desired chromosome.

      Thank you for your comment. In our experimental system, oocytes rarely exhibit chromosome segregation errors (

      Can the position of MTOCs (proportion that linger at the equator) be manipulated in the absence of other defects to determine whether this increases errors (lagging at anaphase, metaphase-II chromosome counting spreads)?

      We agree with the reviewer that a specific manipulation of MTOC positions is exactly what we would need to investigate the significance of central MTOCs. Unfortunately, there are currently no tools available to specifically manipulate MTOC positions without other defects. Therefore, the significance of central MTOCs is currently unclear. In the revised manuscript, we will state these points in Discussion.

      The above analysis would have to be well supported by controls showing that these constructs are having no impact on normal anaphase (proportion of oocytes completing meiosis-I, likelihood of lagging chromosomes etc).

      Thank you for the comment. As we answered above, control oocytes rarely exhibit chromosome segregation errors or lagging chromosomes (

      Related to the above, though I appreciate a fixed metaphase image of MTOC immunofluorescence is presented, the paper is about the dynamics of MTOCs and thus nonetheless relies heavily on the live imaging of cep192. The core results should be confirmed using another (substantially different) MTOC probe. *This final comment applies to the current metaphase data, regardless of whether the study is ultimately extended*

      Thank you for the suggestion. We will confirm the dynamics of MTOCs at metaphase with mEGFP-Cdk5Rap2, another established marker of MTOCs.

      Reviewer #1 (Significance (Required)):

      As explained above, as presented this paper is largely scientifically sound, but far more limited in scope than this groups other recent papers. As explained above, the paper would be made more impactful and the readership broadened if a relationship between MTOC position/movement and segregation problems were established. Or on the other hand if it were established why some MTOCs sometimes linger at the spindle equator. Whilst to my knowledge this is the first time that equator MTOCs have been documented so carefully, oocyte cell biologists may not find the core observation that MTOCs are occasionally at the spindle equator extremely surprising.

      Thank you for your helpful suggestions. Due to lack of tools to specifically manipulate MTOC positions, we are unfortunately not able to directly address whether MTOC position/movement contributes to chromosome segregation problems. On the other hand, we are currently investigating to answer your important question ‘why some MTOCs sometimes linger at the spindle equator’. We speculate that MTOCs become central due to unstable kinetochore-microtubule attachments, which are predominantly observed at early metaphase in normal oocytes. To test this idea, we are currently investigating whether the appearance of central MTOCs are prevented by forced stabilization of kinetochore-microtubule attachments with Ndc80-9A. Our pilot analysis thus far supports this idea. In light of your suggestions, we will incorporate the results into the revised manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      I am commenting on the work of Courtois et al. as an expert in the biochemistry of spindle formation with a focus on acentriolar assembly.

      First and foremost, this a technically excellent study with a number of very interesting and well-documented observations, which are highly relevant for our understanding of the mechanisms of acentriolar spindle formation in the mouse oocyte model. In principle, the manuscript is in a very mature state. However, my major concern at this point would be that there is a break in the story. It starts describing the (very interesting) observation of "central MTOCs". After thoroughly investigating how these behave, the authors stop and look at overall MTOCs distribution after loss of stable MT-kinetochore interactions based on oocytes expressing the Ndc80_9D mutant instead of wt Ndc80. The two parts are experimentally and conceptually not well connected.

      We appreciate your comments on our techniques and novel observations in this study, and thank you for your helpful suggestions.

      Answering the following questions may help to further develop the paper:

      If I understand the arguments correctly, central MTOCs are an "accident" on the way to complete meiosis I spindle formation, which will eventually be corrected and all MTOCs clustered at the poles. Thus, they may serve as an assay for spindle assembly fidelity and kinetics (?). At this point, the reader is left with the observation without efforts to explain the meaning of this observation, ideally experimentally, or at least in a valid discussion.

      Thank you for your thoughtful comment. We agree that we should clearly explain our view on central MTOCs. We indeed speculate that central MTOCs are an “accident” due to unstable kinetochore-microtubule attachments, which are normally pronounced at early metaphase.

      We will revise the manuscript as follows: (1) Following the section for the observation of central MTOCs, we will state our hypothesis that central MTOCs may appear due to unstable kinetochore–microtubule attachments. (2) We will introduce our experiment of the manipulation of kinetochore–microtubule attachment stability as a test for our hypothesis. (3) We will present new results of our analysis for the effects of kinetochore–microtubule attachment stability on the appearance of central MTOCs (please see below).

      Enthusiasm for the technically excellent experiments using the Ndc80 variants are somewhat reduced as conclusions from these experiments are published in the parallel paper of the same laboratory (Yoshida et al.). Due to my opinion, it may thus be even more important to connect these observations with the first part described central MTOCs and to clarify their significance.

      Thank you for the important suggestion.

      First, we agree that we should connect our observations of central MTOCs to the phenotypes of Ndc80 manipulations. To do this, we will reanalyze our dataset to quantify the effects of Ndc80 manipulations on central MTOCs. Our pilot analysis thus far suggests that the forced stabilization of kinetochore–microtubule attachments by Ndc80-9A reduces the appearance of central MTOCs. This would support our idea that central MTOCs appear due to unstable kinetochore–microtubule attachments.

      Second, we agree with the reviewer that experimental clarification of the significance of central MTOCs would be nice. However, as outlined above, we unfortunately have no tool to directly address the significance of MTOC positioning in the fidelity of spindle assembly and chromosome segregation. Although we assume that MTOC positioning is critical for spindle assembly fidelity, as generally thought based on previous studies (Breuer et al., 2010; Clift and Schuh, 2015; Schuh and Ellenberg, 2007), the significance of MTOC positioning in spindle assembly remains uncertain, as you (and also the reviewer 1) point out. We will discuss these points in the revised manuscript.

      Shown if in Fig. 3B but not fully explained: How does the distribution of what is defined as central MTOCs behave in Ndc80_wt and Ndc80_9A mutant oocytes? Do the variants differ, i.e. are there fewer, or less persistent central MTOCs in the 9A mutant? Would they differ in kinetics of appearance and "rescue" to the poles?

      Thank you for the question. As outlined above, we will reanalyze our dataset to quantify the effects of Ndc80-9A on the behavior of central MTOCs. Our pilot analysis suggests that the forced stabilization of kinetochore–microtubule attachments suppresses the appearance of central MTOCs.

      Similarly: is there a correlation of central MTOC appearance, Ndc80 phosphorylation/stability of kinetochore attachment and Anaphase I onset? The authors mention that oocytes expressing the 9A mutant go faster into Anaphase.

      Thank you for this comment. First, we will investigate whether the levels of Ndc80 phosphorylation at kinetochores has any correlations to the distance to central MTOCs. Second, we will address whether microtubules connect kinetochores to central MTOCs. Third, we will perform the tracking of chromosomes that showed correlated motions to closely positioned MTOCs until anaphase onset.

      The observation that "central MTOCs exhibited correlated motions with closely positioned kinetochores" is poorly defined, yet an important observation. Does this mean some sort of short k-fibers remain to connect central MTOCs and kinetochores? Wouldn't one expect that the loss of stable end-on-attachment causes MTOCs to become central? How does this fit into a/the model?

      We believe these concerns will be addressed by the experiments/analyses proposed above. First, we will check if central MTOCs are connected to kinetochores by microtubules. Second, we indeed speculate that loss of stable kinetochore-microtubule attachment allows MTOCs to become central. We will test this idea by quantifying the appearance of central MTOCs in Ndc80-9A-expressing oocytes.

      Along the same lines: The authors hype their conclusion that kinetochores dominate meiosis I spindle formation based on the observation that loss of kinetochore functions results in less well-organized spindle poles and worse MTOC "confinement". This may mean that kinetochores, together with MTOCs, maintain stable k-fibers in meiosis, as shown here and in Yoshida et al. When one or the other end of k-fibers is destabilized (loss of end-on-attachment, loss of MTOC attachment), the fibers collapse and the remaining minus-or-plus-end associated structure loses its destination. We then see central MTOCs and/or kinetochores at poles. In this respect, the interpretation / discussion should be less "kinetochore-centered".

      We agree with your thoughtful comment that the regulations of minus-ends (e.g. MTOCs) and of plus-ends (e.g. kinetochores) are equally relevant for spindle bipolarization. We will tone down our kinetochore-centered view in the Abstract and Discussion and revise them into more balanced statements.

      Is there any way to determine the efficiency of Ndc80 knockdown in the gene replacement respective experiment? I share the view of the authors that their method may be more efficient and may explain apparent discrepancies to previous studies on Ndc80-9A (Guy and Homer, 2013) with more dramatic effects on spindle geometry. However, at that point, this remains speculative. For instance, one may also speculate vice versa that the ko strategy used here is less efficient in a maternally dominated system and leaves behind more wt Ndc80, which better compensates defects seen in the 9A mutant.

      Our gene deletion strategy (Zp3-Cre Ndc80f/f) resulted in >90% depletion of the Ndc80 protein (estimated by Western blot; Supplementary Figure 1c in Yoshida et al, Nat Commun 2020). On the other hand, Gui and Homer report that their morpholino-based depletion strategy resulted in 60–70% depletion of the Ndc80 protein (estimated by Western blot; Figure 1B in Gui and Homer, Dev Cell 2013). Thus, the depletion was more efficient in our experimental system. We will add this information in the manuscript.

      Reviewer #2 (Significance (Required)):

      Courtois et al present data on mechanisms governing spindle assembly in mouse oocytes. Mouse oocytes serve as model system for spindle formation in the absence of centriole-based MTOCs. At the onset of meiosis I, numerous MTOCs form, which shape a mass ("ball") of MT nucleated around chromatin into a bipolar structure. Accumulating evidence indicates that kinetochores play an important role in acentriolar spindle formation in mouse oocytes, yet the mechanisms behind kinetochore action remains unclear.

      Here, Courtois et al. analyze spindle formation in live mouse oocytes using 3D-time-lapse imaging. They use fluorescently tagged Cep192 to track MTOCs and Histone H2B or CENP-C to visualize chromatin or kinetochores. In the first part, the authors deal with the appearance of "central MTOCs", i.e. aggregates of centrosomal protein(s) that, apparently, fail to remain stably integrated into the spindle pole clusters on MTOCs during spindle formation. The authors convincingly demonstrate that these central MTOCs can be seen in the majority of spindles investigated. They demonstrate that central MTOCs generally come from positions at poles from where they "fall back" towards chromosomes. Central MTOCs may even cross the spindle and end up at opposite poles from where they originated from. Interestingly, central MTOCs are often found next to kinetochores.

      In the second part, the authors focus on the role of kinetochores and their stable MT attachment for spindle formation in general and bipolarity/pole organization in particular. The same lab has published data on the role of kinetochores in meiosis I spindle very recently (Yoshida et al. Nat Comm, 2020). Here, they successfully exploit Ndc80 phospho-mutants to compare MTOC distribution in oocytes with reduced or increased end-on-attachment. The data show that stable end-on attachment determines stable MTOC clustering at spindle poles and governs the maintenance of bipolarity and spindle length.

      Thank you for your clear description of our study.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In order to assemble a bipolar structure, acentrosomal spindles relay on multiple non-centrosomal pathways. Mouse oocytes specifically build bipolar spindles by sorting and clustering of microtubule organizing centers (MTOCs). While microtubule cross-linkers, spindle motors and microtubule nucleators are involved; the role of kinetochores and kinetochore-microtubule attachments in meiotic spindle assembly and maintenance has not been thoroughly tested. Using an impressive combination of live cell imaging and semi-automated image analysis, Courtois et al. quantified MTOC behavior in bipolar mouse oocyte spindles and found an ongoing MTOC sorting in metaphase and instances of MTOC-kinetochore associations. The authors further employed an elegant genetic system to replace NDC80 in maturing oocytes with a mutant almost completely unable to form stable microtubule-kinetochore attachments. The data show lack of MTOC confinement at the spindle poles and increased spindle elongation while maintaining spindle bipolarity. The authors concluded that stable kinetochore-microtubule attachments are required to confine MTOCs at the poles, which in turn sets an optimal spindle length. Overall, the data are of very high quality and clearly presented, the manuscript is easy to follow, and the methods are comprehensively described. One concern is the lack of mechanistic link between the natural metaphase MTOC sorting (Fig. 1-2) and massive MTOC rearrangements observed with the NDC80-9D mutant (Fig. 3). A second concern is that deficient MTOC confinements and spindle elongation observed with the 9D mutant could be due unaligned chromosomes rather than lack of stable kinetochore-microtubule attachments, which is the authors' interpretation.

      **Major Points:**

      1) Massive MTOC rearrangements (Supplementary Video 6) are reminiscent of spindle assembly defects or spindle collapse. Since these spindles do not reach a normal metaphase and seem to change shape (Supplementary Video 6; 11:10), it is difficult to differentiate between spindle assembly and spindle maintenance defects. Is there a difference in the timing of bipolar spindle assembly for NDC80-9D vs WT? If so, one interpretation is that stable attachments not only ensure MTOC confinement but also contribute to bipolar spindle assembly.

      We apologize for the lack of explanation for the spindle dynamics seen in Supplementary Video 6, 11:10. At this time point, the spindle rotated in 3D, which appeared as if the spindle collapsed in the z-projection movie. We will add this explanation into the legend.

      Our quantitative analysis of spindle shape in 3D indicated no increased collapse in Ndc80-9D, based on the signals of the spindle marker EGFP-Map4. Moreover, we observed no detectable difference in the timing of the onset of bipolar spindle assembly, as long as we define it with EGFP-Map4 signals. These results are shown in Figure 4B.

      2) Fig. 1-2 vs Fig. 3 - It is not clear how the discrete MTOC sorting phenotype presented in Fig. 1-2 relates to the massive MTOC collapse shown in Fig. 3. The natural MTOC sorting and MTOC-kinetochore associations seem to be happening within the bipolar structure confined by the polar MTOCs. The MTOC rearrangements (e.g., Supplementary Video 6) are much more drastic, reminiscent of a spindle collapse. To make a mechanistic link between the phenotypes, it would be useful to use an intermediate NCD80 mutant (ex. NDC80-4D; Zaytsev et al., 2014 JCB) that may support chromosome alignment and maintenance of the canonical bipolar spindle structure, but still show effects on MTOC sorting.

      Thank you for your nice suggestion. We will test Ndc80-4D. The construct is ready.

      3) Fig. 4 - The authors should provide evidence that unstable kinetochore-microtubule attachments, rather than chromosome-derived signals of misaligned chromosomes (e.g., from Ran or Aurora B), limit spindle elongation. For example, the authors could measure spindle elongation in oocytes with misaligned chromosomes but stable attachments: for example, NDC80-9A oocytes released from an Eg5 inhibition block should carry a number of polar chromosomes with stable attachments. The expectation would be that such spindles form with confined MTOCs and do not elongate as much as NDC80-9D expressing oocytes.

      Thank you for this important suggestion. Following your suggestion, we have conducted a pilot experiment using monastrol washout. However, unfortunately, we did not observe increased chromosome misalignment in Ndc80-9A. We will play around experimental conditions.

      Moreover, we propose to perform an additional experiment. We will use cohesin depletion with Rec8 TRIM-Away, which will produce chromosome misalignment and reduce kinetochore-microtubule attachment stability. We expect that these oocytes exhibit excessive spindle elongation. Then, we ask if Ndc80-9A, which would force to stabilize kinetochore-microtubule attachment (but fail to align chromosomes due to loss of chromosome cohesion), can suppress excessive spindle elongation.

      These experiments will allow us to address direct contribution of kinetochore-microtubule attachment to proper spindle elongation. However, in our opinion, regardless of the results, we cannot exclude the possibility that chromosome alignment contributes to proper spindle elongation, which is indeed an intriguing hypothesis. We will discuss these possibilities in Discussion.

      4) Figure 5D - The authors' model suggests that MTOCs are confined due to their connection to stably attached k-fibers. It would be useful to speculate on the molecular mechanism behind the confinement. Does a maximal k-fiber length restrict the elongation, or is there a pulling force exerted by the kinetochores?

      Thank you for your thoughtful suggestion. As the reviewer suggests, we speculate that the length of k-fibers is critical for restricting MTOC position and spindle elongation. K-fibers may prevent excessive spindle elongation by anchoring MTOCs at their minus ends. Alternatively, k-fibers may act as a platform that inactivates spindle bipolarizers. We will discuss these possibilities in our revised manuscript.

      5) Discussion - Lines 203-204 - "The findings of this study, together with recent studies, suggest a model for how kinetochore-microtubule attachments contribute to acentrosomal spindle assembly (Figure 5D)". - Throughout the paper the authors underscore that biopolar spindles do assembly with the NDC80-9D mutant. The authors should clarify whether spindle assembly is affected by the NDC80-9D mutant or not?

      Thank you for your comment. We agree with the reviewer that we should clearly state our conclusion based on the phenotype of the Ndc80-9D mutant. Our conclusion is that stable kinetochore-microtubule attachment fine-tunes bipolar spindle assembly. If oocytes lack stable attachments, they can form a bipolar-shaped spindle composed of microtubule arrays that are largely bipolar, but the spindle becomes too much elongated and lacks MTOCs at its poles. We will explicitly state these ideas in our revised manuscript.

      **Minor Points:**

      1) Introduction - Lines 38-44 - The authors should cite the role of the Augmin complex in acentrosomal spindle assembly (Watanabe et al., 2016 Cell Reports).

      Thank you for your excellent suggestion. We will cite this relevant paper.

      2) Results - Lines 55-56 - "However, the precise manipulation of the stability of kinetochore-microtubule attachments has not been tested" - Gui et Homer 2013 studied the outcome of NDC80 depletion and tested the NDC80-9A mutant in the context of oocyte spindle assembly. Although, as the authors point out in the Discussion section, there might be differences in the experimental design that lead to different conclusions, it is not entirely accurate that precise manipulations of attachments stability have not been tested. A different wording (e.g., "has not been comprehensively tested") may be better.

      Thank you for your suggestion. We agree that “has not been comprehensively tested” fits better.

      3) Results - Lines 162-164 - "Ndc80-9D-expressing oocytes had no significant delay in the onset of spindle elongation, but had significantly faster kinetics of elongation compared to Ndc80-WT- and Ndc80-9D-expressing oocytes" - The authors probably meant "... Ndc80-9A expressing oocytes."

      Thank you for pointing out this mistake. We will correct it.

      4) Discussion - Lines 239-242 - "... microtubule nucleation in later stages may not be determined by MTOCs but are largely attributed to nucleation within the spindle, as observed by microtubule plus-end tracking in bipolar-shaped spindles (Supplementary Figure 4)." - Strictly speaking, EB3 comets indicate microtubule polymerization rather than nucleation. Microtubule nucleation within the spindle is, however, supported by studies of the Augmin complex (e.g., Watanabe et al., 2016 Cell Rep).

      Thank you for your comment. We will correct our wording for EB3 comets and discuss that microtubule nucleation within the spindle is shown in Watanabe et al., 2016 Cell Rep.

      5) Discussion - Lines 257-260 - "The lagging MTOCs can be positioned close to kinetochores on bi-oriented chromosomes, underscoring the importance of active error corrections of kinetochore-microtubule attachments during metaphase (Lane and Jones, 2014; Yoshida et al., 2015)." - The reasoning here is not clear. Does the number/persistence of lagging MTOCs correlate with chromosome mis-alignment or with the efficiency/timing of chromosome alignment in WT cells?

      We apologize that our discussion was not clear. Previous studies (Lane and Jones, 2014; Yoshida et al., 2015) show that kinetochore-microtubule attachment errors are found on aligned chromosomes during metaphase and must be corrected until anaphase onset in oocytes. We speculate that lagging (or central) MTOCs may be a source of such kinetochore-microtubule attachment errors, although we cannot directly test this hypothesis due to lack of tools to specifically manipulate MTOC positions. We will discuss these points in Discussion.

      To check if central MTOCs are correlated with chromosome misalignment, we will perform the tracking of chromosomes that were closely positioned to lagging MTOCs.

      6) Discussion - Line 266 - "Yoshida et al., 2020" - This article is cited elsewhere in the text as "Yoshida et al., in press".

      Thank you for pointing out these mistakes. We will correct them.

      Reviewer #3 (Significance (Required)):

      Courtois et al., have found a new mechanism contributing to acentrosomal spindle assembly in mouse oocytes. Although kinetochore-dependent spindle assembly occurs in mitotic cells (e.g., Toso et al., 2009 JCB), only the recent work from the Kitajima lab (Yoshida et al., 2020 Nat Comm; this manuscript) showed that kinetochores also impact acentrosomal spindle assembly in meiosis. The genetic model presented here brings a significant technical advance in dissecting relative contributions of spindle assembly pathways in mouse oocytes (ex. Schuh and Ellenberg 2007 Cell; Watanabe et al., 2016 Cell Rep; Drutovic et al., 2020 EMBO J) and complements current methods used to study meiotic error-correction (e.g., Chmatal et al., 2015 Curr Biol, Yoshida et al., 2015 Dev Cell; Vallot et al., 2018 Curr Biol and many others). This model expands an existing toolbox of techniques allowing complete elimination of the endogenous protein specifically in mature mouse oocytes (Clift et al., 2017 Cell; Clift et al., 2018 Nat Protocols), which is a difficult feat due to a limited capacity of ex-vivo culture (Pfender et al., 2015 Nature). Therefore, the work presented in this manuscript may encourage other researchers to establish similar systems for oocyte-specific manipulations, which will allow more precise insight into oocyte biology.

      Expertise keywords: spindle dynamics, chromosome segregation, mitosis, meiosis

      We appreciate your comments. Additional experiments following on your constructive comments will further improve our manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In order to assemble a bipolar structure, acentrosomal spindles relay on multiple non-centrosomal pathways. Mouse oocytes specifically build bipolar spindles by sorting and clustering of microtubule organizing centers (MTOCs). While microtubule cross-linkers, spindle motors and microtubule nucleators are involved; the role of kinetochores and kinetochore-microtubule attachments in meiotic spindle assembly and maintenance has not been thoroughly tested. Using an impressive combination of live cell imaging and semi-automated image analysis, Courtois et al. quantified MTOC behavior in bipolar mouse oocyte spindles and found an ongoing MTOC sorting in metaphase and instances of MTOC-kinetochore associations. The authors further employed an elegant genetic system to replace NDC80 in maturing oocytes with a mutant almost completely unable to form stable microtubule-kinetochore attachments. The data show lack of MTOC confinement at the spindle poles and increased spindle elongation while maintaining spindle bipolarity. The authors concluded that stable kinetochore-microtubule attachments are required to confine MTOCs at the poles, which in turn sets an optimal spindle length. Overall, the data are of very high quality and clearly presented, the manuscript is easy to follow, and the methods are comprehensively described. One concern is the lack of mechanistic link between the natural metaphase MTOC sorting (Fig. 1-2) and massive MTOC rearrangements observed with the NDC80-9D mutant (Fig. 3). A second concern is that deficient MTOC confinements and spindle elongation observed with the 9D mutant could be due unaligned chromosomes rather than lack of stable kinetochore-microtubule attachments, which is the authors' interpretation.

      Major Points:

      1) Massive MTOC rearrangements (Supplementary Video 6) are reminiscent of spindle assembly defects or spindle collapse. Since these spindles do not reach a normal metaphase and seem to change shape (Supplementary Video 6; 11:10), it is difficult to differentiate between spindle assembly and spindle maintenance defects. Is there a difference in the timing of bipolar spindle assembly for NDC80-9D vs WT? If so, one interpretation is that stable attachments not only ensure MTOC confinement but also contribute to bipolar spindle assembly.

      2) Fig. 1-2 vs Fig. 3 - It is not clear how the discrete MTOC sorting phenotype presented in Fig. 1-2 relates to the massive MTOC collapse shown in Fig. 3. The natural MTOC sorting and MTOC-kinetochore associations seem to be happening within the bipolar structure confined by the polar MTOCs. The MTOC rearrangements (e.g., Supplementary Video 6) are much more drastic, reminiscent of a spindle collapse. To make a mechanistic link between the phenotypes, it would be useful to use an intermediate NCD80 mutant (ex. NDC80-4D; Zaytsev et al., 2014 JCB) that may support chromosome alignment and maintenance of the canonical bipolar spindle structure, but still show effects on MTOC sorting.

      3) Fig. 4 - The authors should provide evidence that unstable kinetochore-microtubule attachments, rather than chromosome-derived signals of misaligned chromosomes (e.g., from Ran or Aurora B), limit spindle elongation. For example, the authors could measure spindle elongation in oocytes with misaligned chromosomes but stable attachments: for example, NDC80-9A oocytes released from an Eg5 inhibition block should carry a number of polar chromosomes with stable attachments. The expectation would be that such spindles form with confined MTOCs and do not elongate as much as NDC80-9D expressing oocytes.

      4) Figure 5D - The authors' model suggests that MTOCs are confined due to their connection to stably attached k-fibers. It would be useful to speculate on the molecular mechanism behind the confinement. Does a maximal k-fiber length restrict the elongation, or is there a pulling force exerted by the kinetochores?

      5) Discussion - Lines 203-204 - "The findings of this study, together with recent studies, suggest a model for how kinetochore-microtubule attachments contribute to acentrosomal spindle assembly (Figure 5D)". - Throughout the paper the authors underscore that biopolar spindles do assembly with the NDC80-9D mutant. The authors should clarify whether spindle assembly is affected by the NDC80-9D mutant or not?

      Minor Points:

      1) Introduction - Lines 38-44 - The authors should cite the role of the Augmin complex in acentrosomal spindle assembly (Watanabe et al., 2016 Cell Reports).

      2) Results - Lines 55-56 - "However, the precise manipulation of the stability of kinetochore-microtubule attachments has not been tested" - Gui et Homer 2013 studied the outcome of NDC80 depletion and tested the NDC80-9A mutant in the context of oocyte spindle assembly. Although, as the authors point out in the Discussion section, there might be differences in the experimental design that lead to different conclusions, it is not entirely accurate that precise manipulations of attachments stability have not been tested. A different wording (e.g., "has not been comprehensively tested") may be better.

      3) Results - Lines 162-164 - "Ndc80-9D-expressing oocytes had no significant delay in the onset of spindle elongation, but had significantly faster kinetics of elongation compared to Ndc80-WT- and Ndc80-9D-expressing oocytes" - The authors probably meant "... Ndc80-9A expressing oocytes."

      4) Discussion - Lines 239-242 - "... microtubule nucleation in later stages may not be determined by MTOCs but are largely attributed to nucleation within the spindle, as observed by microtubule plus-end tracking in bipolar-shaped spindles (Supplementary Figure 4)." - Strictly speaking, EB3 comets indicate microtubule polymerization rather than nucleation. Microtubule nucleation within the spindle is, however, supported by studies of the Augmin complex (e.g., Watanabe et al., 2016 Cell Rep).

      5) Discussion - Lines 257-260 - "The lagging MTOCs can be positioned close to kinetochores on bi-oriented chromosomes, underscoring the importance of active error corrections of kinetochore-microtubule attachments during metaphase (Lane and Jones, 2014; Yoshida et al., 2015)." - The reasoning here is not clear. Does the number/persistence of lagging MTOCs correlate with chromosome mis-alignment or with the efficiency/timing of chromosome alignment in WT cells?

      6) Discussion - Line 266 - "Yoshida et al., 2020" - This article is cited elsewhere in the text as "Yoshida et al., in press".

      Significance

      Courtois et al., have found a new mechanism contributing to acentrosomal spindle assembly in mouse oocytes. Although kinetochore-dependent spindle assembly occurs in mitotic cells (e.g., Toso et al., 2009 JCB), only the recent work from the Kitajima lab (Yoshida et al., 2020 Nat Comm; this manuscript) showed that kinetochores also impact acentrosomal spindle assembly in meiosis. The genetic model presented here brings a significant technical advance in dissecting relative contributions of spindle assembly pathways in mouse oocytes (ex. Schuh and Ellenberg 2007 Cell; Watanabe et al., 2016 Cell Rep; Drutovic et al., 2020 EMBO J) and complements current methods used to study meiotic error-correction (e.g., Chmatal et al., 2015 Curr Biol, Yoshida et al., 2015 Dev Cell; Vallot et al., 2018 Curr Biol and many others). This model expands an existing toolbox of techniques allowing complete elimination of the endogenous protein specifically in mature mouse oocytes (Clift et al., 2017 Cell; Clift et al., 2018 Nat Protocols), which is a difficult feat due to a limited capacity of ex-vivo culture (Pfender et al., 2015 Nature). Therefore, the work presented in this manuscript may encourage other researchers to establish similar systems for oocyte-specific manipulations, which will allow more precise insight into oocyte biology.

      Expertise keywords: spindle dynamics, chromosome segregation, mitosis, meiosis

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      I am commenting on the work of Courtois et al. as an expert in the biochemistry of spindle formation with a focus on acentriolar assembly.

      First and foremost, this a technically excellent study with a number of very interesting and well-documented observations, which are highly relevant for our understanding of the mechanisms of acentriolar spindle formation in the mouse oocyte model. In principle, the manuscript is in a very mature state. However, my major concern at this point would be that there is a break in the story. It starts describing the (very interesting) observation of "central MTOCs". After thoroughly investigating how these behave, the authors stop and look at overall MTOCs distribution after loss of stable MT-kinetochore interactions based on oocytes expressing the Ndc80_9D mutant instead of wt Ndc80. The two parts are experimentally and conceptually not well connected.

      Answering the following questions may help to further develop the paper:

      1. If I understand the arguments correctly, central MTOCs are an "accident" on the way to complete meiosis I spindle formation, which will eventually be corrected and all MTOCs clustered at the poles. Thus, they may serve as an assay for spindle assembly fidelity and kinetics (?). At this point, the reader is left with the observation without efforts to explain the meaning of this observation, ideally experimentally, or at least in a valid discussion.
      2. Enthusiasm for the technically excellent experiments using the Ndc80 variants are somewhat reduced as conclusions from these experiments are published in the parallel paper of the same laboratory (Yoshida et al.). Due to my opinion, it may thus be even more important to connect these observations with the first part described central MTOCs and to clarify their significance.
      3. Shown if in Fig. 3B but not fully explained: How does the distribution of what is defined as central MTOCs behave in Ndc80_wt and Ndc80_9A mutant oocytes? Do the variants differ, i.e. are there fewer, or less persistent central MTOCs in the 9A mutant? Would they differ in kinetics of appearance and "rescue" to the poles?
      4. Similarly: is there a correlation of central MTOC appearance, Ndc80 phosphorylation/stability of kinetochore attachment and Anaphase I onset? The authors mention that oocytes expressing the 9A mutant go faster into Anaphase.
      5. The observation that "central MTOCs exhibited correlated motions with closely positioned kinetochores" is poorly defined, yet an important observation. Does this mean some sort of short k-fibers remain to connect central MTOCs and kinetochores? Wouldn't one expect that the loss of stable end-on-attachment causes MTOCs to become central? How does this fit into a/the model?
      6. Along the same lines: The authors hype their conclusion that kinetochores dominate meiosis I spindle formation based on the observation that loss of kinetochore functions results in less well-organized spindle poles and worse MTOC "confinement". This may mean that kinetochores, together with MTOCs, maintain stable k-fibers in meiosis, as shown here and in Yoshida et al. When one or the other end of k-fibers is destabilized (loss of end-on-attachment, loss of MTOC attachment), the fibers collapse and the remaining minus-or-plus-end associated structure loses its destination. We then see central MTOCs and/or kinetochores at poles. In this respect, the interpretation / discussion should be less "kinetochore-centered".
      7. Is there any way to determine the efficiency of Ndc80 knockdown in the gene replacement respective experiment? I share the view of the authors that their method may be more efficient and may explain apparent discrepancies to previous studies on Ndc80-9A (Guy and Homer, 2013) with more dramatic effects on spindle geometry. However, at that point, this remains speculative. For instance, one may also speculate vice versa that the ko strategy used here is less efficient in a maternally dominated system and leaves behind more wt Ndc80, which better compensates defects seen in the 9A mutant.

      Significance

      Courtois et al present data on mechanisms governing spindle assembly in mouse oocytes. Mouse oocytes serve as model system for spindle formation in the absence of centriole-based MTOCs. At the onset of meiosis I, numerous MTOCs form, which shape a mass ("ball") of MT nucleated around chromatin into a bipolar structure. Accumulating evidence indicates that kinetochores play an important role in acentriolar spindle formation in mouse oocytes, yet the mechanisms behind kinetochore action remains unclear.

      Here, Courtois et al. analyze spindle formation in live mouse oocytes using 3D-time-lapse imaging. They use fluorescently tagged Cep192 to track MTOCs and Histone H2B or CENP-C to visualize chromatin or kinetochores. In the first part, the authors deal with the appearance of "central MTOCs", i.e. aggregates of centrosomal protein(s) that, apparently, fail to remain stably integrated into the spindle pole clusters on MTOCs during spindle formation. The authors convincingly demonstrate that these central MTOCs can be seen in the majority of spindles investigated. They demonstrate that central MTOCs generally come from positions at poles from where they "fall back" towards chromosomes. Central MTOCs may even cross the spindle and end up at opposite poles from where they originated from. Interestingly, central MTOCs are often found next to kinetochores.

      In the second part, the authors focus on the role of kinetochores and their stable MT attachment for spindle formation in general and bipolarity/pole organization in particular. The same lab has published data on the role of kinetochores in meiosis I spindle very recently (Yoshida et al. Nat Comm, 2020). Here, they successfully exploit Ndc80 phospho-mutants to compare MTOC distribution in oocytes with reduced or increased end-on-attachment. The data show that stable end-on attachment determines stable MTOC clustering at spindle poles and governs the maintenance of bipolarity and spindle length.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This group has been at the forefront recently of using imaging technologies to understand how chromosome segregation is coordinated in mammalian oocytes, and why errors occur. In the current paper they examine the dynamics of microtubule organising centres (which effectively replace centrioles/centrosomes in oocytes) in MI. The imaging of oocytes in this paper is beautiful. The major findings are (1) that MTOCs that are supposed to be at the spindle pole sometimes end up at the spindle equator, and this is documented very beautifully and (2) the correct positioning of MTOCs at the spindle pole appears to require kinetochore microtubules, as indicated by experiments manipulating the kinetochore component NDC80.

      Major Comments

      As such the major claims of the paper are basically well supported. However, the analyses are is almost entirely restricted to prometaphase/metaphase, and the conclusions are relatively limited. The salient omission is any analysis of MTOC/chromosome relationship during anaphase. Were the paper to be extended to determine whether the lingering of MTOCs at the spindle equator is related to chromosome segregation error, that would increase the reach and importance of the work substantially. Specifically:

      1. Can tracking experiments be performed to determine whether the chromosome that shows movement similarities to the errant MTOC is more/less likely to missegregate? Complete tracking as these authors are expert at should achieve this, or photo-labelling the desired chromosome.
      2. Can the position of MTOCs (proportion that linger at the equator) be manipulated in the absence of other defects to determine whether this increases errors (lagging at anaphase, metaphase-II chromosome counting spreads)?
      3. The above analysis would have to be well supported by controls showing that these constructs are having no impact on normal anaphase (proportion of oocytes completing meiosis-I, likelihood of lagging chromosomes etc).
      4. Related to the above, though I appreciate a fixed metaphase image of MTOC immunofluorescence is presented, the paper is about the dynamics of MTOCs and thus nonetheless relies heavily on the live imaging of cep192. The core results should be confirmed using another (substantially different) MTOC probe. This final comment applies to the current metaphase data, regardless of whether the study is ultimately extended

      Significance

      As explained above, as presented this paper is largely scientifically sound, but far more limited in scope than this groups other recent papers. As explained above, the paper would be made more impactful and the readership broadened if a relationship between MTOC position/movement and segregation problems were established. Or on the other hand if it were established why some MTOCs sometimes linger at the spindle equator. Whilst to my knowledge this is the first time that equator MTOCs have been documented so carefully, oocyte cell biologists may not find the core observation that MTOCs are occasionally at the spindle equator extremely surprising.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank Reviewer #1 and #2 for the evaluation of our research and comments to our manuscript. Their comments are highly appreciated and addressed as described below.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      *Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).*

      Here Ha et al. has further developed their Pumilio RNA tagging methodology for the isolation of UV-crosslinked proteins that are suggested to associate with Xist RNA in mouse embryonic stem cells (mESCs). Within this study the authors claim to have found the Lupus antigen RNA binding protein (La) as a novel Xist interacting partner that influences the efficacy of X-chromosome inactivation (XCI). The authors use a number of different techniques such as qPCR, fluorescent imaging, ATAC-SEQ and SHAPE to show aberration of XCI upon La shRNA knockdown. However, this study has significant flaws in the efficient isolation and validation of Xist associated proteins using their FLAG-out methodology. Furthermore, later experiments predominantly focus on cell death/survival assays, which is somewhat troubling given the essential roles La plays in processes such as cell differentiation and proliferation, ribosome biogenesis, transcriptional control and tRNA maturation. I feel the authors need to robustly address the potential effects La knockdown may be having on their mESCs.

      Reviewer #1 did not fully understand the basic designs of the experimental systems (FLAG-out and iXist), and completely rejected these experimental systems. Reviewer #1 also ignored the majority of the functional analysis on the candidate protein, Ssb. These issues cannot be addressed by additional experiments

      **Major comments:**

      *-Are the key conclusions convincing?*

      My major concern is in their Xist RNA purification.

      First of all, I couldn't find any data on proving the enrichment of Xist RNA itself in their Pumilio pull-down experiment. It would have been useful to show Xist RNA enrichment before benzonase step. Secondly, it is hard to imagine the protocol would successfully isolated Xist RNA-protein complexes from the cell. An earlier report by Clemson et al., (J Cell Biol., 1996) has shown that majority of Xist RNA is still stuck in the nucleus after nuclear matrix prep protocol using detergent, which is not so different from the authors' protocol. Moreover, the authors used UV crosslink, which would have made even harder to purify Xist RNA without sonication. Thirdly, as the tag is located on 5' of Xist RNA, it is rather surprising to see that Spen is not detected in their pulldown. Spen is one of the main functional interactors with Xist, robustly detected by several previous reports. Similarly, other high-affinity binders of Xist such as hnRNP-K and Ciz1 were also lacking from this screen. Finally, the peptides found associated with FLAG-out Xist are extremely low in comparison with other data using glutaraldehyde or formaldehyde crosslinking. For example, HnRNP-M found in Chu et al 2015 has 1120 peptide counts in differentiated cells. The authors here use HnRNP-M as a baseline for specific interactions and show a total of 6 peptide counts in Xist expressing cells and 5 in i-Empty cells (Supplementary excel sheet 1). Similarly, the La protein of interest in this study has 8 counts in i-FLAG-Xist and 6 counts in i-Empty. I struggle to see how this result indicate specific Xist binding. Worryingly this is the starting rationale for the rest of their experiments, it is hard to therefore accept the rest of their conclusions either.

      We have detected Xist RNA after Pumilio pull-down, and added the data in the revised manuscript (Figure S1). The enrichment of Xist RNA by Pumilio pull-down is about 75-fold, comparable to the enrichment reported by Minajigi et al.

      Two out of three previous studies used similar protocols to prep cell lysates for co-IP, including UV cross-linking and detergent (McHugh et al. 2015 and Minajigi et al. 2015). The major difference between their protocols and ours is the co-IP step. They used antisense oligos to pull-down Xist RNA-protein complex, while we take advantage of the specific interaction between PUF and PBS to pull-down Xist RNA-protein complex. With the data in Figure S1, we are confident that our strategy is successful in isolating Xist RNA

      For systematic identification of Xist binding proteins, each method has its own strength and weakness. As we described in the introduction, only 4 proteins were commonly identified by all three studies to systematically identify Xist binding proteins. There is no doubt that our method also missed some authentic Xist binding proteins (false negative) and identified some false positive candidates. Thus, we have to be careful in balancing between the false negative and false positive calls. The reason that we applied the ranking gain to identify Xist binding protein candidates, is to minimize the false negative rate. Meanwhile, we compared our Xist binding protein candidate list with previous identified Xist-binding proteins to enhance the confidence in our candidate lists.

      Regardless the strength and weakness of our method, Ssb is also an Xist-binding protein identified by another study (Chu et al. 2015). More importantly, we have provided experimental validation to confirm Ssb’s involvement in XCI and extensive functional analysis to reveal the protein’s mechanistic role in XCI.

      The other key conclusion the authors make is from the use of numerous cell death/survival assays for both male and female cell lines. This is extremely troubling in the context of assessing their target protein La. La is involved in multiple RNA maturation events of rRNAs, tRNAs and other polIII transcripts. Furthermore, La has been implicated in binding to the mRNA for Cyclin D1 in both human cells and mouse fibroblasts (NIH/3T3 - male) which show a significant effect on cell proliferation upon siRNA knockdown https://www.nature.com/articles/onc2010425. This, along with the observation that La knock-out blastocysts fail to develop any mice or ES cell lines (male or female) show the effect observed in the authors results is most likely not X-linked cell death https://mcb.asm.org/content/mcb/26/4/1445.full.pdf. The authors need to show that their shRNA KD isn't affecting the proliferation and general fitness of their mESC lines.

      The cell death/survival assay was specially designed for analyzing the defect of XCI. The cell death of iXist ESCs upon adding Dox is due to the induction of Xist, which consequently initiates the silencing of the only X chromosome in male cells. Knockdown of genes involved in XCI compromises XCI, thus allowing cell survival. Given the diverse functions of Ssb in cell differentiation and proliferation, ribosome biogenesis, transcriptional control and tRNA maturation, one would expect slow growth and/or cell death of Ssb knockdown cells. Indeed, the result is consistent with our expectation (Figure 2C, without Dox). Nevertheless, more Ssb knockdown cells survive in the presence of Dox, compared with control cells (Figure 2C-E, with Dox), suggesting that Ssb plays an important role in XCI.

      *- Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?*

      As discussed above, I feel the authors have not clearly demonstrated Xist specific protein enrichment and haven't proven X-linked cell death. Due to the lack of necessary control experiments as discussed below, I feel the notion that La is involved directly in XCI as an RNA chaperone is currently preliminary/speculative.

      The FLAG-out experiment just provided an initial point for the study. We have demonstrated the interaction between Xist and Ssb by RIP. And, Ssb knockdown antagonizes the lethal effect of induced XCI in male cells, allowing more cell to survive. This is contradictory to the diverse house-keeping functions of Ssb, which should lead to slow proliferation or cell death. Therefore, the data here (Figure 2C-E) should suggest a role of Ssb in XCI. In addition, we showed that knockdown of Ssb compromises the silencing of X-linked genes (Figure 2F, 2G, and 3E), the compaction of X chromosome (Figure 3D), Xist cloud formation (Figure 4), epigenetic modifications on Xi (Figure 5), Xist RNA folding (Figure 6F-I), and Xist RNA stability (Figure 7C and D). All these data indicate that Ssb is involved in XCI by regulating Xist RNA folding.

      *- Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.*

      I would suggest them to show RT-qPCR results of Xist RNA enrichment from the sample after flagIP before benzonase treatment.

      We have the data, and added it to Figure S1.

      Also, it would have been more convincing if their negative control construct (i-Empty) would contain 25 copies of PBSb RNA at least.

      This is a good alternative design of the negative control. Using i-Empty expressing 25 copies of PBSb RNA will allow us subtract the background causing by proteins binding to PBSb RNA. Yet, as discussed above, regardless how we improve the experimental setting, we cannot completely avoid the issue of false positive and false negative. Our goal of the FLAG-out experiment is to generate a list of Xist binding protein candidates, and their binding to Xist and their functions in XCI should be validated by additional experiments. With our current experimental setting, a list of Xist binding protein candidates has been generated, and we have validated the role of Ssb in XCI with subsequent experiments.

      In Fig1b, the total amount of proteins loaded on the gel is not equivalent between two lanes. The gel should show equivalent amounts of proteins on the gel. It looks like if the negative control sample had been loaded at the same amount as the one with Xist, the band pattern wouldn't be distinguishable between the two samples. Furthermore, as these samples were used in the following mass spectrometry screen it may suggest that the minimal increase in peptide counts observed in the iXist FLAG-out were due to an increased amount of sample being loaded? No controls are conducted to account for this.

      IP samples of i-Empty and i-FLAG-Xist were loaded in the gel in Figure 1b. It is expected that IP sample of i-FLAG-Xist should pull down more proteins than IP samples of i-Empty. The FLAG-PUFb bands (the strongest band in each lane) are about the same amount in two samples, indicating roughly equal amount of loading. After normalization of gel loading according to the FLAG-PUFb bands, the upper part of the i-FLAG-Xist lane showed some unique bands.

      For mass spectrometry analysis, the loading of two samples are independent, therefore, to compare the absolute amount of each protein between the two samples does not always provide valuable information. Yet, the relative amount of different proteins within one sample is not affected by the loading amount, thus, more informative. Therefore, we used the ranking information to estimate the relative amount of different proteins in each sample and used the ranking gain to further identify protein candidates.

      The authors quantify cell death in figures 2C - E. It seems clear that shSsb 1 and 2 have an effect on cell count even in the absence of Dox. The rescue effect seen upon Dox addition is minimal when compared to Empty + Dox 2D. The authors ∆A-iXist line with and without Ssb KD/Dox would be an informative control on whether the increase in cell survival that they see is X-linked.

      As the reviewer pointed out earlier, Ssb plays multiple roles in cellular processes. Inevitably, KD of Ssb leads to slow growth and/or cell death with or without Dox. Thus, it is less meaningful to compare the surviving cell counts in Figure 2D. Rather, the survival rate (Figure 2E) reflects the rescuing effect more precisely. Shown in Figure 2E, both shSsb 1 and 2 increase the survival rate significantly, compared with Empty control.

      Moreover, the data in Figure 3B and C demonstrated that Ssb KD compromises the survival of female differentiating cells, but not the survival of male differentiating cells, also indicating a role of Ssb in XCI. With these experiments, it should be sufficient to conclude that Ssb KD affects X-linked cell death/survival in both iXist male ESCs and WT female differentiating cells

      The qPCR results used to validate silencing defects show minor changes in expression and also don't show significant silencing of X-linked genes sufficient for cell death. Could this be because only ~ 50 - 60% of Male iXist cells seem to be expressing in the movies and that this will have an effect on the observed qPCR results? Furthermore, it seems counterintuitive that expression in the Empty male cells increases in 48h compared to 14h. Is this due to cell death and positive selection of cells less able to silence their X-chromosome? How would these data look in the female XX line? How would the data look in a ∆A-iXist line in the presence and absence of shSsb/Dox?

      First, high-quality live-cell imaging can only be carried out for 2 hours with 2-min time interval. The movies are meant to show the onset of Xist RNA signals. Therefore, they were taken one hour after Dox treatment (figure legend of Figure 4B-D). After overnight Dox treatment, Xist clouds can be seen in majority of cells.

      Second, in Fig. 2F-G, we did not include uninduced iXist male ESCs. Therefore, it is impossible to judge whether induction of Xist in this male ESC line results in Xist-dependent silencing at 14 and 48 hr. However, in our previous publication (Li et al., JMB, 2018, 430: 2734-2746), it has been shown that Gpc4, Hprt, Mecp2, G418, and TomatoRed are silenced (4- to 16-fold reduction) at 24 and 48 hours after Dox induction.

      Third, the qRT-PCR results in 14 h and in 48 h are not normalized to the same internal control. Thus, they are not directly comparable.

      Confusingly, the male line in Fig 3C shows a drop in live cell count at day 6 of differentiation? Surely given their previous results in Fig 2 the Ssb KD should increase cell viability with +Dox? Ssb KD seems to have an adverse effect on ES cells during extended differentiation protocols. In Figure S1 the authors show ~ 8 - 10% survival of male lines during differentiation. Could the recombination of the Xist sequence around the loxP sites enable the cells to outcompete the dead cells? How would iEmpty and ∆A-iXist cells compare here? Have the differentiated cells been tested for their expression of Xist? Additionally, how are there similar live cell counts for male vs female lines when ~90% of male cells die during differentiation? Were more cells plated at day 4? If so, this would bias the competition of male cell survival and therefore make the male line an inappropriate control.

      Given the essential role of La during development a control is needed to prove that this death is X-linked in the female 3F1 line. For example, an XO cell line retaining the Cast allele and shSsb expression could show the amount of death caused from shSsb alone independent of X-linked cell death.

      The reviewer completely misunderstood the experiment. The severe cell death specifically observed in female differentiating ESCs is a strong evidence showing Ssb is involved in XCI (Figure 3).

      The male ESCs in Figure 3C is a WT ESC line without the inducible Xist transgene, in which no XCI occurs upon differentiation. It is completely different from iXist male ESCs with Dox, in which forced Xist induction leads to XCI. Thus, the diverse functions of Ssb might contribute to the slight decrease in live cell count of wild type male cells at day 6 of differentiation.

      Figure S2 shows the differentiation of iXist male ESCs with or without Dox. As explained above, forced Xist induction silences the only X chromosome in male cells, resulting in cell death. In addition, XCI occurs more efficiently in differentiation condition (Figure S2) than in pluripotent status (Figure 2C)

      During differentiation, female ESCs silence one X chromosome, and the other X chromosome remains active. KD of Ssb compromises XCI, and two X chromosomes in some female differentiating cells maintain active, leading to cell death. The reviewer is correct that we need a control to rule out that the essential role of Ssb during development affects cell survival and death. An XO cell line can be used as a control. Similarly, a male cell line (XY) is also a good control. We already included a male cell line as a control in Figure 3B and 3C.

      If I understood correctly, the RNA FISH used dsDNA probes ("Sx9") against 40 kb of the X-inactivation centre (Xic). Surely Tsix or other Xic transcripts will also be visible? Can the authors use their RNA FISH to determine the XX or XO status of their cells? In Figure S5 a number of cells appear to show a single pinpoint of transcription. This could either be low levels of Xist transcripts or Xic transcription from an XO line in which the 129 chromosome is missing. It would be best to solely quantify cells which have two x chromosomes and if a significant amount of X chromosomes have been kicked out, this should be discussed and controlled for.

      This is a valid concern, but this concern can be adequately addressed with the available data in the manuscript.

      First, if the female Ssb KD cell line is an “XO” cell line, in which the X129 allele is “kicked out”, the RNA allelotyping results should show an absolute “silencing” of the X129 allele. However, in complete contrast to this notion, RNA allelotyping detected “more” RNA transcripts from X129, showing the chromosome-wide XCI defects (Figure 3D).

      Second, overexpression of Ssb in Ssb KD female cells restores the Xist clouds and the polycomb marks (Figure S8), suggesting that the Ssb KD female cells are XX, but not XO.

      Third, the severe cell death specifically occurred in female Ssb KD lines is also against the “XO” argument (Figure 3B&C).

      In Fig6, the authors generated a number of Ssb constructs for a rescue assay. However, these results complicate the matter and raise more questions than they address. It seems odd that the ∆RRM1 does not rescue based on comparison with their putative negative control, ∆NLS. However, the ∆RRM1 + 2 and ∆LAM do rescue the phenotype better than the full length Ssb? This makes no logical sense and highlights the inherent variation in cell viability these generated cell lines seem to show.

      Following on from this, figure S7 quantifies the GFP tag mRNA levels, depicting all ∆RRM mutants with expression below ~30%? How can ∆RRM1 or 2 be rescuing in this scenario? Have these lines been tested for their XX or XO status? The loss of an X chromosome would lead to a rescue of the cell death phenotype, which is a process known to occur in XX lines that have been cultured for extended periods of time. Could it also be that the cell lines derived are more or less sensitive to exogenous shRNA expression? Also, further validation is needed to assess the efficiency of KD in these lines as theoretically most of these constructs will be targeted by shRNA? What is the endogenous Ssb expression level in these lines? Where in the mRNA sequence are the shRNAs targeted to? Does this make sense on the relative expression levels of ∆RRM1/2 for example? Further testing of GFP expression could also be assessed by quantitative western blot of GFP or even visualised in their RNA FISH/IF samples (Figure S8), currently neither are shown. In addition, some kind of information of stability of each Ssb protein constructs has not been demonstrated.

      Our shRNA targets the LAM domain, so the expression of ∆LAM is not affected by the shRNA. The reviewer is correct that the detected GFP expression levels of ∆RRM1 and ∆RRM2 are too low to be conclusive. We have removed the data point of ∆RRM1 and ∆RRM2. Meanwhile, it is clear that ∆RRM1&2 has a better rescuing effect than ∆NLS, when ∆RRM1&2 and ∆NLS are expressed at similar levels. Ssb is a well known RNA chaperone/RNA helicase. Identifying Ssb is an Xist-binding protein already suggests the functional role of Ssb in XCI. The data of the plasmid rescue experiments further suggests that Ssb is involved in XCI as a RNA chaperone/RNA helicase.

      As for the Western blot and GFP fluorescence (IF), we have tried both. Neither of them detected GFP signal, reflecting the low expression level of these GFP fusion proteins. As the reviewers pointed out that the shSsb is not targeting the 5’ or 3’-UTR region, therefore, interfering the exogenous Ssb as well. This might be a reason for the low expression of these GFP fusion proteins.

      For the data shown in Figure 7A and B the authors quantify the % of cells with Xist signal. The authors have already shown a defect in Xist visualisation in Ssb KD. Surely it is plausible to assume a faster loss of Xist signal below background in weaker expressing cells. A more appropriate quantification would be the % loss of Xist signal per cell over time.

      With Figure 7C and D, the samples have been treated with actinomycin D which globally affects the transcription of cells even the PolIII associated genes Ssb is needed to mature. This treatment could have an added effect on cell mortality and function. Data confirming that actinomycin D doesn't affect the cells disproportionately is needed. The difference in half-life could be attributed to such a treatment.

      We agree with the reviewer that monitoring Xist signal loss per cell would be a better way to analyze the data. However, in Xist signal loss experiment, snapshot images were taken at four time points (1h, 2h, 3h and 4h). This is not a time-lapse imaging. High-quality time-lapse imaging can only be done within a 2-hour time period with 2-min time interval. Therefore, cell-tracking cannot be done in this experiment. In addition, even though Ssb KD slows down the formation of Xist cloud within the early phase (3 hours) of Xist induction (Figure 4), prolonged (overnight) Xist induction leads to Xist cloud formation in a significant fraction of Ssb KD cells, and the Xist cloud signals are about the same in WT and Ssb KD cells (Figure 7A, 0 h). Similarly, qRT-PCR also revealed that Xist RNA are at the same level in WT and Ssb KD cells (Figure 7C, 0 h). These data argue against that a faster loss of Xist signal in Ssb KD cells is due to weaker initial Xist signal.

      Actinomycin D was added at the last 11 hours of the experiment. During this period, no obvious adverse effects on cells were observed.

      In summarising the authors claim that La binds Xist to facilitate folding and appropriate spreading of Xist along the X-chromosome. No direct interaction has been shown, CLIP-seq data would resolve this, however I do understand this is a challenging technique. The authors have instead opted for RIP followed by qPCR (Figure S2). However, this process has a greater potential for non-specific recovery of RNAs via indirect binding. Furthermore, qPCR may also amplify the relative abundance of the RNA detected. As multiple nucleolar proteins came down in the mass spec screen and FLAG-Ssb is being over expressed, it is plausible to assume some transient Xist interactions may arise from nucleolar association at which La will be in high abundance. Positive and negative nuclear RNA controls (e.g. 7SK and U1 snRNA respectively) could be used so to determine the amount of non-specific Protein-RNA interactions in their RIP pull downs. Cytoplasmic actin is not an appropriate control as it is cytosolic.

      We have to clarify one point that the mass spec screen analyzed samples pulled down by FLAG-PUFb, but not FLAG-Ssb.

      We did not intend to distinguish whether Ssb directly binds Xist or is just associated with Xist. RIP followed by qPCR is sufficient to prove the association between Ssb and Xist RNA.

      We can include nuclear RNA as controls, if the reviewer regards RIP as a valid method to show protein and RNA association

      Other than this the authors may want to probe (via IF) for the presence of La accumulation on the X? Many other know factors such as Ciz1, hnrnpK and PRC1/2 complexes show clear accumulation on the X. If I understand correctly, there are many La antibodies on the market and endogenous levels on the X could be assessed. These antibodies may be useful in IP's and pull downs also.

      Many XCI factors play extensive roles in the cell and are not clearly enriched on Xi, including Spen (Moindrot et al. 2015). We have tried the immunostaining and did not detect Ssb’s enrichment on Xi. Ssb shows a general distribution in the nucleus without a clear enrichment on Xi (data not shown).

      *-Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.*

      The experiments suggested above are centrally focussed on the cell lines that are currently in the authors possession with maybe exceptions with the ∆A-iXist-shSsb line suggested. However, this should be reasonably quick to obtain given their previous work for this paper. Most experiments suggested will focus on the validation of karyotype, Xist expression, rescue construct expression, further RNA FISH classification and repeating more appropriate positive and negative controls for a number of experiments. In theory this can be obtained relatively simply and quickly from current resources. But with the sheer volume of further experiments that are required here, this may take a significant amount of time.

      One vital improvement needed is the replication of mass spec data and the validation of Xist specific recovery and protein enrichment. As it stands this manuscript seems to not have any replicates of the FLAG-out methodology and mass spec data. This is troubling given the poor recovery and specificity of the protein samples obtained. Repeating these experiments would be costly in time and also financially. As it stands, I feel this is essential to conclusively validate their target of interest.

      *- Are the data and the methods presented in such a way that they can be reproduced?*

      The data is presented relatively well, however, it would be beneficial if deailed methods were in the main text and not in a supplementary file. Similarly, more information about the process of differentiation and how cell death/survival was quantified and validated is needed.

      The reviewer rejected the basic design of the experimental system and ignored the majority of the functional analysis data. No additional experiment can address these issues

      We can include more information in the main text, regarding Ssb. However, there is limited space for the main text, various depending on the journals. Meanwhile, the current citation on Ssb is adequate to emphasize that Ssb is a versatile RNA binding protein involved in a variety of fundamental RNA processing events in the cell.

      *- Are the experiments adequately replicated and statistical analysis adequate?*

      In the most part yes, however there seems to be no replicates of the FLAG-out mass spec screen which is worrying given the minimal specificity observed in the current data.

      As we mentioned above, the FLAG-out experiment only serves as a starting point to generate a list of Xist binding protein candidates. Rather than repeating the FLAG-out experiment, we compared the result of FLAG-out to previously published lists of Xist binding protein candidates. More importantly, additional experiments are carried out to validate the Xist binding proteins identified by FLAG-out.

      **Minor comments:**

      *- Specific experimental issues that are easily addressable.*

      Unfortunately, the majority of experimental issues need to be addressed with more robust data which are highlighted above. However, some image analysis, quantification and classification can be amended relatively easily. For example, the live-cell imaging data should be quantified as loss of signal as discussed and RNA FISH should be used to classify XX positive cells and the XO cells can be discarded from analysis.

      We have addressed these issue in the previous sections of this rebuttal.

      *- Are prior studies referenced appropriately?*

      Most papers regarding Xist pull down and biology are discussed and referenced appropriately. However, the role in which La plays during development and its aberrant affects upon KD are seemingly downplayed. I would like to see more discussion of potential defects that could be caused due to globally altering cellular RNA folding.

      We have tried to cite key references about Ssb in development and RNA folding. Due to length limitation, we cannot cite all references in the topic. If necessary, we could discuss the possibility of indirect effect of Ssb KD on XCI through globally altering cellular RNA folding.

      *- Are the text and figures clear and accurate?*

      For the most part, lots of the figures are clear and accurate. Apart from these exceptions.

      1.The Y-axis of Figure 2D is confusing. What does 0.3 as a "sum of area" equate to? 30% of the area was ES cells? This doesn't look to be the case from Fig 2C. Also, how does the intensity of the signal compare? The area may not be a good quantification due to ES cells growing in colonies.

      We have revised the Y-axis labelling of Figure 2D to “sum of area cm2”. Thus, “0.3” means that the area of ESCs is 0.3 cm2. ALPP is highly expressed on ES cell surface. ALPP stain usually produce saturated stains on ES cell colonies. Thoroughly stained ES cell colonies, big and small, show similar signal intensity levels. To analyze the “total signal intensity” will be not much different from “sum of area”.

      2.In the Movies S1-7 there are boxes around certain cells and marked with "Figure 5a - c". This seems to be incorrect as figure 5 is currently the IF staining of polycomb marks. I assume this is in relation to Figure 4b-d?

      We have corrected the labelling mistakes.

      3.Similarly, in Movies S1-7, the intensities of Xist foci seem by eye to be similar. In the paper it is claimed that the Xist clouds that do form are lower in intensity. Are the Movies depicting the same range of pixel intensities? If not, this should be amended. Similarly, figure 7 seems to show relatively equivalent RNA signal at 0 h?

      All the images were collected using a fixed standard of the microscope and camera setting, and these movies depict the same range of pixel intensities. Movies S1-S3 are WT control, and Movies S4-S7 are Ssb KD cells. The Xist cloud signals are weaker in Movie S4-S7 (also quantified in Figure 4E). For the Xist cloud signal, not only the intensity, but also the area of Xist cloud, have to be taken into account.

      The 0 h in Figure 7 is after overnight Dox treatment, and different from the time point in Movies S1-7 (maximum 3 hour Dox treatment, figure legend of Figure 4B-D). The discrepancy can be explained by that knockdown of Ssb only slows down the formation of Xist clouds. After overnight forced expression, the Xist RNA still shows an accumulation in the cells. Figure 7 shows the forced accumulation of Xist RNA after prolonged Dox treatment disappears faster after Dox withdraw.

      4.In figure 4A the data is from female XX cells, this should be highlighted to limit confusion with the male iXist data shown below in 4B-E. It would also be helpful to have the male/female icons (as in figure 3B), for each figure that has images of cells. Currently Figure 4, 5, 7, S5 and S8 are lacking these icons.

      We have revised the labelling on Figure 3, 4, 5, 7 S6 and S9 (S5 and S8 before revision).

      5.No explanation of the Flag-Ssb expression is given for figure S2. Furthermore, is it really necessary to express Flag-Ssb? There are reasonably good antibodies out there for Ssb as this was how it was originally found in Systemic Lupus patients. Also, no data showing the amount of Ssb being overexpressed is shown. This may have big implication to the validity of the RIP-qPCR analysis.

      We could perform qRT-PCR to quantify the overexpression level of Flag-Ssb. If required, we could use Ssb antibody to do Western blot to show the amount of Flag-Ssb protein.

      *- Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Most of the data is presented reasonably well, but the robustness of the data somewhat retracts from their conclusions. I feel the certainty of their conclusion regarding Xist specific La binding and RNA chaperone activity is still presumptive and should be rewritten unless more robust data can confirm Xist interaction. I would also suggest deciding on the nomenclature for the protein of interest and use either La or Ssb, the continued use of both through the figures and text can get a little confusing to the reader.

      In the current literatures, Ssb seems to be commonly used as a gene name and La is used as a protein name. We have revised the manuscript to use one name “Ssb” to describe both the gene and the protein.

      Reviewer #1 (Significance (Required)):

      *- Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.*

      It was a good trial to use PBSb-PUFb system to purify Xist RNA binding proteins, compared to previous reports had used anti-sense oligo purification using complementary sequence to Xist RNA sequences. But currently the purification still needs further validation and repeats to confirm its use. A potential complementary technique could be to isolate Xist directly by using biotinylated probes against the PBSb sequence.

      The authors further claim the identification of a novel Xist RNA chaperone (La/Ssb) which they say facilitates XCI progression. This would be a novel finding in the field; however, the data is currently not robust enough to support this

      *- Place the work in the context of the existing literature (provide references, where appropriate).*

      This work has focused on the development of a milder methodology for purifying Xist RNA during XCI. Others have published similar methodologies predominantly focusing on purifying Xist RNA directly with biotinylated probes (McHugh et al. 2015; Minaji et al. 2015; and Chu et al. 2015). Although this method boasts a milder purification method, it seems to be low yielding in Xist specific proteins. Others have shown a more robust identification of bona fide Xist binding proteins which are currently missing in this manuscript. A recent preprint from the Plath lab has identified new factors involved in XCI during differentiation and their tethering/rescue experiments are far more convincing than the ones shown in this manuscript https://www.biorxiv.org/content/10.1101/2020.03.09.979369v1. The candidate protein Ha et al. have identified has multiple roles in developing cells and has shown to be important during mouse development. However, Ha et al do not robustly show that the knockdown of Ssb causes X-linked cell mortality. Alternatively, as would be presumed from Ssb's essential role in many housekeeping short non-coding RNAs, the cell death seems more ubiquitous upon shRNA KD. Therefore, the link the authors are making here are relatively weak.

      Ssb KD rescues cell death caused by forced induction of Xist in male ESCs. In addition, Ssb KD leads to cell death in differentiating female ESCs, while it has a negligible effect on cell death in differentiating male ESCs. These data clearly demonstrated X-linked cell survival/mortality by Ssb KD.

      Plath lab’s work is different from ours. In their manuscript, the authors report the observation of a protein condensation which is assembled by Xist but sustains in absence of Xist. TDP-43 (a.k.a. Tardbp) happens to be one protein factor involved in the protein condensation and also one candidate protein selected for further validation in our study. In our study, Tardbp KD did not rescue cell death caused by induced XCI in male cells. Thus, Tardbp is not further studied. In the manuscript, we have discussed the possibility that low efficiency of knockdown and redundancy might contribute to the failure in validation of Tardbp

      *- State what audience might be interested in and influenced by the reported findings.*

      The audience may be interested in the novel technique and the finding of a novel Xist binding protein.

      *- Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.*

      RNA biochemistry and developmental biology

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      This manuscript describes a novel "FLAG-out" system, where the authors sought to identify Xist RNA binding proteins. The authors focused on a specific protein found in their screen and also identified in several other screens for Xist RNA binding proteins, Ssb/La, and further characterize the role of this protein in XCI. This manuscript describes the loss of Ssb/La and suggest that it predominately impacts the canonical 'cloud' formation of Xist RNA on the X chromosome during XCI initiation. Further, they determine that loss of Ssb/La decreases Xist RNA half-life and alters folding of Xist RNA transcripts. Based on their findings, the authors propose that Ssb/La functions to directly bind and fold Xist RNA transcripts in a manner that stabilizes Xist RNA, allowing for proper 'cloud' formation and successful initiation of XCI.

      **Major comments:**

      The authors made an interesting findings that the SLE-relevant autoantigen Ssb/La stabilizes Xist RNA transcripts, and there is some evidence that this occurs by binding and maintaining proper folding of Xist RNA. Despite these intriguing observations, there are many parts of the manuscript that need to be addressed in order to support the authors main conclusions.

      The most troubling aspect of this manuscript is the persistent use of an artificial XCI system in male cells to draw strong conclusions about the function of Ssb in XCI. This issue is prevalent throughout the manuscript, and I question why the authors chose to perform most of their experiments in male cells when the same experiments can be (and have previously been by other groups) performed in female cells. Using male ESCs and then making conclusions for XCI, which is a female-specific process, is a major concern.

      In addition to iXist male ESC line, many experiments, such as cell death/survival (Figure 3B, C), allelotype (Figure 3E), Xist could formation (Figure 4A), H3K27me3 and H2AK119ub IF (Figure 5), were performed in female ESC. We chose to do SHAPE and Xist RNA stability assays in iXist male ESC line, because the onset of XCI is much more synchronized in this system. Moreover, in female cells, Xa causes additional layers of complication/noise in the ATAC-sequencing which may not be fully cleared up by data analysis. On the other hand, inducible Xist expression in male ESCs can be used as an experimental system to recapitulate the silencing step of XCI (Ha et al. 2018; Wutz et al. 2002).

      • Out of the 138 identified binding proteins, the authors chose to only validate three: Mybbp1a, Tardbp, and Ssb/La. The logic for choosing these candidates is weak, and the authors are only able to validate 1 out of 3 of these proteins.

      In theory, all candidate proteins in the list are possibly involved in XCI. There is no method which can help to make accurate prediction. We did not follow a clear-cut logic in selecting candidates for validation, but we do consider the candidate gene’s knockout phenotype, “early embryonic lethality”, as a phenotype consistent with a critical role of the candidate gene in XCI. Meanwhile, in the manuscript, we have discussed why we chose the three proteins for validation as the following:

      “……From the candidate proteins, we shortlisted three proteins for individual validation. Myb-binding protein 1A (Mybbp1a, Q7TPV4) and TAR DNA-binding protein 43 (Tardbp, Q921F2) were selected because they are known transcription repressors (11, 12). The Lupus autoantigen La (P32067, encoding-gene name: Ssb) was selected because systemic lupus erythematosus (SLE) is an autoimmune disease characterized by a strikingly high female to male ratios of 9:1 (13). Moreover, its autoimmune antigen La is a ubiquitous and versatile RNA-binding protein and a known RNA chaperone (14). All the three selected candidates have also been identified as Xist-binding proteins in previous studies (2, 4). Moreover, the knockout of these three genes all lead to early embryonic death. Tardbp knockout causes embryonic lethality at the blastocyst implantation stage (15). Mybbp1a and Ssb knockout affect blastocyst formation (16, 17). Early embryonic lethality is a mutant phenotype consistent with a critical role of the mutated gene in XCI (1)** ……”

      We used cell death/survival assay to further validate the role of Xist binding protein candidates in XCI. This is a stringent assay. It requires not only that Xist binding protein candidates bind to Xist, but also that the candidates have to be functionally important in XCI.

      Indeed, it has been demonstrated by Plath lab (the BioRxix manuscript mentioned by reviewer 1) that Tardbp (also named TDP-43), together with other RBPs, bind to the E repeat of Xist to form a condensate and create an Xi-domain. Yet, Tardbp KD did not rescue cell death caused by forced XCI in male cells in our studies. Thus, only 1 out of 3 of these candidates is validated and further studied. In the manuscript, we also discussed that low efficiency of knockdown and redundancy might contribute to the failure in validation of Tardbp and Mybbp1a.

      • Use of the cell death assay is not strong enough to "confirm that La is involved in induced XCI" as stated by the authors. This is a huge overstatement.

      Given the diverse functions of Ssb in cell differentiation and proliferation, ribosome biogenesis, transcriptional control and tRNA maturation, one would expect less surviving Ssb knockdown cells. In contrast, more Ssb knockdown cells survives in the presence of Dox, suggesting that Ssb plays an important role in XCI. Considering the reviewer’s comment, we revised the sentence to “further suggest that Ssb is involved in induced XCI”.

      While the authors observed differences in X-linked gene expression after Ssb KD, they did not examine expression of these genes in after KD of either Mybbp1a or Tardbp. Are the changes observed in these genes specific to Ssb KD? Or could there still be alterations of X-linked gene expression in the non-validated KDs? This experiment should be performed and included in the manuscript, either within Fig 2 or in the supplemental. As well, inclusion of a well characterized positive control, for example Hnrnpu, as comparison to Ssb should be included.

      Mybbp1a and Tardbp were not validated by the cell death assay. Thus, compared with Ssb, Mybbp1a and Tardbp are less important for XCI functionally. We only focused on Ssb in the subsequent studies. Mybbp1a and Tardbp KD could be additional negative controls. Yet, we have used empty vector as a negative control. We do not need so many controls.

      As mentioned, Tardbp indeed binds to Xist RNA. It is very likely that Tardbp KD might alter some X-linked gene expression. This rules out Tardbp KD as a good negative control.

      If we do not see any effect of Ssb KD on X-linked gene expression, a positive control is absolutely required. However, we have detected that Ssb KD compromises the silencing of several X-linked gene. A positive control might not be essential.

      • The authors perform RIP to validate the interaction of Ssb with Xist, but this is performed in male ES cells with induced Xist RNA and with FLAG-tagged Ssb. Aside from these cells being male, in this system Xist RNA expression is much higher than would be found endogenously. RIP should have been done in female differentiated ESCs if there is in fact a role for XCI.

      • The authors need to include more details in the methods section to explain how the FLAG-Ssb is expressed in these cells, and why the authors chose to use a tagged contrast over endogenous Ssb. Due to these issues the result from this experiment is essentially meaningless and is not convincing of Ssb interaction with Xist RNA. There is no reason RIP cannot be performed in female cells, and the authors should repeat this experiment in the relevant experimental condition. As well, if a validated Ssb antibody exists the authors should perform RIP using the endogenous protein.

      If required, we could try to perform RIP and/or CLIP using Ssb antibody in female cells.

      The authors state in Fig 3A-C that the results of the cell death and differentiation experiments "...support a functional role of La in XCI". The authors state earlier that Ssb is a ubiquitous protein that is embryonic lethal (in both female and males). Based on this, the cell death results shown do not support a functional role of La in XCI as the Ssb KD could be having an indirect affect due to its other developmental functions. This manuscript lacks a direct functional link between Ssb and XCI; more data is necessary.

      Given the diverse functions of Ssb in cell differentiation and proliferation, ribosome biogenesis, transcriptional control and tRNA maturation, one would expect less surviving Ssb knockdown cells. In contrast, more Ssb knockdown cells survives in the presence of Dox, suggesting that Ssb plays an important role in XCI.

      For the data in Fig 3A-C, Ssb KD causes the death of female differentiating cells, but not male differentiating cells. Therefore, it rules out that the death of female cells is due to the general function of Ssb. Rather, the specific role of Ssb in XCI contributes to the female specific cell death.

      In Fig 3D, the authors perform ATAC-seq in inducible male ES cells. The authors claim that the extremely slight reduction in chromatin compaction of the Ssb KD compared to control iXist "directly connect La to the heterochromatinization of Xi, supporting a functional role of La in XCI". This is also an overstatement based on the minimal, and possibly indirect, change in compaction. The positive control i-detaA-Xist sample has significantly less compaction (and thus significantly higher compaction defect) than the Ssb KD again disputing the claim stated above. It is unclear why performing ATAC-seq is even necessary, as Ssb isn't stated to have a function in regulating chromatin architecture. In addition, why the authors performed ATAC-seq in the artificial male XCI system and not in the F1 female cells, and the N of the experiment is unclear. If the authors want to include the ATAC-seq in further revisions it should be repeated n=3 in the female system.

      The male induced XCI system provides a more synchronized onset of XCI. More importantly, in the male induced XCI system, only one X chromosome exists, avoiding the interference from the active X chromosome in female cells. If ATAC-seq was performed in female cells, only loci with SNPs can be distinguished. The sequencing reads from Xa will create additional layers of complication/noise which may not be cleared up fully by data analysis

      “i-delat-Xist” is a positive control to show the experimental system works. It is not justified to compare the chromatin accessibility of the mutant, which is only a Ssb “knockdown” mutant, and the control “i-delat-Xist”, in which the Repeat A is “deleted”. We admit that ATAC-Seq results did not reveal a drastic difference in chromatin accessibility between the wild type sample and the mutant sample. However, as what we discussed in the manuscript, clear difference can still be seen at the 14 h time point. This is shown clearly by the heatmap (Fig. 3E) and the sequencing coverage profile (Fig. S4A).

      • In Fig 6, the authors state in their methods that "The shRNA construct, which worked efficiently against Ssb, was not designed against the 3' UTR of the RNA. Therefore, the shRNA is against some of the rescue plasmid constructs. Nonetheless, transfecting the Ssb knockdown cells with the rescue plasmids should compensate the effect of Ssb knockdown and serve as a rescue assay to study the functional domains of La.". This is troubling and seems like a major experimental issue; the specific rescue constructs that may be impacted by this issue are not stated and should be explicitly mentioned. This becomes more confusing when examining the data from rescue experiments.

      We pointed out this issue in the original manuscript. We agree that the experiment was not perfectly designed. In the revision, we added in the information on the shRNA target site. Our shRNA targets the LAM domain, so the expression of ∆LAM is not affected by the shRNA. We agree that the detected GFP expression levels of ∆RRM1 and ∆RRM2 are too low to be conclusive. In the revision, we have removed the data point of ∆RRM1 and ∆RRM2. Meanwhile, it is clear that ∆RRM1&2 has a better rescuing effect than ∆NLS, when ∆RRM1&2 and ∆NLS are expressed at similar levels. Ssb is a well-known RNA chaperone/RNA helicase. Identifying Ssb is an Xist-binding protein already suggests the functional role of Ssb in XCI. The data of the plasmid rescue experiments further suggests that Ssb is involved in XCI as a RNA chaperone/RNA helicase.

      If it is necessary, we could redo this experiments using a shSsb targeting 3’-UTR or expressing GFP-Ssb immune to shSsb.

      In Figure S7, the expression of the rescue constructs deltaRRM1 and deltaRRM2 is extremely low, yet the authors observe a rescue of the cloud phenotype (fig 6D) from those constructs that reaches almost the level of full length Ssb. This is confusing, and the authors need to address this by performing a western blot to show the protein levels of these rescue constructs and discuss further how such a low level of expression can show a rescue phenotype. The results would also be stronger if the authors examined H3K27me3 and H2AK119ub1 enrichment since they observed decreased overlap of these marks with Xist RNA after Ssb KD. Finally, the authors state that "...all three RNA-binding domains are required for the functionality of La in XCI..." however I have trouble coming to this conclusion based on the above issues. As well, if the authors want to support direct function, they should repeat the RIP experiments with these rescues constructs to show that the domains capable of rescue can still bind to Xist RNA.

      Reviewer 1 raised similar concerns. In Figure 6C, the live cell counts of ∆RRM1 and ∆NLS are about the same. It might be due to the low expression level of ∆RRM1 (Figure S7). It is clear that ∆RRM1&2 has a better rescuing effect than ∆NLS, when ∆RRM1&2 and ∆NLS are expressed as similar levels. To make the data more straight forward, we removed the data point of ∆RRM1 and ∆RRM2, because of their low expression levels.

      As for the Western blot and GFP fluorescence (IF), we have tried both. Neither of them detected GFP signal, reflecting the low expression level of these GFP fusion proteins. The shSsb is not targeting the 5’ or 3’-UTR region, therefore interfering the exogenous Ssb as well. This might be a reason for the low expression of these GFP fusion proteins. If it is necessary, we could redo this experiments using a shSsb targeting 3’–UTR or expressing GFP-Ssb immune to shSsb.

      We deleted the sentence "all three RNA-binding domains are required for the functionality of La in XCI".

      **Minor comments:**

      The authors may want to consider better highlighting the strengths of their "FLAG-out" system. As written, is it difficult to tell how this system sets them apart from the previously published studies referenced in the text, especially as some of these studies used similar crosslinking conditions and cell types. Additionally, the logic and questions the authors pose in the introduction as to why they performed this project are too general and not very strong. For example, the authors mention how might protein machinery may assemble on Xist RNA, and how might Xist RNA may spread on the X chromosome. However neither of these topics are actually addressed in their experiments or discussion. These are interesting questions, but the authors should either discuss them further within the context of their results or take these questions out. It would also be helpful if the authors could better label Figure 4, as it is unclear in the figure itself that Fig 4A is in reference to female cells, but remaining panels are in male cells.

      The inducible XCI in male cells is a valid system to recapitulate the silencing step of XCI. It also provides unique advantages in many experiments, such as ATAC-seq. Meanwhile, we did perform extensive functional analysis on the endogenous XCI process using female cells. However, we do realize that presenting the data of induced XCI in male cells together with the data from female cells is confusing to many readers. We have revised the labelling on Figure 3, 4, 5, 7 S6 and S9 (S5 and S8 before revision).

      To understand “how the protein machinery is assembled by Xist” and “how Xist spreads along its host chromosome territory” are not specifically the initial aims of this study. We removed the sentences from the introduction section. However, we believe Ssb may provide clues for the future studies to fully address these questions, and we did provide the following thoughts in the discussion section:

      “……Secondly, as Ssb is able to utilize ATP to unwind RNA-RNA and RNA-DNA duplex, it may play a more active role in controlling the structural dynamics of Xist in living cells (14, 23). These structural dynamics may be important for recruiting proteins onto the RNA and spreading of the RNA along its host chromosome territory……”

      Reviewer #2 (Significance (Required)):

      I am not convinced the this manuscript, as written, has sufficient novelty. Ssb/La has been previously identified to be an Xist RNA binding protein with older/different approaches. However, there are some interesting observations in this manuscript. Major revisions are necessary.

      We agree with the reviewer that identification of Ssb as an Xist RNA binding protein is not novel. The novelty of our discovery lies in: 1) we developed a new method for isolating lincRNA associated proteins; 2) we confirmed that Ssb is an important player involved in XCI; 3) we showed that Ssb regulates the folding of Xist RNA, consequently the stability of Xist and the formation of Xist cloud.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This manuscript describes a novel "FLAG-out" system, where the authors sought to identify Xist RNA binding proteins. The authors focused on a specific protein found in their screen and also identified in several other screens for Xist RNA binding proteins, Ssb/La, and further characterize the role of this protein in XCI. This manuscript describes the loss of Ssb/La and suggest that it predominately impacts the canonical 'cloud' formation of Xist RNA on the X chromosome during XCI initiation. Further, they determine that loss of Ssb/La decreases Xist RNA half-life and alters folding of Xist RNA transcripts. Based on their findings, the authors propose that Ssb/La functions to directly bind and fold Xist RNA transcripts in a manner that stabilizes Xist RNA, allowing for proper 'cloud' formation and successful initiation of XCI.

      Major comments:

      The authors made an interesting findings that the SLE-relevant autoantigen Ssb/La stabilizes Xist RNA transcripts, and there is some evidence that this occurs by binding and maintaining proper folding of Xist RNA. Despite these intriguing observations, there are many parts of the manuscript that need to be addressed in order to support the authors main conclusions.

      • The most troubling aspect of this manuscript is the persistent use of an artificial XCI system in male cells to draw strong conclusions about the function of Ssb in XCI. This issue is prevalent throughout the manuscript, and I question why the authors chose to perform most of their experiments in male cells when the same experiments can be (and have previously been by other groups) performed in female cells. Using male ESCs and then making conclusions for XCI, which is a female-specific process, is a major concern.

      • Out of the 138 identified binding proteins, the authors chose to only validate three: Mybbp1a, Tardbp, and Ssb/La. The logic for choosing these candidates is weak, and the authors are only able to validate 1 out of 3 of these proteins.

      • Use of the cell death assay is not strong enough to "confirm that La is involved in induced XCI" as stated by the authors. This is a huge overstatement.

      • While the authors observed differences in X-linked gene expression after Ssb KD, they did not examine expression of these genes in after KD of either Mybbp1a or Tardbp. Are the changes observed in these genes specific to Ssb KD? Or could there still be alterations of X-linked gene expression in the non-validated KDs? This experiment should be performed and included in the manuscript, either within Fig 2 or in the supplemental. As well, inclusion of a well characterized positive control, for example Hnrnpu, as comparison to Ssb should be included.

      • The authors perform RIP to validate the interaction of Ssb with Xist, but this is performed in male ES cells with induced Xist RNA and with FLAG-tagged Ssb. Aside from these cells being male, in this system Xist RNA expression is much higher than would be found endogenously. RIP should have been done in female differentiated ESCs if there is in fact a role for XCI.

      • The authors need to include more details in the methods section to explain how the FLAG-Ssb is expressed in these cells, and why the authors chose to use a tagged contrast over endogenous Ssb. Due to these issues the result from this experiment is essentially meaningless and is not convincing of Ssb interaction with Xist RNA. There is no reason RIP cannot be performed in female cells, and the authors should repeat this experiment in the relevant experimental condition. As well, if a validated Ssb antibody exists the authors should perform RIP using the endogenous protein.

      • The authors state in Fig 3A-C that the results of the cell death and differentiation experiments "...support a functional role of La in XCI". The authors state earlier that Ssb is a ubiquitous protein that is embryonic lethal (in both female and males). Based on this, the cell death results shown do not support a functional role of La in XCI as the Ssb KD could be having an indirect affect due to its other developmental functions. This manuscript lacks a direct functional link between Ssb and XCI; more data is necessary.

      • In Fig 3D, the authors perform ATAC-seq in inducible male ES cells. The authors claim that the extremely slight reduction in chromatin compaction of the Ssb KD compared to control iXist "directly connect La to the heterochromatinization of Xi, supporting a functional role of La in XCI". This is also an overstatement based on the minimal, and possibly indirect, change in compaction. The positive control i-detaA-Xist sample has significantly less compaction (and thus significantly higher compaction defect) than the Ssb KD again disputing the claim stated above. It is unclear why performing ATAC-seq is even necessary, as Ssb isn't stated to have a function in regulating chromatin architecture. In addition, why the authors performed ATAC-seq in the artificial male XCI system and not in the F1 female cells, and the N of the experiment is unclear. If the authors want to include the ATAC-seq in further revisions it should be repeated n=3 in the female system.

      • In Fig 6, the authors state in their methods that "The shRNA construct, which worked efficiently against Ssb, was not designed against the 3' UTR of the RNA. Therefore, the shRNA is against some of the rescue plasmid constructs. Nonetheless, transfecting the Ssb knockdown cells with the rescue plasmids should compensate the effect of Ssb knockdown and serve as a rescue assay to study the functional domains of La.". This is troubling and seems like a major experimental issue; the specific rescue constructs that may be impacted by this issue are not stated and should be explicitly mentioned. This becomes more confusing when examining the data from rescue experiments.

      • In Figure S7, the expression of the rescue constructs deltaRRM1 and deltaRRM2 is extremely low, yet the authors observe a rescue of the cloud phenotype (fig 6D) from those constructs that reaches almost the level of full length Ssb. This is confusing, and the authors need to address this by performing a western blot to show the protein levels of these rescue constructs and discuss further how such a low level of expression can show a rescue phenotype. The results would also be stronger if the authors examined H3K27me3 and H2AK119ub1 enrichment since they observed decreased overlap of these marks with Xist RNA after Ssb KD. Finally, the authors state that "...all three RNA-binding domains are required for the functionality of La in XCI..." however I have trouble coming to this conclusion based on the above issues. As well, if the authors want to support direct function, they should repeat the RIP experiments with these rescues constructs to show that the domains capable of rescue can still bind to Xist RNA.

      Minor comments:

      The authors may want to consider better highlighting the strengths of their "FLAG-out" system. As written, is it difficult to tell how this system sets them apart from the previously published studies referenced in the text, especially as some of these studies used similar crosslinking conditions and cell types. Additionally, the logic and questions the authors pose in the introduction as to why they performed this project are too general and not very strong. For example, the authors mention how might protein machinery may assemble on Xist RNA, and how might Xist RNA may spread on the X chromosome. However neither of these topics are actually addressed in their experiments or discussion. These are interesting questions, but the authors should either discuss them further within the context of their results or take these questions out. It would also be helpful if the authors could better label Figure 4, as it is unclear in the figure itself that Fig 4A is in reference to female cells, but remaining panels are in male cells.

      Significance

      I am not convinced the this manuscript, as written, has sufficient novelty. Ssb/La has been previously identified to be an Xist RNA binding protein with older/different approaches. However, there are some interesting observations in this manuscript. Major revisions are necessary.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      Here Ha et al. has further developed their Pumilio RNA tagging methodology for the isolation of UV-crosslinked proteins that are suggested to associate with Xist RNA in mouse embryonic stem cells (mESCs). Within this study the authors claim to have found the Lupus antigen RNA binding protein (La) as a novel Xist interacting partner that influences the efficacy of X-chromosome inactivation (XCI). The authors use a number of different techniques such as qPCR, fluorescent imaging, ATAC-SEQ and SHAPE to show aberration of XCI upon La shRNA knockdown. However, this study has significant flaws in the efficient isolation and validation of Xist associated proteins using their FLAG-out methodology. Furthermore, later experiments predominantly focus on cell death/survival assays, which is somewhat troubling given the essential roles La plays in processes such as cell differentiation and proliferation, ribosome biogenesis, transcriptional control and tRNA maturation. I feel the authors need to robustly address the potential effects La knockdown may be having on their mESCs.

      Major comments:

      -Are the key conclusions convincing?

      My major concern is in their Xist RNA purification. First of all, I couldn't find any data on proving the enrichment of Xist RNA itself in their Pumilio pull-down experiment. It would have been useful to show Xist RNA enrichment before benzonase step. Secondly, it is hard to imagine the protocol would successfully isolated Xist RNA-protein complexes from the cell. An earlier report by Clemson et al., (J Cell Biol., 1996) has shown that majority of Xist RNA is still stuck in the nucleus after nuclear matrix prep protocol using detergent, which is not so different from the authors' protocol. Moreover, the authors used UV crosslink, which would have made even harder to purify Xist RNA without sonication. Thirdly, as the tag is located on 5' of Xist RNA, it is rather surprising to see that Spen is not detected in their pulldown. Spen is one of the main functional interactors with Xist, robustly detected by several previous reports. Similarly, other high-affinity binders of Xist such as hnRNP-K and Ciz1 were also lacking from this screen. Finally, the peptides found associated with FLAG-out Xist are extremely low in comparison with other data using glutaraldehyde or formaldehyde crosslinking. For example, HnRNP-M found in Chu et al 2015 has 1120 peptide counts in differentiated cells. The authors here use HnRNP-M as a baseline for specific interactions and show a total of 6 peptide counts in Xist expressing cells and 5 in i-Empty cells (Supplementary excel sheet 1). Similarly, the La protein of interest in this study has 8 counts in i-FLAG-Xist and 6 counts in i-Empty. I struggle to see how this result indicate specific Xist binding. Worryingly this is the starting rationale for the rest of their experiments, it is hard to therefore accept the rest of their conclusions either.

      The other key conclusion the authors make is from the use of numerous cell death/survival assays for both male and female cell lines. This is extremely troubling in the context of assessing their target protein La. La is involved in multiple RNA maturation events of rRNAs, tRNAs and other polIII transcripts. Furthermore, La has been implicated in binding to the mRNA for Cyclin D1 in both human cells and mouse fibroblasts (NIH/3T3 - male) which show a significant effect on cell proliferation upon siRNA knockdown https://www.nature.com/articles/onc2010425. This, along with the observation that La knock-out blastocysts fail to develop any mice or ES cell lines (male or female) show the effect observed in the authors results is most likely not X-linked cell death https://mcb.asm.org/content/mcb/26/4/1445.full.pdf. The authors need to show that their shRNA KD isn't affecting the proliferation and general fitness of their mESC lines.

      - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      As discussed above, I feel the authors have not clearly demonstrated Xist specific protein enrichment and haven't proven X-linked cell death. Due to the lack of necessary control experiments as discussed below, I feel the notion that La is involved directly in XCI as an RNA chaperone is currently preliminary/speculative.

      - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      I would suggest them to show RT-qPCR results of Xist RNA enrichment from the sample after flagIP before benzonase treatment.

      Also, it would have been more convincing if their negative control construct (i-Empty) would contain 25 copies of PBSb RNA at least.

      In Fig1b, the total amount of proteins loaded on the gel is not equivalent between two lanes. The gel should show equivalent amounts of proteins on the gel. It looks like if the negative control sample had been loaded at the same amount as the one with Xist, the band pattern wouldn't be distinguishable between the two samples. Furthermore, as these samples were used in the following mass spectrometry screen it may suggest that the minimal increase in peptide counts observed in the iXist FLAG-out were due to an increased amount of sample being loaded? No controls are conducted to account for this.

      The authors quantify cell death in figures 2C - E. It seems clear that shSsb 1 and 2 have an effect on cell count even in the absence of Dox. The rescue effect seen upon Dox addition is minimal when compared to Empty + Dox 2D. The authors ∆A-iXist line with and without Ssb KD/Dox would be an informative control on whether the increase in cell survival that they see is X-linked.

      The qPCR results used to validate silencing defects show minor changes in expression and also don't show significant silencing of X-linked genes sufficient for cell death. Could this be because only ~ 50 - 60% of Male iXist cells seem to be expressing in the movies and that this will have an effect on the observed qPCR results? Furthermore, it seems counterintuitive that expression in the Empty male cells increases in 48h compared to 14h. Is this due to cell death and positive selection of cells less able to silence their X-chromosome? How would these data look in the female XX line? How would the data look in a ∆A-iXist line in the presence and absence of shSsb/Dox?

      Confusingly, the male line in Fig 3C shows a drop in live cell count at day 6 of differentiation? Surely given their previous results in Fig 2 the Ssb KD should increase cell viability with +Dox? Ssb KD seems to have an adverse effect on ES cells during extended differentiation protocols. In Figure S1 the authors show ~ 8 - 10% survival of male lines during differentiation. Could the recombination of the Xist sequence around the loxP sites enable the cells to outcompete the dead cells? How would iEmpty and ∆A-iXist cells compare here? Have the differentiated cells been tested for their expression of Xist? Additionally, how are there similar live cell counts for male vs female lines when ~90% of male cells die during differentiation? Were more cells plated at day 4? If so, this would bias the competition of male cell survival and therefore make the male line an inappropriate control. Given the essential role of La during development a control is needed to prove that this death is X-linked in the female 3F1 line. For example, an XO cell line retaining the Cast allele and shSsb expression could show the amount of death caused from shSsb alone independent of X-linked cell death.

      If I understood correctly, the RNA FISH used dsDNA probes ("Sx9") against 40 kb of the X-inactivation centre (Xic). Surely Tsix or other Xic transcripts will also be visible? Can the authors use their RNA FISH to determine the XX or XO status of their cells? In Figure S5 a number of cells appear to show a single pinpoint of transcription. This could either be low levels of Xist transcripts or Xic transcription from an XO line in which the 129 chromosome is missing. It would be best to solely quantify cells which have two x chromosomes and if a significant amount of X chromosomes have been kicked out, this should be discussed and controlled for.

      In Fig6, the authors generated a number of Ssb constructs for a rescue assay. However, these results complicate the matter and raise more questions than they address. It seems odd that the ∆RRM1 does not rescue based on comparison with their putative negative control, ∆NLS. However, the ∆RRM1 + 2 and ∆LAM do rescue the phenotype better than the full length Ssb? This makes no logical sense and highlights the inherent variation in cell viability these generated cell lines seem to show. Following on from this, figure S7 quantifies the GFP tag mRNA levels, depicting all ∆RRM mutants with expression below ~30%? How can ∆RRM1 or 2 be rescuing in this scenario? Have these lines been tested for their XX or XO status? The loss of an X chromosome would lead to a rescue of the cell death phenotype, which is a process known to occur in XX lines that have been cultured for extended periods of time. Could it also be that the cell lines derived are more or less sensitive to exogenous shRNA expression? Also, further validation is needed to assess the efficiency of KD in these lines as theoretically most of these constructs will be targeted by shRNA? What is the endogenous Ssb expression level in these lines? Where in the mRNA sequence are the shRNAs targeted to? Does this make sense on the relative expression levels of ∆RRM1/2 for example? Further testing of GFP expression could also be assessed by quantitative western blot of GFP or even visualised in their RNA FISH/IF samples (Figure S8), currently neither are shown. In addition, some kind of information of stability of each Ssb protein constructs has not been demonstrated.

      For the data shown in Figure 7A and B the authors quantify the % of cells with Xist signal. The authors have already shown a defect in Xist visualisation in Ssb KD. Surely it is plausible to assume a faster loss of Xist signal below background in weaker expressing cells. A more appropriate quantification would be the % loss of Xist signal per cell over time.

      With Figure 7C and D, the samples have been treated with actinomycin D which globally affects the transcription of cells even the PolIII associated genes Ssb is needed to mature. This treatment could have an added effect on cell mortality and function. Data confirming that actinomycin D doesn't affect the cells disproportionately is needed. The difference in half-life could be attributed to such a treatment.

      In summarising the authors claim that La binds Xist to facilitate folding and appropriate spreading of Xist along the X-chromosome. No direct interaction has been shown, CLIP-seq data would resolve this, however I do understand this is a challenging technique. The authors have instead opted for RIP followed by qPCR (Figure S2). However, this process has a greater potential for non-specific recovery of RNAs via indirect binding. Furthermore, qPCR may also amplify the relative abundance of the RNA detected. As multiple nucleolar proteins came down in the mass spec screen and FLAG-Ssb is being over expressed, it is plausible to assume some transient Xist interactions may arise from nucleolar association at which La will be in high abundance. Positive and negative nuclear RNA controls (e.g. 7SK and U1 snRNA respectively) could be used so to determine the amount of non-specific Protein-RNA interactions in their RIP pull downs. Cytoplasmic actin is not an appropriate control as it is cytosolic.

      Other than this the authors may want to probe (via IF) for the presence of La accumulation on the X? Many other know factors such as Ciz1, hnrnpK and PRC1/2 complexes show clear accumulation on the X. If I understand correctly, there are many La antibodies on the market and endogenous levels on the X could be assessed. These antibodies may be useful in IP's and pull downs also.

      -Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      The experiments suggested above are centrally focussed on the cell lines that are currently in the authors possession with maybe exceptions with the ∆A-iXist-shSsb line suggested. However, this should be reasonably quick to obtain given their previous work for this paper. Most experiments suggested will focus on the validation of karyotype, Xist expression, rescue construct expression, further RNA FISH classification and repeating more appropriate positive and negative controls for a number of experiments. In theory this can be obtained relatively simply and quickly from current resources. But with the sheer volume of further experiments that are required here, this may take a significant amount of time. One vital improvement needed is the replication of mass spec data and the validation of Xist specific recovery and protein enrichment. As it stands this manuscript seems to not have any replicates of the FLAG-out methodology and mass spec data. This is troubling given the poor recovery and specificity of the protein samples obtained. Repeating these experiments would be costly in time and also financially. As it stands, I feel this is essential to conclusively validate their target of interest.

      - Are the data and the methods presented in such a way that they can be reproduced?

      The data is presented relatively well, however, it would be beneficial if deailed methods were in the main text and not in a supplementary file. Similarly, more information about the process of differentiation and how cell death/survival was quantified and validated is needed.

      - Are the experiments adequately replicated and statistical analysis adequate?

      In the most part yes, however there seems to be no replicates of the FLAG-out mass spec screen which is worrying given the minimal specificity observed in the current data.

      Minor comments:

      - Specific experimental issues that are easily addressable.

      Unfortunately, the majority of experimental issues need to be addressed with more robust data which are highlighted above. However, some image analysis, quantification and classification can be amended relatively easily. For example, the live-cell imaging data should be quantified as loss of signal as discussed and RNA FISH should be used to classify XX positive cells and the XO cells can be discarded from analysis.

      - Are prior studies referenced appropriately?

      Most papers regarding Xist pull down and biology are discussed and referenced appropriately. However, the role in which La plays during development and its aberrant affects upon KD are seemingly downplayed. I would like to see more discussion of potential defects that could be caused due to globally altering cellular RNA folding.

      - Are the text and figures clear and accurate?

      For the most part, lots of the figures are clear and accurate. Apart from these exceptions.

      1.The Y-axis of Figure 2D is confusing. What does 0.3 as a "sum of area" equate to? 30% of the area was ES cells? This doesn't look to be the case from Fig 2C. Also, how does the intensity of the signal compare? The area may not be a good quantification due to ES cells growing in colonies.

      2.In the Movies S1-7 there are boxes around certain cells and marked with "Figure 5a - c". This seems to be incorrect as figure 5 is currently the IF staining of polycomb marks. I assume this is in relation to Figure 4b-d?

      3.Similarly, in Movies S1-7, the intensities of Xist foci seem by eye to be similar. In the paper it is claimed that the Xist clouds that do form are lower in intensity. Are the Movies depicting the same range of pixel intensities? If not, this should be amended. Similarly, figure 7 seems to show relatively equivalent RNA signal at 0 h?

      4.In figure 4A the data is from female XX cells, this should be highlighted to limit confusion with the male iXist data shown below in 4B-E. It would also be helpful to have the male/female icons (as in figure 3B), for each figure that has images of cells. Currently Figure 4, 5, 7, S5 and S8 are lacking these icons.

      5.No explanation of the Flag-Ssb expression is given for figure S2. Furthermore, is it really necessary to express Flag-Ssb? There are reasonably good antibodies out there for Ssb as this was how it was originally found in Systemic Lupus patients. Also, no data showing the amount of Ssb being overexpressed is shown. This may have big implication to the validity of the RIP-qPCR analysis.

      - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Most of the data is presented reasonably well, but the robustness of the data somewhat retracts from their conclusions. I feel the certainty of their conclusion regarding Xist specific La binding and RNA chaperone activity is still presumptive and should be rewritten unless more robust data can confirm Xist interaction. I would also suggest deciding on the nomenclature for the protein of interest and use either La or Ssb, the continued use of both through the figures and text can get a little confusing to the reader.

      Significance

      - Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      It was a good trial to use PBSb-PUFb system to purify Xist RNA binding proteins, compared to previous reports had used anti-sense oligo purification using complementary sequence to Xist RNA sequences. But currently the purification still needs further validation and repeats to confirm its use. A potential complementary technique could be to isolate Xist directly by using biotinylated probes against the PBSb sequence. The authors further claim the identification of a novel Xist RNA chaperone (La/Ssb) which they say facilitates XCI progression. This would be a novel finding in the field; however, the data is currently not robust enough to support this.

      - Place the work in the context of the existing literature (provide references, where appropriate).

      This work has focused on the development of a milder methodology for purifying Xist RNA during XCI. Others have published similar methodologies predominantly focusing on purifying Xist RNA directly with biotinylated probes (McHugh et al. Minaji et al and Chu et al.). Although this method boasts a milder purification method, it seems to be low yielding in Xist specific proteins. Others have shown a more robust identification of bona fide Xist binding proteins which are currently missing in this manuscript. A recent preprint from the Plath lab has identified new factors involved in XCI during differentiation and their tethering/rescue experiments are far more convincing than the ones shown in this manuscript https://www.biorxiv.org/content/10.1101/2020.03.09.979369v1. The candidate protein Ha et al have identified has multiple roles in developing cells and has shown to be important during mouse development. However, Ha et al do not robustly show that the knockdown of Ssb causes X-linked cell mortality. Alternatively, as would be presumed from Ssb's essential role in many housekeeping short non-coding RNAs, the cell death seems more ubiquitous upon shRNA KD. Therefore, the link the authors are making here are relatively weak.

      - State what audience might be interested in and influenced by the reported findings.

      The audience may be interested in the novel technique and the finding of a novel Xist binding protein.

      - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      RNA biochemistry and developmental biology

    1. Transparent Peer Review


      Download the complete Review Process [PDF] including:

      • reviews
      • authors' reply
      • editorial decisions

      </br>

    2. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General comments

      We thank all three reviewers for providing their thoughtful and insightful review comments of our manuscript. We appreciate that the reviewers recognized the significance and impact of our work - “Very little imaging has been done on CAR synapses and to our knowledge this is the first live cell imaging study describing CAR microclustsers” (Reviewer 2); “This is an evolving field and little is known to date. Hence, this study could represent an insightful and important advance to the field” (Reviewer 3). A broad audience from both basic and clinical research sides will be interested in this work: “_This study will have a broad audience. Both scientists that study basic T cell signaling as well as clinicians that use CAR Ts will be interested in this study” (_Reviewer 2); “Audience is to both basic immunologist and cancer biologists” (Reviewer 3).

      Meanwhile, we understand that the reviewers have raised a few major and minor issues, which we attempted to address. Most importantly, as suggested by both reviewer 1 and 3, we performed new experiments showing that LAT is not required for microcluster formation of the 1st generation of CAR (new Fig 4 and EV5). This finding suggests that the CAR-independent signaling is due to the intrinsic CAR architecture, and is not dependent on the co-signaling domains of CD28 and 4-1BB.

      With the successful solutions to other issues, we believe the manuscript has been significantly improved and is ready for publication. Below we will provide point-to-point responses to each reviewer’s comments.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors compare the TCR alone to a CAR that contains signaling modules from three receptors- TCR, CD28 and 41BB. The data quality if good and the experiments done are. The difference is quite clear, and I would even like to see a little more of the evidence related to failure of the TCR system.

      We appreciate the general positive comment of this reviewer.

      More specifically:

      Su and colleagues show that a third generation CAR with TCR zeta, CD28 and 41BB signal transduction pathways can activate a T cell for microcluster formation and Gads/SLP-76 recruitment, but not IL-2 production, without LAT. This is surprising because LAT is generally considered, as is up held here, as an essential adapter protein for T cell activation. However, this is not a "fair" experiment as the CAR has sequences from TCR, and two co-stimulatory receptor- CD28 and 41BB. It would be important and very straight-forward to test first and second generation CARs to determine if LAT independence is a function of the CAR architecture itself, or the additional costimulatory sequences. If it turns out that a first generation CAR with only TCR sequences can trigger LAT independent clustering and SLP-76 recruitment then the comparison would be fair and no additional experiment would be needed to make the point that the CAR architecture is intrinsically LAT independent. If the CD28 and/or 41BB sequences are needed for LAT independence then the fair comparison would be to co-crosslink TCR, CD28 and 41BB (an inducible costimulator such that anti-CD27 might be substituted to have a constitutively expressed receptor with this similar motifs) should be cross-linked with the TCR to make this a fair comparison between the two architectures.

      We agree with the reviewer that it is critical to make a “fair” comparison between TCR and CAR by testing the 1st generation CAR, which only contains the TCR/CD3z domain. Our new data showed that LAT is not required for microcluster and synapse formation of the 1st generation of CAR, in both Jurkat and primary T cells (new Fig 4 and EV5). This result is similar to our previously reported result from the 3rd generation CAR, although the 1st generation CAR induced less IL-2 production and CD69 expression in LAT null cells than the 3rd generation CAR did (new Fig 6). This suggests that the LAT-independent signaling is intrinsic to the CAR architecture, as the reviewer suggested. The co-signaling domains from CD28 and 4-1BB contribute to, but are not required for bypassing LAT to transduce the CAR signaling.

      The authors may want to cite work from Vignali and colleagues that even the TCR has two signaling modules- the classical ZAP-70/LAT module that is responsible to IL-2 and a Vav/Notch dependent module that controls proliferation. Its not clear to me that the issue raised about distinct signaling by CARs is completely parallel to this, but its interesting that Vignali also associated the classical TCR signaling pathway as responsible for IL-2 with an alterive pathways that uses the same ITAMs to control distinct functions. See Guy CS, Vignali KM, Temirov J, Bettini ML, Overacre AE, Smeltzer M, Zhang H, Huppa JB, Tsai YH, Lobry C, Xie J, Dempsey PJ, Crawford HC, Aifantis I, Davis MM, Vignali DA. Distinct TCR signaling pathways drive proliferation and cytokine production in T cells. Nat Immunol. 2013;14(3):262-70.

      We appreciate the reviewer’s mentioning this paper from Vignali’s group. It provides insights into understanding LAT-independent signaling in CAR T cells. We cited this paper and added a discussion about the mechanism of LAT-independent signaling.

      I would be very interested to see a movie of the LAT deficient T cells interacting with the anti-CD3 coated bilayers in Figure 2A. Since OKT3 has a high affinity for CD3 and is coated on the surface at a density that should engage anti-CD3 I'm surprised there is no clustering even simply based on mass action. The result looks almost like a dominant negative effect of LAT deficiency on a high affinity extracellular interaction. It would be interesting to see how this interface evolves or if there is anti-adhesive behavior that emerges.

      We now presented a movie showing the detailed process of LAT deficient GFP-CAR T cells landing on the bilayers coated with OKT3 (new Movie EV5), in which the bright field images delineate the locations of the cells, the OKT3 signal marks TCR, and the GFP signal marks CAR proteins on the plasma membranes. No TCR clusters (as indicated by OKT3) were formed during the landing process. We think the binding of bilayer-presented OKT3 to TCR is not sufficient to trigger TCR microclusters. However, TCR microclusters could form in LAT-deficient cells if OKT3 is presented by glass surface. This point is raised by reviewer 2. We added a discussion on the difference between bilayer and glass-presented OKT3 in inducing microcluster formation.

      Reviewer #1 (Significance (Required)):

      While it interesting that the CAR is LAT independent, its obvious that the signalling networks are different as the CAR has two sets of motifs that are absent in the TCR, so the experiments as presented are not that insightful about the specific nature of the differences that lead to the different outcomes. At present its not a particularly well controlled experiment as the third gen CAR is changing too many things in relation to the TCR for the experiment to be interpreted. It would be easy to address this is a revised manuscript. To publish as is the discussion would need to acknowledge these limitations. The work is preliminary as science, but it might be useful to T cell engineering field to have this information as a preliminary report, which might be an argument for adding discussion of limitations, but going forward without more detailed analysis of mechanism.

      This is an excellent point and we have addressed it. See our response above on the new data of the 1st generation CAR.

      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      In this study, the authors have interrogated CAR signaling by imaging CD19-CAR microclusters as well as T cell signaling molecules recruited to CAR microclusters. They report differences spatial assembly between CAR and TCR microclusters that form on a lipid bilayer containing ligand. They also report that LAT is not required for CAR microcluster formation, recruitment of downstream signaling molecules or IL-2 production in Jurkat cells, while in primary T cells IL-2 production by CARs show more of a LAT dependence. From these observations, they conclude that CAR T cells have a rewired signaling pathway as compared to T cells that signal through the TCR.

      Major comments:

      • Are the key conclusions convincing?

      The conclusions made by the authors about CAR microclusters are convincing. However, the conclusion that there is a "rewired signaling network" different from TCR microclusters needs to be more convincingly demonstrated in side-by-side comparisons of TCR and CAR microclusters and synapses.

      1. One of the key conclusions in this study is that CAR microclusters form in the absence of LAT, but TCR microclusters require LAT (in JCam2.5 cells in Fig. 2 and primary T cells in Fig. 4B). The requirement of LAT for formation of TCR microclusters is surprising, given multiple reports (one of which the authors have cited) that TCRz and ZAP70 clusters form normally in the absence of LAT (pZAP microclusters form normally in JCam2.5 cells Barda-Saad Nature Immunology 2005 Figure 1; TCRz clusters form normally in LAT CRISPR KO Jurkat cells Yi et al., Nature Communications, 2019 Figure 5). The authors should carefully evaluate TCRz and ZAP70 clusters (that form upstream of LAT) in their assays.

      We thank the reviewer for raising this excellent point. LAT-independent TCR clusters were reported in the two papers mentioned by the reviewer, which we think is convincing. However, there is a key difference in the experimental settings between these two papers and ours. We use supported lipid bilayer to present MOBILE TCR-activating antibody to activate T cells, whereas these two papers used IMMOBILE TCR-activating antibody attached to the cover glass. We reasoned that the mobile surface of supported lipid bilayer more closely mimics the antigen-presenting cell surface where antigens are mobile on the membrane. We added a new discussion about the difference between supported lipid bilayer and cover glass-based activation.

      We agree with the reviewer on the careful evaluation of TCR and ZAP70 clusters. We had showed the data of TCR clusters as marked by TCR-interacting OKT3 (Fig 3A). We performed new experiments on ZAP70 clusters (new Fig EV3). Our data suggest that, similar to TCR clusters, ZAP70 clusters are not formed in LAT-deficient T cells, if activated by OKT3, but are formed if activated by CD19.

      1. The authors make major conclusions about LAT dependence and independence of TCR and CAR microclusters respectively, by using JCam2.5 Jurkat cells and CRISPR/Cas9 edited primary cells. Of relevance to this conclusion, differences in the phosphorylation status of ZAP70 and SLP76 have been described between JCam2.5 cells lacking LAT (in which LAT was found to be deleted by gamma radiation) and J.LAT cells (in which LAT was specifically deleted by CRISPR/Cas9 in Lo et al Nature Immunology 2018). Of importance, pZAP and pSLP76 appeared fairly intact in J.LAT cells, but absent in JCam2.5 cells (Lo et al., Nat Immunol. 2018, Supp Fig 2). Therefore, the authors should evaluate TCRz, ZAP70, Gads and SLP76 in TCR and CAR microclusters in J.LAT cells. This may partly explain the discrepancy in LAT requirement for IL-2 production in JCam2.5 cells and primary cells with LAT CRISPRed out.

      Jcam2.5 is a classical well-characterized LAT-deficient cell line that has been continuously used in the T cell signaling field (Barda-Saad Nature Immunology 2005, Rouquette-Jazdanian A, Mol. Cell, 2012; Balagopalan L, J Imm. 2013; Carpier J, J Exp Med, 2018; Zucchetti A, Nat. Comm. 2019). We agreed with the concern that the reviewer raised on the absence of pZAP70 and pSLP76 in JCam2.5 cells. As the reviewer suggested, we obtained J.LAT, which is LAT null but has intact pZAP70 and pSLP76. We introduced CAR into J.LAT and the wild-type control and performed the clustering assay as we did for Jcam2.5. Our results showed that, similar to Jcam2.5, CAR forms robust microclusters in J.LAT cells (new Fig EV2). More importantly, we presented data confirming the LAT-independent CAR clustering, SLP76 phosphorylation, and IL-2 production in human primary T cells (Fig 7). Therefore, the data from three independent cell sources support our conclusion on LAT-independent CAR signal transduction.

      1. Since the authors are reporting differences between CAR synapses and TCR synapses, the authors should show side by side comparison of CAR and TCR synapses in Figure 1F.

      We focused on characterizing CAR synapse in this manuscript and did not make any conclusion on the difference between TCR and CAR synapse. We are cautious about comparing CAR synapse to TCR synapse for technical reasons: it is critical to use antigen-specific TCRs (e.g. mouse OTI as a common model) to study the TCR synapse pattern so that the study will be physiologically relevant. However, we use human T cell line and human primary T cells for the CAR study. The technical barrier to introduce an antigen-specific TCR complex into these cells, and to activate these cells by purified peptide-MHC complex, is very high. And the result is interesting, but beyond the scope of the current work.

      1. The authors should evaluate Gads microcluster formation in response to TCR stimulation via OKT3 (in Figure 4A). Given that it has been reported that TCRz, Grb2 and c-Cbl are recruited to microclusters in Jurkat cells lacking LAT by CRISPR deletion (Yi et al., Nature Communications, 2019), it is important to establish the differences between TCR microclusters and CAR microclusters in side by side comparisons in their assay system.

      As the reviewer suggested, we evaluated Gads microcluster formation with TCR stimulation and found that Gads did not form microclusters in LAT-deficient cells (new Fig 5A). Because we only made conclusions on the Gads-SLP76 pathway, we think investigating Grb2 and c-Cbl microcluster, though interesting, is beyond the scope of this manuscript.

      1. Similar to the comment about Gads above, the authors should evaluate pSLP76 microcluster formation in response to TCR stimulation via OKT3 in primary T cells lacking LAT in Figure 4C, i.e. side by side comparisons of pSLP76 in TCR and CAR synapses (with and without LAT) should be shown.

      We totally agree and performed new experiment on pSLP76 in human primary T cells. Our data suggested that, similar to Jurkat, pSLP76 microclusters remain intact in LAT null primary cells (new Fig 7D and 7E).

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?
      1. The data shown in Figure 3C shows a reduction in conjugate formation from 80% (WT) to 30% (LAT -). This is a severe reduction and does not support the authors' claim in the corresponding Figure legend that "LAT is dispensable for cell conjugate formation between Jurkat T cells expressing CAR and Raji B cells" and the Abstract that "LAT.....is not required for....immunological synapse formation". Statistical analysis for variance should be shown here.

      We agree with the reviewer’s judgement. This cell conjugation analysis was performed using Jcam2.5 cells. As pointed by the reviewer, Jcam2.5 has additional defects in ZAP70 and SLP76 in addition to the lack of LAT. Therefore, we performed the same analysis again using J.LAT cells, which was recommended by the reviewer. Our new data showed that J.LAT cells form conjugates with Raji B cells in a similar rate as the wild-type cells do, as evaluated by statistical analysis (new Fig 6A). Therefore, we think these new data support the claim that LAT is dispensable for cell conjugate formation.

      1. In a similar vein, based on data from Movie S5 (where in a single cell, CAR microclusters translocate from cell periphery to center), and Figure 3C where (as described above in point 1) conjugate formation appears to be severely reduced, the authors conclude in the Results and Abstract that "LAT....is not required for actin remodeling following CAR activation". This conclusion is not supported by the data and the authors should remove this claim. Alternatively, actin polymerization in CAR expressing cells (that are LAT sufficient and deficient) can be easily evaluated using phalloidin or F-Tractin.

      As suggested by the reviewer, we evaluated actin polymerization in TCR or CAR stimulated cells using a filamentous actin reporter F-tractin. Our data showed that LAT is required for TCR-induced but not CAR-induced actin polymerization (new Fig 5C). Therefore, our results support the claim that LAT is not required for actin remodeling following CAR activation.

      • Would additional experiments be essential to support the claims of the paper?<br> Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Yes. Please see major comments above.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Yes. It should take 3 months to complete these experiments, since reagents and experimental systems to do these experiments already exist.

      • Are the data and the methods presented in such a way that they can be reproduced?<br> Yes. Methods are clearly explained.

      We appreciate the reviewer’s recognition of the clarity of the methods part.

      • Are the experiments adequately replicated and statistical analysis adequate?

      There is no statistical analysis to evaluate differences between samples in Figures 3 and 4. These must be included.

      We now added statistical analysis in Fig 5B and 6A (old figure 3 and 4).

      Minor comments:

      • Specific experimental issues that are easily addressable.

      Please see Major Comments above. We believe that the recommended experiments are not difficult to execute since reagents exist and experimental systems are already set up.

      • Are prior studies referenced appropriately?

      Authors reference 13 and 14 for the following sentence in Results section 2: "Deletion or mutation of LAT impairs formation of T cell microclusters". However, in Reference 14 Barda-Saad et al., actually show that pZAP clusters are intact in JCam2.5 cells lacking LAT. Perhaps authors should clarify that LAT (and downstream signaling molecule) microclusters are impaired when LAT is deleted or mutated.

      As the reviewer suggested, we now clarified that clustering of LAT downstream binding partners is impaired when citing reference (Barda-Saad et al).

      • Are the text and figures clear and accurate?

      Yes. But would be helpful if authors specify what "control" is in Fig. 3B and C. In Figure 3B it is lipid bilayers without CD19, while in 3C it is K562 cells that do not express CD19.

      We now specified “control” in the figure.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Would be helpful if authors specify in every Figure or at least Figure legend the experimental bilayer system/ligand used, since they use both OKT3 and CD19 as ligands in the paper.

      We now specified the ligand in the figure or legend.

      Reviewer #2 (Significance):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      If CAR microclusters and synapses are appropriately compared in a side by side comparison with TCR microclusters and synapses (as described in comments above), this study will be a conceptual advance in the field of CAR signaling. CAR microclusters have not been studied previously.

      • Place the work in the context of the existing literature (provide references, where appropriate).

      Very little imaging has been done on CAR synapses and to our knowledge this is the first live cell imaging study describing CAR microclusters.

      We appreciate this reviewer’s comment on our work as a conceptual advance in understanding CAR signaling.

      • State what audience might be interested in and influenced by the reported findings.<br> This study will have a broad audience. Both scientists that study basic T cell signaling as well as clinicians that use CAR Ts will be interested in this study.

      We appreciate this reviewer’s recognition of the broad audience of this manuscript.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      T cell signaling and imaging of proximal T cell signaling responses.

      Reviewer #3 (Evidence, reproducibility and clarity):

      This manuscript by Dong and colleagues characterizes the molecular requirements and consequences of engaging a third-generation chimeric antigen receptor (CAR) directed to CD19. Utilizing a biological system of JCaM2.5, a Jurkat T cell mutant with dramatically low levels of LAT, expressing a CAR directed to CD19 fused to the cytoplasmic tails of CD28, 4-1BB and CD3z that is activated by CD19/ICAM1 reconstituted lipid bilayers, the authors demonstrate LAT is not required for microcluster formation, immunologic synapse formation or recruitment of GADS and pSLP76 to the plasma membrane. In contrast, LAT was required for anti-CD3 mediated microcluster formation and pSLP76 recruitment to the plasma membrane. However, LAT does appear to contribute to efficient synapse formation, PIP2 hydrolysis and IL-2 secretion when CAR+ JCaM2.5 or primary T cells are presented with Raji B cells, respectively. These data provide intriguing insights into the molecular requirements for third-generation CAR-T cell functions. The authors have developed quite a nice system to understand the molecular contributions for CAR-T function. A few suggestions are provided here to further enhance the accuracy and significance of the findings:

      1. The authors can address whether the LAT-independent effects are due to the attributes of third generation CAR-Ts with inclusion of CD28 and 4-1BB cytoplasmic domains or whether these differences are intrinsic to all CAR-Ts (e.g., first and second generation CARs).

      This is an excellent point. We have included new data showing LAT-independent cluster formation of the 1st generation CAR in both Jurkat and primary T cells (new Fig 4 and EV5). Therefore, we favor the second possibility as pointed by the reviewer that LAT-independent effects are intrinsic to CAR architecture.

      1. Since a first-generation CAR-T forms non-conventional synapses (Davenport, et al., PNAS 2018), the authors should consider more detailed kinetic analysis to understand the formation and dissolution of the constituents of the synapse with their third generation CAR. This should include measurements of the duration of microcluster and synapse formation as well as further analysis of c- and p-SMAC constituents (e.g., LFA-1, TALIN, LCK and pSLP76) over time.

      We agree with the reviewer on a more detailed characterization of the CAR synapse. We measured the duration of the unstable CAR synapse and time from cell landing to the start of retrograde flow (new Fig 2C). We also determined the localization of CD45, a marker for d-SMAC (new Fig 2D). We found that the formation of dSMAC is also not common in CAR T synapse, strengthening our conclusion that CAR forms non-typical immunological synapse.

      1. The authors utilize two different activation platforms. While using CD19/ICAM1 reconstituted bilayers, CAR+ JCaM2.5 or CAR+ primary T cells demonstrate no differences compared to wildtype JCaM2.5 cells in the parameters studied. However, when using Raji B cells, the CAR+ JCaM2.5 cells or CAR+ primary T cells demonstrate a more intermediate phenotype with respect to cell conjugate formation (Figure 3C) and IL-2 production (Figure 4D). The authors should analyze whether the differences attributed to the different outcomes may be due to the stimulation mode. For example, is c-SMAC assembly and GADS or pSLP76 recruitment to the plasma membrane still LAT-independent when activated with Raji B cells?

      As the reviewer suggested, we examined c-SMAC assembly in Raji B cells conjugated with CAR T cells. We found that the majority of CAR do not form cSMAC (new Fig EV4), which is consistent with the result from the bilayer activation system. Since both Gads and SLP76 are cytosolic proteins, they keep largely in the cytosolic pool which obscures their recruitment and clustering on the plasma membrane when imaged by confocal microscopy at the cross-section of cell-cell synapse.

      1. The authors should consider whether CAR expression level affects their observations. For example, do lower levels of CAR expression make the system LAT-dependent? Further, what is the level of the CAR relative to endogenous TCR expression on their primary T cells.

      We agree with the reviewer that it is informative to determine if LAT-independent signaling is dose dependent. We tried to measure the CAR concentration relative to the endogenous TCR/CD3z. By western blot using two different antibodies against CD3z, we detected TCR/CD3z expression, but found no bands corresponding to CAR. We believe this reflects a low expression of CAR in our system, which is confirmed by FACS. The general low expression of CAR makes it challenging to sort an even lower CAR-expressing population. Therefore, we sought alternative ways to determine the dose-dependence; we titrated the CD19 concentrations on the bilayer. As shown in the new Figure EV1, CAR formed microclusters similarly in the wild-type versus LAT-deficient cells in a wide range of CD19 concentration. Therefore, we conclude that the LAT-independent cluster formation is robust at low antigen density as well.

      Minor comment:

      1. Since JCaM2.5 has differences when compared to the parental Jurkat E6.1 T cell line, the authors should utilize JCaM2.5 reconstituted with wildtype LAT as a comparator.<br> Agreeing with this reviewer, we recognized that Jcam2.5 was generated by mutagenesis which may result in protein expression difference for genes besides Lat. As suggested by reviewer1, we used J.LAT, a genuine LAT knockout cell line that is generated by CRISPR-mediated gene targeting, to perform the clustering assay (new Fig EV2). Our results showed that, similar to Jcam2.5, CAR but not the TCR formed microclusters in J.LAT cells.

      Reviewer #3 (Significance):

      The mechanism(s) by which CAR-Ts function is of high significance from both scientific and clinical viewpoints. From a scientific viewpoint, it provides important basic mechanistic information of how T cells are being activated to kill tumor cells. By understanding the molecular requirements, additional generations of CARs can be designed to provide greater efficacy, overcome resistance and possibly less toxicity.

      This is an evolving field and little is known to date. Hence, this study could represent an insightful and important advance to the field.

      Audience is to both basic immunologist and cancer biologists.

      We appreciate this reviewer’s comments on the high significance of our work to the field of both basic immunology and clinical application.

      My expertise is in T cell signaling, T cell biology and immunotherapy.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript by Dong and colleagues characterizes the molecular requirements and consequences of engaging a third-generation chimeric antigen receptor (CAR) directed to CD19. Utilizing a biological system of JCaM2.5, a Jurkat T cell mutant with dramatically low levels of LAT, expressing a CAR directed to CD19 fused to the cytoplasmic tails of CD28, 4-1BB and CD3 that is activated by CD19/ICAM1 reconstituted lipid bilayers, the authors demonstrate LAT is not required for microcluster formation, immunologic synapse formation or recruitment of GADS and pSLP76 to the plasma membrane. In contrast, LAT was required for anti-CD3 mediated microcluster formation and pSLP76 recruitment to the plasma membrane. However, LAT does appear to contribute to efficient synapse formation, PIP2 hydrolysis and IL-2 secretion when CAR+ JCaM2.5 or primary T cells are presented with Raji B cells, respectively. These data provide intriguing insights into the molecular requirements for third-generation CAR-T cell functions.

      The authors have developed quite a nice system to understand the molecular contributions for CAR-T function. A few suggestions are provided here to further enhance the accuracy and significance of the findings:

      1. The authors can address whether the LAT-independent effects are due to the attributes of third generation CAR-Ts with inclusion of CD28 and 4-1BB cytoplasmic domains or whether these differences are intrinsic to all CAR-Ts (e.g., first and second generation CARs).
      2. Since a first-generation CAR-T forms non-conventional synapses (Davenport, et al., PNAS 2018), the authors should consider more detailed kinetic analysis to understand the formation and dissolution of the constituents of the synapse with their third generation CAR. This should include measurements of the duration of microcluster and synapse formation as well as further analysis of c- and p-SMAC constituents (e.g., LFA-1, TALIN, LCK and pSLP76) over time.
      3. The authors utilize two different activation platforms. While using CD19/ICAM1 reconstituted bilayers, CAR+ JCaM2.5 or CAR+ primary T cells demonstrate no differences compared to wildtype JCaM2.5 cells in the parameters studied. However, when using Raji B cells, the CAR+ JCaM2.5 cells or CAR+ primary T cells demonstrate a more intermediate phenotype with respect to cell conjugate formation (Figure 3C) and IL-2 production (Figure 4D). The authors should analyze whether the differences attributed to the different outcomes may be due to the stimulation mode. For example, is c-SMAC assembly and GADS or pSLP76 recruitment to the plasma membrane still LAT-independent when activated with Raji B cells?
      4. The authors should consider whether CAR expression level affects their observations. For example, do lower levels of CAR expression make the system LAT-dependent? Further, what is the level of the CAR relative to endogenous TCR expression on their primary T cells.

      Minor comment:

      1. Since JCaM2.5 has differences when compared to the parental Jurkat E6.1 T cell line, the authors should utilize JCaM2.5 reconstituted with wildtype LAT as a comparator.

      Significance (Required)

      The mechanism(s) by which CAR-Ts function is of high significance from both scientific and clinical viewpoints. From a scientific viewpoint, it provides important basic mechanistic information of how T cells are being activated to kill tumor cells. By understanding the molecular requirements, additional generations of CARs can be designed to provide greater efficacy, overcome resistance and possibly less toxicity.

      This is an evolving field and little is known to date. Hence, this study could represent an insightful and important advance to the field.

      Audience is to both basic immunologist and cancer biologists.

      My expertise is in T cell signaling, T cell biology and immunotherapy.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      In this study, the authors have interrogated CAR signaling by imaging CD19-CAR microclusters as well as T cell signaling molecules recruited to CAR microclusters. They report differences spatial assembly between CAR and TCR microclusters that form on a lipid bilayer containing ligand. They also report that LAT is not required for CAR microcluster formation, recruitment of downstream signaling molecules or IL-2 production in Jurkat cells, while in primary T cells IL-2 production by CARs show more of a LAT dependence. From these observations, they conclude that CAR T cells have a rewired signaling pathway as compared to T cells that signal through the TCR.

      Major comments:

      Are the key conclusions convincing?

      The conclusions made by the authors about CAR microclusters are convincing. However, the conclusion that there is a "rewired signaling network" different from TCR microclusters needs to be more convincingly demonstrated in side-by-side comparisons of TCR and CAR microclusters and synapses.

      1. One of the key conclusions in this study is that CAR microclusters form in the absence of LAT, but TCR microclusters require LAT (in JCam2.5 cells in Fig. 2 and primary T cells in Fig. 4B). The requirement of LAT for formation of TCR microclusters is surprising, given multiple reports (one of which the authors have cited) that TCR and ZAP70 clusters form normally in the absence of LAT (pZAP microclusters form normally in JCam2.5 cells Barda-Saad Nature Immunology 2005 Figure 1; TCR clusters form normally in LAT CRISPR KO Jurkat cells Yi et al., Nature Communications, 2019 Figure 5). The authors should carefully evaluate TCR and ZAP70 clusters (that form upstream of LAT) in their assays.
      2. The authors make major conclusions about LAT dependence and independence of TCR and CAR microclusters respectively, by using JCam2.5 Jurkat cells and CRISPR/Cas9 edited primary cells. Of relevance to this conclusion, differences in the phosphorylation status of ZAP70 and SLP76 have been described between JCam2.5 cells lacking LAT (in which LAT was found to be deleted by gamma radiation) and J.LAT cells (in which LAT was specifically deleted by CRISPR/Cas9 in Lo et al Nature Immunology 2018). Of importance, pZAP and pSLP76 appeared fairly intact in J.LAT cells, but absent in JCam2.5 cells (Lo et al., Nat Immunol. 2018, Supp Fig 2). Therefore, the authors should evaluate TCR, ZAP70, Gads and SLP76 in TCR and CAR microclusters in J.LAT cells. This may partly explain the discrepancy in LAT requirement for IL-2 production in JCam2.5 cells and primary cells with LAT CRISPRed out.
      3. Since the authors are reporting differences between CAR synapses and TCR synapses, the authors should show side by side comparison of CAR and TCR synapses in Figure 1F.
      4. The authors should evaluate Gads microcluster formation in response to TCR stimulation via OKT3 (in Figure 4A). Given that it has been reported that TCR, Grb2 and c-Cbl are recruited to microclusters in Jurkat cells lacking LAT by CRISPR deletion (Yi et al., Nature Communications, 2019), it is important to establish the differences between TCR microclusters and CAR microclusters in side by side comparisons in their assay system.
      5. Similar to the comment about Gads above, the authors should evaluate pSLP76 microcluster formation in response to TCR stimulation via OKT3 in primary T cells lacking LAT in Figure 4C, i.e. side by side comparisons of pSLP76 in TCR and CAR synapses (with and without LAT) should be shown.

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      1. The data shown in Figure 3C shows a reduction in conjugate formation from 80% (WT) to 30% (LAT -). This is a severe reduction and does not support the authors' claim in the corresponding Figure legend that "LAT is dispensable for cell conjugate formation between Jurkat T cells expressing CAR and Raji B cells" and the Abstract that "LAT.....is not required for....immunological synapse formation". Statistical analysis for variance should be shown here.
      2. In a similar vein, based on data from Movie S5 (where in a single cell, CAR microclusters translocate from cell periphery to center), and Figure 3C where (as described above in point 1) conjugate formation appears to be severely reduced, the authors conclude in the Results and Abstract that "LAT....is not required for actin remodeling following CAR activation". This conclusion is not supported by the data and the authors should remove this claim. Alternatively, actin polymerization in CAR expressing cells (that are LAT sufficient and deficient) can be easily evaluated using phalloidin or F-Tractin.

      Would additional experiments be essential to support the claims of the paper?<br> Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Yes. Please see major comments above.

      Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Yes. It should take 3 months to complete these experiments, since reagents and experimental systems to do these experiments already exist.

      Are the data and the methods presented in such a way that they can be reproduced?

      Yes. Methods are clearly explained.

      Are the experiments adequately replicated and statistical analysis adequate?

      There is no statistical analysis to evaluate differences between samples in Figures 3 and 4. These must be included.

      Minor comments:

      Specific experimental issues that are easily addressable.

      Please see Major Comments above. We believe that the recommended experiments are not difficult to execute since reagents exist and experimental systems are already set up.

      Are prior studies referenced appropriately?

      Authors reference 13 and 14 for the following sentence in Results section 2: "Deletion or mutation of LAT impairs formation of T cell microclusters". However, in Reference 14 Barda-Saad et al., actually show that pZAP clusters are intact in JCam2.5 cells lacking LAT. Perhaps authors should clarify that LAT (and downstream signaling molecule) microclusters are impaired when LAT is deleted or mutated.

      Are the text and figures clear and accurate?

      Yes. But would be helpful if authors specify what "control" is in Fig. 3B and C. In Figure 3B it is lipid bilayers without CD19, while in 3C it is K562 cells that do not express CD19.

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions?<br> Would be helpful if authors specify in every Figure or at least Figure legend the experimental bilayer system/ligand used, since they use both OKT3 and CD19 as ligands in the paper.

      Significance (Required)

      Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      If CAR microclusters and synapses are appropriately compared in a side by side comparison with TCR microclusters and synapses (as described in comments above), this study will be a conceptual advance in the field of CAR signaling. CAR microclusters have not been studied previously.

      Place the work in the context of the existing literature (provide references, where appropriate).

      Very little imaging has been done on CAR synapses and to our knowledge this is the first live cell imaging study describing CAR microclusters.

      State what audience might be interested in and influenced by the reported findings.

      This study will have a broad audience. Both scientists that study basic T cell signaling as well as clinicians that use CAR Ts will be interested in this study.

      Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      T cell signaling and imaging of proximal T cell signaling responses.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors compare the TCR alone to a CAR that contains signaling modules from three receptors- TCR, CD28 and 41BB. The data quality if good and the experiments done are. The difference is quite clear, and I would even like to see a little more of the evidence related to failure of the TCR system.

      More specifically:

      Su and colleagues show that a third generation CAR with TCR zeta, CD28 and 41BB signal transduction pathways can activate a T cell for microcluster formation and Gads/SLP-76 recruitment, but not IL-2 production, without LAT. This is surprising because LAT is generally considered, as is up held here, as an essential adapter protein for T cell activation. However, this is not a "fair" experiment as the CAR has sequences from TCR, and two co-stimulatory receptor- CD28 and 41BB. It would be important and very straight-forward to test first and second generation CARs to determine if LAT independence is a function of the CAR architecture itself, or the additional costimulatory sequences. If it turns out that a first generation CAR with only TCR sequences can trigger LAT independent clustering and SLP-76 recruitment then the comparison would be fair and no additional experiment would be needed to make the point that the CAR architecture is intrinsically LAT independent. If the CD28 and/or 41BB sequences are needed for LAT independence then the fair comparison would be to co-crosslink TCR, CD28 and 41BB (an inducible costimulator such that anti-CD27 might be substituted to have a constitutively expressed receptor with this similar motifs) should be cross-linked with the TCR to make this a fair comparison between the two architectures.

      The authors may want to cite work from Vignali and colleagues that even the TCR has two signaling modules- the classical ZAP-70/LAT module that is responsible to IL-2 and a Vav/Notch dependent module that controls proliferation. Its not clear to me that the issue raised about distinct signaling by CARs is completely parallel to this, but its interesting that Vignali also associated the classical TCR signaling pathway as responsible for IL-2 with an alterive pathways that uses the same ITAMs to control distinct functions. See Guy CS, Vignali KM, Temirov J, Bettini ML, Overacre AE, Smeltzer M, Zhang H, Huppa JB, Tsai YH, Lobry C, Xie J, Dempsey PJ, Crawford HC, Aifantis I, Davis MM, Vignali DA. Distinct TCR signaling pathways drive proliferation and cytokine production in T cells. Nat Immunol. 2013;14(3):262-70.

      I would be very interested to see a movie of the LAT deficient T cells interacting with the anti-CD3 coated bilayers in Figure 2A. Since OKT3 has a high affinity for CD3 and is coated on the suface at a density that should engage anti-CD3 I'm surprised there is no clustering even simply based on mass action. The result looks almost like a dominant negative effect of LAT deficiency on a high affinity extracellular interaction. It would be interesting to see how this interface evolves or if there is anti-adhesive behavior that emerges.

      Significance

      While it interesting that the CAR is LAT independent, its obvious that the signalling networks are different as the CAR has two sets of motifs that are absent in the TCR, so the experiments as presented are not that insightful about the specific nature of the differences that lead to the different outcomes. At present its not a particularly well controlled experiment as the third gen CAR is changing too many things in relation to the TCR for the experiment to be interpreted. It would be easy to address this is a revised manuscript. To publish as is the discussion would need to acknowledge these limitations. The work is preliminary as science, but it might be useful to T cell engineering field to have this information as a preliminary report, which might be an argument for adding discussion of limitations, but going forward without more detailed analysis of mechanism.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Rebuttal to reviewers ReviewCommons manuscript # RC-2020-00281

      We would like to thank the reviewers and editors of Review Commons for evaluating our manuscript entitled “Transcriptional comparison of Testicular Adrenal Rest Tumors with fetal and adult tissues” and providing their valuable comments. We have listed the reviewers’ comments along with our response and amendments below.

      Board Advice on initial submission:

      This seems to be a study mainly relevant to the field of Testicular Adrenal Rest Tumors (TART). It presents the first RNAseq profiling of these tumors in multiple human samples at different stages. This has the potential to advance knowledge in this particular field. It would be less interesting to researchers interested in tissue spatial transcriptomics in general, since the experimental and computational tools are quite standard, but the findings may be important to the TART field.

      Response: Indeed, this is the first study using transcriptomics to characterize Testicular Adrenal Rest Tumors, a frequent occurrence in patients with Congenital Adrenal Hyperplasia. It is also the first to find that the reported adrenal and testicular features of these tumors can be found in a single cell. We therefore believe this study is not only of interest to those working in the TART field, but also in development, endocrinology and andrology in general.

      Comments Reviewer #1:

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      The manuscript by Schroder M., et al describes the whole transcriptome of testicular adrenal rest tumors (TART) and shows that TART tissue is characteristically similar to adult adrenal and testicular rather than fetal adrenal and testicular tissues. The authors propose that their previous claim that TART is derived from an undifferentiated pluripotent progenitor is likely untrue and claim that TART likely originates from a mature cell type with both adrenal and testicular characteristics. The authors describe a unique cell type most similar to the adult adrenal, but with variable testis-specific gene expression patterns. The finding of overexpressed genes associated with ECM remodeling is interesting and may provide insight into the natural history of these tumors. A strength of the study is the number of tissue samples since surgery for these rare tumors is usually not performed.

      **Major Comments:**

      • The key conclusions are mostly based on RNA studies, thus their claims are preliminary.

      Response: We agree that a major part of our conclusions is based on RNA studies. Although indeed primarily based on transcriptomics, this claim is, in our opinion, not preliminary as the identity of TART cells can definitively be deduced from their expression profile. Our second key conclusion, i.e. that TART cells comprise both adrenal and testicular features within the same, unique, TART specific cell, is based on immunohistochemistry of adrenal and testis-specific enzymes.

      In Figure 1/Result p. 3: Authors claim that there were no exclusive HSD17B3 staining cells without CYP11B1, however Figure 1 looks like there are exclusively green (HSD17B) areas (especially TART3). The authors need to address this. It appears as if there are mature Leydig cells. This is important because the presence of Leydig cells would affect the interpretation of the findings.

      Response: We do understand the concern of the reviewer. Aspecific background staining for HSD17B3 in TART samples complicated the differentiation between specific and background staining. This can be seen when comparing the staining in HSD17B3-positive (Leydig) cells with the background staining in non-Leydig cells in testis tissue and in a portion of TART cells. In TART, we found that cells with high intensity, specific HSD17B3 staining all also showed CYP11B1 staining, but not vice-versa. However, we do acknowledge that due to this -most likely background- staining, the occurrence of mature Leydig cells in TART cannot be completely excluded based on our results.

      Therefore, we have tried to be more careful in our claims in the results section (page 3; TART cells express adrenal- and Leydig cell-specific steroidogenic enzymes paragraph) and we have addressed this in the discussion section (page 5/6):

      High background staining for HSD17B3 complicated the differentiation between specific and background staining. For some cells this exclusive HSD17B3 staining might have been specific and therefore, despite that most HSD17B3-positive cells were positive for CYP11B1, the absence of mature Leydig cells in TART could not be guaranteed by these results.

      Discussion: authors state that based on their previous observations that fetal Leydig cells have both adrenal and testis developmental potential. It was speculated that TART might have been derived from a totipotent progenitor cell type, but the current study shows that these tumors lack similarities with fetal tissues. Thus, the authors claim that these tumors are not derived from the transdifferentiation of pluripotent cells. However what is the origin of this mature distinct cell type? Is it not possible that this distinctive cell type is derived from a common progenitor since the testis and adrenal gland are derived from the same adrenogonadal primordium? Lack of similarities with fetal tissues at this late stage of development does not necessarily rule out a common progenitor origin.

      Response: In this study, we compared the TART transcriptome with fetal tissues, as we hypothesized these might be similar considering the likely progenitor origin of TART cells. However, this was not the case, and we showed that the transcriptomic profile of TART resembles the transcriptomic profile of mature cell types, rather than their fetal counterparts. Therefore, we conclude that the hypothesis that TART arises from progenitor cells is not supported by our data. The reviewer is correct that we did not prove that it is not derived from pluripotent cells. We have therefore added the following text to the discussion:

      Although we here find that the transcriptome of TART tissues are clearly distinct from fetal tissues, we did not prove that TART does not originate from fetal Leydig cells. TART being derived from a multipotent progenitor cell is still possible as we initially hypothesized, given the fact that TART is likely already present in utero and its resemblance to both testis and adrenal tissues which derive from a common primordium. Therefore, we were surprised to find TART to be more like adult adrenal and testis tissue, raising the possibility of TART being derived from a ‘mature’ progenitor cell type, i.e. adult stem Leydig cells or adrenal progenitor cells, that under influence of high ACTH levels and/or the localization in the testicular region might differentiate into a distinct cell type that expresses both adrenal- and testis-specific markers. However, this remains to be established.

      **Minor Comment:**

      In Methods: Was RNA isolated from FFPE sections or frozen tissue?

      We agree that this was not clearly mentioned enough in our original manuscript, as both frozen (RNA isolation) and FFPE (IHC) material was used. We have now clarified in the methods section that the RNA was retrieved from frozen tissue samples (page 8; RNA isolation, library preparation, and sequencing paragraph).

      Reviewer #1 (Significance (Required)):

      This first study of transcriptome analysis of TART provides useful insight into the characteristics of these rare tumors that commonly develop in males with classic CAH. This study provides a foundation for further investigation of the biological pathways contributing to the development of TART, the most common cause of male infertility in CAH. This study is of interest to endocrinologists. Reviewed by a pediatric endocrinologist and molecular biologist - we are not completely aware of the sequencing analysis but are familiar with clustering and enrichment analysis.

      Comments Reviewer #3:

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Schröder et al describes the transcriptome sequencing of TARTs in CAH/CS in order to sort out the origin of TARTs. This is an interesting subject and the manuscript is well-written but I have a few comments that could be addressed.

      • Some parts of the Results should be in the Methods and some in the Discussion. In the Results only the results should be given.

      Response: We agree that we have incorporated some methodological sentences and some concluding remarks in the results sections to, in our opinion, improve the flow of the manuscript. As the manuscript guidelines differ between journals, we have for now decided not to change this. We will do so if this is wanted by the concerning journal.

      Normally TARTs are not removed or biopsied, if not by mistake... Thus, most centers would not have tissue samples of TARTs at all. How come you have so many samples available?

      Response: We thank the reviewer for highlighting this. As indeed TARTs are not routinely removed, the number of TART tissues included in our dataset is unique. Most of the TART samples were already obtained in 2004 because of reported pain and discomfort and in an attempt to improve semen quality in these patients. Removal of those particular TART samples have led to new insights that removal of longstanding TART did not improve semen parameters, nor parameters of pituitary-gonadal function (Claahsen-van der Grinten et al., 2007). Therefore, to date, the only indication for surgery for the removal of longstanding TART is the relief of pain or discomfort.

      Ref 2 and 3 are rather old and similar. Could newer review references be used instead?

      Response: We have changed those two references for a more recent review by Dr. Witchel on Congenital Adrenal Hyperplasia, who addresses both statements in a more recent review (Witchel, 2017).

      Reviewer #3 (Significance (Required)):

      New and significant study. Very interesting for people dealing with CAH patients.

      References

      Claahsen-van der Grinten, H. L., Otten, B. J., Takahashi, S., Meuleman, E. J. H., Hulsbergen-van de Kaa, C., Sweep, F. C. G. J., & Hermus, A. R. M. M. (2007). Testicular adrenal rest tumors in adult males with congenital adrenal hyperplasia: Evaluation of pituitary-gonadal function before and after successful testis-sparing surgery in eight patients. Journal of Clinical Endocrinology & Metabolism, 92(2), 612-615. doi:10.1210/jc.2006-1311

      Witchel, S. F. (2017). Congenital Adrenal Hyperplasia. J Pediatr Adolesc Gynecol, 30(5), 520-534. doi:10.1016/j.jpag.2017.04.001

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript by Schröder et al describes the transcriptome sequencing of TARTs in CAH/CS in order to sort out the origin of TARTs. This is an interesting subject and the manuscript is well-written but I have a few comments that could be addressed.

      1. Some parts of the Results should be in the Methods and some in the Discussion. In the Results only the results should be given.
      2. Normally TARTs are not removed or biopsied, if not by mistake... Thus, most centers would not have tissue samples of TARTs at all. How come you have so many samples available?
      3. Ref 2 and 3 are rather old and similar. Could newer review references be used instead?

      Significance

      New and significant study. Very interesting for people dealing with CAH patients.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Schroder M., et al describes the whole transcriptome of testicular adrenal rest tumors (TART) and shows that TART tissue is characteristically similar to adult adrenal and testicular rather than fetal adrenal and testicular tissues. The authors propose that their previous claim that TART is derived from an undifferentiated pluripotent progenitor is likely untrue and claim that TART likely originates from a mature cell type with both adrenal and testicular characteristics. The authors describe a unique cell type most similar to the adult adrenal, but with variable testis-specific gene expression patterns. The finding of overexpressed genes associated with ECM remodeling is interesting and may provide insight into the natural history of these tumors. A strength of the study is the number of tissue samples since surgery for these rare tumors is usually not performed.

      Major Comments:

      1. The key conclusions are mostly based on RNA studies, thus their claims are preliminary.

      2. In Figure 1/Result p. 3: Authors claim that there were no exclusive HSD17B3 staining cells without CYP11B1, however Figure 1 looks like there are exclusively green (HSD17B) areas (especially TART3). The authors need to address this. It appears as if there are mature Leydig cells. This is important because the presence of Leydig cells would affect the interpretation of the finidings

      3. Discussion: authors state that based on their previous observations that fetal Leydig cells have both adrenal and testis developmental potential. It was speculated that TART might have been derived from a totipotent progenitor cell type, but the current study shows that these tumors lack similarities with fetal tissues. Thus, the authors claim that these tumors are not derived from the transdifferentiation of pluripotent cells. However what is the origin of this mature distinct cell type? Is it not possible that this distinctive cell type is derived from a common progenitor since the testis and adrenal gland are derived from the same adrenogonadal primordium? Lack of similarities with fetal tissues at this late stage of development does not necessarily rule out a common progenitor origin.

      Minor Comment:

      In Methods: Was RNA isolated from FFPE sections or frozen tissue?

      Significance

      This first study of transcriptome analysis of TART provides useful insight into the characteristics of these rare tumors that commonly develop in males with classic CAH. This study provides a foundation for further investigation of the biological pathways contributing to the development of TART, the most common cause of male infertility in CAH. This study is of interest to endocrinologists. Reviewed by a pediatric endocrinologist and molecular biologist - we are not completely aware of the sequencing analysis but are familiar with clustering and enrichment analysis.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript entitled "Vasohibin-1 mediated tubulin detyrosination selectively regulates secondary sprouting and lymphangiogenesis in the zebrafish trunk" by de Oliveira investigates the function of the carboxylpeptidase Vasohibin during the formation of the zebrafish trunk vasculature and reports a requirement of Vasohibin for secondary sprout formation and in particular the formation the lymphatic vasculature.

      Having established the expression of Vasohibin in sorted ECs of 24 hpf embryos, the remaining study addresses the function of Vasohibin in this cell type. It is largely based on the use of a splice-site interfering morpholino. Particular commendable is the analysis, demonstrating that the KD of vash-1 indeed results in a significant reduction of detyrosination in endothelial tubulin. Findings in the vascular system then include: (i) the detection of increased division and hence supernumerous cells occurring selectively in 2nd sprouts from the PCV; (ii) an increased persistence of the initially formed 3 way connections with ISV and artery; (iii) reduced formation of parachordal lymphangioblasts and (iv) a reduced number of somites with a thoracic duct segment; (v) frequent formation of lumenized connections between PLs (where present) and ISV. To demonstrate specificity, the approach was repeated with a different morpholino and defects were partially rescued by MO-insensitive RNA.

      Possible additional and relevant information could include data on a vash-1 promotor mutant to independently verify the MO-based functional analysis. Mutants would also allow analysis of further development, are the defects leading to the demise of the fish or is a later regeneration and normalization of the lymphatic vasculature observed?

      We agree that a mutant would be desirable to validate the phenotypic analysis of the morpholinos used, and would also allow for further analysis. However, this is not achievable within a reasonnable time frame, especially in the context of current work restrictions.

      In addtion to the two splice morpholinos currently used to knockdown vash-1 expression, we will use an ATG morpholino to further investigate our observations and hypothesis regarding the role of vash-1 in lymphatic vessels formation. We will also validate it by westernblot and attempt to rescue it with mRNA.

      We have not investigated the phenotype past 4 dpf. We will add investigation of lymphatics and morphology at 5 dpf.

      In addition, are other lymphatic vessel beds like the cranial lymphatics affected?

      Using the Tg[fli1a:EGFP]y7 line, we have not been able to identify apparent differences in other vascular beds including the cranial lymphatics. However a detailed fine-grained investigation of the cranial vascular bed has not been performed. Given the focus of the present study on the trunk vasculature to understand the mechanisms of vash-1, we feel that a detailed analysis of cranial lymphatics would at this stage be somewhat out of scope.

      PLs have been demonstrated to be at least partially guided in their movement by the CXCR4/SDF1 system and SVEP1. Has the expression of these factors been tested in vash-1 KDs?

      We have not investigated the potential role of the CXCR4/SDF1 system and SVEP1 in vash-1 regulation of lymphangiogenesis. We will investigate the expression of cxcr4a, cxcl12a, cxcl12b and svep1 by in situ hibridization upon vash-1 knockdown.

      With regards to the frequently observed connections of PLs and ISVs in vash-1 morphants, can the proposed lumen formation of these shunts be demonstrated e.g. by injection of Q-dots or microbeads into the circulation?

      Although the lumenisation is very clear thanks to the membrane targeted expression of the label in this line, we will further analyse whether these abberant ISV to ISV connection can be perfused by Q-dots injections.

      Concerning the mechanisms of these defects, is it possible to analyse the asymmetric cell division leading to 2nd sprouts in greater detail? Is the same number or are more cells sprouting form PCV and can the fli1ep:EGFP-DCX cell line in fixed samples be used to identify the spindle orientation in dividing cells?

      We agree with the reviewer and plan to use the Tg[fli1ep:EGFP-DCX] fish line to investigate spindle asymmetry in uninjected embryos, as well as compare the spindle in control MO and vash-1 KD embryos. Vash-1 has been shown to regulate spindle formation in osteosarcoma cells (Liao et al., 2019). We will attempt to clarify whether this function is conserved in endothelial cells and contributes to the control of endothelial cell proliferation during initiation and formation of secondary sprouting.

      We also agree that it is important to look at the PCV in the begining of secondary sprouting and will clarify whether the sprouting is initiated by an increased number of cells.

      **Minor issues:** Page 5, Mat & Meth, please spell out PTU at its first mention.

      This has been corrected accordingly (see page 4).

      Page 6 Mat & Meth, Secondary sprout and 3-way connection parameters: The number of nuclei was assessed in each secondary sprouts (del s, singular) just prior...

      This has been corrected accordingly (see page 5).

      Page 16, 8th line from bottom: Recent work demonstrated that a secondary sprout either contributes (add s) to remodelling a pre-existing ISV into a vein, or forms (add s)a PLs (Geudens et al., 2019).

      This has been corrected accordingly (see page 16).

      Page 25, Legend to Fig. 2D-G: "...G,G' shows quantification of dTyr signal upon vash-1 KD..." Fig2 G,G' show immunostaining rather than quantification of the dTyr signal, which is shown Fig. 2H-J

      This has been corrected accordingly (see page 26).

      Fig. 1D / Fig. 2H-J please increase weight of the error intervals and / or change colour for improved visibility

      This has been corrected accordingly (Fig. 1D and 2H-J), and we added n.s. to Fig. 1D.

      Reviewer #1 (Significance (Required)):

      Taken together the manuscript is comprehensively written and the study provides a conclusive analysis of the MO-mediated KD of Vasohibin in zebrafish embryonic development presenting significant novel findings. Known was a generally inhibitory function of Vasohibin on vessel formation and its enzymatic activity as a carboxylpeptidase responsible for tubulin detyrosination, affecting spindle function and mitosis. New is the detailed analysis of the Vasohibin KD on zebrafish trunk vessel formation and the description of a selective impairment of 2nd sprout formation. The manuscript is of interest for vascular biologists.

      REFEREES CROSS-COMMENTING

      I fully concur with the comments of reviewer #2, all three reviews find that this study is of significant interest to the vascular biology community as the relevance of tubulin detyrosination for developmental angiogenesis has not been investigated. Also all three reviews highlight the potential limitations of the use of splice morpholinos (suggested alternatives include ATG morpholinos and CRIPR mutants), the requirement to provide further evidence for a endothelial cell autonomous defect and the need to clarify some of the data representation.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      The manuscript by Bastos de Oliveira et al. describes an important investigation of the endothelial tubulin detyrosination during vascular development. Namely, they found detyronised microtubules in secondary sprouts, which is absent in MO-vash-1 treated embryos. The authors use the vash-1 morpholino approach to uncover the developmental consequences of suppressed detyrosination in angiogenesis and lymphangiogenesis in vivo in zebrafish. By a combination of transgenic lines, immunohistochemistry and time-lapse imaging, Bastos de Oliveira et al., have found that Vash-1 is a negative regulator of secondary sprouting in zebrafish. The authors showed that in the absence of Vash-1 more cells are present in the secondary sprouts due to increased cell proliferation; however lymphatic vascular network fails to form. The current manuscript requires additional experimental evidence to support the conclusions. Please see below the major technical concerns and minor comments.

      **Major comments:**

      -This study is based on analysis of the phenotypes observed in embryos injected with vash-1 morpholino. The authors use two different types of splice morpholinos, perform rescue experiments with RNA, and validate one MO-vash-1 with western blot. Morpholinos are not trivial to work with, and the results are variable hence additional controls need to be included, as following the recommendation put together by the zebrafish community (Stainier et, al., Plos Genetics, 2017). As the severity of the phenotypes comparing MO1 with MO2 is different and MO-vash-1 embryos appear developmentally delayed (Figure 2D-F and 5E-F overall size seem to be affected), additional MO is required, for example, ATG-MO or generation of CRISPR mutant would be favourable. All the morpholino used need to be validated using an antibody, RT-PCR and qPCR. It is essential to carry out the rescue experiments for all the MO used in this study and following the guidelines. Including the dose-response curve, data would be informative.

      We agree with the reviewer and the recommendations of the zebrafish community. We will investigate the phenotypes with another KD strategy, such as the ATG-Morpholino suggested by the reviewer. We will also supply more validation of the MO2 including RNA rescue and westernblot (already included in Fig. 5 I).

      We added dose-response curves (Supp. Figure 1 E,G) and a developmental morphology assessment for the morpholino 1 (Supp. Figure 1 A,B).

      Given our extensive analysis of the effects of vash-1 KD, we believe the embryos in 2F are not developmentally delayed. However, the image in figure 2F does give that impression, and therefore may have triggered the reviewer’s concerns. We double checked and found that due to an oversight, we included a picture from a slightly different region of the trunk in comparision to Fig. 2D. We will add pictures of the same trunk region (Fig.2D-F) as we have done in all other figures. We nonetheless supply a supplementary figure 1 showing and quantifying the development of the analysed vash-1 morphants.

      -In addition to EC, the levels of dTyr are lower in MO-vash-1 in neural tube and neurons spanning the trunk (Figrue 2 D-G'). These have been previously shown to be important for secondary sprouting. Is it possible that the observed phenotypes in the secondary sprouting are due to defects in these neurons?

      We agree with the reviewer that a potential contribution of altered neuronal differentiation to the vascular phenotype should be clarified. We will assess the morphology of the neurons and their dendrites relevant for pathfinding (Lim et al., 2011) in vash-1 KD embryos, using a pan-neuronal zebrafish line, as well as via immunostaining against alpha-tubulin. Should we find evidence for changes in neuronal cells, we will attempt to clarify a cell autonomous role of vash-1 by transplantation experiments.

      -Embryo number used in this study appears to be low especially in figure 3G, 5D, 5G, to conclude draw conclusions from these experiments, the number of embryos used should be higher than 20. Figure 4J please specify how many embryos were used.

      We will increase the number of embryos per condition to a minimum of 20 embryos and update the averages in the text for 3G (control: 7 and vash-1 KD: 11 embryos).

      In 5D and 5G each point is an embryo and more than 20 embryos per condition were used (in 5D 23-35 embryos per condition, in 5G 60-63 embryos/condition), we corrected the legend 5D and 5G (see page 27) and made it clear that each point in the graph corresponds to one embryo (5D- percentage of PLs associated with veins in each embryo; 5G- percentage of somites with toraxic duct in each embryo).

      In 4J, 18 embryos were used for control (about 3 sprouts/embryo– 52 sprouts quantified) and 7 embryos for vash-1* KD condition (about 3 sprouts/embryo – 24 sprouts quantified). We corrected the number of control sprouts in the legend and added the number of embryos to increase clarity (see page 27).

      -The authors hypothesise that VASH acts in the sprouting endothelial cells, based on the Q-PCR in Figure 1. However, in this experiment all EC have been sorted thus this remains ambiguous in which cell types vash-1 is expressed. Please provide the expression pattern for vash-1 across the developmental stages the phenotypes are observed.

      We agree with the reviewer that it would be beneficial to understand the expression pattern of vash-1 in wild type embryos. We plan to perform in situ hybridization for vash-1 mRNA.

      -Throughout the manuscript the authors refer the lymphatic identity, however, there is no evidence in the paper that the identity status has been assessed. To support these claims Prox1 immunohistochemistry or analysis of prox1 expression in the reporter line would be appropriate.

      We agree with the reviewer and plan to perform a Prox1 immunostaining (Koltowska et al., 2015) in vash-1 KD embryos at 34-36 hpf (secondary sprouting) to investigate Prox1 levels upon vash-1 KD.

      **Minor comments:**

      -The authors refer to the literature where overexpression of VASH suppresses the angiogenesis. As the RNA injections were used in rescue experiments, the data of vash-1 RNA injections into the wild-type embryos would be beneficial.

      We have injected vash-1 RNA into a control morpholino injected embryos (28 control embryos, 14 Vash-1 RNA injected embryos) and we observed a significant decrease in PLs at 52 hpf (average of -control: 87,5% somites with PLs to 67% somites with PLs in vash-1 RNA embryos). This could be due to a decrease of secondary sprouting, which would be in accordance with the current literature that vash-1 overexpression is anti-angiogenic. We will further investigate and add the results to figure 5. Figure 1. vash-1* mRNA injection leads to a decrease in somites with PLs (preliminary).

      -In figures 2J, 3J, 3K, 3N, 4J, 5C, 5D and 5G the N number was set for examples as the number of sprouts, the number of somites with TD, number of ISV. To strengthen the observation in the manuscript quantification of the sprouts, PL, vISVs and lymphatic phenotypes with N set as the number of embryos would be more informative. Indicating the number of embryos used, in the graphs, would be helpful.

      We agree with the reviewer and have added embryo numbers in all legends and graphs. In 2J, 3J, 3K, 4J each point is a sprout, a cell division or an ISV, corresponding to the N. We agree that the number of embryos could be more clearly stated, so we added the number of embryos analysed in the figure legend and will add them in the graphs.

      In 5C, 5D and 5G each point corresponds to an embryo (clarified in the legend of Fig. 5- see page 27).

      Fig. 5C refers to the percentage of somites with PLs in each embryo, 5D refers to percentage of the existing PLs in one embryo connected to a venous ISV, 5G corresponds to percentage of somites with a TD segment in each embryo.

      -In Figure 5A, B and D the authors quantify what they refer to as a lumenised connection between the vISVs and PL. In the control image (second star), a somewhat lumenised structure is present, clarification of how the scores were set is missing.

      In Fig. 5C we show a quantification of the percentage of somites with PLs per embryo, by counting the PLs identified with an asterisk in Fig. 5A-B. PLs are normally not lumenised, with few exceptions also ocurring in wild-type – see Fig. 4 in (S Isogai et al., 2001).

      In Fig. 5D we quantified the proportion of PLs associated/connected with venous ISvs (see Methods section page 6), by 52 hpf in control and vash-1 morphants.

      In 5B and 5F,F‘, the arrowheads identify lumenised PLs present in vash-1 KD embryos. We will add a quantification of kdr-l:ras-Cherry positive ISV-to-ISV connections, corresponding to the lumenised endothelial connections, since kdr-l:ras-Cherry signal labels endothelial (and not lymphatic) cells and is particularly strong at the luminal endothelial membrane of the vessel.

      -In Figure 3 E and F the authors show the excessive sprouting phenotype between controls and Mo-vash-1. The images presented are taking from different parts of the embryos (middle of the trunk vs plexus region), hampering the comparison between the two groups. The quantification of the phenotypes in both experimental groups should be in the same region of the embryo, as the local difference can occur. It is key to provide representative images to support these observations.

      The images presented are representative of the phenotype quantified, and the time-lapses were done in comparable regions of the zebrafish trunk (+- 1-2 somites in both groups due to drift during image aquisition), making the comparison possible.

      -Figure 1D the vash-1 expression levels in EC seem very variable in this graph, therefore no conclusion can be drawn from this data, especially as the authors do not provide the p-values.

      We added n.s. in the graph, to make it clear that the difference between developmental stages is not significant, potentially due to high biological variability between embryos, as seen in two primer pairs. We believe that presenting this biological variability is of importance to the readers.

      We write on page 12 about this result: „During the sprouting phase (24hpf), vash-1 expression was 5-7 times higher in endothelial than in non-ECs, decreasing at 48 hpf (Fig. 1C-D). Although these results are not significant, they were independently confirmed with a second primer set.”. The only conclusion we made from this data is that Vash-1 is dynamically expressed in the zebrafish endothelium during development, as we now added in the discussion (page 14).

      -In the introduction, the authors state: 'Although primary and secondary sprouts appear morphologically similar, with tip and stalk cells' - Please provide the reference that supports the claim that secondary sprouts have tip-stalk cells morphology/organisation.

      Although many studies have investigated primary and secondary sprouting, identifying both shared as well as distinct molecular regulation, and show morphological details that are apparently similar, a formal claim that secondary sprouts show tip and stalk cell identities and behaviour is hard to find. Given that this is not relevant for the central findings of the work, we modified the sentence and added a reference “Although primary and secondary sprouts appear morphologically similar, with tip and stalk cells” (Sumio Isogai et al., 2003)…” See page 2.

      We also updated the discussion for consistency: “Although the cellular mechanisms of primary and secondary sprouting in zebrafish appear very similar, with tip cell selection and guided migration and stalk cell proliferation, secondary sprouting utilises alternative signalling pathways and entails a unique specification step that establishes both venous ISVs and lymphatic structures.” (see page 15)

      -The authors refer the increased cell division phenotypes observed in the movies, however, the movie files have not been available to the reviewers.

      We will provide the movies.

      Reviewer #2 (Significance (Required)):

      This is an important study as uncovering the mechanistic details of angiogenic and lymphangiogenic negative regulators is of high value with the potential for therapeutic developments. To date, Vash-1 has been only studied in the context of tumour angiogenesis, vasculature in diabetic nephropathy and pulmonary arterial hypertension, and it remains unclear what is its role during development and how does it regulate vascular network formation. The tyrosination status of microtubule in endothelial cells is understudied. This study revealed, previously uncharacterised detyrosinated microtubules in endothelial cells in vivo. And further dissects how this process might be regulated, brings unique insights into the vascular biology field and beyond. Thus, delving into the cell biological mechanism such as microtubule dynamics and modification in vivo in embryo context is a significant step forward in setting new standards in the field.

      I am developmental biologist who has experience in model organisms such as zebrafish and mouse. The main focus of my work is on developmental angiogenesis and lymphangiogenesis.

      REFEREES CROSS-COMMENTING

      After reading the other reviews comments, it seems that we all agree that this study is of high value to vascular biology field and beyond bringing novel findings.

      Importantly the reviewers' comments are in line with each other and have identified several commonalities that should be addressed. Such as: Further validation of Morpholinos, or using alternative methods to replicate the findings. additional evidence that the observed phenotypes are primary due to vash-1 requirement within EC, and not due to the secondary effect in other cells such as CXCR4/SDF1 system and SVEP1, neurons or general delay of the embryos Further evidence of for VASH expression pattern the number of embryos used in the experiments, and how the data is represented.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Vasohibin-1 (Vash-1) is known to detyrosinate microtubules (MTs) and limit angiogenesis. Using in vivo live imaging and whole mount immunofluorescence staining of zebrafish trunk vasculature, Bastos de Oliveira et al. show that the MT detyrosination role of Vash-1 is conserved in zebrafish and that Vash-1 is essential for limiting venous sprouting and subsequent formation of lymphatics. Their findings suggest a role for MT detyrosination in lympho-venous cell specification.

      **Major comments:**

      1 . The authors claim that Vash-1 regulates secondary sprouting and lymphangiogenesis by detyrosinating MTs. However, no direct evidence of this link is provided in the manuscript. The authors only separately show that knockdown of vash-1 affects MT detyrosination and secondary sprouting and lymphangiogenesis. They have not shown a causative effect. The authors should therefore qualify the above stated claim as speculative. In other words, the authors should mention that their data only suggests that disruption of MT detyrosination is the underlying cause for aberrant secondary sprouting and lymphangiogenesis in vash-1 KD embryos.

      We agree with the reviewer about the lack of evidence to state that the disruption of microtubule detyrosination leads to aberrant secondary sprouting. Although we believe this is the most parsimonius explanation for the secondary sprouts behavioural defects as cell division is disturbed and microtubule detyrosination is implicated in cell division (Barisic et al., 2015), we want to make clear that our data currently only suggest a specific role of microtubule detyrosination in secondary sprouting. Examples of this are page 14 of the discussion „These results suggest that Vash-1-driven microtubule detyrosination limits excessive venous EC sprouting and proliferation during lympho-venous development in zebrafish.” as well as the abstract.

      We also corrected the sentence in the discussion (page 14): “In this study, we identified Vash-1-mediated microtubule detyrosination as a cellular mechanism as a novel regulator of EC sprouting from the PCV and the subsequent formation of lymphatic vessels in the zebrafish trunk.”

      To avoid any overstatement, we also propose the following title change: Vasohibin-1 mediated tubulin detyrosination selectively regulates secondary sprouting and lymphangiogenesis in the zebrafish trunk.

      As detailed in response to comment 2 below, we will however attempt to investigate the direct connection. Depending on the outcome, we will adapt conclusions and title accordingly.

      2 . In order to provide more compelling evidence for a direct relationship between MT tyrosination and lymphangiogenesis, the authors could try mutating the carboxypeptidase domain of vash-1 or overexpressing a dominant negative transcript (that contains a mutated carboxypeptidase domain). If this gives the same phenotypes as the vash-1 morphants, it would indicate that the carboxypeptidase activity of Vash-1 (in detyrosinating MTs) is responsible for limiting secondary sprouting and promoting specification of lymphatics. This suggested experiment is fairly realistic in terms of both time and resources. For example, since the authors already have the human vash-1 cDNA cloned, making a dominant negative transcript from this would take around two weeks, imaging and analysis of embryos injected with this mRNA would take another four weeks. Therefore, in total, the suggested experiment would take around 6 weeks. Although the alternative experiment, that is, making a carboxypeptidase domain mutant of vash-1 would be a better choice in terms of reproducibility and long-term use of a stable line, it would admittedly take a relatively larger amount of time. Therefore, the ultimate choice would depend on the authors.

      We will investigate this further by cloning and expressing a mutated vash-1 cDNA which translates a validated catalytically dead Vash-1 (Nieuwenhuis et al., 2017). However, this mutant has not been shown to function as dominant negative, so it is unclear whether it can be used as a dominant negative mutant.

      3 . Both the data and methods are presented in a way that ensures reproducibility. The statistical analysis is very well done, in that the authors were very prudent in their choice of statistical tests. However, in many figures and subfigures (Fig. 2B, H-J; Fig. 3G, J, K, N; Fig. 4J; Fig. 5J), the number of replicates was not mentioned and instead only the sample size was stated. Whether this was just an oversight or if it should be taken to mean that the analysis was performed on just one replicate is unclear. The authors need to clarify this aspect of their analysis. Further, In Fig. 2H-J, Fig. 3G,J, K, N and Fig. 4J, the total number of data points in control MO vs vash-1 KD seem to be quite different. In other words, there seem to be a lot more data points in one experimental condition than the other. Does this difference fall within the acceptable range? If the authors were to compare a similar number of data points between the two experimental conditions, would the results of the statistical analysis still be the same?

      We apreciate this comment and clarified the replicate numbers in the figure legends: Fig. 2B- 3 replicates (page 25), Fig. 2 H-J- quantification is 1 replicate (page 26), Fig. 2 D-G is representative of 3 replicates (page 25). Fig. 3 G,J,K,N – quantification is from 1 replicate (page 26), Fig. 3 B,C,E,F,H,I are representative of 2 experimental replicates (page 26). Fig. 4J – quantification is 1 replicate (page 27), Fig. 4 A-F is representative of 3 replicates (page 27). Fig. 5 J correspondes to 1 replicate (page 28).

      We plan to increase replicates and numbers in quantifications shown in Fig. 3 G,J,K,N and Fig. 5 J as they are relevant for the conclusions of the manuscript, and adapt the text.

      The quantifications of immunostaining signals are comparable between different samples of the same experiment but technically not easy accross different experiments, due to some variability of the immunostaining. However, the pattern we report in the quantifications and representative pictures is consistentely detected (reduced dTyr signal upon vash-1 KD in Fig 2 D-G; higher dTyr intensity in secondary rather than primary sprouts in Fig. 4 A-F). We added in the legend that the pictures of the embryos in these figures are representative of 3 biological replicates (see page 25 and 27).

      We recognise the unequal sample size in control and vash-1 KD groups in Fig. 2H-J, Fig. 3G,J, K, N and Fig. 4J. Generally, the vash-1 KD group shows more variance than the control group (see Fig. 3 J-N, 4J for example), hence the reason why we analysed a higher sample size.

      In the planned experiments (repeating quantifications of Fig. 3 J-N), we will analyse a similar number of embryos.

      We corrected the figure legend of 2 H-J on the number of ISVs - 108 ISVs from 7 embryos for control and 150 ISVs for vash-1 KD, from 9 embryos (see page 26).

      4 . The authors only provide KD data on the function of vash-1 using morpholinos. According to several recent guidelines concerning the use of morpholinos, this is not widely accepted in the zebrafish community as sufficient to provide robust insight into gene function. Please refer for example to the following publication: Guidelines for morpholino use in zebrafish, Stainier et al., PLOS Genetics, 2017. The generation of a vash-1 mutant is a necessary requirement for backing up morpholino KD data. Further, even though the authors state that embryos were selected on the pre-established criteria that they have normal morphology, beating heart, and flowing blood, certain morphological differences between control MO injected and vash-1 KD embryos could be observed in some figures. In Fig. 2D, F and Fig. 5A, B, E, F the vash-1 KD embryos seem smaller (extend of the dorso-ventral axis) than control MO injected embryos. The authors need to provide images showing the overall morphology of morpholino injected embryos and need to provide evidence that morpholino injections do not cause developmental delays.

      We agree that a mutant would be desirable to validate the phenotypic analysis of the morpholinos used, and would also allow for further analysis. However, this is not achievable within a reasonnable time frame, especially in the context of current work restrictions. We have added a sentence about the need to confirm the loss of function phenotype with vash-1 mutants in the discussion (see page 14).

      In addtion to the two morpholinos currently used to knockdown vash-1 expression, we will use an ATG morpholino to further investigate our observations and hypothesis regarding the role of vash-1 in lymphatic vessels formation. We will also validate it by westernblot and attempt to rescue it with mRNA.

      We added a supplementary figure with pictures and quantifications of antero-posterior (Sup. Figure 1 C) and dorso-ventral length (Sup. Figure 1 D) of the analysed control and vash-1 morpholino injected embryos‘ development at 24, 34, 52 and 4dpf which shows no significant developmental delay and morphological defect. There is some occurrence of curvature of the tail at 34-52 hpf.

      We added a sentence in the Methods section (pages 10) to clarify the morphant’s morphology and dosage-response curves.

      We observe a 1-2 hour developmental delay of both the control and the vash-1 KD embryos compared to uninjected wild-type embryos, which led us to chose the 52 hpf time point to investigate the PLs. In uninjected embryos they are usually developed by 48hpf (Hogan et al., 2009).

      Fig. 2 D shows a more anterior region of the zebrafish trunk than Fig. 2F (the tail has a smaller dorso-ventral length)- we will provide more comparable pictures from the same region.

      Fig. 5B is slightly tilted – we will provide a picture with the same orientation.

      Fig. 5 E and F have a similar length from dorsal aorta to the dorsal longitudinal anastomotic vessel. However, we appreciate a difference in the sub intestinal vascular plexus (SIVP), which is consistently underdeveloped in the vash-1 KD embryos.

      Figure 2- vash-1 deficient embryos show underdeveloped intestinal vascular system at 4 dpf.

      **Minor comments:**

      a. The authors should back their qPCR data for vash-1 expression (Figure 1) by standard mRNA in situ hybridization, given the large degree of variability in vash-1 expression. Do they observe a dynamic expression in the vasculature using this technique?

      We agree with the reviewer that an in situ hybridization would be beneficial to understand the expression pattern of vash-1 in wild type embryos. Accordingly, we will look at vash-1 expression by in situ hybridization in WT embryos.

      The number of nuclei per sprout in Fig. 3J does not correspond with the number of divisions per sprout presented in Fig. 3K. The authors observe one or two cell divisions per sprout in ctr MO injected embryos (Fig. 3K), however, Fig. 3J shows that the majority of ctr. sprouts contains only one cell. This is even more dramatic for vash-1 MO injected embryos, which can have up to four divisions, therefore should contain six cells. However, the maximum number of cells the authors report is three to four cells. How do these observations go together?

      We believe these quantifications are not contradicting. The number of endothelial nuclei was assessed just prior to the connection to the ISV and the cell division quantification was done in a time-lapse from the time of secondary sprout emergence until the resolution of the 3-way connection. It is expected that there are more cell divisions during a longer time frame, as cells migrate dorsally or ventrally out of the sprout.

      Fig. 5I and J have the same data points for control MO and vash-1 MO1. Does this mean that both graphs are from the same experiment? If so, the authors could combine the two graphs into one. If the two graphs are not from the same experiment, both would need to have independent controls.

      Fig 5 I and J are indeed from the same experiment. They are now combined into one graph (see Fig. 5 J).

      d. The percentage of somites with PLs in vash-1 MO1 injected embryos in Fig. 5I is half the value shown in Fig. 5C. Although this kind of variability might be expected in biological samples, perhaps the authors could briefly discuss the issue and its implications on reproducibility in the manuscript so as to have the readers be aware of it, especially since the rescue of the vash-1 morpholino phenotype back to 50% from 25% is the same value the authors observed in the vash-1 KD alone in Fig. 5C. Here the value is 50% for the morpholino injection.

      We added a sentence discussing the phenotypic variability in the discussion (see page 16), and we added a dosage response curve for the PLs (Sup. Figure 1 F), showing that embryos injected with the same amount of morpholino show variability in the percentage of somites with PLs at 52hpf. We added a more representative picture of PLs for vash-1 morphant in Fig. 5I ( Y-axis of Fig. 2H and 4J correspond to ratios, which have no units. Nontheless, we added AU/AU to these graphs to make it clearer. We added the bars in Fig. 5D.

      It would help to have an inference or conclusion at the end of each results section.

      We added one conclusion sentence per results section (see pages 11-14).

      Reviewer #3 (Significance (Required)):

      Conceptual: As per my knowledge, this is the first study that looks at microtubule modifications in the context of a vertebrate organism past the gastrulation stage, as opposed to similar studies that have been done in cell culture or invertebrates (S. cerevisiae, C. elegans and D. melanogaster). Moreover, this study is one of few that address a novel link between the cytoskeleton and the process of cell fate specification.

      Previous studies have separately shown that Vash-1 limits angiogenesis and detyrosinates MTs. The current study combines the two observations in the context of endothelial cells, and hypothesizes that perhaps the function of Vash-1 in limiting angiogenesis and at the same time promoting lymphatic development could be due to its role in MT modification at the molecular level and the consequent effect of this on cell division and/or fate specification at the cellular level. In short, this study aims to connect the long-standing gap in knowledge between cytoskeletal modifications and cell dynamics (in particular, division and specification) in a vertebrate organism. I therefore believe that the current study would be an exciting finding for research communities that study cytoskeletal influence on cellular dynamics and also those in the broad area of vascular biology.

      My field of expertise relates to vascular biology, specifically developmental angiogenesis and the behavior of endothelial cells in zebrafish.

      References

      Barisic, M., Silva E Sousa, R., Tripathy, S. K., Magiera, M. M., Zaytsev, A. V., Pereira, A. L., Janke, C., Grishchuk, E. L., & Maiato, H. (2015). Microtubule detyrosination guides chromosomes during mitosis. Science, 348(6236), 799–803. https://doi.org/10.1126/science.aaa5175

      Hogan, B. M., Bos, F. L., Bussmann, J., Witte, M., Chi, N. C., Duckers, H. J., & Schulte-Merker, S. (2009). Ccbe1 is required for embryonic lymphangiogenesis and venous sprouting. Nature Genetics, 41(4), 396–398. https://doi.org/10.1038/ng.321

      Isogai, S, Horiguchi, M., & Weinstein, B. M. (2001). The vascular anatomy of the developing zebrafish: an atlas of embryonic and early larval development. Developmental Biology, 230(2), 278–301. https://doi.org/10.1006/dbio.2000.9995

      Isogai, Sumio, Lawson, N. D., Torrealday, S., Horiguchi, M., & Weinstein, B. M. (2003). Angiogenic network formation in the developing vertebrate trunk. Development, 130(21), 5281–5290. https://doi.org/10.1242/dev.00733

      Kimura, H., Miyashita, H., Suzuki, Y., Kobayashi, M., Watanabe, K., Sonoda, H., Ohta, H., Fujiwara, T., Shimosegawa, T., & Sato, Y. (2009). Distinctive localization and opposed roles of vasohibin-1 and vasohibin-2 in the regulation of angiogenesis. Blood, 113(19), 4810–4818. https://doi.org/10.1182/blood-2008-07-170316

      Koltowska, K., Lagendijk, A. K., Pichol-Thievend, C., Fischer, J. C., Francois, M., Ober, E. A., Yap, A. S., & Hogan, B. M. (2015). Vegfc Regulates Bipotential Precursor Division and Prox1 Expression to Promote Lymphatic Identity in Zebrafish. Cell Reports, 13(9), 1828–1841. https://doi.org/10.1016/j.celrep.2015.10.055

      Liao, S., Rajendraprasad, G., Wang, N., Eibes, S., Gao, J., Yu, H., Wu, G., Tu, X., Huang, H., Barisic, M., & Xu, C. (2019). Molecular basis of vasohibins-mediated detyrosination and its impact on spindle function and mitosis. Cell Research, June. https://doi.org/10.1038/s41422-019-0187-y

      Lim, A. H., Suli, A., Yaniv, K., Weinstein, B., Li, D. Y., & Chien, C. Bin. (2011). Motoneurons are essential for vascular pathfinding. Development, 138(21), 4813. https://doi.org/10.1242/dev.075044

      Nicenboim, J., Malkinson, G., Lupo, T., Asaf, L., Sela, Y., Mayseless, O., Gibbs-Bar, L., Senderovich, N., Hashimshony, T., Shin, M., Jerafi-Vider, A., Avraham-Davidi, I., Krupalnik, V., Hofi, R., Almog, G., Astin, J. W., Golani, O., Ben-Dor, S., Crosier, P. S., … Yaniv, K. (2015). Lymphatic vessels arise from specialized angioblasts within a venous niche. Nature, 522(7554), 56–61. https://doi.org/10.1038/nature14425

      Nieuwenhuis, J., Adamopoulos, A., Bleijerveld, O. B., Mazouzi, A., Stickel, E., Celie, P., Altelaar, M., Knipscheer, P., Perrakis, A., Blomen, V. A., & Brummelkamp, T. R. (2017). Vasohibins encode tubulin detyrosinating activity. Science, 358(6369), 1453–1456. https://doi.org/10.1126/science.aao5676

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Vasohibin-1 (Vash-1) is known to detyrosinate microtubules (MTs) and limit angiogenesis. Using in vivo live imaging and whole mount immunofluorescence staining of zebrafish trunk vasculature, Bastos de Oliveira et al. show that the MT detyrosination role of Vash-1 is conserved in zebrafish and that Vash-1 is essential for limiting venous sprouting and subsequent formation of lymphatics. Their findings suggest a role for MT detyrosination in lympho-venous cell specification.

      Major comments:

      1 . The authors claim that Vash-1 regulates secondary sprouting and lymphangiogenesis by detyrosinating MTs. However, no direct evidence of this link is provided in the manuscript. The authors only separately show that knockdown of vash-1 affects MT detyrosination and secondary sprouting and lymphangiogenesis. They have not shown a causative effect. The authors should therefore qualify the above stated claim as speculative. In other words, the authors should mention that their data only suggests that disruption of MT detyrosination is the underlying cause for aberrant secondary sprouting and lymphangiogenesis in vash-1 KD embryos.

      2 . In order to provide more compelling evidence for a direct relationship between MT tyrosination and lymphangiogenesis, the authors could try mutating the carboxypeptidase domain of vash-1 or overexpressing a dominant negative transcript (that contains a mutated carboxypeptidase domain). If this gives the same phenotypes as the vash-1 morphants, it would indicate that the carboxypeptidase activity of Vash-1 (in detyrosinating MTs) is responsible for limiting secondary sprouting and promoting specification of lymphatics. This suggested experiment is fairly realistic in terms of both time and resources. For example, since the authors already have the human vash-1 cDNA cloned, making a dominant negative transcript from this would take around two weeks, imaging and analysis of embryos injected with this mRNA would take another four weeks. Therefore, in total, the suggested experiment would take around 6 weeks. Although the alternative experiment, that is, making a carboxypeptidase domain mutant of vash-1 would be a better choice in terms of reproducibility and long-term use of a stable line, it would admittedly take a relatively larger amount of time. Therefore, the ultimate choice would depend on the authors.

      3 . Both the data and methods are presented in a way that ensures reproducibility. The statistical analysis is very well done, in that the authors were very prudent in their choice of statistical tests. However, in many figures and subfigures (Fig. 2B, H-J; Fig. 3G, J, K, N; Fig. 4J; Fig. 5J), the number of replicates was not mentioned and instead only the sample size was stated. Whether this was just an oversight or if it should be taken to mean that the analysis was performed on just one replicate is unclear. The authors need to clarify this aspect of their analysis. Further, In Fig. 2H-J, Fig. 3G,J, K, N and Fig. 4J, the total number of data points in control MO vs vash-1 KD seem to be quite different. In other words, there seem to be a lot more data points in one experimental condition than the other. Does this difference fall within the acceptable range? If the authors were to compare a similar number of data points between the two experimental conditions, would the results of the statistical analysis still be the same?

      4 . The authors only provide KD data on the function of vash-1 using morpholinos. According to several recent guidelines concerning the use of morpholinos, this is not widely accepted in the zebrafish community as sufficient to provide robust insight into gene function. Please refer for example to the following publication: Guidelines for morpholino use in zebrafish, Stainier et al., PLOS Genetics, 2017. The generation of a vash-1 mutant is a necessary requirement for backing up morpholino KD data. Further, even though the authors state that embryos were selected on the pre-established criteria that they have normal morphology, beating heart, and flowing blood, certain morphological differences between control MO injected and vash-1 KD embryos could be observed in some figures. In Fig. 2D, F and Fig. 5A, B, E, F the vash-1 KD embryos seem smaller (extend of the dorso-ventral axis) than control MO injected embryos. The authors need to provide images showing the overall morphology of morpholino injected embryos and need to provide evidence that morpholino injections do not cause developmental delays.

      Minor comments:

      a. The authors should back their qPCR data for vash-1 expression (Figure 1) by standard mRNA in situ hybridization, given the large degree of variability in vash-1 expression. Do they observe a dynamic expression in the vasculature using this technique?

      b. The number of nuclei per sprout in Fig. 3J does not correspond with the number of divisions per sprout presented in Fig. 3K. The authors observe one or two cell divisions per sprout in ctr MO injected embryos (Fig. 3K), however, Fig. 3J shows that the majority of ctr. sprouts contains only one cell. This is even more dramatic for vash-1 MO injected embryos, which can have up to four divisions, therefore should contain six cells. However, the maximum number of cells the authors report is three to four cells. How do these observations go together?

      c. Fig. 5I and J have the same data points for control MO and vash-1 MO1. Does this mean that both graphs are from the same experiment? If so, the authors could combine the two graphs into one. If the two graphs are not from the same experiment, both would need to have independent controls.

      d. The percentage of somites with PLs in vash-1 MO1 injected embryos in Fig. 5I is half the value shown in Fig. 5C. Although this kind of variability might be expected in biological samples, perhaps the authors could briefly discuss the issue and its implications on reproducibility in the manuscript so as to have the readers be aware of it, especially since the rescue of the vash-1 morpholino phenotype back to 50% from 25% is the same value the authors observed in the vash-1 KD alone in Fig. 5C. Here the value is 50% for the morpholino injection.

      e. The Y-axis label is missing in Fig. 2H and Fig. 4J. Figure 5D lacks bars showing median and standard deviation.

      f. It would help to have an inference or conclusion at the end of each results section.

      Significance

      Conceptual: As per my knowledge, this is the first study that looks at microtubule modifications in the context of a vertebrate organism past the gastrulation stage, as opposed to similar studies that have been done in cell culture or invertebrates (S. cerevisiae, C. elegans and D. melanogaster). Moreover, this study is one of few that address a novel link between the cytoskeleton and the process of cell fate specification.

      Previous studies have separately shown that Vash-1 limits angiogenesis and detyrosinates MTs. The current study combines the two observations in the context of endothelial cells, and hypothesizes that perhaps the function of Vash-1 in limiting angiogenesis and at the same time promoting lymphatic development could be due to its role in MT modification at the molecular level and the consequent effect of this on cell division and/or fate specification at the cellular level. In short, this study aims to connect the long-standing gap in knowledge between cytoskeletal modifications and cell dynamics (in particular, division and specification) in a vertebrate organism. I therefore believe that the current study would be an exciting finding for research communities that study cytoskeletal influence on cellular dynamics and also those in the broad area of vascular biology.

      My field of expertise relates to vascular biology, specifically developmental angiogenesis and the behavior of endothelial cells in zebrafish.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Bastos de Oliveira et al. describes an important investigation of the endothelial tubulin detyrosination during vascular development. Namely, they found detyronised microtubules in secondary sprouts, which is absent in MO-vash-1 treated embryos. The authors use the vash-1 morpholino approach to uncover the developmental consequences of suppressed detyrosination in angiogenesis and lymphangiogenesis in vivo in zebrafish. By a combination of transgenic lines, immunohistochemistry and time-lapse imaging, Bastos de Oliveira et al., have found that Vash-1 is a negative regulator of secondary sprouting in zebrafish. The authors showed that in the absence of Vash-1 more cells are present in the secondary sprouts due to increased cell proliferation; however lymphatic vascular network fails to form. The current manuscript requires additional experimental evidence to support the conclusions. Please see below the major technical concerns and minor comments.

      Major comments:

      -This study is based on analysis of the phenotypes observed in embryos injected with vash-1 morpholino. The authors use two different types of splice morpholinos, perform rescue experiments with RNA, and validate one MO-vash-1 with western blot. Morpholinos are not trivial to work with, and the results are variable hence additional controls need to be included, as following the recommendation put together by the zebrafish community (Stainier et, al., Plos Genetics, 2017). As the severity of the phenotypes comparing MO1 with MO2 is different and MO-vash-1 embryos appear developmentally delayed (Figure 2D-F and 5E-F overall size seem to be affected), additional MO is required, for example, ATG-MO or generation of CRISPR mutant would be favourable. All the morpholino used need to be validated using an antibody, RT-PCR and qPCR. It is essential to carry out the rescue experiments for all the MO used in this study and following the guidelines. Including the dose-response curve, data would be informative.

      -In addition to EC, the levels of dTyr are lower in MO-vash-1 in neural tube and neurons spanning the trunk (Figrue 2 D-G'). These have been previously shown to be important for secondary sprouting. Is it possible that the observed phenotypes in the secondary sprouting are due to defects in these neurons?

      -Embryo number used in this study appears to be low especially in figure 3G, 5D, 5G, to conclude draw conclusions from these experiments, the number of embryos used should be higher than 20. Figure 4J please specify how many embryos were used.

      -The authors hypothesise that VASH acts in the sprouting endothelial cells, based on the Q-PCR in Figure 1. However, in this experiment all EC have been sorted thus this remains ambiguous in which cell types vash-1 is expressed. Please provide the expression pattern for vash-1 across the developmental stages the phenotypes are observed.

      -Throughout the manuscript the authors refer the lymphatic identity, however, there is no evidence in the paper that the identity status has been assessed. To support these claims Prox1 immunohistochemistry or analysis of prox1 expression in the reporter line would be appropriate.

      Minor comments:

      -The authors refer to the literature where overexpression of VASH suppresses the angiogenesis. As the RNA injections were used in rescue experiments, the data of vash-1 RNA injections into the wild-type embryos would be beneficial.

      -In figures 2J, 3J, 3K, 3N, 4J, 5C, 5D and 5G the N number was set for examples as the number of sprouts, the number of somites with TD, number of ISV. To strengthen the observation in the manuscript quantification of the sprouts, PL, vISVs and lymphatic phenotypes with N set as the number of embryos would be more informative. Indicating the number of embryos used, in the graphs, would be helpful.

      -In Figure 5A, B and D the authors quantify what they refer to as a lumenised connection between the vISVs and PL. In the control image (second star), a somewhat lumenised structure is present, clarification of how the scores were set is missing.

      -In Figure 3 E and F the authors show the excessive sprouting phenotype between controls and Mo-vash-1. The images presented are taking from different parts of the embryos (middle of the trunk vs plexus region), hampering the comparison between the two groups. The quantification of the phenotypes in both experimental groups should be in the same region of the embryo, as the local difference can occur. It is key to provide representative images to support these observations.

      -Figure 1D the vash-1 expression levels in EC seem very variable in this graph, therefore no conclusion can be drawn from this data, especially as the authors do not provide the p-values.

      -In the introduction, the authors state: 'Although primary and secondary sprouts appear morphologically similar, with tip and stalk cells' - Please provide the reference that supports the claim that secondary sprouts have tip-stalk cells morphology/organisation.

      -The authors refer the increased cell division phenotypes observed in the movies, however, the movie files have not been available to the reviewers.

      Significance

      This is an important study as uncovering the mechanistic details of angiogenic and lymphangiogenic negative regulators is of high value with the potential for therapeutic developments. To date, Vash-1 has been only studied in the context of tumour angiogenesis, vasculature in diabetic nephropathy and pulmonary arterial hypertension, and it remains unclear what is its role during development and how does it regulate vascular network formation. The tyrosination status of microtubule in endothelial cells is understudied. This study revealed, previously uncharacterised detyrosinated microtubules in endothelial cells in vivo. And further dissects how this process might be regulated, brings unique insights into the vascular biology field and beyond. Thus, delving into the cell biological mechanism such as microtubule dynamics and modification in vivo in embryo context is a significant step forward in setting new standards in the field.

      I am developmental biologist who has experience in model organisms such as zebrafish and mouse. The main focus of my work is on developmental angiogenesis and lymphangiogenesis.

      REFEREES CROSS-COMMENTING

      After reading the other reviews comments, it seems that we all agree that this study is of high value to vascular biology field and beyond bringing novel findings.

      Importantly the reviewers' comments are in line with each other and have identified several commonalities that should be addressed. Such as: Further validation of Morpholinos, or using alternative methods to replicate the findings. additional evidence that the observed phenotypes are primary due to vash-1 requirement within EC, and not due to the secondary effect in other cells such as CXCR4/SDF1 system and SVEP1, neurons or general delay of the embryos Further evidence of for VASH expression pattern the number of embryos used in the experiments, and how the data is represented.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript entitled "Vasohibin-1 mediated tubulin detyrosination selectively regulates secondary sprouting and lymphangiogenesis in the zebrafish trunk" by de Oliveira investigates the function of the carboxylpeptidase Vasohibin during the formation of the zebrafish trunk vasculature and reports a requirement of Vasohibin for secondary sprout formation and in particular the formation the lymphatic vasculature.

      Having established the expression of Vasohibin in sorted ECs of 24 hpf embryos, the remaining study addresses the function of Vasohibin in this cell type. It is largely based on the use of a splice-site interfering morpholino. Particular commendable is the analysis, demonstrating that the KD of vash-1 indeed results in a significant reduction of detyrosination in endothelial tubulin. Findings in the vascular system then include: (i) the detection of increased division and hence supernumerous cells occurring selectively in 2nd sprouts from the PCV; (ii) an increased persistence of the initially formed 3 way connections with ISV and artery; (iii) reduced formation of parachordal lymphangioblasts and (iv) a reduced number of somites with a thoracic duct segment; (v) frequent formation of lumenized connections between PLs (where present) and ISV. To demonstrate specificity, the approach was repeated with a different morpholino and defects were partially rescued by MO-insensitive RNA.

      Possible additional and relevant information could include data on a vash-1 promotor mutant to independently verify the MO-based functional analysis. Mutants would also allow analysis of further development, are the defects leading to the demise of the fish or is a later regeneration and normalization of the lymphatic vasculature observed? In addition, are other lymphatic vessel beds like the cranial lymphatics affected? PLs have been demonstrated to be at least partially guided in their movement by the CXCR4/SDF1 system and SVEP1. Has the expression of these factors been tested in vash-1 KDs? With regards to the frequently observed connections of PLs and ISVs in vash-1 morphants, can the proposed lumen formation of these shunts be demonstrated e.g. by injection of Q-dots or microbeads into the circulation? Concerning the mechanisms of these defects, is it possible to analyse the asymmetric cell division leading to 2nd sprouts in greater detail? Is the same number or are more cells sprouting form PCV and can the fli1ep:EGFP-DCX cell line in fixed samples be used to identify the spindle orientation in dividing cells?

      Minor issues: Page 5, Mat & Meth, please spell out PTU at its first mention.

      Page 6 Mat & Meth, Secondary sprout and 3-way connection parameters: The number of nuclei was assessed in each secondary sprouts (del s, singular) just prior...

      Page 16, 8th line from bottom: Recent work demonstrated that a secondary sprout either contributes (add s) to remodelling a pre-existing ISV into a vein, or forms (add s)a PLs (Geudens et al., 2019).

      Page 25, Legend to Fig. 2D-G: "...G,G' shows quantification of dTyr signal upon vash-1 KD..." Fig2 G,G' show immunostaining rather than quantification of the dTyr signal, which is shown Fig. 2H-J

      Fig. 1D / Fig. 2H-J please increase weight of the error intervals and / or change colour for improved visibility

      Significance

      Taken together the manuscript is comprehensively written and the study provides a conclusive analysis of the MO-mediated KD of Vasohibin in zebrafish embryonic development presenting significant novel findings. Known was a generally inhibitory function of Vasohibin on vessel formation and its enzymatic activity as a carboxylpeptidase responsible for tubulin detyrosination, affecting spindle function and mitosis. New is the detailed analysis of the Vasohibin KD on zebrafish trunk vessel formation and the description of a selective impairment of 2nd sprout formation. The manuscript is of interest for vascular biologists.

      REFEREES CROSS-COMMENTING

      I fully concur with the comments of reviewer #2, all three reviews find that this study is of significant interest to the vascular biology community as the relevance of tubulin detyrosination for developmental angiogenesis has not been investigated. Also all three reviews highlight the potential limitations of the use of splice morpholinos (suggested alternatives include ATG morpholinos and CRIPR mutants), the requirement to provide further evidence for a endothelial cell autonomous defect and the need to clarify some of the data representation.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Response to Reviewers

      We are grateful to the Reviewers for their thoughtful and helpful assessment of our work. Below we include a point-by-point response to the Reviewers' critiques concerning the interpretation of our results and the power of our system to elucidate key dynamics of fission yeast homology-directed repair (HDR). We appreciate that the Reviewers judged our assay to be a valuable new tool for studying DSB repair in S. pombe. In general, the Reviewers also felt that our data provides new insights into homology search during HDR in fission yeast, including 1) that multiple DSB-donor encounters often precede repair and 2) that the activity of the helicase Rqh1, which dissolves strand invasion structures, alters the kinetics and efficiency of HDR in S. pombe. The Reviewers also raised several concerns with regards to 1) some technical aspects of the experimental approach, 2) the display of the data, and 3) the interpretation of the data. The Reviewers requested additional experiments to address the efficacy of our 5 minute observational time window and the rate of spontaneous damage in the Rqh1 null background, which we are able to provide in a resubmission. We will also clarify experimental details that the Reviewers found confusing in the original text. Lastly, the Reviewers highlighted minor needed figure adjustments that we will incorporate.

      Point-by-point Response:

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Vines et al adapted a system that has been used in S. cerevisiae to study the homology search and homologous recombination repair events by live cell imaging. The authors utilized a system they set up in a fission yeast strain that has a fluorescently tagged endonuclease induced DSB site and monitored RAD52 focus formation in both haploid and diploid cells. The main findings presented are that multiple strand invasion events occur during DSB repair and the role of Rqh1 in promoting these multiple events. For example, cells with Rqh1 loss either have a single strand invasion event that quickly leads to repair or a very long extensive repair time. Overall the results are intriguing with new insight into DSB repair being presented.*

      We appreciate the Reviewer’s recognition that our work provides new insights into homology-directed repair (HDR) in fission yeast.

      The manuscript would benefit from having another system to help to support or validate the key findings and/or the use of some mutants to help uncouple the different roles of Rad51 and/or Rqh1.

      While we agree with the Reviewer that using orthogonal approaches is always desirable, it is not clear what other experimental platform can address the dynamic events with single cell resolution that underlie our observations here; indeed, this was the motivation behind designing this new approach. However, we will provide additional, detailed context to support our findings in the revised manuscript that highlights how orthogonal experimental strategies (e.g. DSB repair outcome assays) already in the literature (e.g. Hope et al., PNAS, 2006) are consistent with our findings. Importantly, however, there is no other population-based system we are aware of that could demonstrate, for example, that Rqh1 shows two different behaviors in individual cells (repair failure and more rapid repair). See more in response to comment 7, below.

      \*Major comment:**

      1) In Figure 1C, and also Figure 2D, the RAD52 focus observed does not appear in the same location as the LacO cassette. I assume this is because of the way the images are cropped. It would be nice if the authors are saying that the RAD52 focus co-localizes with the inducible DSB location for this to be more readily apparent in the representative images. *

      Co-localization events, indicated with the yellow circles, are assessed within raw 3D data that is then flattened for representation in 2D in the figures. For Figure 1C, the two events in the example cell indeed overlap in 3D space. However, in Figure 2D (cells lacking Rad51) we do not observe any colocalization events in the example (and there are no time points annotated with yellow circles).

      2) In Figure 3A, the authors claim that the mean time to repair an endonuclease induced DSB is 50 min +/- 20 min. It is unclear whether or not this experiment is done in a diploid strain.

      We apologize if we were not clear. All experiments presented in the manuscript are carried out in diploid cells. What varies is whether there is a lac operator integrated at one copy of Chr II (all experiments except Fig. 2A) or on both copies (only Fig. 2A). This will be clarified in the revised text.

      3) In Figure 3, whether or not this experiment represents asynchronous cells can greatly influence the timing of DSB repair, as the cell cycle is a huge contributor to HDR repair.

      We agree with the Reviewer - the cell cycle has a critical influence on DSB repair mechanism. The diploid fission yeast in which we induced and observed DSBs are indeed asynchronous. However, in fission yeast, which spend over 80% of their cell in G2, we can assess cell cycle by morphology; cytokinesis coincides with the beginning of G2, which then persists until mitotic entry (which is also very obvious from the nuclear shape as visualized by Rad52-mCherry). Moreover, we previously found that HO endonuclease only induces DSBs during S phase (Leland et al., eLife, 2018). Given this, for individual cells we observe site-specific DSBs beginning in late S and early G2 phases and all of our analysis is done at this phase of the cell cycle. These observations are further validated by the observation that an HO-induced DSB undergoes very high rates of gene conversion in fission yeast (Prudden et al, EMBO J., 2003).

      4) In Figure 3D, since a major finding of the paper is that there are multiple invasion events, it would be nice to show some representative images of a few cells where multiple pairings occur.

      In Supplementary Figure 2A, we provided an example of a cell with multiple encounters between the DSB and donor. This will be more clearly highlighted in the revised text.

      5) It is known from Eric Greene's work that RAD51 mediated homology search can do multiple samplings of 8-9 nucleotide segments. Have the authors considered the area around the DSB site and how many potential pairing sites there might be in this region? Is it possible that having a LAC array with repeated segments might be influencing this the pairing since there would be multiple templates?

      We acknowledge that the homology of the region surrounding the DSB is important for faithful recognition of a homologous donor and that there could be many pairing sites surrounding our induced DSB after end resection. Such local sampling, however, would not be discernible due to the resolution of the light microscope (>0.2µm). We will address this noteworthy point during our discussion in the revision. Importantly, we placed the lacO array over 3 kb away from the locus where the HO recognition site is integrated on the homologous chromosome to attempt to avoid exactly the Reviewer’s concern.

      6) It would aid the reader if there were some picture schematics of what the authors think is occurring throughout the paper in the Figures. Since this is a results/discussion, this approach would be appropriate in lieu of a model figure at the end (which would also be very nice).

      We agree that diagrams would aid in communication of our hypotheses and interpretations, and these will be included in the revision.

      7) Since the multiple strand invasion events is a major finding of the paper, it is important to test the hypothesis that multiple strand invasion events are occurring a different way. A few ideas would be to examine Lorraine Symington's work on BIR where she observes multiple template switching events (Smith, CE, Llorente, B, Symington, LS (2007) Nature, 447(7140): 102-105) or something analogous to Wolf Heyer's recent study in Cell on template switching that the authors already cited. Another idea is to try a RAD51 mutant. For example, Doug Bishop's group has created a RAD51 mutant that uncouples the homology search from strand exchange, Rad51-II3A mutant (Cloud, V et al (2012) Science, 337(6099): 1222). Perhaps a mutant like this might be able to further support the key finding here.

      While our findings share parallels with the works raised by the Reviewer, we would argue that there is a fundamental difference between BIR-type assays and the one we present here, namely that we are visualizing multiple strand invasion events at the homologous chromosome in a normal, high fidelity repair event rather than multiple strand invasion events during BIR, which frequently result in translocations. Moreover, as the two chromosomes are perfectly homologous in our assay, we cannot leverage sequencing to reveal past strand invasion events that took place during HDR. We also cannot, unfortunately, access multiple simultaneous strand invasion events due to the diffraction limit of the light microscope. We concede that it would be informative to further dissect strand invasion using tools such as the Rad51-II3A mutant described in budding yeast in work referenced above by Reviewer #1 and developed in fission yeast by Sarah Lambert’s group (Ait Saada et al., Mol. Cell, 2017). However, with the present limitations on our laboratory access and the timeline necessary to carry out this experiment, we feel this is currently beyond the scope of this work.

      8) It is surprising that Rqh1 doesn't have a role in DNA end resection since this is a conserved function from budding yeast to man. Would similar results to what is observed in Figure 4 be observed in a Dna2 or Exo1 mutant?

      We acknowledge that Rqh1 orthologs in other organisms (BLM/Sgs1/etc.) have been shown to contribute to DSB end resection. However, previous work from our group indicates that Rqh1 is entirely dispensable for long-range resection in fission yeast (Leland et al., eLife, 2018). Interestingly, in this work we also demonstrated that it is only upon loss of either the 53BP1/Rad9 orthologue Crb2 or Rev7 that Rqh1 is able to compensate for loss of Exo1. It remains unclear whether this is a peculiarity of fission yeast (perhaps because they rely heavily on HR due to extensive time in G2) or if it is a direct consequence of the long G2 itself. Regardless, we demonstrated that cells lacking Exo1 cannot generate sufficient ssDNA tracts to load visualizable Rad52-mCherry (Leland et al., eLife, 2018). Given this, we cannot address this genetic background in this assay. The essential role for Dna2 in replication has also precluded its analysis.

      \*Minor comment:**

      1) As mentioned in the first line of the abstract, HDR is generally considered error-free as opposed to a pathway that "can be" error-free. *

      We acknowledge that HDR (and more specifically HR) is often error-free, but there are notable exceptions such as when a non-homologous donor is utilized for repair or when the polymerases engaged during repair incorporate errors (work from Haber and colleagues). We will expand and clarify this sentence in the revision.

      2) In Figure 2D, it is unclear whether this experiment is done in diploid cells. The rest of the figure is in diploid cells but two LacO cassette are not present past the first frame. Please clarify in the legend and/or figure panel. As mentioned above, this is also confusing in Figure 3.

      As above, we monitored repair events in diploid cells only – this will be clarified in the revised text.

      *Reviewer #1 (Significance (Required)):

      The most important advancement in this paper is that multiple strand invasion events occur during homologous recombination and the role of the Rqh1 in this process. Rqh1 is important protein whose mutation is implicated in human disease such as Bloom syndrome and cancer. In addition, misregulation of double-strand break repair and particularly of Rad51 is associate with cancer. Therefore, understanding the basic mechanisms of how Rad51 mediates double-strand break repair and the role of Rqh1 in this process is critical for understanding fundamental aspects of cancer development. * We appreciate the Reviewer’s assessment of the impact of this work.

      *Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this study, Vines et al developed a microscopy-based assay to determine the kinetics of a site-specific interhomolog repair event, in living fission yeast cells. They detect efficient homology search and homology-directed repair in the system. They also observe that repair is likely to involve multiple site-specific and Rad51-dependent co-localization events between the DSB and donor sequence, suggesting that efficient inter-homologue repair involves multiple strand invasion events. Loss of the RecQ helicase Rqh1 leads to repair through a single strand invasion event. However, failure to repair is more frequent in rqh1 mutants, which could reflect increased strand invasion at non-homologous sites.

      Overall, I find the approach to investigate homology search and homology-directed repair using live cell imaging interesting and potentially very informative. The ability to observe the process in living cells, and with high temporal resolution, complements a variety of previous studies that employ more indirect approaches to invoke similar models. In particular, previous work by the Heyer, Lichten and Hunter laboratories, in budding yeast, has established that Sgs1 promotes non-crossover recombination by acting as a quality control in the maturation of HR intermediates. In this sense, while newly described here for fission yeast, it is not unexpected that homology-directed repair involves multiple strand invasion cycles. In my opinion, the strength of the work is the method/approach, rather than the specific conclusions made (even though I think that it is important to know how fission yeast cells perform homology search).*

      We thank the Reviewer for their appreciation of the value that cell biology can bring to the study of homology-directed repair. We wholeheartedly agree that this work is consistent with prior work on Sgs1. With regards to multiple strand invasion cycles, while we agree that there may be many in the field who could be unsurprised by this result, we would argue that 1) demonstrating this by direct visualization of individual DNA repair invents has clear inherent value and 2) many studying homology search itself (or who have modeled homology search in silico, for example) do not incorporate multiple strand invasion cycles in their thinking. Thus, we would argue that this work goes beyond a technical feat and will have impact beyond the approach.

      *However, for the reasons detailed below, my general impression is that it isn't clear how robust the method is at delivering unambiguous information on the important questions asked:

      1) The authors state that they have developed a system to monitor the 'dynamics and kinetics' of an engineered, inter-homologue repair event. With this in mind, I was expecting a more detailed exploration of the process of homology search. For example, what happens at shorter time scales? Is it possible that by imaging at every 5 minutes many of the events are missed? Could the authors be missing very transient events (especially in rqh1 mutants) by using an inappropriate time scale? *

      We acknowledge that it would be ideal to observe DSB repair across a range of time scales in our system. For practical reasons we found it most valuable to choose the 5 minute time window since it was most amenable to observing the entire course of repair as often as possible in an asynchronous cell population (see our response to Reviewer #1’s comment 3 above) while mitigating photobleaching. However, we recognize that we sacrificed time resolution between acquired frames in order to do this. Like the Reviewer, we were also concerned that we were missing transient events due to an inappropriate timescale.

      To address this, we acquired additional data in WT cells with greater time resolution with a focus on encounter frequency rather than time to repair (as the overall length of the usable movie that we can obtain is shorter). When imaging WT cells with a site-specific DSB at 2 minute intervals (2.5 times more frequently), we do observe a shift (of ~ 1 encounter per 30 minute window) toward more colocalization events with the donor sequence. We also observe, however, that more sampling leads to an increase in random encounters as revealed by similar analysis of the two lacO control strain as described in the manuscript. These data will be included in the revision and suggest that we may be missing some transient encounter events while using 5 minute time points. As noted by Reviewer #2, this could account for repair in the subset of WT and Rqh1-null cells in which we observed no encounters. We will acknowledge these caveats in the revision but would argue that our data support the conclusion that loss of Rqh1 decreases the number and/or lifetime of strand invasion events.

      2) Another point relates to the Rad52 signal/foci, which is central to the study. While it is clear to me what the authors consider to be a focus of Rad52, I am not sure how to interpret what has happens when Rad52 is as enriched throughout the entire nucleus as it is in the repair focus in the still before. For example, Figure 1C, 40 min vs 45 min. How do the authors interpret what is being visualised? Similarly, is the level of colocalization at 90 min really reflecting a specific enrichment of Rad52 at the DSB site? Much more of the Rad52 signal is away from the DSB. In other words, are quantitative criteria being used to assign colocalization events?

      As described in our Methods and the text, we used specific criteria to define 1) whether DSBs are site-specific and 2) whether they are colocalized with the donor site. In the images indicated as “contrast adjusted” we have scaled each panel time point individually with respect to the pixel intensities (that is, the least and most intense pixels have been set the same value for each). This strategy allows us to convey relatively dim Rad52-mCherry foci, particularly early after DSB end resection. A consequence of this is that the apparent background for panels in which there is not a strong Rad52-mCherry focus will appear higher, while the background will appear relatively less at time points with a strong Rad52-mCherry focus. For this reason we also present the raw image (found above). It is important to emphasize that when we are applying co-localization criteria, we do so within a 3D stack of images to ensure that the Rad52-mCherry signal and lacO array GFP signal coincide. In 2D representation, however, we understand that this may appear less clear.

      In the particular case of the colocalization in Figure 1C at 90 minutes that the Reviewer points out, it is more evident in the 3-D Z stacks that the surrounding mCherry signal apart from the colocalization with the lacO array is due to inhomogeneity in the background signal. Another contribution is that the lacO array signal often becomes delocalized during colocalization events (as evident in that 90 minute time point). Although this is an interesting observation, we are still investigating what activity may explain this response. We will address the caveats of our colocalization analysis more fully in the revision.

      3) In the system described here, Rad52 foci form in only ~15% of cells. I think it would be important to rationalise this low number in the manuscript. Moreover, G2 Rad52 foci still form at considerable rates in cells without HO. I think it would be important that the authors provide some explanation on what this might reflect.

      There are several considerations that we believe contribute to this observation, which we also documented previously in haploid cells (Leland et al., eLife, 2018). First and foremost, this assay is quite different from endpoint assays that involve induction of HO nuclease because we analyze only those events that happen immediately after additional of uracil to elevate HO endonuclease expression under the control of the urg1 promoter. Combined with the efficient repair of any DSB induced by leaky HO expression (taking less than an hour according to our data), we likely miss events that have already taken place or would take place later in other assay systems. Lastly, it is established that nucleosomes can prevent HO cleavage in its intrinsic role in budding yeast (Laurenson and Rine, Microbiol. Rev., 1992; Haber, Ann. Rev. Genet., 1998); we cannot rule out that cleavage at this particular site is less efficient due to intrinsic nucleosome stability. With respect to spontaneous DNA damage, most of this is short-lived and occurs in S-phase, likely due to replication stress, although we occasionally observe long-lived Rad52 foci in a sub-population of cells – this is in line with previous publications (Coulon et al., MBoC, 2006; Lorenz et al. Mol. Cell Biol., 2009; Sanchez et al., Mol. Cell Biol., 2012; Schonbrun et al., J Biol. Chem., 2013). We will provide a greater explanation of the observed induction rate in the revision.

      \*Other issues to consider:**

      4) In Figure 2D, the overlay does not show any green. It is possible that the green channel was not overlaid with the pink? *

      We apologize for this error and very much appreciate the Reviewer noticing that it is missing from the merged image. This will be corrected.

      5) In Figure 2D, the unadjusted images for Rad52 are very sharp. Did the authors perform contrast adjustment in the top panels? If so, this should be indicated. My current impression is that the data was duplicated by mistake.

      The Rad52-mCherry data in Figure 2D was labelled correctly and not duplicated. Because cells lacking Rad51 accumulate extensively resected DSBs (and therefore abnormally high levels of Rad52 loading), the intensity of Rad52-mCherry is very high. For simplicity we will remove the contrast-adjusted Rad52-mCherry images in the revision.

      6) I don't understand why is the time since nuclear division different is every single figure. For simplicity, it would be much better to start every figure at T=0.

      We agree with the Reviewer. In the revision we will normalize all kymographs to begin at t=0 with the exception of the Fig. S1D (where we are visualizing the subsequent division).

      *Reviewer #2 (Significance (Required)):

      see above. Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, the authors describe a system to monitor an inducible site-specific double-strand break (DSB) and the undamaged homologous locus during homology-directed repair in S. pombe cells. The authors show that the Rad52 focus on the induced DSB is more persistent than spontaneous Rad52 foci that form throughout the cell cycle. The persistent Rad52 focus intermittently colocalizes with the donor sequence labeled with LacI-GFP, reflecting multiple strand invasion events, and this colocalization requires the Rad51 recombinase. The authors report that the time to repair is dependent on the number of strand invasion events (colocalization of Rad52 and homolog), and that the initial distance between the induced DSB and the homolog predicts the time to their first contact, but does not predict the time to repair. Lastly, the authors claim that repair in rqh1Δ cells is bimodal, either failing to repair within the experimental time frame, or being more efficient than WT cells (which often involves a single colocalization event).

      **These claims are supported by the data:**

      1) Rad52 focus on the induced DSB is more persistent than spontaneous Rad52 foci that form throughout the cell cycle.

      2) Multiple colocalization events between Rad52 focus and the donor sequence are frequent, and this colocalization is dependent on Rad51, which reflects multiple strand invasion events.

      3) rqh1Δ cells have a lower rate of productive repair compared to WT cells. *

      The key concern I have for this section is the noise in Rad52 images. For example, in Fig. 1C at 15 minutes, it looks like there is a Rad52 focus both before and after adjustment but the time point is labeled as not having a Rad52 focus. Conversely, in Fig. 2D at 60 minutes, it looks like there isn't a Rad52 focus but the time point is labeled as having a Rad52 focus. How did the authors determine the presence of a Rad52 focus? Additionally, it is difficult to assess colocalization of Rad52 and LacI-GFP in merged images (hard to see Rad52 focus in Fig. 1C merged and LacI-GFP in Fig. 2D merged).

      The criteria that we established to indicate a Rad52-mCherry focus (as annotated by a pink circle and as explained in the Methods) is that it persists for at least three frames (>15 minutes). This was chosen because it is a characteristic of the HO-induced DSB but not of spontaneous DNA damage that occurs frequently during S-phase. Indeed, the numerous, small, and short-lived foci at the 15 minute time point in Fig. 1C referred to by the Reviewer occurs just 15 minutes after nuclear division and is perfectly characteristic of replication stress that is independent of HO endonuclease expression. Thus, the pink circles indicate a specific type of Rad52-mCherry focus that is relevant for the assay. We agree that the Rad52-mCherry focus in Fig. 2D at ~60 minutes is poorly visualized in the flattened image, but would like to emphasize that we assess the foci in the true 3D volume. With regards to the merged images, we will adjust the individual signals to make it easier for the reader to assess colocalization in the revision.

      \*These claims are supported by weak data:**

      1) The initial distance between the induced DSB and donor sequence predicts the time to their first physical encounter (Line 60). *

      We agree with the Reviewer that our word choice (“predicts”) suggests a stronger relationship than is supported by the data. However, we also argue that there is nonetheless a meaningful correlation. We believe this is an important point to make because it supports prior work in budding yeast suggesting that relative position affects donor choice preference. We will edit this language in the revised text.

      2) Repair efficiency is dictated by the number of strand invasion events (Line 61-62). Figures 3E and 3F technically have positive correlations that support the authors' claims but there is a lot of noise. I think the data needs to be more robust, especially considering the strong wording used to describe the data. A minor comment on Fig. 3F: why is there a data point with 3.5 encounters?

      Again, we agree with the Reviewer that our word choice (“dictate”) is too strong given the data and we will edit the text accordingly. We thank the reviewer for noticing the error in Fig. 3F, which will be corrected.

      \*These claims are not supported by the data:**

      1) In the absence of Rqh1, successful repair requires a single strand invasion event (Line 63). *

      We acknowledge that this is too strong a claim to make based on our data and will amend this language in the revision text. Specifically, and as outlined in our response to Reviewer #2 with regards to our imaging frequency, we will revise the manuscript to state that cells lacking Rqh1 are more likely to repair without a visualized colocalization event and/or they possess shorter lived strand invasion events. Importantly, repair outcome assays indicate that cells lacking Rqh1 display elevated gene conversion rates rather than non-HDR-mediated repair (Hope et al., PNAS, 2006). Thus, we do not expect that the lack of colocalization reflects NHEJ but rather our inability to “catch” the colocalization event with the temporal resolution we can achieve.

      2) rqh1Δ cells that complete repair are more efficient than WT cells and often involve a single colocalization event (Line 178-179).

      As for the above, we agree that our claim that rqh1Δ cells “often” involve a single colocalization event is too strong a claim based on our data. We will amend this language in the revised text.

      Fig. 4A shows an example of a rqh1Δ cell with productive repair but without any colocalization with the homolog, which contradicts the statement that successful repair requires a single strand invasion event in the absence of Rqh1. If the authors interpreted the single continuous presence of Rad52 focus during time-lapse as evidence of a single strand invasion event, then it would nullify using multiple colocalization events as evidence for multiple strand invasion events. In other words, the data in Fig. 3D that clearly displays multiple colocalization events in individual cells during repair can no longer be evidence of multiple strand invasion events since those cells all had one continuous presence of Rad52 focus.

      We believe that we understand the confusion that the Reviewer is articulating in their comment and apologize that we have not been clearer in explaining our interpretation. For this site-specific DSB to be repaired, we expect that it must either 1) engage with the homologous chromosome to be repaired by HR/BIR or 2) be repaired through an alternative pathway – at this non-repetitive, resected locus this would likely be a microhomology-mediated (alt-) NHEJ mechanism. However, prior analysis of repair outcome in a model of interhomologue repair in the absence of Rqh1 (Hope et al., PNAS, 2006) demonstrates an increase in cross-over HR events rather than end joining events, arguing that interhomologue HR still dominates (and with increased CO to NCO frequency). We interpret the continuous presence of a Rad52 focus to only reflect that a DSB has been subjected to resection and has not yet been repaired. Taking these two points together, within the lifetime of a Rad52-loaded DSB it can either 1) never colocalize with the donor sequence and fail to repair (as in cells lacking Rad51, Fig. 2D-F) or 2) undergo strand invasion (and therefore colocalization) at least one time (but possibly multiple times) to allow for HDR to occur. However, we agree (and must clarify in the revision) that we often infer that at least one strand invasion event has taken place to support successful HDR when we do not capture the event at our experimental time resolution. Based on the additional data at shorter timescales that we will add to the revised manuscript (as outlined in the response to Reviewer 2, point 1), which demonstrates that we may in some cases be undercounting relevant colocalization events that are too brief to be accurately captured with 5 minute time resolution, we think the most parsimonious explanation is that cells lacking Rqh1 spend less time with the DSB and donor sequence colocalized prior to repair. We agree with the Reviewer, however, that we cannot say whether this reflects a shorter duration of interactions and/or a fewer number of interactions. We will therefore revise the manuscript to acknowledge this point.

      Regarding the second claim, I think Fig. 4D only shows rqh1Δ cells with successful repair (since the longest repair time is 55 minutes, but it is not clear from the figure legend). It is not shown how many colocalization events these cells had in Fig. 4D, but there are 16 cells in Fig. 4D while there are only 2 cells with a single encounter (shown in Fig. 4F). With these numbers, it seems like rqh1Δ cells that complete repair are more efficient than WT cells but only few of these cells involve a single colocalization event.

      The Reviewer is correct, Figure 4D does indeed show only rqh1Δ cells with the site-specific DSB that successfully repair – this will be clarified in the revision text. As described above in our response to Reviewer #2’s comment 1, it may be that we are missing colocalization events in rqh1Δ DSB cells. However, we would argue that our data do support that, for cells lacking Rqh1 that execute repair, there are fewer and/or shorter-lived colocalization events. Again, this will be made clear in the revision.

      Also, how often do Rad52 foci form spontaneously in rqh1Δ cells and what is the duration? This data was provided for WT but not for rqh1Δ.

      We agree that increased levels of genome instability (and therefore Rad52 foci) would present an issue – and indeed this has prevented us from analyzing some genetic backgrounds. However, we do not observe a significant increase in spontaneous Rad52-mCherry focus formation in rqh1Δ cells. This data will be included in the revision.

      All of the data would have been more supported if the homologous chromosome would have been tagged. Such a configuration would really have helped the interpretation of the rqh1∆ data.

      We agree that in theory it would be advantageous to have both copies of the chromosome tagged. Indeed, we attempted to leverage a different version of this experimental system with lacO arrays on both copies while inducing a DSB. However, the complexity of monitoring (and keeping the identity clear) for the two copies presented major challenges. Better would be two distinct arrays – an approach that has been used in budding yeast. However, to date many groups, including ours, have been unable to get TetO-TetR arrays to perform well in fission yeast.

      * Reviewer #3 (Significance (Required)):

      The significance of this work is the conceptual advance in the field of DNA repair. Homology search is an important process in homology-directed repair and is not fully understood. This study reports time-lapse data on the interaction between a DSB and its donor template during repair and provides insight into the kinetics of homology search. The audience for this manuscript is the field of DNA repair, and to a lesser extent, field of live-cell imaging.*


    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript, the authors describe a system to monitor an inducible site-specific double-strand break (DSB) and the undamaged homologous locus during homology-directed repair in S. pombe cells. The authors show that the Rad52 focus on the induced DSB is more persistent than spontaneous Rad52 foci that form throughout the cell cycle. The persistent Rad52 focus intermittently colocalizes with the donor sequence labeled with LacI-GFP, reflecting multiple strand invasion events, and this colocalization requires the Rad51 recombinase. The authors report that the time to repair is dependent on the number of strand invasion events (colocalization of Rad52 and homolog), and that the initial distance between the induced DSB and the homolog predicts the time to their first contact, but does not predict the time to repair. Lastly, the authors claim that repair in rqh1Δ cells is bimodal, either failing to repair within the experimental time frame, or being more efficient than WT cells (which often involves a single colocalization event).

      These claims are supported by the data:

      1) Rad52 focus on the induced DSB is more persistent than spontaneous Rad52 foci that form throughout the cell cycle.

      2) Multiple colocalization events between Rad52 focus and the donor sequence are frequent, and this colocalization is dependent on Rad51, which reflects multiple strand invasion events.

      3) rqh1Δ cells have a lower rate of productive repair compared to WT cells. The key concern I have for this section is the noise in Rad52 images. For example, in Fig. 1C at 15 minutes, it looks like there is a Rad52 focus both before and after adjustment but the time point is labeled as not having a Rad52 focus. Conversely, in Fig. 2D at 60 minutes, it looks like there isn't a Rad52 focus but the time point is labeled as having a Rad52 focus. How did the authors determine the presence of a Rad52 focus? Additionally, it is difficult to assess colocalization of Rad52 and LacI-GFP in merged images (hard to see Rad52 focus in Fig. 1C merged and LacI-GFP in Fig. 2D merged).

      These claims are supported by weak data:

      1) The initial distance between the induced DSB and donor sequence predicts the time to their first physical encounter (Line 60).

      2) Repair efficiency is dictated by the number of strand invasion events (Line 61-62). Figures 3E and 3F technically have positive correlations that support the authors' claims but there is a lot of noise. I think the data needs to be more robust, especially considering the strong wording used to describe the data. A minor comment on Fig. 3F: why is there a data point with 3.5 encounters?

      These claims are not supported by the data:

      1) In the absence of Rqh1, successful repair requires a single strand invasion event (Line 63).

      2) rqh1Δ cells that complete repair are more efficient than WT cells and often involve a single colocalization event (Line 178-179). Fig. 4A shows an example of a rqh1Δ cell with productive repair but without any colocalization with the homolog, which contradicts the statement that successful repair requires a single strand invasion event in the absence of Rqh1. If the authors interpreted the single continuous presence of Rad52 focus during time-lapse as evidence of a single strand invasion event, then it would nullify using multiple colocalization events as evidence for multiple strand invasion events. In other words, the data in Fig. 3D that clearly displays multiple colocalization events in individual cells during repair can no longer be evidence of multiple strand invasion events since those cells all had one continuous presence of Rad52 focus. Regarding the second claim, I think Fig. 4D only shows rqh1Δ cells with successful repair (since the longest repair time is 55 minutes, but it is not clear from the figure legend). It is not shown how many colocalization events these cells had in Fig. 4D, but there are 16 cells in Fig. 4D while there are only 2 cells with a single encounter (shown in Fig. 4F). With these numbers, it seems like rqh1Δ cells that complete repair are more efficient than WT cells but only few of these cells involve a single colocalization event. Also, how often do Rad52 foci form spontaneously in rqh1Δ cells and what is the duration? This data was provided for WT but not for rqh1Δ. All of the data would have been more supported if the homologous chromosome would have been tagged. Such a configuration would really have helped the interpretation of the rqh1∆ data.

      Significance

      The significance of this work is the conceptual advance in the field of DNA repair. Homology search is an important process in homology-directed repair and is not fully understood. This study reports time-lapse data on the interaction between a DSB and its donor template during repair and provides insight into the kinetics of homology search. The audience for this manuscript is the field of DNA repair, and to a lesser extent, field of live-cell imaging.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, Vines et al developed a microscopy-based assay to determine the kinetics of a site-specific interhomolog repair event, in living fission yeast cells. They detect efficient homology search and homology-directed repair in the system. They also observe that repair is likely to involve multiple site-specific and Rad51-dependent co-localization events between the DSB and donor sequence, suggesting that efficient inter-homologue repair involves multiple strand invasion events. Loss of the RecQ helicase Rqh1 leads to repair through a single strand invasion event. However, failure to repair is more frequent in rqh1 mutants, which could reflect increased strand invasion at non-homologous sites.

      Overall, I find the approach to investigate homology search and homology-directed repair using live cell imaging interesting and potentially very informative. The ability to observe the process in living cells, and with high temporal resolution, complements a variety of previous studies that employ more indirect approaches to invoke similar models. In particular, previous work by the Heyer, Lichten and Hunter laboratories, in budding yeast, has established that Sgs1 promotes non-crossover recombination by acting as a quality control in the maturation of HR intermediates. In this sense, while newly described here for fission yeast, it is not unexpected that homology-directed repair involves multiple strand invasion cycles. In my opinion, the strength of the work is the method/approach, rather than the specific conclusions made (even though I think that it is important to know how fission yeast cells perform homology search). However, for the reasons detailed below, my general impression is that it isn't clear how robust the method is at delivering unambiguous information on the important questions asked:

      1) The authors state that they have developed a system to monitor the 'dynamics and kinetics' of an engineered, inter-homologue repair event. With this in mind, I was expecting a more detailed exploration of the process of homology search. For example, what happens at shorter time scales? Is it possible that by imaging at every 5 minutes many of the events are missed? Could the authors be missing very transient events (especially in rqh1 mutants) by using an inappropriate time scale?

      2) Another point relates to the Rad52 signal/foci, which is central to the study. While it is clear to me what the authors consider to be a focus of Rad52, I am not sure how to interpret what has happens when Rad52 is as enriched throughout the entire nucleus as it is in the repair focus in the still before. For example, Figure 1C, 40 min vs 45 min. How do the authors interpret what is being visualised? Similarly, is the level of colocalization at 90 min really reflecting a specific enrichment of Rad52 at the DSB site? Much more of the Rad52 signal is away from the DSB. In other words, are quantitative criteria being used to assign colocalization events?

      3) In the system described here, Rad52 foci form in only ~15% of cells. I think it would be important to rationalise this low number in the manuscript. Moreover, G2 Rad52 foci still form at considerable rates in cells without HO. I think it would be important that the authors provide some explanation on what this might reflect.

      Other issues to consider:

      4) In Figure 2D, the overlay does not show any green. It is possible that the green channel was not overlaid with the pink?

      5) In Figure 2D, the unadjusted images for Rad52 are very sharp. Did the authors perform contrast adjustment in the top panels? If so, this should be indicated. My current impression is that the data was duplicated by mistake.

      6) I don't understand why is the time since nuclear division different is every single figure. For simplicity, it would be much better to start every figure at T=0.

      Significance

      see above.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Vines et al adapted a system that has been used in S. cerevisiae to study the homology search and homologous recombination repair events by live cell imaging. The authors utilized a system they set up in a fission yeast strain that has a fluorescently tagged endonuclease induced DSB site and monitored RAD52 focus formation in both haploid and diploid cells. The main findings presented are that multiple strand invasion events occur during DSB repair and the role of Rqh1 in promoting these multiple events. For example, cells with Rqh1 loss either have a single strand invasion event that quickly leads to repair or a very long extensive repair time. Overall the results are intriguing with new insight into DSB repair being presented. The manuscript would benefit from having another system to help to support or validate the key findings and/or the use of some mutants to help uncouple the different roles of Rad51 and/or Rqh1.

      Major comment:

      1) In Figure 1C, and also Figure 2D, the RAD52 focus observed does not appear in the same location as the LacO cassette. I assume this is because of the way the images are cropped. It would be nice if the authors are saying that the RAD52 focus co-localizes with the inducible DSB location for this to be more readily apparent in the representative images.

      2) In Figure 3A, the authors claim that the mean time to repair an endonuclease induced DSB is 50 min +/- 20 min. It is unclear whether or not this experiment is done in a diploid strain.

      3) In Figure 3, whether or not this experiment represents asynchronous cells can greatly influence the timing of DSB repair, as the cell cycle is a huge contributor to HDR repair.

      4) In Figure 3D, since a major finding of the paper is that there are multiple invasion events, it would be nice to show some representative images of a few cells where multiple pairings occur.

      5) It is known from Eric Greene's work that RAD51 mediated homology search can do multiple samplings of 8-9 nucleotide segments. Have the authors considered the area around the DSB site and how many potential pairing sites there might be in this region? Is it possible that having a LAC array with repeated segments might be influencing this the pairing since there would be multiple templates?

      6) It would aid the reader if there were some picture schematics of what the authors think is occurring throughout the paper in the Figures. Since this is a results/discussion, this approach would be appropriate in lieu of a model figure at the end (which would also be very nice).

      7) Since the multiple strand invasion events is a major finding of the paper, it is important to test the hypothesis that multiple strand invasion events are occurring a different way. A few ideas would be to examine Lorraine Symington's work on BIR where she observes multiple template switching events (Smith, CE, Llorente, B, Symington, LS (2007) Nature, 447(7140): 102-105) or something analogous to Wolf Heyer's recent study in Cell on template switching that the authors already cited. Another idea is to try a RAD51 mutant. For example, Doug Bishop's group has created a RAD51 mutant that uncouples the homology search from strand exchange, Rad51-II3A mutant (Cloud, V et al (2012) Science, 337(6099): 1222). Perhaps a mutant like this might be able to further support the key finding here.

      8) It is surprising that Rqh1 doesn't have a role in DNA end resection since this is a conserved function from budding yeast to man. Would similar results to what is observed in Figure 4 be observed in a Dna2 or Exo1 mutant?

      Minor comment:

      1) As mentioned in the first line of the abstract, HDR is generally considered error-free as opposed to a pathway that "can be" error-free.

      2) In Figure 2D, it is unclear whether this experiment is done in diploid cells. The rest of the figure is in diploid cells but two LacO cassette are not present past the first frame. Please clarify in the legend and/or figure panel. As mentioned above, this is also confusing in Figure 3.

      Significance

      The most important advancement in this paper is that multiple strand invasion events occur during homologous recombination and the role of the Rqh1 in this process. Rqh1 is important protein whose mutation is implicated in human disease such as Bloom syndrome and cancer. In addition, misregulation of double-strand break repair and particularly of Rad51 is associate with cancer. Therefore, understanding the basic mechanisms of how Rad51 mediates double-strand break repair and the role of Rqh1 in this process is critical for understanding fundamental aspects of cancer development.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their comments and outline below how we plan to address them.


      Reviewer #1 (Evidence, reproducibility and clarity (Required)): **Summary:** Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate). The authors here describe a method to modify bacterial artificial chromosomes (BAC) harbouring gene loci from eukaryotes. When wanting to modify a BAC an antibiotic selection cassette is often included alongside the desired mutation/modification to increase the number of successful recombinants in E.coli. Traditionally, this is removed in a second recombination process to leave only the desired modification. The novelty in the procedure described herein is to add a synthetic intron consensus sequence around the selection cassette, which eliminates the need for the subsequent removal of the antibiotic cassette from the BAC before transfection into mammalian cells, saving time and resources. The technique is clever in its simplicity and appears to function for a number of gene loci. The authors validated the correct functioning of the modified BACs for a number of genes using three main assays - transcript level, protein level and localisation. **Major comments:** *Are the key conclusions convincing?* The conclusion that the method described generates functional modified BACs is valid. *Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?* While the method is successfully employed in this study, its efficiency is not quantified in relation to the state-of-the-art as described in the introduction. One assumes it would be more efficient, but this has not been tested empirically in the paper. Does the inclusion of the synthetic intron sequence have an effect on the efficiency of modifying BACs compared to a more typical two-step positive/negative antibiotic selection cassette? *

      • *

      This is a good point that we did not directly address. In general, the efficiency is similar to that of integrating any cassette with selectable marker, as has been published (Poser et al 2008), and therefore also higher than the two-step counterselection method, which requires such a cassette integration in the first step alone. We will include new data specifically addressing the efficiency of our new method (see specifics below)

      The functionality of this approach rests entirely on the ability of the target cell to correctly splice out the synthetic intron. The authors are aware of this potential problem as highlighted in the lines below, but do not make efforts to explicitly test splicing. On lines 224-225, the authors state "We cannot exclude that a small portion of synthetic introns within individual cells are misspliced". On lines 230-231 it is stated that "mis-spliced mRNAs are probably minimal and degraded by nonsense-mediated decay". On lines 215-217, the authors describe an "investigation of transgenic lines at the single-cell level" that suggests "the synthetic intron is correctly spliced out in all the cells of the population". How do the authors reach this conclusion? U2OS and HeLa cells are considered very "robust" and may not show detectable consequences when stressed with an increased level of nonsense-mediated decay. Further, many genes maintain a high level of expression that buffers them against small changes in transcription/splicing. The synthetic intron might have a bigger impact on more tightly regulated genes, so assessing the splicing rate would be essential if the authors wish to advocate their technique as generally applicable.

      • *

      We will assay for splicing efficiency as outlined below.

      The ability of the synthetic intron to be removed from final transcripts depends on functioning splicing machinery. The authors might emphasise this issue, as spliceosome mutations are important fields of study and might not be compatible with this method.

      • *

      We can add this in the text

      The authors used un-directed integration of each BAC under study. Therefore, it is hard to assess what effect the synthetic intron has, as the authors only ever assess the downstream levels of the correctly spliced, translated and localised protein. The authors themselves state that this can lead to clonal variations in expression of up to 2-fold and on line 250 that this variation "could compensate for synthetic intron effects", but make no effort to test this. Again, lines 267-268 highlight the potential dangers of potential effects of the synthetic introns, but do not test these. \Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.* If not already performed, a large number of bacterial colonies should be screened for the correct modification and frequency of correct ones reported. This frequency - reported for at least three different modifications - would estimate what sort of efficiency this method provides. The modified region of each BAC should be sequenced and the results reported. The rate of exactly modified clones is important, in case of spontaneous or low fidelity integration of the antibiotic cassette. The percentage of transcripts that have the synthetic intron correctly spliced out should be measured for some of the BAC constructs used in the study. A direct head-to-head comparison of this newer method compared to other techniques, or even the authors' own previous two-step approach is necessary to assess the benefits of this method. Preferably, the experiment would be run in parallel with and without antibiotic selection applied, to show that it drastically improves chances of finding a correct clone. *

      We will generate 3 new mutations in BACs and analyze both the efficiency of integration by PCR and accuracy via sequencing. In practice, we have observed that the efficiency is similar to any other cassette integration, such as a GFP tag (Poser et al Nature Methods 2008) or a counterselection cassette (Bird et al Nature Methods 2012) (80-90%). Integrating a mutation via the second step of the counterselection method introduces a further 20% decrease in efficiencies on average.

      \Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.* Repeating the transformation of the BAC and targeting cassette and assessing the recombination efficiency and sequencing should only require existing reagents and take less than a week or two to complete. Quantitative RT-PCR to assess the percentage of transcripts that have the synthetic intron spliced out would take a little more work. However, this should not be a considerable investment in time or resources for a standard microbiology laboratory and could be completed within a few weeks using modern techniques, such as that described in Londoño et al. 2016. Repeating all the experiments in parallel would be considerable work and would only be strictly necessary if the authors wish to emphasise the benefits of their method over the many others already in wide use. *

      • *

      We will use quantitative PCR to estimate the fraction of transcripts that correctly splice out the artificial intron for two clonal cell lines characterized in the study: RNAi-resistant AurA-GFP (Fig 4), and GTSE1-14A (newly introduced; see below). While the exact method described in Londoño et al 2016 will not be applicable due to the larger size of the artificial intron, we believe we can adapt it to detect different splicing events.

      \Are the data and the methods presented in such a way that they can be reproduced?* Barring the omission of Table S1, which presumably includes exact information on the BACs modified and sequences used etc., there is sufficient other data and methods to allow the experiments to be repeated. Targeting the ESI procedure to the middle of exons is likely to have a bigger impact for smaller exons as the authors mention on lines 99-100. Making it clear which exon sizes for each gene were successfully targeted in this study would help give some idea of how significant a problem this might be. Perhaps Table S1 contains this information, but it was not provided. It would also help reviewers check the design strategies. *

      We apologize for inadvertently failing to upload Table S1 on bioRxiv. It has been uploaded now as part of this submission process. This table indeed contains BAC and target sequence information, including the size of the targeted exon (and the 2 “new” resulting exons). Targeted exons range in size from 138bp to 1537bp, and “new” exons are as small as 48bp.

      \Are the experiments adequately replicated and statistical analysis adequate?* The replication and statistically analysis of the data as presented appear adequate. Figure Legends should state the statistic used to generate error bars. *

      This will be updated

      \*Minor comments:** Specific experimental issues that are easily addressable. Are the promoters used in the vectors described universally functional? For example, is the PGK promoter functional in yeast? *

      • *

      The PGK promoter contained in the cassettes is a mammalian promoter, which has also been reported to work in flies.

      \Are prior studies referenced appropriately?* The manuscript may benefit from the referencing of BAC modification techniques from a wider variety of groups, such as those using CRISPR-guided recombineering (Pyne et al. 2015). *

      We will add citations of more techniques

      \Are the text and figures clear and accurate?* The body text is very clear save minor typographical or grammatical errors. Regarding figures, some of the coloured text in Figure 1 is somewhat illegible when printed in grayscale. Line 278 - The acronyms LAP and NLAP are not defined/explained. Antibody section starting Line 282 may fit better next to Western Blot section. Figure 2C - The blot images would benefit from arrows to indicate expected sizes of proteins. Figure 3A - the graph may benefit from a dashed line at 100% to highlight that values are normalised to controls. Figure 4 - The differences between panels B & C are unclear. Figure 4E - The legend could provide a little more detail on cell cycle stage/status of the captured cells. *

      All of the above will be addressed accordingly

      \Do you have suggestions that would help the authors improve the presentation of their data and conclusions?* Lines 23-27 are somewhat unclear and feel out of context. Perhaps the authors could clarify this as a further advantage of using BACs instead of endogenous gene modifications. *

      Thanks for the input, we will clarify this.

      While not affecting the factual content of the paper, I would advocate that the authors format the method described in Figure S3 into a more detailed text based layout similar to that seen in a typical Nature Methods article. However, this may depend on the format required by any eventual publishing journal.

      • *

      We prefer the graphical protocol, but will discuss whether to add a text protocol with the journal editor.

      That all of the work the paper was carried out in human cell lines and using human genes is a further caveat, but the authors admit this in the discussion and one would assume that most mammalian cells would respond similarly in their ability to splice out the synthetic intron. Reviewer #1 (Significance (Required)): \Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.* This work is a formal description of a newer method that could be useful for many of those employing bacterial artificial chromosomes in numerous studies, such as gene regulation. *Place the work in the context of the existing literature (provide references, where appropriate).* This work builds on methodology previously published by the authors - a counter-selection two-step procedure (Bird et al. 2011). It sets out to formally describe a method merely mentioned as "BAC intronization" in a later paper by some of the authors (Zheng et al. 2014). Other alternative one-step procedures are also available, but present a different set of challenges (Lyozin et al. 2014). Some newer approaches, such as those using CRISPR-guided recombineering (Pyne et al. 2015) or systems that combine CRISPR and positive/negative selection cassettes (Wang et al. 2016) may be slightly more efficient, but are also more complex in their design. Bird et al. 2011 DOI: 10/dv776q Pyne et al. 2015 DOI: 10/f7jx92 Wang et al. 2016 DOI: 10/f89db5 Zheng et al. 2014 DOI: 10/f5pkr6 *State what audience might be interested in and influenced by the reported findings.* As a technology paper this work should have interest from a broad field of research. While the use of BACs could sometimes be considered more traditional in light of the explosion in CRISPR-based genome editing capabilities, it is definitely seeing a resurgence as the limitations of CRISPR in modifying large regions of genome become more apparent. Therefore, technologies that accelerate the modification of BACs could prove increasingly useful. As category of audience, all those involved in significant recombineering or gene/genome engineering would potentially benefit. *Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.* Synthetic genomics, synthetic biology, cancer cell biology, gene and genome engineering REFEREES CROSS COMMENTING I would agree with reviewer two's assessment that we both view the paper in a similar light. Reviewer #2 (Evidence, reproducibility and clarity (Required)): This is a methods-focused paper that presents a strategy to efficiently introduce mutations into a bacterial artificial transgene using synthetic introns. BAC-based methods have been an effective strategy for introducing trans genes into human cells to achieve near-endogenous expression, including extensive work from these authors. However, generating mutations and changes within the internal coding sequence presents some challenges for how to target these mutations and select for the mutated form. Here, the authors describe a way to overcome this by introducing synthetic introns into an adjacent sequence. This allows them to introduce a selectable marker and conduct the molecular biology without creating complications downstream for the functionality of the protein. This method is carefully described and presented. The authors also provide clear validation by using this to create RNAi-resistant versions of multiple different mitotic factors as well as creating targeted mutants that alter the functional properties of a protein. This work clearly takes advantage of other ongoing studies from these labs (including mutants and cell lines that appear to also have been described elsewhere), but the ability to combine these in a single paper and clearly describe the method provides a helpful advance and validation. Based on the description and data presented, I think that things are clear and carefully validated. As such, I do not have technical comments or concerns and I would be comfortable with this paper appearing in an appropriate journal in its present form. Reviewer #2 (Significance (Required)): This is a solid methods paper, but for considering the nature of the impact and significance of this paper, there are several things to note: 1.The BAC-based method does appear to be a powerful and effective strategy. However, beyond the work of Mitocheck and the authors that are part of this paper, this has not seen widespread adoption. It is possible that this current method may increase its usage due to the value of the targeted mutations within the coding sequence, but at present it is not a broadly used strategy. *

      We agree that using BACs as transgenes has not seen widespread adoption as a tool on the broader cell biology community (although certainly beyond members of the Mitocheck consortium). This is likely because many erroneously think that it is a technique for specialist laboratories. We are trying to change this! For reasons outlined below, there is still an increasing desire for conditional analysis of mutated genes under physiological expression/regulation frequently not attainable via directed Cas9-based mutation. A major aim of this paper is thus to further simplify the methods for generating modified BAC transgenes.

      2.This BAC-based approach (and also RNAi) are becoming increasingly replaced by the use of CRISPR/Cas9 genome editing. The absence of Cas9-based strategies in this paper limits the potential impact and reach of this paper. The authors do mention the possibility of using a similar synthetic intron strategy for use with Cas9 in the Discussion, and appear to have conducted some experiments. If possible, it would substantially increase the value of this paper if this data and strategy were also included in the Results section (acknowledging that this may still be a work in progress).

      While some uses of BAC transgenes are in some cases better replaced by CRISPR/Cas9 techniques (i.e. GFP tagging), there are several occasions where using BACs are preferable: As stated in the text, RNAi-resistant BACs allow for conditional analysis of recessive mutations. Mutations in essential genes that are lethal will prevent growth and recovery of viable cells if integrated into the genome via Cas9. Additionally, deleterious mutations are prone to accumulate suppressive changes in chromosome integrity or gene expression during the procedure of selecting and expanding Cas9-modified cells for analysis, particularly in the genomically instable cancer cell lines frequently employed.

      We use both BACs and CRISPR/Cas9 in our lab according to our needs.

      We do have an ongoing project to apply this intronization technique to enable more efficient selection of CRISPR/Cas9 integrations. Preliminary results suggest that it works to allow selection of point mutations, but it is still being optimized, including a redesign of the cassette, and is not ready for publication.

      3.The method is solid and well-validated, but there are no new results or insights presented in this paper from the work that is described (this is fine, just commenting for considering the right journal fit).

      As “biological insights” gained as a result of this technique we had cited a couple studies that made use of the technique already (to functionally analyze a microcephaly-associated mutation in the centriolar protein CPAP at the single cell level in HeLa cells and neural progenitor cells (Zheng et al 2014, Gabirel et al 2016)). As a response to this critique to include “new biology” in this paper, we will add new unpublished data investigating a specific question: Is the cell-cycle-regulated disruption of the EB1-GTSE1 (microtubule plus-end tracking proteins) interaction in mitosis required for chromosome segregation fidelity? We have generated a GTSE1 mutant with 14 phosphosites mutated to alanine using this technique. We will present the effect on chromosome segregation.

      REFEREES CROSS COMMENTING It appears that both reviewers are largely on the same page regarding this paper.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This is a methods-focused paper that presents a strategy to efficiently introduce mutations into a bacterial artificial transgene using synthetic introns. BAC-based methods have been an effective strategy for introducing trans genes into human cells to achieve near-endogenous expression, including extensive work from these authors. However, generating mutations and changes within the internal coding sequence presents some challenges for how to target these mutations and select for the mutated form. Here, the authors describe a way to overcome this by introducing synthetic introns into an adjacent sequence. This allows them to introduce a selectable marker and conduct the molecular biology without creating complications downstream for the functionality of the protein.

      This method is carefully described and presented. The authors also provide clear validation by using this to create RNAi-resistant versions of multiple different mitotic factors as well as creating targeted mutants that alter the functional properties of a protein. This work clearly takes advantage of other ongoing studies from these labs (including mutants and cell lines that appear to also have been described elsewhere), but the ability to combine these in a single paper and clearly describe the method provides a helpful advance and validation.

      Based on the description and data presented, I think that things are clear and carefully validated. As such, I do not have technical comments or concerns and I would be comfortable with this paper appearing in an appropriate journal in its present form.

      Significance

      This is a solid methods paper, but for considering the nature of the impact and significance of this paper, there are several things to note:

      1.The BAC-based method does appear to be a powerful and effective strategy. However, beyond the work of Mitocheck and the authors that are part of this paper, this has not seen widespread adoption. It is possible that this current method may increase its usage due to the value of the targeted mutations within the coding sequence, but at present it is not a broadly used strategy.

      2.This BAC-based approach (and also RNAi) are becoming increasingly replaced by the use of CRISPR/Cas9 genome editing. The absence of Cas9-based strategies in this paper limits the potential impact and reach of this paper. The authors do mention the possibility of using a similar synthetic intron strategy for use with Cas9 in the Discussion, and appear to have conducted some experiments. If possible, it would substantially increase the value of this paper if this data and strategy were also included in the Results section (acknowledging that this may still be a work in progress).

      3.The method is solid and well-validated, but there are no new results or insights presented in this paper from the work that is described (this is fine, just commenting for considering the right journal fit).

      REFEREES CROSS COMMENTING

      It appears that both reviewers are largely on the same page regarding this paper.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate). The authors here describe a method to modify bacterial artificial chromosomes (BAC) harbouring gene loci from eukaryotes. When wanting to modify a BAC an antibiotic selection cassette is often included alongside the desired mutation/modification to increase the number of successful recombinants in E.coli. Traditionally, this is removed in a second recombination process to leave only the desired modification. The novelty in the procedure described herein is to add a synthetic intron consensus sequence around the selection cassette, which eliminates the need for the subsequent removal of the antibiotic cassette from the BAC before transfection into mammalian cells, saving time and resources. The technique is clever in its simplicity and appears to function for a number of gene loci. The authors validated the correct functioning of the modified BACs for a number of genes using three main assays - transcript level, protein level and localisation.

      Major comments:

      Are the key conclusions convincing?

      The conclusion that the method described generates functional modified BACs is valid.

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      While the method is successfully employed in this study, its efficiency is not quantified in relation to the state-of-the-art as described in the introduction. One assumes it would be more efficient, but this has not been tested empirically in the paper. Does the inclusion of the synthetic intron sequence have an effect on the efficiency of modifying BACs compared to a more typical two-step positive/negative antibiotic selection cassette? The functionality of this approach rests entirely on the ability of the target cell to correctly splice out the synthetic intron. The authors are aware of this potential problem as highlighted in the lines below, but do not make efforts to explicitly test splicing. On lines 224-225, the authors state "We cannot exclude that a small portion of synthetic introns within individual cells are misspliced". On lines 230-231 it is stated that "mis-spliced mRNAs are probably minimal and degraded by nonsense-mediated decay". On lines 215-217, the authors describe an "investigation of transgenic lines at the single-cell level" that suggests "the synthetic intron is correctly spliced out in all the cells of the population". How do the authors reach this conclusion? U2OS and HeLa cells are considered very "robust" and may not show detectable consequences when stressed with an increased level of nonsense-mediated decay. Further, many genes maintain a high level of expression that buffers them against small changes in transcription/splicing. The synthetic intron might have a bigger impact on more tightly regulated genes, so assessing the splicing rate would be essential if the authors wish to advocate their technique as generally applicable. The ability of the synthetic intron to be removed from final transcripts depends on functioning splicing machinery. The authors might emphasise this issue, as spliceosome mutations are important fields of study and might not be compatible with this method. The authors used un-directed integration of each BAC under study. Therefore, it is hard to assess what effect the synthetic intron has, as the authors only ever assess the downstream levels of the correctly spliced, translated and localised protein. The authors themselves state that this can lead to clonal variations in expression of up to 2-fold and on line 250 that this variation "could compensate for synthetic intron effects", but make no effort to test this. Again, lines 267-268 highlight the potential dangers of potential effects of the synthetic introns, but do not test these.

      Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      If not already performed, a large number of bacterial colonies should be screened for the correct modification and frequency of correct ones reported. This frequency - reported for at least three different modifications - would estimate what sort of efficiency this method provides. The modified region of each BAC should be sequenced and the results reported. The rate of exactly modified clones is important, in case of spontaneous or low fidelity integration of the antibiotic cassette. The percentage of transcripts that have the synthetic intron correctly spliced out should be measured for some of the BAC constructs used in the study. A direct head-to-head comparison of this newer method compared to other techniques, or even the authors' own previous two-step approach is necessary to assess the benefits of this method. Preferably, the experiment would be run in parallel with and without antibiotic selection applied, to show that it drastically improves chances of finding a correct clone.

      Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Repeating the transformation of the BAC and targeting cassette and assessing the recombination efficiency and sequencing should only require existing reagents and take less than a week or two to complete. Quantitative RT-PCR to assess the percentage of transcripts that have the synthetic intron spliced out would take a little more work. However, this should not be a considerable investment in time or resources for a standard microbiology laboratory and could be completed within a few weeks using modern techniques, such as that described in Londoño et al. 2016. Repeating all the experiments in parallel would be considerable work and would only be strictly necessary if the authors wish to emphasise the benefits of their method over the many others already in wide use.

      Are the data and the methods presented in such a way that they can be reproduced?

      Barring the omission of Table S1, which presumably includes exact information on the BACs modified and sequences used etc., there is sufficient other data and methods to allow the experiments to be repeated. Targeting the ESI procedure to the middle of exons is likely to have a bigger impact for smaller exons as the authors mention on lines 99-100. Making it clear which exon sizes for each gene were successfully targeted in this study would help give some idea of how significant a problem this might be. Perhaps Table S1 contains this information, but it was not provided. It would also help reviewers check the design strategies.

      Are the experiments adequately replicated and statistical analysis adequate?

      The replication and statistically analysis of the data as presented appear adequate. Figure Legends should state the statistic used to generate error bars.

      Minor comments:

      Specific experimental issues that are easily addressable. Are the promoters used in the vectors described universally functional? For example, is the PGK promoter functional in yeast?

      Are prior studies referenced appropriately?

      The manuscript may benefit from the referencing of BAC modification techniques from a wider variety of groups, such as those using CRISPR-guided recombineering (Pyne et al. 2015).

      Are the text and figures clear and accurate?

      The body text is very clear save minor typographical or grammatical errors. Regarding figures, some of the coloured text in Figure 1 is somewhat illegible when printed in grayscale.

      Line 278 - The acronyms LAP and NLAP are not defined/explained.

      Antibody section starting Line 282 may fit better next to Western Blot section.

      Figure 2C - The blot images would benefit from arrows to indicate expected sizes of proteins.

      Figure 3A - the graph may benefit from a dashed line at 100% to highlight that values are normalised to controls.

      Figure 4 - The differences between panels B & C are unclear.

      Figure 4E - The legend could provide a little more detail on cell cycle stage/status of the captured cells.

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Lines 23-27 are somewhat unclear and feel out of context. Perhaps the authors could clarify this as a further advantage of using BACs instead of endogenous gene modifications.

      While not affecting the factual content of the paper, I would advocate that the authors format the method described in Figure S3 into a more detailed text based layout similar to that seen in a typical Nature Methods article. However, this may depend on the format required by any eventual publishing journal. That all of the work the paper was carried out in human cell lines and using human genes is a further caveat, but the authors admit this in the discussion and one would assume that most mammalian cells would respond similarly in their ability to splice out the synthetic intron.

      Significance

      Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      This work is a formal description of a newer method that could be useful for many of those employing bacterial artificial chromosomes in numerous studies, such as gene regulation.

      Place the work in the context of the existing literature (provide references, where appropriate).

      This work builds on methodology previously published by the authors - a counter-selection two-step procedure (Bird et al. 2011). It sets out to formally describe a method merely mentioned as "BAC intronization" in a later paper by some of the authors (Zheng et al. 2014). Other alternative one-step procedures are also available, but present a different set of challenges (Lyozin et al. 2014). Some newer approaches, such as those using CRISPR-guided recombineering (Pyne et al. 2015) or systems that combine CRISPR and positive/negative selection cassettes (Wang et al. 2016) may be slightly more efficient, but are also more complex in their design.

      Bird et al. 2011 DOI: 10/dv776q

      Pyne et al. 2015 DOI: 10/f7jx92

      Wang et al. 2016 DOI: 10/f89db5

      Zheng et al. 2014 DOI: 10/f5pkr6

      State what audience might be interested in and influenced by the reported findings.

      As a technology paper this work should have interest from a broad field of research. While the use of BACs could sometimes be considered more traditional in light of the explosion in CRISPR-based genome editing capabilities, it is definitely seeing a resurgence as the limitations of CRISPR in modifying large regions of genome become more apparent. Therefore, technologies that accelerate the modification of BACs could prove increasingly useful. As category of audience, all those involved in significant recombineering or gene/genome engineering would potentially benefit.

      Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Synthetic genomics, synthetic biology, cancer cell biology, gene and genome engineering

  3. Jun 2020
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all reviewers for their comments and suggestions, which will make our manuscript a much better one. Accordingly, we have already made changes to the manuscript (marked in yellow) and we will perform all the experiments requested. Below, we answer the reviewers point by point.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): This study provides solid evidences showing a role for the spectraplakin Short-stop (Shot) in subcellular lumen formation in the Drosophila embryonic and larval trachea. This subcellular morphogenetic process relies on an inward membrane growth that depends on the proper organization of actin and microtubules (MTs) in terminal cells (TCs). Shot depletion leads to a defective or absent lumen while conversely, Shot overexpression promotes excessive branching, independently on the regulation of centrosome numbers previously shown to be important for the regulation of the lumen formation process (Ricolo, D., Deligiannaki, M., Casanova, J. & Araújo, S. J. Centrosome Amplification Increases Single-Cell Branching in Post-mitotic Cells. Current Biology 26, 2805-2813 (2016)). Shot is rather important to regulate the organization of the cytoskeleton by crosslinking MTs and actin. Shot expression in TCs is controlled by the Drosophila Serum Response Factor (DSRF) transcription factor. Finally Shot functionally overlaps with the MT-stabilizing protein Tau to promote lumen morphogenesis. The figures are clear and the questions well addressed with carefully designed and controlled experiments. However, I would have few suggestions that will hopefully make some points clearer. **Major comments:** -Statistical analyses should be added for comparisons of proportions, including Fig. 1E, 1L, Fig. 2G-I, Fig. 6L, Fig. 7K, Fig. 8C-D and Fig. 9G.

      We agree with this and have now redone all graphs and revised all quantifications from this study. We have added error bars in all above mentioned graphs and have provided statistical analysis where appropriate. We have also redone all graphics and phenotype reporting, which is done now in relation to total TCs (rather than embryos or GBs and DBs TCs). This was suggested also by reviewer #2 and we agree because this is a more stringent and comparable way of quantifying our results.

      -It is not always clear what genotype has been used as the "wt" genotype, as in Fig. S2 or Fig. 3 for example, this should be added to figure legends.

      We have now clarified which flies are used as controls in each experiment throughout the paper. We have left wt where flies were wt, and changed all other cases to either the genotype or “control”.

      -Live imaging of Shot has been performed with ShotC-GFP, that cannot bind actin. Don't the authors think ShotA-GFP would reflect more accurately Shot endogenous behavior as it interacts both with actin and MTs? It would be better to show this, even if the results shown here tend to be consistent with Shot endogenous localization shown with Shot antibody staining.

      We agree and we will analyse movies with both ShotC and ShotA and present them in the revised version.

      -It is of course not possible to generate CRISPR mutant flies with mutations in putative DSRF binding sites in a reasonable amount of time, to confirm that Shot transcription is controlled by DSRF. It would thus be nice to reveal shot mRNA expression with in situ hybridization experiments in wt vs. bs embryos. This would confirm that Shot mRNA is downregulated upon DSRF inhibition and rule out a possible indirect effect on Shot protein stability for example.

      We believe the presented 3-way approach (in silico, protein quantification and phenotype rescue) is sufficient to show that Shot expression is regulated by DSRF. It is unlikely that we are dealing with protein stability or other issues, because we can rescue the lumen elongation phenotype by solely expressing Shot in TCs. However, we agree it would be nice to show this in an in situ hybridization experiment, and we will try to provide a conclusive one for resubmission. In situ detection methods, however, may not be accurate enough to detect such differences in single-cells.

      -In the same figure, it would also be interesting to show what happens to actin and MTs in bs TCs and to which extent their organization is rescued by Shot overexpression.

      We are working on this for resubmission. These experiments were frozen by the current COVID-19 pandemic and this is why they were not submitted with the first version.

      -UAS-EB1GFP does not seem to be an appropriate control in Figure 9 (A and B) since it can affect MT dynamics (Vitre, B. et al. EB1 regulates microtubule dynamics and tubulin sheet closure in vitro. Nat. Cell Biol. 10, 415-421 (2008)). Why not simply use an UAS-GFP?

      We have not detected any notorious larval TC phenotypes by overexpressing UASEB1GFP in TCs. Their branching is comparable to that in previous studies (for example Schotenfeld-Roames, et al Current Biology 2014) and there were no detectable luminal branching phenotypes. However, we agree it is more correct to analyse cells with a plain GFP and have repeated the controls for this experiment using DSRFGAL4UASGFP. This is now shown in figure 9.

      -Shot and probably Tau crosslinking activities are important for lumen morphogenesis with a striking increase in the number of embryos without lumen in shot3 and shot3 tauMR22 mutant embryos. The rescue experiments clearly show that Shot binding to both MT and actin is essential for efficient rescue. The same might apply to Tau since it is able to crosslink actin and MTs (Elie, A. et al. Tau co-organizes dynamic microtubule and actin networks. Sci Rep 5, 1-10 (2015)). I believe showing actin and MTs organization in these rescue experiments would be necessary.

      We agree and we will provide these experiments upon resubmission.

      Second, the overexpression experiments indicate that Shot is able to induce extra lumen formation even when unable to bind actin as shown with the increase in the number of supernumerary lumina (ESLs) under overexpression of ShotC and ShotCtail to a lesser extent. This phenotype is also observed under Tau overexpression. This suggest that not crosslinking anymore but rather making MTs more stable could be sufficient to promote extra lumen formation in a wt context. Stabilising MTs by treatment with Taxol might thus be sufficient to promote ESL formation. I am fully aware of the difficulty of treating Drosophila embryos with drugs, making this experiment hard to do, but I think this dual function of Shot and Tau (crosslinking actin and MTs to promote branching vs. stabilizing MTs leading to excessive branching) should be discussed.

      In Figure 2 we show not just that UASShotC is able to induce ESl but also that UAS-ShotCtail containing only the MT binding domain of Shot is enough to induce ESLs in TCs, whereas UAS-deltaCtail is not. We agree Taxol treatment would be a nice experiment to do, however we also think we provide enough evidence that MT stability is enough for ESL whereas de novo lumen formation requires crosslinking of MTs to actin. As advised, we will discuss better both Shot and Tau dual function in ESL generation and de novo lumen formation for resubmission.

      **Minor comments:**

      We have already addressed most these minor comments in the manuscript (text revised and changes in yellow). And we provide answers to some of the comments below.

      -p2 line 1: 'acentrosomal luminal branching points' may be better than 'acentrosomal branching points' to describe the phenotype. -p4, line 16: the reference 23 is not properly inserted (should be after 'closure'). -p5, line 16: Please mention what the abbreviations Bnl and Btn stand for. -p5, line 20: these 80% of TCs cells with defects in subcellular lumen formation should appear on the graph in Fig. 1E (as shown in graph 1L).

      We have added shot RNAi results to graph E in figure 1.

      -p5, line 26: this 36% value does not seem to correspond to anything on the graph in Fig. 1N. According to the figure legend, 20% of TCs did not elongate at all and the lumen was completely absent (class IV), which is consistent with the result shown in Fig. 1L. Also, I am not sure why only 25 TCs were analysed in Fig. 1N while there are the data to analyse more as shown in Fig. 1E (400 TCs), this would make the graph more representative.

      Figure 1 N represents a detail of the different phenotypes present in shot mutant embryos. Whereas for most of the paper we consider only complete lack of TC lumen, here we show the different types of affected TCs and not just the ones with a complete lack of subcellular lumen. We apologise because it was not explained in the original manuscript that types III and IV are the “no lumen” class (they were subdivided into 2 classes because they have different cell enlongation phenotypes). 36% of the total of affected TCs displayed the lack of lumen phenotype (this means a 22,5 % of the total number of TCs, because total affected TCs are 62,5% only). Numbers are similar but not exactly the same because this analysis was done using confocal microscopy and cells analysed one by one in detail, which is not possible using colorimetric methods and only luminal markers. This is also the reason we only analysed 25 TCs in this case. We thank the reviewer for pointing this out and have better described it in the manuscript.

      -p6, line 8: ShotA-GFP is indeed a long isoform but is not the full-length Shot, as it does not contain the plakin repeat exon which would add another ~3000aa.

      We have corrected this.

      -p6, lines 21-23: ShotA-GFP localisation is not shown in FigS1. The authors should refer to Fig. 2. Enlarged areas/arrows might help the reader to better visualise the different localisations of ShotA-GFP and ShotC-GFP.

      We thank the reviewer for this request and we will change the figure providing enlarged areas upon resubmission. In this version of the manuscript we have already changed the error in figure referral in the text.

      -p7, line 23: Rca1 mutants should be better introduced here.

      We have added one sentence of introduction to the Rca1 phenotype.

      -p8, line 6: Shot colocalizes/associates with stable MTs and actin would be a more appropriate title for this paragraph.

      We thank the reviewer for this alternative, and we have changed this title in the manuscript.

      -p16, line 18: 'Shot is able to mediate crosstalk' would be better than 'Shot is able to crosstalk'. -p40, lines 6 and 7: L, M and N should be K', K' and K' respectively. -p41, Fig 10D: It is quite hard to see on the cartoon what the phenotype is for Shot OE.

      We will make this clearer for resubmission.

      -The following reference shows an important role for Shot in crosslinking actin and MTs during morphogenesis of the Drosophila embryo and should be cited in this manuscript (Booth, A. J. R., Blanchard, G. B., Adams, R. J. & Röper, K. A Dynamic Microtubule Cytoskeleton Directs Medial Actomyosin Function during Tube Formation. Developmental Cell 29, 562-576 (2014)).

      We thank the reviewer for pointing this out, because this is of course an important reference known to us, which we forgot to add. We have now added this to the manuscript.

      -FigS3. It would be good to add the labels on the figure (ShotC-GFP in green, and MoeRFP/lifeActinRFP in Magenta).

      We will do this for resubmission.

      Reviewer #1 (Significance (Required)): The findings shown in this manuscript shed an important light on the way subcellular morphogenesis occurs. It was known that both actin and MTs were required in this process, particularly during the formation of Drosophila trachea (JayaNandanan, N., Mathew, R. & Leptin, M. Guidance of subcellular tubulogenesis by actin under the control of a synaptotagmin-like protein and Moesin. Nature Communications 1-10 (2019). doi:10.1038/ncomms4036; Gervais, L. & Casanova, J. In Vivo Coupling of Cell Elongation and Lumen Formation in a Single Cell. Current Biology 20, 359-366 (2010)). This work provides additional molecular insights into the way branching morphogenesis from a single cell occurs in vivo, clearly demonstrating a requirement for actin-MT crosslinking mediated by Shot and Tau. This could be of great interest in the field of branching morphogenesis and lumen formation, not only in invertebrates but also in vertebrates where such a crosslinking might occur in the vasculature, the lung, the kidney or the mammary gland for example (Ochoa-Espinosa, A. & Affolter, M. Branching Morphogenesis: From Cells to Organs and Back. Cold Spring Harb Perspect Biol 4, a008243-a008243 (2012)). *Field of expertise:* morphogenesis, Drosophila, cytoskeleton, microtubules. Reviewer #2 (Evidence, reproducibility and clarity (Required)): **Summary:** The development of branched structures with intracellular lumen is widely observed in single cells of circulatory systems. However the molecular and cellular mechanisms of this complex morphogenesis are largely unknown. In previous study, the authors revealed that centrosome as a microtubule organizing center (MTOC) located at the apical junction contributes subcellular lumen formation in the terminal cells of Drosophila tracheal system. The microtubule bundles organized by MTOC are suggested to serve as trafficking mediators and structural stabilizers for the newly elongated lumen. In this manuscript, they focused on a Drosophila spectraplakin, Shot, which have been reported to crosslink MT minus-ends to actin network, in the subcellular lumen formation. The paper started by description of lumen elongation defect of the tracheal terminal cells in the shot[3] null mutant. The overexpression of full-length and series of truncated form of shot exhibited extra-subcellular lumina (ESL) in TCs, suggesting that Shot is required for the lumen formation in dose dependent manner. They next addressed whether Shot overexpression induces ESL through the supernumerary centrosomes as in Rca1 mutant, however the number of centrosomes was not affected. Moreover, the ESL were sprouted distally from the apical junction, suggesting that Shot operate in different way from the Rca1-dependent microtubule organization. To get mechanistic insight of Shot in the luminal formation, they checked localization of the Shot and found it localized with stable MTs around the nascent lumen and with the F-actin at the tip of the cell during the cell elongation and subcellular lumen formation. In shot[3] mutant, the MT-bundles were no longer localized to apical region and the actin accumulation at the tip of the cell was also reduced. The rescue experiments using several truncated forms of Shot, and well-designed genetic analysis using various shot mutants revealed that both MT binding domain and actin binding domains are needed to develop the lumen. The expression of shot was under the regulation by terminal cell-specific transcription factor bs/DSRF, and the overexpression of shot in bs LOF mutant suppressed its phenotype, indicated that part of the luminal phenotype of bs mutant in terminal cells are due to lower levels of the activity of shot. Finally, they checked whether Tau can compensate the function of shot in the subcellular lumen formation. The lumen elongation defect in shot mutant was suppressed by tau expression, and tau overexpression phenocopied the shot overexpression-induced ESL. Although tau mutant did not show the lumen formation defects, the double mutant of shot and tau exhibited synergistic effect. Shot was also required for subcellular luminal branching at larval stages. Overall, this work highlighted the importance of Shot as a crosslinker between MT and actin that acts in downstream of the FGF signaling-induced bs/DSRF expression for the subcellular lumen formation. An excess of Shot is sufficient for ESL formation from ectopic acentrosomal branching points. Furthermore, the Tau protein can functionally replace Shot in this context. **Major comments:** *- Are the key conclusions convincing?* *- Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?* The conclusions were basically supported by the set of data presented in this article, but following points need to be clarified. The truncated form ShotC lacks only half of calponin domain that are essential for the actin binding, thus it is still possible to bind actin to some extent. Although the actin binding activity is reported as "very weak" in the cited references, the quantitative analysis has not been done. Thus, the interpretation and claims based on the experiments using ShotC should be reviewed carefully.

      We agree with the reviewer and will revise all the text for resubmission in order to make this unambiguous. However, we would like to remark that our claims are not only based on UAS-ShotC but also in the shotkakP2 allele, which does not contain one of the calponin domains and in isoforms such UAS-Shot C-tail which do not have any ABD.

      Data set in some places seems fragmented. For example, overexpression study of shot constructs (Fig. 2) lacks phenotypic comparison of control (btl Gal4 driven control FP) to compare if phenotypes of shot constructs expression are different from control. Different methods of phenotypic quantification are employed. One was counting embryo number with at least one abnormality among 20 TCs of DB or GB, or the other counting every TC for the presence of lumen/branching conditions. The latter is more stringent measure and is more appropriate for the study of single cell morphogenesis.

      We totally agree with the reviewer. We have now revised all quantifications and graphs:

      1) We have used btl>GFP as control to all overexpression experiments in embryos and DSRFGAL4UASGFP in control larvae.

      2) We have made the paper uniform regarding quantifications, which are now all done in relation to total TCs and not embryos.

      For this reason, many of the graphs, figure legends and quantification values in the the manuscript text are now changed.

      *- Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.* The all movies were using ShotC isoform which lacks half of the actin binding domain. The truncated isoform is not suitable to observe the localization, especially the colocalization with actin. The movies need to be retaken using full-length Shot at the dosage that does not interfere with normal TC development.

      We agree and we will analyse movies with both ShotC and ShotA for resubmission.

      Some statements on Moesin and Tau localization sound as if the authors studied Shot interaction with nascent Moe and Tau molecules. This is confusing because fragments of Moe and Tau, but not functional full length proteins, were used.

      We will revise the text to make this unambiguous fir resubmission.

      *- Are the suggested experiments realistic in terms of time and resources?* It would help if you could add an estimated cost and time investment for substantial experiments. Because the transgenic fly is already present, we assume it would be done in 4 weeks. However, it would be influnced under social circumstances whether the lab facilities are able to access or not. *- Are the data and the methods presented in such a way that they can be reproduced?* *- Are the experiments adequately replicated and statistical analysis adequate?* The methods provided seem to be sufficient for reproducing the data by competent researchers, and most of the data are solid and the sample numbers are sufficient for the claims. However, the criteria for phenotypic evaluation differs among graphs and figures, that possibly confuse the readers. Standardized measurement methods are desirable. **Minor comments:** *- Specific experimental issues that are easily addressable.* In the rescue experiments shown in Figure 6, only full-length Shot rescued the subcellular lumen formation, but either of truncated Shot did not. The localization study of MT and actin in those conditions will reveal whether proper localizations of actin and MT are critical for the lumen formation.

      We are working on this for resubmission. These experiments were stalled by the current COVID-19 pandemic and this is why they were not submitted with the first version. We will provide MT and actin localization for the rescue experiments with ShotA and ShotC.

      *- Are prior studies referenced appropriately?* The references are cited appropriately. *- Are the text and figures clear and accurate?* There are several typos: Remodelling -> remodeling, signalling -> signaling. In the figure 2, G and H seem redundant. Scale bars are missing in Fig1 F-K, Fig2 K-L, Fig6 A-I, Fig7 E-J and Fig8 E-J.

      We have changed the graphs in figure 2. Typos have been corrected. We will provide errors bars for resubmission.

      The author often called shot+ genotype as "wild type". They are transgenic strains with some mutations, and cannot be found in the wild. They should be simply called with genotype or "control" for experiments.

      We thank the reviewer for pointing these typos and incoherences with control genotypes. We have partly revise the text and figures and will finish for resubmission.

      *- Do you have suggestions that would help the authors improve the presentation of their data and conclusions?* In Figure 4, as the localization of Shot is difficult to see in detail, enlarged insets might help. In addition, the green and cyan in C'-E' is difficult to distinguish.

      We will change this for resubmission.

      With Figure 5, the authors claimed that Shot LOF leads to disorganized MT-bundles and actin localization. We feel this is an overstatement and the Figure should be backed up with better data, or removed. F-actin and microtubule localizations are highly dynamic and the snapshot pictures are insufficient for demonstrating defective localization. It is also possible that (potential) difference in the marker localization is due to indirect effect of Shot LOF in cell shape.

      We agree with the reviewer that fixed samples are not the best to analyse cytoskeletal components, but we observe clear differences in MT bundles and specially in actin localization in shot mutants as compared to controls and we believe it is important to show these results. Cell shape might of course alter the analysis which is why we present 3 different cell shapes in Figure 5. In addition, there are many previous studies where localization of MTs and actin was done in fixed mutant embryos, where cell shape is also affected, and revealed important steps in TC formation (Gervais and Casanova, 2010; JayanNadanan et al. 2014).Nonetheless, we have revised the text in order to avoid overstatements.

      Reviewer #2 (Significance (Required)): *- Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.* *- Place the work in the context of the existing literature (provide references, where appropriate).* In blood capillary and insect trachea, the branching process of single vessel cells involves sprouting of cell protrusions, followed by the lumen extension from the main vessels. The lumen formation involves assembly of plasma membrane components inside of the cytoplasm. Since the luminal membrane is associated with protein complexes common to apical cell membrane, lumen formation is believed to involve redirection of apical trafficking of membranes to intracellular sites (Sigurbjörnsdóttir, Mathew, Leptin 2014, 10.1038/nrm3871). The authors previously demonstrated that centrosome is an important link of preexisting lumen to de novo lumen formation, leading to the hypothesis that centrosome-derived microtubules organize lumen membrane assembly. *- State what audience might be interested in and influenced by the reported findings.* In this manuscript, the authors addressed this issue by looking at the function of Shot/Plakin that has both microtubule and actin binding activities. Shot is an ideal candidate for linking actin-rich cell protrusions in the leading edge to centrosome- associated lumen tip. Indeed the authors clearly showed that shot is required for lumen extension and overexpressed shot protein associates with intracellular tract rich in microtubules and F-actin. Their findings are definitely a progress in the field of Drosophila tracheal development. Having said that, how Shot links leading edge protrusions and centrosomes, how it is organized into pre-lumen tract, and how it contribute to further assembly of luminal membrane and directed secretion, are not well understood yet. Without clues to those fundamental questions, I believe this paper is most appropriate for experts readers of Drosophila cell biology and tracheal development. Finally I feel that the paper include many data sets and some pictures are not easy to grasp essential points, such as three movies showing localization of overexpressed shot-C, RFP-moesin, and Lifeact. *- Define your field of expertise with a few keywords to help the authors contextualize your point of view.* Drosophila, tracheal cell biology. *- Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.* No Reviewer #3 (Evidence, reproducibility and clarity (Required)): **Summary** In their manuscript entitled "Coordinated crosstalk between microtubules and actin by a spectraplakin regulates lumen formation and branching" Ricolo and Araujo characterize the requirement for Short Stop (Shot) in the formation of subcellular tubes in tracheal terminal cells. The authors examined embryos homozygous for shot3, a presumed null allele of shot. They found an 80% penetrant defect in seamless tube formation or growth. The phenotype resembles that reported for mutations in blistered, which encodes the Drosophila SRF ortholog. The authors find that expression of SRF is not blocked by mutations in shot and later find that bs mutants have decreased levels of shot expression and that shot overexpression can partly suppress the bs tube formation defects. The authors then examine whether the requirement for shot is autonomous to the trachea and find that it is, as pan-tracheal shot RNAi replicates the seamless tube defects. The authors find that overexpression of various Shot isoforms results in the formation of ectopic seamless tubes within terminal cells. Using the various transgenic constructs available for shot, the authors show that the overexpression phenotype is dependent upon the interaction between Shot and microtubules, and is dose-dependent. Previous work had shown that ectopic terminal cell tubes also can arise due to increased centrosome number; the authors show that centrosome number is not altered in shot mutants. Shot has well characterized actin and microtubule binding functions, and the authors show that Shot localization overlaps both with microtubules and with actin, and that both cytoskeletal elements are aberrant in shot mutant cells. In a series of experiments utilizing various shot mutant backgrounds and shot transgenes, the authors identify requirements for both Shot-cytoskeleton interactions in the formation and branching of seamless tubes in terminal cells. Finally, the authors examine the requirement for Tau in the same processes. Tau and Shot had previously been found to work together in neurons, and this seems to be true in terminal cells as well. Tau overexpression induces ectopic seamless tubes and can partially suppress shot loss of function. Embryos mutant for tau showed seamless tube directionality defects, but not lumen formation or branching. Embryos doubly mutant for tau and shot showed a more severe seamless tube defect than shot mutants alone - an increase in terminal cells with no lumen from 22% to 85%. Authors also examined terminal cells in larval stages using dsrf-Gal4 to knockdown shot in terminal cells (rather than pan-tracheal knockdown with breathless). The authors conclude from their studies that Shot, through its interactions with microtubules and the actin cytoskeleton coordinate the outgrowth and branching of subcellular tubes. Overlapping function of Tau and possibly other additional MAPs also act in these processes. The work is largely well done and the conclusions are supported by the data. **Minor concerns:** -If one were to start this work today, crispr knockout and knockins would be preferred. While shot^3 is widely considered a null allele, there are indications that some shot function is still present in shot^3 embryos. This would also be relevant to the penetrance of the defects. The transgenes are useful, but given the dosage effects noted in various of the authors experiments, interpretation of some experiments is complicated as compared to a knockin. For overexpression experiments, landing site constructs would be preferable. I do not mean to suggest that the authors necessarily go this route, but am just pointing out a limitation of the approach.

      We agree, but we also think that with the amount of data and tools generated by other labs over recent years, regarding shot function in the nervous system (Voelzmann et al 2017), we are in a position to be able to take the conclusions of this work based on these transgenic and different shot alleles.

      -Insight into function at higher resolution than altered microtubule and actin organization would significantly increase the impact. -cell autonomy (line 19, p5) is not the correct term. Pan-tracheal knockdown tests tissue autonomy. Mosaic analysis or terminal cell specific knockdown would address cell autonomy.

      We have changed the manuscript accordingly.

      -line 14 p6 acting should be actin -dsrf-Gal4 transgenes were made by Mark Metzstein

      We have corrected these.

      -there also appears to be rescue of the fusion cell defects of shot by Tau overexpression. Authors should comment on this and what it means for the seamless tubulogenesis program in terminal cells vs fusion cells.

      We will reanalyse shot rescued with tau embryos focusing on fusion phenotypes and discuss this in the revised version.

      Reviewer #3 (Significance (Required)): The findings will be of interest to a broad cell biology community as they provide a conceptual advance and may help to focus future work on seamless tubulogenesis. The authors do a good job of placing the results in the context of previous studies. *Field of expertise:* Drosophila, tracheal tubulogenesis, developmental biology

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      In their manuscript entitled "Coordinated crosstalk between microtubules and actin by a spectraplakin regulates lumen formation and branching" Ricolo and Araujo characterize the requirement for Short Stop (Shot) in the formation of subcellular tubes in tracheal terminal cells.

      The authors examined embryos homozygous for shot3, a presumed null allele of shot. They found an 80% penetrant defect in seamless tube formation or growth. The phenotype resembles that reported for mutations in blistered, which encodes the Drosophila SRF ortholog. The authors find that expression of SRF is not blocked by mutations in shot and later find that bs mutants have decreased levels of shot expression and that shot overexpression can partly suppress the bs tube formation defects.

      The authors then examine whether the requirement for shot is autonomous to the trachea and find that it is, as pan-tracheal shot RNAi replicates the seamless tube defects.

      The authors find that overexpression of various Shot isoforms results in the formation of ectopic seamless tubes within terminal cells. Using the various transgenic constructs available for shot, the authors show that the overexpression phenotype is dependent upon the interaction between Shot and microtubules, and is dose-dependent.

      Previous work had shown that ectopic terminal cell tubes also can arise due to increased centrosome number; the authors show that centrosome number is not altered in shot mutants.

      Shot has well characterized actin and microtubule binding functions, and the authors show that Shot localization overlaps both with microtubules and with actin, and that both cytoskeletal elements are aberrant in shot mutant cells. In a series of experiments utilizing various shot mutant backgrounds and shot transgenes, the authors identify requirements for both Shot-cytoskeleton interactions in the formation and branching of seamless tubes in terminal cells.

      Finally, the authors examine the requirement for Tau in the same processes. Tau and Shot had previously been found to work together in neurons, and this seems to be true in terminal cells as well. Tau overexpression induces ectopic seamless tubes and can partially suppress shot loss of function. Embryos mutant for tau showed seamless tube directionality defects, but not lumen formation or branching. Embryos doubly mutant for tau and shot showed a more severe seamless tube defect than shot mutants alone - an increase in terminal cells with no lumen from 22% to 85%.

      Authors also examined terminal cells in larval stages using dsrf-Gal4 to knockdown shot in terminal cells (rather than pan-tracheal knockdown with breathless).

      The authors conclude from their studies that Shot, through its interactions with microtubules and the actin cytoskeleton coordinate the outgrowth and branching of subcellular tubes. Overlapping function of Tau and possibly other additional MAPs also act in these processes.

      The work is largely well done and the conclusions are supported by the data.

      Minor concerns:

      -If one were to start this work today, crispr knockout and knockins would be preferred. While shot^3 is widely considered a null allele, there are indications that some shot function is still present in shot^3 embryos. This would also be relevant to the penetrance of the defects. The transgenes are useful, but given the dosage effects noted in various of the authors experiments, interpretation of some experiments is complicated as compared to a knockin. For overexpression experiments, landing site constructs would be preferable. I do not mean to suggest that the authors necessarily go this route, but am just pointing out a limitation of the approach.

      -Insight into function at higher resolution than altered microtubule and actin organization would significantly increase the impact.

      -cell autonomy (line 19, p5) is not the correct term. Pan-tracheal knockdown tests tissue autonomy. Mosaic analysis or terminal cell specific knockdown would address cell autonomy.

      -line 14 p6 acting should be actin

      -dsrf-Gal4 transgenes were made by Mark Metzstein

      -there also appears to be rescue of the fusion cell defects of shot by Tau overexpression. Authors should comment on this and what it means for the seamless tubulogenesis program in terminal cells vs fusion cells.

      Significance

      The findings will be of interest to a broad cell biology community as they provide a conceptual advance and may help to focus future work on seamless tubulogenesis. The authors do a good job of placing the results in the context of previous studies.

      Field of expertise: Drosophila, tracheal tubulogenesis, developmental biology

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The development of branched structures with intracellular lumen is widely observed in single cells of circulatory systems. However the molecular and cellular mechanisms of this complex morphogenesis are largely unknown. In previous study, the authors revealed that centrosome as a microtubule organizing center (MTOC) located at the apical junction contributes subcellular lumen formation in the terminal cells of Drosophila tracheal system. The microtubule bundles organized by MTOC are suggested to serve as trafficking mediators and structural stabilizers for the newly elongated lumen.

      In this manuscript, they focused on a Drosophila spectraplakin, Shot, which have been reported to crosslink MT minus-ends to actin network, in the subcellular lumen formation. The paper started by description of lumen elongation defect of the tracheal terminal cells in the shot[3] null mutant. The overexpression of full-length and series of truncated form of shot exhibited extra-subcellular lumina (ESL) in TCs, suggesting that Shot is required for the lumen formation in dose dependent manner. They next addressed whether Shot overexpression induces ESL through the supernumerary centrosomes as in Rca1 mutant, however the number of centrosomes was not affected. Moreover, the ESL were sprouted distally from the apical junction, suggesting that Shot operate in different way from the Rca1-dependent microtubule organization. To get mechanistic insight of Shot in the luminal formation, they checked localization of the Shot and found it localized with stable MTs around the nascent lumen and with the F-actin at the tip of the cell during the cell elongation and subcellular lumen formation. In shot[3] mutant, the MT-bundles were no longer localized to apical region and the actin accumulation at the tip of the cell was also reduced. The rescue experiments using several truncated forms of Shot, and well-designed genetic analysis using various shot mutants revealed that both MT binding domain and actin binding domains are needed to develop the lumen. The expression of shot was under the regulation by terminal cell-specific transcription factor bs/DSRF, and the overexpression of shot in bs LOF mutant suppressed its phenotype, indicated that part of the luminal phenotype of bs mutant in terminal cells are due to lower levels of the activity of shot. Finally, they checked whether Tau can compensate the function of shot in the subcellular lumen formation. The lumen elongation defect in shot mutant was suppressed by tau expression, and tau overexpression phenocopied the shot overexpression-induced ESL. Although tau mutant did not show the lumen formation defects, the double mutant of shot and tau exhibited synergistic effect. Shot was also required for subcellular luminal branching at larval stages.

      Overall, this work highlighted the importance of Shot as a crosslinker between MT and actin that acts in downstream of the FGF signaling-induced bs/DSRF expression for the subcellular lumen formation. An excess of Shot is sufficient for ESL formation from ectopic acentrosomal branching points. Furthermore, the Tau protein can functionally replace Shot in this context.

      Major comments:

      - Are the key conclusions convincing? - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The conclusions were basically supported by the set of data presented in this article, but following points need to be clarified.

      The truncated form ShotC lacks only half of calponin domain that are essential for the actin binding, thus it is still possible to bind actin to some extent. Although the actin binding activity is reported as "very weak" in the cited references, the quantitative analysis has not been done. Thus, the interpretation and claims based on the experiments using ShotC should be reviewed carefully.

      Data set in some places seems fragmented. For example, overexpression study of shot constructs (Fig. 2) lacks phenotypic comparison of control (btl Gal4 driven control FP) to compare if phenotypes of shot constructs expression are different from control. Different methods of phenotypic quantification are employed. One was counting embryo number with at least one abnormality among 20 TCs of DB or GB, or the other counting every TC for the presence of lumen/branching conditions. The latter is more stringent measure and is more appropriate for the study of single cell morphogenesis.

      - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      The all movies were using ShotC isoform which lacks half of the actin binding domain. The truncated isoform is not suitable to observe the localization, especially the colocalization with actin. The movies need to be retaken using full-length Shot at the dosage that does not interfere with normal TC development.

      Some statements on Moesin and Tau localization sound as if the authors studied Shot interaction with nascent Moe and Tau molecules. This is confusing because fragments of Moe and Tau, but not functional full length proteins, were used.

      - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Because the transgenic fly is already present, we assume it would be done in 4 weeks. However, it would be influnced under social circumstances whether the lab facilities are able to access or not.

      - Are the data and the methods presented in such a way that they can be reproduced? - Are the experiments adequately replicated and statistical analysis adequate?

      The methods provided seem to be sufficient for reproducing the data by competent researchers, and most of the data are solid and the sample numbers are sufficient for the claims. However, the criteria for phenotypic evaluation differs among graphs and figures, that possibly confuse the readers. Standardized measurement methods are desirable.

      Minor comments:

      - Specific experimental issues that are easily addressable.

      In the rescue experiments shown in Figure 6, only full-length Shot rescued the subcellular lumen formation, but either of truncated Shot did not. The localization study of MT and actin in those conditions will reveal whether proper localizations of actin and MT are critical for the lumen formation.

      - Are prior studies referenced appropriately?

      The references are cited appropriately.

      - Are the text and figures clear and accurate?

      There are several typos: Remodelling -> remodeling, signalling -> signaling. In the figure 2, G and H seem redundant. Scale bars are missing in Fig1 F-K, Fig2 K-L, Fig6 A-I, Fig7 E-J and Fig8 E-J.

      The author often called shot+ genotype as "wild type". They are transgenic strains with some mutations, and cannot be found in the wild. They should be simply called with genotype or "control" for experiments.

      - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      In Figure 4, as the localization of Shot is difficult to see in detail, enlarged insets might help. In addition, the green and cyan in C'-E' is difficult to distinguish.

      With Figure 5, the authors claimed that Shot LOF leads to disorganized MT-bundles and actin localization. We feel this is an overstatement and the Figure should be backed up with better data, or removed. F-actin and microtubule localizations are highly dynamic and the snapshot pictures are insufficient for demonstrating defective localization. It is also possible that (potential) difference in the marker localization is due to indirect effect of Shot LOF in cell shape.

      Significance

      - Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      - Place the work in the context of the existing literature (provide references, where appropriate).

      In blood capillary and insect trachea, the branching process of single vessel cells involves sprouting of cell protrusions, followed by the lumen extension from the main vessels. The lumen formation involves assembly of plasma membrane components inside of the cytoplasm. Since the luminal membrane is associated with protein complexes common to apical cell membrane, lumen formation is believed to involve redirection of apical trafficking of membranes to intracellular sites (Sigurbjörnsdóttir, Mathew, Leptin 2014, 10.1038/nrm3871). The authors previously demonstrated that centrosome is an important link of preexisting lumen to de novo lumen formation, leading to the hypothesis that centrosome-derived microtubules organize lumen membrane assembly.

      - State what audience might be interested in and influenced by the reported findings.

      In this manuscript, the authors addressed this issue by looking at the function of Shot/Plakin that has both microtubule and actin binding activities. Shot is an ideal candidate for linking actin-rich cell protrusions in the leading edge to centrosome- associated lumen tip. Indeed the authors clearly showed that shot is required for lumen extension and overexpressed shot protein associates with intracellular tract rich in microtubules and F-actin. Their findings are definitely a progress in the field of Drosophila tracheal development. Having said that, how Shot links leading edge protrusions and centrosomes, how it is organized into pre-lumen tract, and how it contribute to further assembly of luminal membrane and directed secretion, are not well understood yet. Without clues to those fundamental questions, I believe this paper is most appropriate for experts readers of Drosophila cell biology and tracheal development.

      Finally I feel that the paper include many data sets and some pictures are not easy to grasp essential points, such as three movies showing localization of overexpressed shot-C, RFP-moesin, and Lifeact.

      - Define your field of expertise with a few keywords to help the authors contextualize your point of view.

      Drosophila, tracheal cell biology.

      - Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      No

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study provides solid evidences showing a role for the spectraplakin Short-stop (Shot) in subcellular lumen formation in the Drosophila embryonic and larval trachea. This subcellular morphogenetic process relies on an inward membrane growth that depends on the proper organization of actin and microtubules (MTs) in terminal cells (TCs). Shot depletion leads to a defective or absent lumen while conversely, Shot overexpression promotes excessive branching, independently on the regulation of centrosome numbers previously shown to be important for the regulation of the lumen formation process (Ricolo, D., Deligiannaki, M., Casanova, J. & Araújo, S. J. Centrosome Amplification Increases Single-Cell Branching in Post-mitotic Cells. Current Biology 26, 2805-2813 (2016)). Shot is rather important to regulate the organization of the cytoskeleton by crosslinking MTs and actin. Shot expression in TCs is controlled by the Drosophila Serum Response Factor (DSRF) transcription factor. Finally Shot functionally overlaps with the MT-stabilizing protein Tau to promote lumen morphogenesis.

      The figures are clear and the questions well addressed with carefully designed and controlled experiments. However, I would have few suggestions that will hopefully make some points clearer.

      Major comments:

      -Statistical analyses should be added for comparisons of proportions, including Fig. 1E, 1L, Fig. 2G-I, Fig. 6L, Fig. 7K, Fig. 8C-D and Fig. 9G.

      -It is not always clear what genotype has been used as the "wt" genotype, as in Fig. S2 or Fig. 3 for example, this should be added to figure legends.

      -Live imaging of Shot has been performed with ShotC-GFP, that cannot bind actin. Don't the authors think ShotA-GFP would reflect more accurately Shot endogenous behavior as it interacts both with actin and MTs? It would be better to show this, even if the results shown here tend to be consistent with Shot endogenous localization shown with Shot antibody staining.

      -It is of course not possible to generate CRISPR mutant flies with mutations in putative DSRF binding sites in a reasonable amount of time, to confirm that Shot transcription is controlled by DSRF. It would thus be nice to reveal shot mRNA expression with in situ hybridization experiments in wt vs. bs embryos. This would confirm that Shot mRNA is downregulated upon DSRF inhibition and rule out a possible indirect effect on Shot protein stability for example.

      -In the same figure, it would also be interesting to show what happens to actin and MTs in bs TCs and to which extent their organization is rescued by Shot overexpression.

      -UAS-EB1GFP does not seem to be an appropriate control in Figure 9 (A and B) since it can affect MT dynamics (Vitre, B. et al. EB1 regulates microtubule dynamics and tubulin sheet closure in vitro. Nat. Cell Biol. 10, 415-421 (2008)). Why not simply use an UAS-GFP?

      -Shot and probably Tau crosslinking activities are important for lumen morphogenesis with a striking increase in the number of embryos without lumen in shot3 and shot3 tauMR22 mutant embryos. The rescue experiments clearly show that Shot binding to both MT and actin is essential for efficient rescue. The same might apply to Tau since it is able to crosslink actin and MTs (Elie, A. et al. Tau co-organizes dynamic microtubule and actin networks. Sci Rep 5, 1-10 (2015)). I believe showing actin and MTs organization in these rescue experiments would be necessary.

      Second, the overexpression experiments indicate that Shot is able to induce extra lumen formation even when unable to bind actin as shown with the increase in the number of supernumerary lumina (ESLs) under overexpression of ShotC and ShotCtail to a lesser extent. This phenotype is also observed under Tau overexpression. This suggest that not crosslinking anymore but rather making MTs more stable could be sufficient to promote extra lumen formation in a wt context. Stabilising MTs by treatment with Taxol might thus be sufficient to promote ESL formation. I am fully aware of the difficulty of treating Drosophila embryos with drugs, making this experiment hard to do, but I think this dual function of Shot and Tau (crosslinking actin and MTs to promote branching vs. stabilizing MTs leading to excessive branching) should be discussed.

      Minor comments:

      -p2 line 1: 'acentrosomal luminal branching points' may be better than 'acentrosomal branching points' to describe the phenotype.

      -p4, line 16: the reference 23 is not properly inserted (should be after 'closure').

      -p5, line 16: Please mention what the abbreviations Bnl and Btn stand for.

      -p5, line 20: these 80% of TCs cells with defects in subcellular lumen formation should appear on the graph in Fig. 1E (as shown in graph 1L).

      -p5, line 26: this 36% value does not seem to correspond to anything on the graph in Fig. 1N. According to the figure legend, 20% of TCs did not elongate at all and the lumen was completely absent (class IV), which is consistent with the result shown in Fig. 1L.

      Also, I am not sure why only 25 TCs were analysed in Fig. 1N while there are the data to analyse more as shown in Fig. 1E (400 TCs), this would make the graph more representative.

      -p6, line 8: ShotA-GFP is indeed a long isoform but is not the full-length Shot, as it does not contain the plakin repeat exon which would add another ~3000aa.

      -p6, lines 21-23: ShotA-GFP localisation is not shown in FigS1. The authors should refer to Fig. 2. Enlarged areas/arrows might help the reader to better visualise the different localisations of ShotA-GFP and ShotC-GFP.

      -p7, line 23: Rca1 mutants should be better introduced here.

      -p8, line 6: Shot colocalizes/associates with stable MTs and actin would be a more appropriate title for this paragraph.

      -p16, line 18: 'Shot is able to mediate crosstalk' would be better than 'Shot is able to crosstalk'.

      -p40, lines 6 and 7: L, M and N should be K', K' and K' respectively.

      -p41, Fig 10D: It is quite hard to see on the cartoon what the phenotype is for Shot OE.

      -The following reference shows an important role for Shot in crosslinking actin and MTs during morphogenesis of the Drosophila embryo and should be cited in this manuscript (Booth, A. J. R., Blanchard, G. B., Adams, R. J. & Röper, K. A Dynamic Microtubule Cytoskeleton Directs Medial Actomyosin Function during Tube Formation. Developmental Cell 29, 562-576 (2014)).

      -FigS3. It would be good to add the labels on the figure (ShotC-GFP in green, and MoeRFP/lifeActinRFP in Magenta).

      Significance

      The findings shown in this manuscript shed an important light on the way subcellular morphogenesis occurs. It was known that both actin and MTs were required in this process, particularly during the formation of Drosophila trachea (JayaNandanan, N., Mathew, R. & Leptin, M. Guidance of subcellular tubulogenesis by actin under the control of a synaptotagmin-like protein and Moesin. Nature Communications 1-10 (2019). doi:10.1038/ncomms4036; Gervais, L. & Casanova, J. In Vivo Coupling of Cell Elongation and Lumen Formation in a Single Cell. Current Biology 20, 359-366 (2010)). This work provides additional molecular insights into the way branching morphogenesis from a single cell occurs in vivo, clearly demonstrating a requirement for actin-MT crosslinking mediated by Shot and Tau.

      This could be of great interest in the field of branching morphogenesis and lumen formation, not only in invertebrates but also in vertebrates where such a crosslinking might occur in the vasculature, the lung, the kidney or the mammary gland for example (Ochoa-Espinosa, A. & Affolter, M. Branching Morphogenesis: From Cells to Organs and Back. Cold Spring Harb Perspect Biol 4, a008243-a008243 (2012)).

      Field of expertise: morphogenesis, Drosophila, cytoskeleton, microtubules.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The response to reviewers consists of three parts:

      1. A summary of the main points from the two reviews, and the authors' response to these points.
      2. A detailed revision plan for the preprint, taking into account both the main points of the reviews, and other comments made by the reviewers.
      3. A point-by-point response to the reviewers.

      For figure citations, OV = old version, i.e. bioRxiv preprint 2019-826180v2, and NV = new version, i.e. revised and re-submitted version.

      1. Summary of main points by the reviewers, and authors’ responses:

      • Both reviewers felt that the manuscript was overlong; Reviewer 1 recommended either shortening it or splitting it into two stories, while Reviewer 2 recommended cutting down the text.
        • We have considerably shortened the manuscript in accordance with this request (see revision plan below). We had already considered splitting the manuscript into two parts during the drafting stage, and had rejected this possibility as the data are intertwined - the retroactive validation of the dimer interface by the mutagenesis constructs (OV Fig. S3 [NV Fig. S4]) being a good example.
        • The revised manuscript features 7 main figures and 13 supplementals.
      • Both reviewers felt too much text and figure space was allocated to negative data, specifically the investigation of potential lipid binding by the TbMORN1 protein, and that there should be more focus on the positive parts of the story.
        • A key part of shortening the manuscript has been moving most of the negative data on lipid binding into the supplemental figures, and considerably shortening the associated text. This has allowed the main figures and associated text to focus more on the positive elements of the project, while still ensuring publication of all the data.
      • The reviewers appear to be in slight disagreement concerning discussion of the data. Reviewer 1 has encouraged more speculation on the physiological role of PE binding, a potential lipid transfer function, a role for calcium ions, the relevance of the observed disulphide bond, and the role of zinc ions in apicomplexan proteins; Reviewer 2 has recommended avoiding excessive speculation or inference.
        • Given that both reviewers have agreed that the original manuscript was overlong, we have implemented Reviewer 2's suggestion here and reduced the amount of speculation in the revised text.
      • The reviewers agreed that the technical quality of the data was high and that the conclusions drawn were robust.
        • We are glad that the reviewers were appreciative of the data quality. For this reason, we were reluctant to remove any of the data from the manuscript and would prefer instead to transfer it to the supplementals. We feel that the negative data still have considerable community value, given that they show that MORN repeats are not automatically lipid binding modules and can thus act as a caveat to other researchers.

      2. Detailed revision plan for the preprint:

      • We have implemented the reviewers' suggestions and substantially shortened the manuscript, primarily by trimming the (phospho)lipid-binding section, which contains a large amount of negative data. The following main figures have been moved into the supplemental section:
        • OV Fig. 2 ("TbMORN1 interacts with phospholipids but not liposomes") has become NV Fig. S2
        • OV Fig. 4 ("TbMORN1(2-15) does not bind to liposomes in vitro") has become NV Fig. S6
        • OV Fig. 8 ("Conservation and properties of residues in TbMORN1(7- 15)") has become NV Fig. S11
      • This has left a total of 7 main figures and 13 supplementals.
      • The text associated with the entirety of the lipid-binding part (OV lines 210- 530, OV Figs. 2-6 [NV Figs. 2-4, S2, S6], OV Supplemental Figs. 2-6 [NV Supplemental Figs. S3-S5, S7, S8]) has been condensed. The focus of this section is now on the positive parts of the data: the PE association (OV Fig. 3 [NV Fig. 2]) and the in vivo work (OV Figs. 5, 6 [NV Figs. 3, 4]).
      • We have additionally limited the amount of inference and speculation in the manuscript.

      3. Point-by-point responses to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      MORN (membrane occupation and recognition nexus) repeat proteins are found in prokaryotes and eukaryotes. They feature characteristic repeats in their primary sequence, have been assumed to play a role in lipid binding, but remain poorly characterized on the functional and structural level. This manuscript tries to address both these questions and is organized in major parts. In the first part the authors characterize a putative role of MORN repeat proteins in lipid binding and membrane association. In the second part, the authors use X-ray crystallography to establish the structure of MORN repeat proteins and to investigate the dimerization.

      As a cleverly chosen point of departure, they focus their study particularly on MORN1 from Trypanosoma brucei (TbMORN1), which is composed solely on MORN repeats. The structures of MORN repeats (from several species) in part two provide interesting insights into their mode of homotypic interactions and their role as dimerization or oligomerization devices. The lipid binding and membrane association of MORN proteins in the first part remains somewhat confusing and unclear, despite the use of a whole battery of techniques.

      We anticipate that the shortening and refocusing of the lipid binding data has addressed this issue.

      It is questionably, why the authors invest so many figures and words to inform the reader on negative results.

      We have chosen to publicise our negative data in full because, as noted in the manuscript, there is a widespread and erroneous assumption that MORN repeats are lipid binding modules. We feel that publishing these data will allow them to act as a caveat to other researchers working on MORN repeat proteins. We have, however, addressed the reviewer's request in that we have considerably shortened the text associated with these data and have moved the corresponding figures into the supplementals.

      The authors suggest that MORN proteins can bind to lipids via their hydrophobic acyl chainswhich is 'very hard to imagine under physiological conditions unless TbMORN1 is a lipid carrier and not a membrane-binding proteins. Unfortunately, a role as lipid carrier has not been rigorously tested.

      The reviewer is correct that we have not specifically tested for a function as a lipid carrier protein and although this was only speculation, it has been toned down accordingly.

      In this sense the first part remains somewhat immature and incoherent. Furthermore, they suggest based on the lack-of-evidence that MORN proteins do not bind membranes in vivo and in vitro.

      We are not clear where this suggestion was made. Our data indicate that TbMORN1 does not directly bind membranes in vivo or in vitro, and we therefore noted that putative lipid binding by other MORN repeat proteins should be viewed with caution. Specifically, we stated in the Discussion (OV lines 955-956) that "the presence of MORN repeats in a protein should not be taken as indicative of lipid binding or lipid membrane binding without experimental evidence". Again, our expectation is that the major changes planned for the data presentation in this section will make it more coherent.

      The main issue of this manuscript is, in my view, the way the data were presented.The manuscript is generally well-written, but much too long. The structural work is important and concise.

      We have considerably shortened the manuscript as per the reviewer's request, and especially the section on lipid binding.

      The first part, however, reports in five separate figures on a lack of membrane binding by a MORN protein and its ability to bind individual lipids. The physiologically relevance of this lipid binding is questionable as acknowledged by the authors.

      We have moved two of these figures (OV Figs. 2, 4) into the supplementals section [NV Figs. S2, S6], shortened the associated text, and limited the amount of speculation.

      Even though I find it important that the membrane/lipid binding ability of MORN proteins is rigorously tested, I would highly recommend to separate the current manuscript in two independent stories. Alternatively, I would recommend to reduce the first part into a single figure and to remove the most artifactual assays.

      We have implemented the second of these two suggestions for the manuscript. We had already considered splitting the manuscript during the drafting stage, but rejected this possibility as the data were too intertwined. Consequently, we have opted to considerably reduce the first part, and moved OV Figs. 2 and 4 into the supplementals [NV Figs. S2, S6]. We would prefer not to remove data altogether as they are likely to have community value even if they are negative and as noted, they are of good quality.

      In the current form, the first part and the second part of the manuscript remain somewhat detached from each other. The characterization of the lipid binding/membrane binding properties has a number of substantial weaknesses (e.g. use of quite different, nonphysiological buffers for membrane binding assays; use of deletion mutants for the binding assays, which do not show the full potential of oligomerization). This which makes it hard to read and confuses the reader. Even though I have no reason to doubt the conclusions by the authors, I do not think that all necessary caution has been invested to rule out other possibilities.

      We believe that the shortening and refocusing of the manuscript should address these issues. For consideration of the buffer and deletion mutant points, please see responses to Major Points below.

      In summary, even though the technical quality of the individual performed assays is high, there are some conceptual issues that make it hard to make a strong case based on a collection of individual, clear datasets. Even though I find the structures of the MORN proteins important, timely, and interesting, I would not recommend this study for publication in its current form. The manuscript would be more fun to read if both of the parts would be shortened substantially and more focused.

      We have implemented this suggestion: the manuscript has been considerably shortened (from 20,489/135,073 to 18,555/103,988 characters/words, focused on reducing the negative lipid-binding results).

      While I agree that most evidence provided on lipid/membrane binding of TbMORN1 argue against a direct role of MORN proteins in membrane binding, I feel that the experimental approach is not coherent enough. See a few major points of criticism below.

      Major Points:

      1. The authors decide to characterize the membrane binding of a MORN repeat protein using a deletion variant that lacks the N-terminal repeat. However, in Figure 1B they show that the N-terminal repeat is important for the formation of higher-order oligomers. While I fully understand that the presence of the most N-terminal repeat does hamper the structural work, I find it problematic to remove it for the lipid/membrane-binding assays. The formation of higher oligomeric species beyond the dimer, may be important for membrane binding/recruitment (avidity effects).

      As we explained in the manuscript, the reason for not using the full-length protein for in vitro work was because it was polydisperse, and that the yields were extremely low. See OV lines 178-179 ("The yields of TbMORN1(1-15) were always very low, making this construct not generally suitable for in vitro assays".) and OV lines 411-414 ("...TbMORN1(1-15), which was polydisperse in vitro and formed large oligomers (Fig. 1B). The membrane-binding activity of these polydisperse oligomers was not possible to test in vitro, as the purification yields of TbMORN1(1-15) were always low."). Consequently, we used the longest construct that was suitable in terms of chemical and oligomeric homogeneity. Using the full-length protein would have had inherent problems with aggregation, and consequently would have compromised the data and derived results. In order to make this clear in the manuscript we edited the sentence mentioned above as follows:

      “It was not possible to test the membrane-binding activity of these polydisperse oligomers in vitro however, as the purification yields of TbMORN1(1-15) were always low. As an alternative, the possible membrane association of TbMORN1(1-15) was examined in vivo."

      2) (Related to point 1) I do not understand the choice of the buffers used for some of the assays. The use of pH 8.5 and NaCl concentrations of 200 mM are non-physiological.

      These were the buffer conditions required to retain the protein in a monodisperse state, suitable for in vitro assays.

      For CD spectroscopy, a high ionic strength was obtained by the use of 200 mM NaF. If a high ionic strength is required to prevent the formation of higher oligomers of MORN, it raises the question if the formation of higher oligomers (under physiological conditions) may also contribute to their function.

      The oligomers of TbMORN1 may indeed be the most functionally relevant form of TbMORN1 but we do not currently have a means of testing this in vitro, as acknowledged in the text (OV lines 411-414, quoted above). The aim of CD spectroscopy was to assess fold integrity and stability of different constructs; we used buffers as recommended for the CD spectroscopy experiments by Kelly et al, 2005 (doi:10.1016/j.bbapap.2005.06.005) (Table 1 and section 4.2). Furthermore, the CD spectra of TbMORN(1-15) and TbMORN(2-15) (OV Fig. S1E [NV Fig. S1E]) are basically superimposable, suggesting identical secondary structure content at the concentration used for these experiments.

      It is unclear, in which buffer the fluorescence anisotropy measurements were performed.

      We have provided details on the buffer conditions for the fluorescence anisotropy experiments in the Materials and Methods section, NV page 23, lines 962-963.

      The sucrose-loaded vesicles were hydrated in a 20 mM HEPES pH 7.4, 0.3 M Sucrose. The composition of the buffer after the addition of MORN proteins is not clear.

      The Materials and Methods are now unambiguous on this point. Please see NV lines 1036- 1046: "6 μM Rhodamine B dihexadecanoyl phosphoethanolamine (Rh-DHPE) was added to all lipid mixtures to facilitate the visualisation of the SLVs. The lipid mixtures were dried under a nitrogen stream, and the lipid films hydrated in 20 mM HEPES pH 7.4; 0.3 M sucrose. The lipid mixtures were subjected to 4 cycles of freezing in liquid nitrogen followed by thawing in a sonicating water bath at RT. The vesicles were pelleted by centrifugation (250,000 × g, 30 min, RT) and resuspended in 20 mM HEPES pH 7.4, 100 mM KCl to a total lipid concentration of 1 mM. SLVs were incubated with 1.5 μM purified TbMORN1(2-15) in gel filtration buffer (20 mM Tris-HCl pH 8.5, 200 mM NaCl, 2% glycerol, 1 mM DTT) at a 1:1 ratio (30 min, RT)." The liposomes were at physiological pH and close to physiological ionic strength.

      Despite the use of an impressive array of techniques, this first part of the manuscript remains somewhat immature and incoherent. Due to the use of constructs that have not the full ability to oligomerize (point 1) and due to the inconsistent use of experimental conditions, it is hard to draw firm conclusions from this first part.

      Any biochemical study is conducted within the constraints of the choice of construct and the choice of buffer conditions, and the data are valid within those parameters. This applies as much to positive data as to negative data, so we are not clear why the reviewer is placing such emphasis on this point. In the case of the LiMA data, which are the most unbiased and comprehensive dataset in the manuscript, these experiments were well-controlled and there were also domains present that were recruited to membranes under the buffer conditions, allowing us to rule out that the assay conditions were completely unsuitable. Validating negative results should be done as carefully and with as many orthogonal approaches as the validation of positive results. The reviewer acknowledges below that "the data point in the direction that MORN proteins (or at least TbMORN1) does not directly bind to membranes". This is the conclusion that we wanted to communicate.

      For example: In Figure 2E TbMORN(2-15) does show some concentration-dependent binding, which -however- is interpreted as background binding. What are the results using this assay (or better: a liposome floatation assay) when using full-length TbMORN(1-15) in a more physiological buffer?

      As noted already, it is not possible to use the TbMORN1(1-15) construct for in vitro assays owing to the extremely low yields and polydisperse nature of the protein. The excess fulllength protein was associated with the cytosolic fraction and not the membrane fraction in vivo (OV Fig. 6B [NV Fig. 4B]).

      The statement that MORN proteins bind to lipids, but not to liposomes/membranes is -in my view- not sufficiently addressed to make a strong case.

      At no point do we suggest that MORN repeat proteins in general bind to lipids and not to liposomes/membranes. On the contrary, and as detailed in the manuscript, we set out to assay the lipid binding activity of TbMORN1, found that it appears to bind to lipids but not to liposomes/membranes, and have therefore cautioned that lipid or liposome/membrane binding of other MORN repeat proteins must be tested experimentally before claims of function are made.

      3) The physiological relevance of lipid binding to MORN proteins remains obscure (as also acknowledged by the authors). Does the binding of PE lipids to the MORN protein have a physiological role? Does the binding of fluorescent PI(4,5)P2 point to a physiological role of MORN proteins?

      These are interesting questions that we would like to address in future work.

      4) In light of recent data from the Chris Stefan lab (PMID: 31402097) a co-incidence detection of PI(4,5)P2, PS, and cholesterol seems possible. Can the authors address this possibility?

      Again, the involvement of cholesterol, PS, and PI(4,5)P2 would be interesting questions for subsequent work but are beyond the scope of the present study. We did partially address this issue in our use of PI(4,5)P2, POPC and cholesterol containing liposomes in liposome cosedimentation assays, which showed no binding (OV Fig. S3A [NV Fig. S4A]).

      Furthermore, the role of Ca2+ signaling / Ca2+ ions has not been addressed. In light of the important role of Ca2+ for the recognition of PI(4,5)P2 (PMID: 28177616), this point should be addressed.

      We carried out liposome pelleting assays in the presence of Ca2+ and Mg2+, and saw no binding by TbMORN1(2-15) in either condition (see data below). These data were not included in the MS because of the insufficient number of technical replicates available.

      5) For characterizing the binding of lipids to MORN proteins, the authors use nonphysiological fluorescent and short-chain lipid analogues at concentrations, which are unlikely to occur for endogenous PIPs in the cytosol of cells. Why choosing such an artificial system? Why introducing this system at length, if other -less artifact-prone- assays are available? I would recommend to not feature this assay as prominently as it was in the current study.

      Our aim was to stick to using the same fluorophore throughout all the experiments. The choice of short-chain lipids was constrained by what was commercially available with the BODIPY TMR fluorophore. We have implemented the reviewer's suggestion in the manuscript, and the text associated with the fluorescence anisotropy assays has been considerably shortened. We are aware that the chosen concentration of the fluorescent lipids was out of physiological range, but the requirements of the fluorescence anisotropy itself necessitated a compromise. The possible shortcomings of the fluorescence anisotropy assays are, we believe, more than amply compensated by the LiMA data.

      6) How would PE find its way to the lipid binding region in MORN? Would it diffuse to the MORN protein via the aqueous phase or would the MORN protein pickup PE form membranes up collision? The authors should address this point, by separating the lipiddepleted MORN protein from donor-vesicles containing PE by a dialysis membrane. If PE would not find its way to the lipid binding site of MORN, this would imply that MORN protein can extract lipids only upon colliding with the membrane. What is the stoichiometry of PE to MORN?

      These are all interesting questions that we would like to pursue in subsequent work, but we feel that they are beyond the scope of the present study. Until we have conditions suitable for obtaining high yields and monodisperse populations of the full-length protein, which probably also necessitates developing conditions for controlled oligomerisation, it would be premature to start this. As to how it picks up PE: it is well known that specific lipid binding/chaperoning proteins can deliver their lipid cargo to other proteins. Additionally, proteins that bind lipids use hydrophobic domains to both interact with and sequester fatty acids and/or lipids from membranes. The literature is populated with lots of such examples. https://www.sciencedirect.com/science/article/pii/S0092867416310765.

      Despite my critique raised above, I agree with the authors that the data point in the direction that MORN proteins (or at least TbMORN1) does not directly bind to membranes. Their data, however, would still be consistent with a role as lipid transfer protein and a recruitment of MORN proteins to the membrane by other proteins. Have the authors performed any additional experiments in this direction? Also, the potential role of palmitoylation is only mentioned in the discussion (page 22), while palmitoylation would provide a simple means for membrane recruitment.

      We are glad that the reviewer concurs with our main conclusion. We agree, as noted in the discussion, that a role as a lipid transfer protein might still be possible, and this is something that we would like to pursue in follow-up work. We have not yet performed any additional experiments in this direction. Concerning palmitoylation, the predictions using the CSS-Palm software were always weak and ambiguous, and in addition the best candidate cysteine residue was Cys351, which is in our structure engaged in the disulphide bond observed in the C2 crystal form. We feel that this is something to keep in mind, but is not yet a strong enough hypothesis to pursue intensively.

      Minor Points:

      Figure 1B: The authors should provide information on the void volume of the column.

      Implemented in the figure legend (7.2 ml).

      Page 17, line 696-701: The authors point out that the C2 crystal form is stabilized by two disulfide bridges. The authors should comment on the physiological relevance of these disulfide bridges.

      Given the reducing environment of the cytosol, it is an open question as to whether these disulphide bridges exist in vivo. We would prefer not to speculate on this point, as we do not feel it would be productive.

      Page 18, line 734-740: The authors should provide data on the potential role of Zn2+ on MORN function in a physiological context. The section describing that the dimer is stabilized by Zn2+ ions (pages 18 and 19) lacks a discussion if Zn2+ are functionally relevant. There is only a beautiful sequence analysis and a discussion of the conservation of the Zn2+ coordinating residues. Can the authors perform Zn2+ titrations and SEC-MALS experiments (or alternatives such as SAXS) to show that Zn2+ indeed affects the oligomeric state of only the PfMORN, but not the other MORN proteins that form alternative dimers?

      The known requirement for zinc ions in Plasmodium growth was already noted (OV lines 992- 993, Marvin et al., 2012), and is, we believe, sufficient to address the issue of physiological relevance at this stage. The zinc ions are predicted to affect the architecture of the apicomplexan (Plasmodium, Toxoplasma) MORN1 protein dimers, not their oligomeric state. For PfMORN1, SEC-MALS and SAXS were carried out in 20 mM Tris-HCl pH 7.5, 100 mM NaCl with no zinc present. When EDTA was added, no change in behaviour of the protein was seen by SEC-MALS. When “TPEN”, a strong zinc chelator, was added, the protein precipitated in SEC-MALS experiments.

      Reviewer #1 (Significance):

      A putative role of MORN proteins in membrane and lipid binding is addressed. The view the MORN proteins bind directly to membranes is challenged. Structures of dimeric MORN proteins provide important insight into the modes of dimerization.

      There is a recent structure of MORN proteins (which is referenced by the authors), but I feel that additional structural work is important and justified. The work on membrane vs. lipid binding is important, but not sufficiently addressed in the current manuscript.

      We are glad that the reviewer finds the structural work important and justified, although we disagree with the reviewer’s assessment of the lipid binding. As noted in the previous paragraph, our data challenge the assumption that MORN repeat proteins directly bind membranes, and we feel that this alone is a significant conceptual advance.

      I would recommend to separate the study in two parts. The audience is likely to confused (or bored) by the lengthy discussion on whether or not MORN proteins bind lipids and or membrane or not.

      We would prefer to implement the reviewer's other suggestion, namely that the manuscript is considerably shortened and less focus given to the negative data on lipid binding.

      I am not an expert in structural biology, but have a fair understanding of structural biology. I have worked on lipid binding proteins and have a very good understanding of lipid/membrane-binding assays.


      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary

      The manuscript describes an extensive and detailed investigation into the structure and function(s) of MORN domains. It has to be acknowledged that, despite the considerable amount of work reported, the conclusions are rather limited. From a technical viewpoint, the experiments have been appropriately executed and, generally, I concur with the conclusions drawn. However, the manuscript is over-long: in general, I would recommend concentrating on positive conclusions which can be drawn from the data and avoid excessive speculation or inference (some examples given below).

      We are glad that the reviewer is satisfied with the technical quality of the work and (in general) the validity of the conclusions. We acknowledge that the original submission was fairly long, and have considerably shortened the revised manuscript and focused more on the positive conclusions in order to implement this suggestion.

      Major Comments

      There are three general- perhaps rather obvious- points to make. First, there is no particular reason to think that conservation of structure necessarily indicates conservation of a particular function. There seems to be an implicit assumption that MORN domains are associated with a specific, well-defined biological function. Given their diversity, are there particular reasons to think that this is the case?

      The reviewer is exactly right that there is an implicit assumption that MORN domains are associated with a specific, well-defined function: specifically, lipid binding. It is this assumption, which has been widely circulated in the almost complete absence of experimental evidence, that we are challenging. We agree that MORN repeats are likely to be capable of multiple functions, and protein-protein interactions are now better supported than protein-lipid interactions.

      Second, a strategy which examines the properties of just the recombinant MORN domains in vitro, removed from the context of the whole protein (eg junctophilin) or- importantly- its interacting partners in vivo, has obvious limitations. Frequently a reductionist approach is successful; however, in this case, MORN domains appear to be less tractable to that kind of approach. For all the in vitro binding and structural experiments presented, there is always a concern that the absence of other parts of the relevant MORN-containing protein or its partners could explain failure or inconsistency of in vitro biological activity measurements.

      Again, the reviewer is right that there is an inherent contextual limitation to any in vitro work that utilises a single protein, but this is a concern that - by definition - could be raised about any in vitro study utilising a single protein. It should be noted that we have also carried out in vivo experiments using TbMORN1 (OV Figs. 5, 6 [NV Figs. 3, 4]).

      Third, the possibility that MORN domains might mediate interactions with other proteins seems to be given little consideration, in spite of the Li et al (2019) paper. An experimental strategy which looked for binding partners (eg by pulldown assay) might have provided more insight.

      These data are already in the literature. A previous study by the same team (Morriswood et al., 2013) used proximity-dependent biotin identification to identify candidate binding partners and near neighbours of TbMORN1.

      In order to stress this point we added the following sentence in the discussion section, NV pages 18-19, lines 774-778.

      “The concluding data presented here suggest that TbMORN1 utilises this oligomerisation capacity to build mesh-like assemblies, which can reach considerable size in vitro (Fig. 7G). These mesh-like assemblies may reflect the endogenous organisation of the protein in vivo, where a number of binding partners have already been identified (Morriswood et al., 2013)”.

      Minor Comments

      1. In the abstract and elsewhere the authors refer to a possible function of MORN domains as 'dimerisation and oligomerisation devices' (line 53). What is the evidence that dimer formation is important for function in vivo?

      This is an interesting and important question and one that we would like to address in future work. We did attempt to generate trypanosome cell lines that inducibly expressed monomeric TbMORN1 (the double mutant, where the point mutations were simultaneously introduced in the dimerisation interface in repeats 13 and 14), but no expression of the ectopic protein was ever observed (9 separate clones obtained in 3 independent transfections). This might indicate the importance of the dimeric state in vivo, perhaps hinting that dimerisation is important for protection from degradation. In general, proteins assuming higher oligomeric states in homo- or heteromeric assemblies benefit from increased robustness in the cellular environment and optimised activity by the following means:

      • Increased stability by decreasing the surface area/volume ratio
      • Simple construction of larger complexes
      • Allosteric regulation
      • Co-localisation of distinct biological functions
      • Substrate channelling
      • Protection from aggregation or degradation

      Which or which combination of the factors is relevant for TbMORN1 being a functional dimer in vivo is difficult to say at this point.

      1. Did the authors attempt to co-crystallize TbMORN1(7-15) with PI(4,5)P2?

      No. For crystallisation, we used lysine methylated samples, and by doing this we neutralised positively-charged potential binding sites which would have interacted with the negatively charged lipid headgroup. We did not observe any bound lipids in the electron density maps obtained from the crystals.

      1. Fig 2C: did the authors also estimate binding stoichiometry as well as the equilibrium binding constants for these data? This should be determined by fitting a single binding site model to the data. Other methods (eg ITC) can probably determine this with more accuracy. The value of stoichiometry is sometimes forgotten in such binding measurements- is one ligand bound per monomer or dimer, for example?

      We discussed estimation of the binding stoichiometry in the fluorescence anisotropy assays at some length, but the conclusion was that the required experiments would contain too many approximations to provide high-confidence data. We did use ITC and also MST, but did not observe any binding with these assays.

      1. Lines 674-678 I found it hard to work out whether these constructs harbour the natural C-terminal sequence without truncation or addition of an affinity tag. I think the answer is 'yes' but it was difficult working this out from the details in M&M.

      TbMORN1(7-15) crystallisation was with a C-terminal Strep tag; TgMORN1(7-15) and PfMORN1(7-15) had their affinity tags removed by protease treatment prior to crystallisation. We have clarified this point in the M&M, page 29, lines 1189-1192: “Crystallisation of TbMORN1(7-15) (with a C-terminal Strep tag), TgMORN1(7-15) and PfMORN1(7-15) (both with affinity tags removed) was performed at 22 °C using a sitting-drop vapour diffusion technique and micro-dispensing liquid handling robots (Phoenix RE (Art Robbins Instruments) and Mosquito (TTP labtech).”

      1. Lines 688-694 The PISA interface analysis is useful here in distinguishing crystal contacts from those which persist in solution. The discussion of the results is unclear, however, on this critical point: were the dimer interfaces the only contacts which were significant in the various crystal forms?

      Yes, correct. PISA showed that the described dimerisation contacts were the only significant ones in the various crystal forms. Other crystals contacts had typically low P-values and poor ΔG and small “radar” surface in the complexive PISA analysis.

      In the case of both TbMORN1 crystal forms and in the case of the TgMORN1 P43212 crystal form we have a dimer in the asymmetric unit, while in the case of the PfMORN1 and TgMORN1 P6222 form we have one molecule in the asymmetric unit, and the dimer is created by the crystallographic twofold axis. In the latter cases the quaternary structure resulting from the symmetry operations was the top-scoring one considering either P-values and/or the number of stabilising interactions buried surface area.

      1. Lines 754-763 This paragraph seems rather speculative and is a good example where the text could be cut down.

      If the line citation is correct, then we disagree with this assessment and would prefer not to implement it. The paragraph in question concerns a detailed and very precise discussion of the side chain interactions that stabilise the V-shaped forms of TgMORN1 and PfMORN1.

      1. Line 765-788 This section is also rather overdone: such observations are only useful if they are subsequently tested by recording dimer conformation for a representative selection of MORN dimers from different species.

      Again, we disagree with the reviewer's assessment of this analysis. The analysis has considerable predictive power and already has some experimental validation via the SAXS observation that PfMORN1 is capable of forming extended dimers in solution (OV Fig. 10C [NV Fig. 7C]).

      1. Lines 800-801 I don't think this statement is strictly correct. The SAXS data show that PfMORN1(7-15) adopts an extended conformation, with no evidence of the 'V' shaped structure. Related to that point, from what I could glean from the SAXS Methods section, all solution conditions for these experiments were conducted without Zn2+? If some dimer interfaces require Zn2+, should it not be included?

      We have clarified this statement. The SAXS experiments were conducted without zinc, and, as we have stressed, the V-shaped form of TgMORN1 and PfMORN1 was only ever observed in the crystals. For PfMORN1, SEC-MALS and SAXS were carried out in 20 mM Tris-HCl pH 7.5, 100 mM NaCl with no zinc present. When EDTA was added, no change in behaviour of the protein was seen by SEC-MALS. When “TPEN”, a strong zinc chelator, was added, the protein precipitated in SEC-MALS experiments.

      Reviewer #2 (Significance):

      There is certainly value in establishing that MORN domains do not, in vitro, appear to bind to lipid vesicles, and to define their lipid binding capability (although it is rather complex). The crystal structures and SAXS data extend the rather limited structural data on MORN domains. Despite the effort involved, conclusions about likely functions of MORN domains in vivo are rather limited.

      We are glad that the reviewer acknowledges the value in challenging the assumption that MORN repeats are lipid binding devices, and that the structural data are important for expanding the knowledge base on this class of repeat motif proteins. In vivo functional work is being actively pursued at present.

      My expertise lies in X-ray crystallography and protein biochemistry.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The manuscript describes an extensive and detailed investigation into the structure and function(s) of MORN domains. It has to be acknowledged that, despite the considerable amount of work reported, the conclusions are rather limited. From a technical viewpoint, the experiments have been appropriately executed and, generally, I concur with the conclusions drawn. However, the manuscript is over-long: in general, I would recommend concentrating on positive conclusions which can be drawn from the data and avoid excessive speculation or inference (some examples given below).

      Major Comments

      There are three general- perhaps rather obvious- points to make. First, there is no particular reason to think that conservation of structure necessarily indicates conservation of a particular function. There seems to be an implicit assumption that MORN domains are associated with a specific, well-defined biological function. Given their diversity, are there particular reasons to think that this is the case? Second, a strategy which examines the properties of just the recombinant MORN domains in vitro, removed from the context of the whole protein (eg junctophilin) or- importantly- its interacting partners in vivo, has obvious limitations. Frequently a reductionist approach is successful; however, in this case, MORN domains appear to be less tractable to that kind of approach. For all the in vitro binding and structural experiments presented, there is always a concern that the absence of other parts of the relevant MORN-containing protein or its partners could explain failure or inconsistency of in vitro biological activity measurements. Third, the possibility that MORN domains might mediate interactions with other proteins seems to be given little consideration, in spite of the Li et al (2019) paper. An experimental strategy which looked for binding partners (eg by pulldown assay) might have provided more insight.

      Minor Comments

      1. In the abstract and elsewhere the authors refer to a possible function of MORN domains as 'dimerisation and oligomerisation devices' (line 53). What is the evidence that dimer formation is important for function in vivo?
      2. Did the authors attempt to co-crystallize TbMORN1(7-15) with PI(4,5)P2?
      3. Fig 2C: did the authors also estimate binding stoichiometry as well as the equilibrium binding constants for these data? This should be determined by fitting a single binding site model to the data. Other methods (eg ITC) can probably determine this with more accuracy. The value of stoichiometry is sometimes forgotten in such binding measurements- is one ligand bound per monomer or dimer, for example?
      4. Lines 674-678 I found it hard to work out whether these constructs harbour the natural C-terminal sequence without truncation or addition of an affinity tag. I think the answer is 'yes' but it was difficult working this out from the details in M&M.
      5. Lines 688-694 The PISA interface analysis is useful here in distinguishing crystal contacts from those which persist in solution. The discussion of the results is unclear, however, on this critical point: were the dimer interfaces the only contacts which were significant in the various crystal forms?
      6. Lines 754-763 This paragraph seems rather speculative and is a good example where the text could be cut down.
      7. Line 765-788 This section is also rather overdone: such observations are only useful if they are subsequently tested by recording dimer conformation for a representative selection of MORN dimers from different species.
      8. Lines 800-801 I don't think this statement is strictly correct. The SAXS data show that PfMORN1(7-15) adopts an extended conformation, with no evidence of the 'V' shaped structure. Related to that point, from what I could glean from the SAXS Methods section, all solution conditions for these experiments were conducted without Zn2+? If some dimer interfaces require Zn2+, should it not be included?

      Significance

      There is certainly value in establishing that MORN domains do not, in vitro, appear to bind to lipid vesicles, and to define their lipid binding capability (although it is rather complex). The crystal structures and SAXS data extend the rather limited structural data on MORN domains. Despite the effort involved, conclusions about likely functions of MORN domains in vivo are rather limited. My expertise lies in X-ray crystallography and protein biochemistry.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      MORN (membrane occupation and recognition nexus) repeat proteins are found in prokaryotes and eukaryotes. They feature characteristic repeats in their primary sequence, have been assumed to play a role in lipid binding, but remain poorly characterized on the functional and structural level. This manuscript tries to address both these questions and is organized in major parts. In the first part the authors characterize a putative role of MORN repeat proteins in lipid binding and membrane association. In the second part, the authors use X-ray crystallography to establish the structure of MORN repeat proteins and to investigate the dimerization.

      As a cleverly chosen point of departure, they focus their study particularly on MORN1 from Trypanosoma brucei (TbMORN1), which is composed solely on MORN repeats. The structures of MORN repeats (from several species) in part two provide interesting insights into their mode of homotypic interactions and their role as dimerization or oligomerization devices. The lipid binding and membrane association of MORN proteins in the first part remains somewhat confusing and unclear, despite the use of a whole battery of techniques. It is questionably, why the authors invest so many figures and words to inform the reader on negative results. The authors suggest that MORN proteins can bind to lipids via their hydrophobic acyl chains- which is 'very hard to imagine under physiological conditions unless TbMORN1 is a lipid carrier and not a membrane-binding proteins.' Unfortunately, a role as lipid carrier has not been rigorously tested. In this sense the first part remains somewhat immature and incoherent. Furthermore, they suggest based on the lack-of-evidence that MORN proteins do not bind membranes in vivo and in vitro.

      The main issue of this manuscript is, in my view, the way the data were presented.The manuscript is generally well-written, but much too long. The structural work is important and concise. The first part, however, reports in five separate figures on a lack of membrane binding by a MORN protein and its ability to bind individual lipids. The physiologically relevance of this lipid binding is questionable as acknowledged by the authors. Even though I find it important that the membrane/lipid binding ability of MORN proteins is rigorously tested, I would highly recommend to separate the current manuscript in two independent stories. Alternatively, I would recommend to reduce the first part into a single figure and to remove the most artifactual assays. In the current form, the first part and the second part of the manuscript remain somewhat detached from each other. The characterization of the lipid binding/membrane binding properties has a number of substantial weaknesses (e.g. use of quite different, non-physiological buffers for membrane binding assays; use of deletion mutants for the binding assays, which do not show the full potential of oligomerization). This which makes it hard to read and confuses the reader. Even though I have no reason to doubt the conclusions by the authors, I do not think that all necessary caution has been invested to rule out other possibilities.

      In summary, even though the technical quality of the individual performed assays is high, there are some conceptual issues that make it hard to make a strong case based on a collection of individual, clear datasets. Even though I find the structures of the MORN proteins important, timely, and interesting, I would not recommend this study for publication in its current form. The manuscript would be more fun to read if both of the parts would be shortened substantially and more focused. While I agree that most evidence provided on lipid/membrane binding of TbMORN1 argue against a direct role of MORN proteins in membrane binding, I feel that the experimental approach is not coherent enough. See a few major points of criticism below.

      Major Points:

      1) The authors decide to characterize the membrane binding of a MORN repeat protein using a deletion variant that lacks the N-terminal repeat. However, in Figure 1B they show that the N-terminal repeat is important for the formation of higher-order oligomers. While I fully understand that the presence of the most N-terminal repeat does hamper the structural work, I find it problematic to remove it for the lipid/membrane-binding assays. The formation of higher oligomeric species beyond the dimer, may be important for membrane binding/recruitment (avidity effects).

      2) (Related to point 1) I do not understand the choice of the buffers used for some of the assays. The use of pH 8.5 and NaCl concentrations of 200 mM are non-physiological. For CD spectroscopy, a high ionic strength was obtained by the use of 200 mM NaF. If a high ionic strength is required to prevent the formation of higher oligomers of MORN, it raises the question if the formation of higher oligomers (under physiological conditions) may also contribute to their function. It is unclear, in which buffer the fluorescence anisotropy measurements were performed. The sucrose-loaded vesicles were hydrated in a 20 mM HEPES pH 7.4, 0.3 M Sucrose. The composition of the buffer after the addition of MORN proteins is not clear. Despite the use of an impressive array of techniques, this first part of the manuscript remains somewhat immature and incoherent. Due to the use of constructs that have not the full ability to oligomerize (point 1) and due to the inconsistent use of experimental conditions, it is hard to draw firm conclusions from this first part. For example: In Figure 2E TbMORN(2-15) does show some concentration-dependent binding, which -however- is interpreted as background binding. What are the results using this assay (or better: a liposome floatation assay) when using full-length TbMORN(1-15) in a more physiological buffer? The statement that MORN proteins bind to lipids, but not to liposomes/membranes is -in my view- not sufficiently addressed to make a strong case.

      3) The physiological relevance of lipid binding to MORN proteins remains obscure (as also acknowledged by the authors). Does the binding of PE lipids to the MORN protein have a physiological role? Does the binding of fluorescent PI(4,5)P2 point to a physiological role of MORN proteins?

      4) In light of recent data from the Chris Stefan lab (PMID: 31402097) a co-incidence detection of PI(4,5)P2, PS, and cholesterol seems possible. Can the authors address this possibility? Furthermore, the role of Ca2+ signaling / Ca2+ ions has not been addressed. In light of the important role of Ca2+ for the recognition of PI(4,5)P2 (PMID: 28177616), this point should be addressed.

      5) For characterizing the binding of lipids to MORN proteins, the authors use non-physiological fluorescent and short-chain lipid analogues at concentrations, which are unlikely to occur for endogenous PIPs in the cytosol of cells. Why choosing such an artificial system? Why introducing this system at length, if other -less artifact-prone- assays are available? I would recommend to not feature this assay as prominently as it was in the current study.

      6) How would PE find its way to the lipid binding region in MORN? Would it diffuse to the MORN protein via the aqueous phase or would the MORN protein pickup PE form membranes up collision? The authors should address this point, by separating the lipid-depleted MORN protein from donor-vesicles containing PE by a dialysis membrane. If PE would not find its way to the lipid binding site of MORN, this would imply that MORN protein can extract lipids only upon colliding with the membrane. What is the stoichiometry of PE to MORN?

      Despite my critique raised above, I agree with the authors that the data point in the direction that MORN proteins (or at least TbMORN1) does not directly bind to membranes. Their data, however, would still be consistent with a role as lipid transfer protein and a recruitment of MORN proteins to the membrane by other proteins. Have the authors performed any additional experiments in this direction? Also, the potential role of palmitoylation is only mentioned in the discussion (page 22), while palmitoylation would provide a simple means for membrane recruitment.

      Minor Points:

      Figure 1B: The authors should provide information on the void volume of the column.

      Page 17, line 696-701: The authors point out that the C2 crystal form is stabilized by two disulfide bridges. The authors should comment on the physiological relevance of these disulfide bridges.

      Page 18, line 734-740: The authors should provide data on the potential role of Zn2+ on MORN function in a physiological context. The section describing that the dimer is stabilized by Zn2+ ions (pages 18 and 19) lacks a discussion if Zn2+ are functionally relevant. There is only a beautiful sequence analysis and a discussion of the conservation of the Zn2+ coordinating residues. Can the authors perform Zn2+ titrations and SEC-MALS experiments (or alternatives such as SAXS) to show that Zn2+ indeed affects the oligomeric state of only the PfMORN, but not the other MORN proteins that form alternative dimers?

      Significance

      A putative role of MORN proteins in membrane and lipid binding is addressed. The view the MORN proteins bind directly to membranes is challenged. Structures of dimeric MORN proteins provide important insight into the modes of dimerization.

      There is a recent structure of MORN proteins (which is referenced by the authors), but I feel that additional structural work is important and justified. The work on membrane vs. lipid binding is important, but not sufficiently addressed in the current manuscript.

      I would recommend to separate the study in two parts. The audience is likely to confused (or bored) by the lengthy discussion on whether or not MORN proteins bind lipids and or membrane or not.

      I am not an expert in structural biology, but have a fair understanding of structural biology. I have worked on lipid binding proteins and have a very good understanding of lipid/membrane-binding assays.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      R-We would like to thank the reviewers for their constructive feedback. We respond to all the reviewers points below. We highlighted major changes introduced to the manuscript in response to both reviewers’ comments in the attached revised version of the manuscript.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The work described in this manuscript by Tan and Marques aims to address if the splicing of enhancer-associated long noncoding RNAs (elncRNAs) has a direct impact on enhancer activity or just reflects their cognate's enhancer's high activity.

      For this purpose, the authors started by integrating RNA-seq data for human lymphoblastoid cell lines with ENCODE enhancer annotations and ChIP-seq data for enhancer function-associated chromatin modifications to show that multi-exonic elncRNAs are more transcriptionally active than single-exonic elncRNAs and eRNAs. They then show that regions flanking elncRNA splice sites are enriched in splicing-associated sequence elements and that these are under stronger purifying selection (suggesting some functional relevance), both when compared to promoter-associated lncRNAs. They also show the concomitance of cis-disrupted splicing in elncRNAs and drops in expression in their target genes. Finally, they use causal inference analysis of joint seQTLs and joint scQTLs to investigate the causal relationship between splicing of elncRNAs and expression of putative gene targets and chromatin states at elncRNA cognate enhancers, respectively. They conclude that, in both cases, most associations are causally mediated by splicing of elncRNAs and therefore that this contributes to their enhancer activity.

      This manuscript is generally well written, targets an original question and potentially sets the seeds for a new exciting line of research on transcriptional regulation, by providing some evidence for the functional relevance of the splicing of elncRNAs. However, the overlooking of some important aspects of the regulation of RNA splicing led to biases in the design of data analyses and in the interpretation of the biological implications of some results that need to be dealt with before the described work can be considered sound enough for publication.

      R:We would like to thank the reviewer for taking the time to assess our manuscript and for the constructive comments.

      **Essential revisions:**

      1.Results, page 10, lines 21-24: The statement that "the impact of SS variants on gene splicing efficiency depends on the total number of alternative transcripts and exons" is not properly substantiated. The four examples given in Figure S3 do not illustrate any dependence or trend. If such dependence is "expected" the underlying concept must be explained, i.e. why the impact of SS variants on splicing efficiency should depend on the number of alternative transcripts and exons. The reader is unrealistically expected to be used to the chosen splicing efficiency metric to intuit its dependence on the number of exons. Moreover, it is not obvious where the dependence on the number of alternative transcripts comes from, particularly given that alternative splicing (e.g. the skipping of a neighbouring exon, if internal) is not profiled.

      R-In the previous version of the manuscript, we estimated gene level changes in splicing, which includes all alternative splicing events within a gene. Therefore, the more exons an elncRNA has, the more “diluted” we expect the overall impact of SS variant on elncRNA splicing efficiency to be. After considering the reviewer’s comment, we realized that only splicing events directly impacted by the SS variant should be considered in this analysis.

      In the revised version of the manuscript, we considered only alternative splicing events that include the splice donor acceptor site changed by the SS variant, and are therefore a direct consequence, of the SS variant. As suggested by the reviewer, for these splicing events, we report the fold difference in Percentage-Spliced-In (PSI) (estimated by Leafcutter (Li et al. 2018)) between samples that carry reference and alternative alleles at these SS variants. To further illustrate these changes, we now include a diagram, for each SS variant, with differential splicing information and the fold difference in PSI for each affected splicing events (Figure 3B,C, Supplementary Figure S3). In addition, the overall change across all affected splicing events is also plotted in Figure 3D and Supplementary Figure S4.

      We have modified this section to account for this and the next comment from the reviewer.

      To estimate the impact of SS variant on splicing efficiency, we calculated the Percentage-Spliced-In (PSI) (Li et al. 2018) per individual and for each elncRNA splicing event involving the splice donor or acceptor site disrupted by the SS variants (Figure 3B,C, Supplementary Figure S3). PSI measures exon inclusion and considers spliced reads spanning exon junctions (Li et al. 2018). We compared the average difference in PSI, as a proxy for change in splicing efficiency, of all affected splicing events between individuals that carry the reference and alternative canonical splice donor/acceptor sites (GT-AG). Alongside decreased exon inclusion, SS variants can also promote exon skipping events (Figure 3B,C, Figure S3). Despite some increase in exon skipping, SS variants are associated with an overall decrease in splicing efficiency (Figure 3D and Supplementary Figure S4).” (Page 10).

      Along these same lines, and more importantly, why haven't the authors looked at the possibility that a variant disrupting a splice site would lead to skipping of the neighbouring exon (if internal)? Given how the spliceosome operates (in terms of intron and exon recognition), wouldn't this be the most likely scenario? When calculating the splicing index, are reads spanning junctions between non-consecutive exons considered? Otherwise, not profiling alternative isoforms generated by exon skipping will necessarily bias splicing efficiency quantifications by overlooking fully efficient splicing associated with such isoforms. Similarly, how did the authors make sure that splicing changes did not bias elncRNA expression estimates? How was the effective transcript length determined for the calculation of RPKMs? The authors need to make these methodological clarifications, as well as why exon skipping was not considered as a splicing disruption with potential functional implications. Calculating the percent spliced-in (PSI) for all internal exons would be much informative.

      R-Regarding the methodology, what we refer to as splicing efficiency is Percentage-Spliced-In (PSI). We calculated PSI for all, including alternative, splicing events. We now make this clearer throughout the manuscript and in the figure axis/legends.

      As detailed in the methods section, to minimize the impact of alternative splicing on gene expression estimates, we quantified expression at the gene, and not at the transcript, level using HTSeq across all annotated exons. This approach allows us to assess elncRNA and target gene expression while masking differences in alternative transcript abundance, which are not relevant in the context of this analysis.

      As suggested by the reviewer, instead of considering PSI of all possible splicing events of the gene, in the revised version of the manuscript, we considered only splicing events that are directly impacted by the SS variant. This change does not impact our conclusions, but certainly provides a better understanding of how SS variants impact splicing and we would like to thank the reviewer for raising this point. As predicted by the reviewer and as expected given how the spliceosome operates, exon skipping is a frequent outcome of SS variants. However, the increase in exon skipping is not sufficient to compensate for the decrease in the inclusion of these exons, which is directly impacted by the SS variants. This is demonstrated by the lower overall splicing efficiency for each elncRNAs in individuals that carry SS variants that disrupt canonical splice/donor acceptor sites (Figure 3D and Supplementary Figure S4).

      3.All results in panels 3B-F are presented as fold differences. It is actually not clear what those differences refer to. For instance, the grey boxes are the distributions of the fold differences in splicing index / expression between individuals carrying reference alleles and what?

      R-The boxplots represent the distribution of the fold difference in PSI or expression for each individual relative to the median PSI or expression in individuals with the reference genotype. As expected, the distribution of log fold difference in either PSI or expression for individuals carrying the reference allele is centered at 0.

      We have clarified this in the methods section and figure legends.

      4.It is expectable that most joint seQTLs result from variants directly impacting splicing in cis. As the quantification of splicing is noisier than that of expression, a stronger effect is required for the detection of an sQTL than an eQTL. In other words, joint seQTLs are essentially sQTLs. This illustrated by the example in Figure 4A, with the SNP in an intronic region of the elncRNA being associated with strong differences in splicing and tiny (R-We agree with the reviewer that the quantification of splicing is noisier than that of expression. However, and in contrast with the reviewer’s hypothesis, higher “noise” in splicing quantification compared to expression led to weaker associations between splicing and seQTLs, as illustrated in the figure below. This is in line with splicing being measured with higher error rate, which would ultimately lead to smaller detectable sQTL effect than what they would be with perfect measurements. This also demonstrates that since a priori, eQTLs association are stronger, if a bias exists in the causal inference analysis, it should favour detection of non-causal associations. Therefore, our approach is not biased in detecting causal seQTLs.

      We agree with the reviewer that this potential bias may be a concern to readers and should be addressed. We have added this analysis to the text (Supplementary Figure S7E) and explained why the causal inference testing approach is not biased in detecting causal seQTLs.

      “To assess whether this approach was biased towards the detection of causal seQTLs we compared the slope and adjusted p-value of the associations between the variant and splicing or expression for all causal seQTLs. As illustrated in Supplementary Figure S7E this analysis revealed there is no evidence that stronger sQTLs would favour causal model predictions.” (Page 16).

      5.It is not totally clear what message the authors intend to convey with the result of panel 4D. Are they talking about the relative position of the variant to the elncRNA transcript or the target transcript? If the former, shouldn't the known synergy between transcription and 5´ end splicing reflect on elncRNA expression? If the latter, it is not obvious how the result connects to the mentioned synergy.

      R:In Figure 4D, we show the relative position, within a transcript, of the exonic splicing junction which is associated with causal seQTLs. The enrichment in associations with splicing junctions located at 5’ end of elncRNAs is consistent with the synergy between 5’ end splicing and transcription. We clarify this in the text:

      Importantly, 90% of seQTL associations that support elncRNA splicing as a mediator of target expression are associated with splicing junctions located at the 5´ end of the transcript, which is consistent with the known synergy between transcription and 5´ end splicing (Furger et al. 2002; Damgaard et al. 2008)(Figure 4D).” (Page 17).

      **Proofreading edits:**

      R:We would like to thank the reviewer for identifying all the typos listed below. We have corrected them in the revised version of the manuscript.

      6.Introduction, page 3, line 13: double "in".

      7.Figure S2A, leftmost panel X-axis label: "intrno" instead of "intron".

      8.Results, page 10, line 30: remove "of".

      9.It is 5´ and 3´(prime) not 5' and 3' (apostrophe).

      **Other suggestions:**

      10.Violin plots (with included boxplots) would more comprehensively convey the differences in distributions than the chosen notched boxplots.

      R-We thank the reviewer for this suggestion. Although we appreciate the added information a violin plot can provide, this also renders, in our opinion, their interpretation less intuitive. Because boxplots are simpler and easier to interpret, after consideration, we decided to continue using these to represent the distribution of the data.

      Reviewer #1 (Significance (Required)):

      It is hard for me to assess the significance of this work (beyond some evidence for the potential functional relevance of the splicing of elncRNAs) until the aforementioned concerns are addressed but it is of potential interest to the broad RNA research community.

      I am a computational biologist with experience in the analysis of high-throughput transcriptomic data and a focus on transcriptional and alternative splicing regulation.

      ========================================================================

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This manuscript addresses an interesting question - whether the splicing of transcriptional enhancer-associated RNAs influences their transcriptional enhancement activity. The analyses appear carefully done, using appropriate datasets and statistical methods. The authors find, for example, that marks of active chromatin are enriched near spliced elncRNAs, that splicing-related motifs of elncRNAs are under selective constraint, and that splicing of elncRNAs is associated with higher elncRNA expression and to very slightly higher expression of target genes.

      R:We thank the reviewer for the constructive feedback on our manuscript. We have extended our analysis to address the reviewers concerns that we detail in our response to the comments below.

      However, I did not find the main results convincing of the main conclusion for the following reasons:

      1.The most direct evidence is shown in Figure 3, where SNPs that occur in 3' splice sites of elncRNA introns are explored, and it is shown that variants predicted to disrupt splicing of elncRNA introns are associated with reduced expression of target but not non-target genes. But the fold difference in expression of target genes is extremely small - a few percent - and is actually less than the fold difference in expression of the elncRNAs themselves (which appears closer to 10%), raising the question of whether elncRNA expression rather than splicing may be more important for activity. Furthermore, the entire analysis has an anecdotal quality, being based on only 4 splice-disrupting elncRNA variants. I did not find the figure at all convincing of the conclusion the authors draw from it.

      R:We agree with the reviewer that our analysis is limited by the available genotyping data that is restricted to common genetic variants. Our evolutionary constraint analysis (Figure 2) indicates that variants that disrupt elncRNA splicing are depleted by natural selection and so we expected to identify a relatively small number of elncRNAs (n=4) suitable for this analysis. Despite the anticipated challenges in identifying elncRNA splice site mutations, we nevertheless believe this unbiased natural mutational analysis is analogous to experimentally disrupting splice sites of these 4 elncRNA candidates.

      Regarding the strength of impact of splicing on target expression: in the absence of a comparable experiment, we could not anticipate the magnitude of the effect. We acknowledge that previous studies, which sought to completely remove splicing by either deleting all elncRNA introns (Yin et al. 2015) or terminating transcription after its 1st exon (Engreitz et al. 2016), were both associated with significantly stronger impact on elincRNA splicing and target expression than what we report here. The analysis we present here involves single nucleotide polymorphisms and so it is not surprising to have resulted in more moderate impact on overall splicing. Furthermore, whether the differences in the impact on target expression between this and previous analysis is the result of stronger effect of complete removal of splicing or a consequence of the genetic changes introduced remains unclear. The small yet consistent decrease in target expression we observed, even with minimal changes in splicing of an unbiased set of 4 candidates, is in our opinion strong evidence that modulation in elncRNA splicing is sufficient to impact, albeit moderately, target expression.

      Importantly, we replicated the impact of decreased splicing on target expression of the 4 elncRNA candidates using 89 samples of Yoruba (YRI) population from the Geuvadis dataset (Supplementary Figure S5). The robustness of the mutational study consistently supports the physiologically relevant effect of elncRNA splicing on cognate enhancer function.

      As pointed out by the reviewer, elncRNA SS variants led to stronger impact on the expression of the elncRNAs compared to that of their targets (Figure 3F,H and Supplementary Figure S4), which suggests that target expression regulation is likely a consequence of changes in elncRNA expression as a result of changes in its splicing. This is described as our working model in the discussion section of the manuscript.

      2.Figure 4 uses a causal inference approach and involves larger datasets. While causal inference can be a useful tool to identify candidate causal relationships, it does not prove causality, which still requires some sort of experimental perturbation. Thus, I found these results suggestive but still not satisfying to justify, e.g., the title of the paper or claims made in the abstract. As in Figure 3, the specific example shown in Fig. 4A again shows a relatively tiny effect on target gene expression, which again appears to be a few percent at most.

      R:For the reasons explained above, we had no expectation that the effect size of the association between elncRNA splicing and target expression would be high. It is nevertheless key that these associations are robust, which would provide reliable support for our hypothesis. To assess this, we used 2 independent datasets to replicate elncRNA target associations with sQTL variants associated with elncRNA splicing: 1) 147 LCL samples from GTEx and 2) 31,684 blood samples from eQTLgen. Using these datasets, we replicated the association between sQTL and target expression for targets of up to 77% of elncRNAs. As expected, replicated associations have significantly higher effect size in both datasets (Supplementary Figure S9) and 1.2 times more associations can be replicated in the eQTLgen blood samples with a larger cohort size. LCL-specific effect of elncRNA splicing likely explains why not all associations are replicated in these blood samples. We report these analyses in the manuscript (Supplementary Figure S9).

      We agree with the reviewer that the causal inference analysis is only suggestive per se. However, we would argue that conclusions of the present manuscript do not rely on this analysis alone, but instead on the combined evidence of several experiments, including the natural mutational analysis that is analogous to the experiment the reviewer proposes.

      Considering the reviewers concern, we realized that previous version of Figure 4A did not reflect the average strength of the association between seQTL variant and target expression (median=0.319, ranging from 0.16 to 0.81, Rebuttal Figure 1). For this reason, we replaced the previous illustration by a more representative example (Figure 4A).

      The text illustrating reproducibility of our results in GTEx and eQTLgen have been added to Page 17 of the manuscript.

      We used two independent datasets to assess the robustness of elncRNA target association with sQTL variants we predict to be associated with the splicing of these elncRNAs in LCLs. Using a smaller cohort of LCLs (n=147 (GTEx Consortium 2013)), we found a significant association in the same direction between sQTL and target expression for targets of 70% of elncRNAs (45% of variants). A larger fraction of associations (77% of elncRNAs and 52% of variants) could be replicated in a larger cohort of blood samples (n=31,684 (Võsa et al. 2018)). The difference in size between these two cohorts is likely to explain the difference in replication rate. The association between elncRNA splicing variants and target expression that were replicated have significantly higher effect size relative to non-replicated associations (Supplementary Figure S9). Furthermore, LCL-specific effect also likely explains why not all associations can be replicated in the large blood cohort.” (Page 17).

      3.Figure 2 shows that splicing-related signals are under selective constraint in spliced elncrRNAs, which is convincing. But this does not prove that splicing of elncRNAs is directly related to enhancer activity. It is equally plausible that elncRNA expression directly impacts enhancer activity and that elncRNA splicing is conserved because it boost elncRNA expression, for example.

      R:The reviewer is right and the sentence “If splicing of elncRNAs is important for enhancer function, …” does not faithfully describe the conclusions that can be drawn from the analysis reported in Figure 2. This portion of the text now reads: “If splicing of elncRNAs is functionally relevant, one would expect selection to have prevented the accumulation of deleterious mutations in their splicing-associated motifs during evolution” (Page 8). We would like to thank the reviewer for pointing this out.

      Other points:

      4.Are the ChiP profiles in Figures 1A-E significantly different from each other in a statistical sense? Probably yes, but a specific test should be done.

      R:We now added boxplots representing the distribution of read density centered at transcript promoters. Statistical difference in the distribution is also tested. We show this in the revised Figure 1A-E and Supplementary Figure S1B-C.

      5.This sentence (p. 10) was hard to follow and should be clarified: "As expected, the impact of SS variants on gene splicing efficiency depends on the total number of alternative transcripts and exons and ranges from 11% to 24% for elncRNA with 6 to 2 number of exons, respectively (Supplementary Figure S3)."

      R:We had previously estimated the average amount of change in splicing for all alternative splicing events at each elncRNA candidate. To calculate this, we considered the difference in Percentage-Spliced-In (PSI) for all splicing events and divided this by the total number of considered events. Given that only a subset of events is affected by a splice site variant, the more exons an elncRNA has, the more alternative splicing events are likely to occur and the lower the average impact of a SS variant on overall gene splicing efficiency is expected to be. Following a comment from reviewer 1 (comment 1), we now only consider splicing events directly disrupted by the SS variant. We agree this sentence was not clear and have removed it from the manuscript.

      6.Related to point 5 above, Supplementary Figure 3 is somewhat confusing because two splicing change and three expression change plots are shown for each locus, without labels of what each one is, or explanation of what the red and green colors mean.

      R:We apologize for the confusion and thank the reviewer for pointing this out. In the figure, we plot the fold difference in elncRNA splicing, target gene splicing, target gene expression, non-target gene expression, and elncRNA expression. elncRNA features are plotted in red and target gene features are plotted in green. We have added labels to clarify the relevant plots (Figure 3D-H, Supplementary Figure S4,5).

      **Minor points:**

      1.Top of p. 16: "90% of those that support elncRNA splicing as a mediator 3 of target expression are located at the 5' end of the transcript, which is 4 consistent with the known synergy between transcription and 5' end splicing" - a reference is needed

      R:We thank the reviewer for pointing this out and we have now added the appropriate reference.

      Importantly, 90% of seQTL associations that support elncRNA splicing as a mediator of target expression are associated with splicing junctions located at the 5´ end of the transcript, which is consistent with the known synergy between transcription and 5´ end splicing (Furger et al. 2002; Damgaard et al. 2008)(Figure 4D).” (Page 17)

      2.Figure 5B,C y-axes indicate "fold difference", but scales include negative numbers, which is confusing. Probably should redo the analysis showing log of fold difference.

      R:We thank the reviewer for the suggestion. Since the fold difference in Percentage-Spliced-In (PSI) used to estimate the amount of splicing at each exon junction can be of both positive and negative values, we now plot the log modulus transformation (John and Draper, 1980) of the data, which is equivalent to a log transformation while preserving the sign of the data. The analysis has been redone for Figure 3D-H, 5B,C, and Supplementary Figure S4, S5. This change does not impact the conclusions and makes the interpretation of the results more intuitive.

      3.p. 20 Describes U1 snRNP as "a protein essential for the 4 recognition of nascent RNA 5' splice site and assembly of the spliceosome". U1 is a large RNA-protein complex, not a protein.

      R:We thank the reviewer for pointing this out and this has now been corrected.

      Chromatin-bound lncRNAs have been recently shown to be enriched in U1 small nuclear ribonucleoprotein (snRNP) RNA-protein complex, a protein essential for the recognition of nascent RNA 5´ splice site and assembly of the spliceosome (Yin et al. 2020).” (Page 22)

      4.Typo: p. 11, l. 2 missing word (genes): "expression levels of other nearby was unaffected"

      R:This has been corrected.

      Reviewer #2 (Significance (Required)):

      The question addressed is very interesting, given recent work the significance of transcription from enhancers, and work addressing functional relationships between splicing and expression. The work is suggestive of effects of enhancer splicing on expression but I did not find it fully convincing as the effects observed are extremely small, and other explanations are not ruled out, as discussed above.

      Prior literature has shown that many active enhancers are transcribed, that enhancer transcription can preced and is positively correlated with target gene expression, and work from both Ulitsky and from the authors indicates that splicing of enhancer-associated lncRNAs is positively correlated with enhancer activity. A variety of studies have also shown that splicing of protein-coding genes generally has a strong positive effect on gene expression. Here, the authors attempt to go further and show that splicing of enhancers causes increased transcriptional enhancement of target genes. A variety of public genotype, expression, chromatin and other types of data are analyzed to address this question. The statistical genetics crowd may find the work of interest, but molecular biologists will not be convinced of the conclusions. My expertise is in computational biology, genomics and RNA biology.

      Engreitz JM, Haines JE, Perez EM, Munson G, Chen J, Kane M, McDonel PE, Guttman M, Lander ES. 2016. Local regulation of gene expression by lncRNA promoters, transcription and splicing. Nature 539: 452-455.

      John, J., & Draper, N. (1980). An Alternative Family of Transformations. Journal of the Royal Statistical Society. Series C (Applied Statistics), 29(2), 190-197. doi:10.2307/2986305

      Li YI, Knowles DA, Humphrey J, Barbeira AN, Dickinson SP, Im HK, Pritchard JK. 2018. Annotation-free quantification of RNA splicing using LeafCutter. Nature genetics 50: 151-158.

      Yin Y, Yan P, Lu J, Song G, Zhu Y, Li Z, Zhao Y, Shen B, Huang X, Zhu H et al. 2015. Opposing Roles for the lncRNA Haunt and Its Genomic Locus in Regulating HOXA Gene Activation during Embryonic Stem Cell Differentiation. Cell stem cell 16: 504-516.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript addresses an interesting question - whether the splicing of transcriptional enhancer-associated RNAs influences their transcriptional enhancement activity. The analyses appear carefully done, using appropriate datasets and statistical methods. The authors find, for example, that marks of active chromatin are enriched near spliced elncRNAs, that splicing-related motifs of elncRNAs are under selective constraint, and that splicing of elncRNAs is associated with higher elncRNA expression and to very slightly higher expression of target genes.

      However, I did not find the main results convincing of the main conclusion for the following reasons:

      1.The most direct evidence is shown in Figure 3, where SNPs that occur in 3' splice sites of elncRNA introns are explored, and it is shown that variants predicted to disrupt splicing of elncRNA introns are associated with reduced expression of target but not non-target genes. But the fold difference in expression of target genes is extremely small - a few percent - and is actually less than the fold difference in expression of the elncRNAs themselves (which appears closer to 10%), raising the question of whether elncRNA expression rather than splicing may be more important for activity. Furthermore, the entire analysis has an anecdotal quality, being based on only 4 splice-disrupting elncRNA variants. I did not find the figure at all convincing of the conclusion the authors draw from it.

      2.Figure 4 uses a causal inference approach and involves larger datasets. While causal inference can be a useful tool to identify candidate causal relationships, it does not prove causality, which still requires some sort of experimental perturbation. Thus, I found these results suggestive but still not satisfying to justify, e.g., the title of the paper or claims made in the abstract. As in Figure 3, the specific example shown in Fig. 4A again shows a relatively tiny effect on target gene expression, which again appears to be a few percent at most.

      3.Figure 2 shows that splicing-related signals are under selective constraint in spliced elncrRNAs, which is convincing. But this does not prove that splicing of elncRNAs is directly related to enhancer activity. It is equally plausible that elncRNA expression directly impacts enhancer activity and that elncRNA splicing is conserved because it boost elncRNA expression, for example.

      Other points:

      4.Are the ChiP profiles in Figures 1A-E significantly different from each other in a statistical sense? Probably yes, but a specific test should be done.

      5.This sentence (p. 10) was hard to follow and should be clarified: "As expected, the impact of SS variants on gene splicing efficiency depends on the total number of alternative transcripts and exons and ranges from 11% to 24% for elncRNA with 6 to 2 number of exons, respectively (Supplementary Figure S3)."

      6.Related to point 5 above, Supplementary Figure 3 is somewhat confusing because two splicing change and three expression change plots are shown for each locus, without labels of what each one is, or explanation of what the red and green colors mean.

      Minor points:

      1.Top of p. 16: "90% of those that support elncRNA splicing as a mediator 3 of target expression are located at the 5' end of the transcript, which is 4 consistent with the known synergy between transcription and 5' end splicing" - a reference is needed

      2.Figure 5B,C y-axes indicate "fold difference", but scales include negative numbers, which is confusing. Probably should redo the analysis showing log of fold difference.

      3.p. 20 Describes U1 snRNP as "a protein essential for the 4 recognition of nascent RNA 5' splice site and assembly of the spliceosome". U1 is a large RNA-protein complex, not a protein.

      4.Typo: p. 11, l. 2 missing word (genes): "expression levels of other nearby was unaffected"

      Significance

      The question addressed is very interesting, given recent work the significance of transcription from enhancers, and work addressing functional relationships between splicing and expression. The work is suggestive of effects of enhancer splicing on expression but I did not find it fully convincing as the effects observed are extremely small, and other explanations are not ruled out, as discussed above.

      Prior literature has shown that many active enhancers are transcribed, that enhancer transcription can preced and is positively correlated with target gene expression, and work from both Ulitsky and from the authors indicates that splicing of enhancer-associated lncRNAs is positively correlated with enhancer activity. A variety of studies have also shown that splicing of protein-coding genes generally has a strong positive effect on gene expression. Here, the authors attempt to go further and show that splicing of enhancers causes increased transcriptional enhancement of target genes. A variety of public genotype, expression, chromatin and other types of data are analyzed to address this question. The statistical genetics crowd may find the work of interest, but molecular biologists will not be convinced of the conclusions. My expertise is in computational biology, genomics and RNA biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The work described in this manuscript by Tan and Marques aims to address if the splicing of enhancer-associated long noncoding RNAs (elncRNAs) has a direct impact on enhancer activity or just reflects their cognate's enhancer's high activity.

      For this purpose, the authors started by integrating RNA-seq data for human lymphoblastoid cell lines with ENCODE enhancer annotations and ChIP-seq data for enhancer function-associated chromatin modifications to show that multi-exonic elncRNAs are more transcriptionally active than single-exonic elncRNAs and eRNAs. They then show that regions flanking elncRNA splice sites are enriched in splicing-associated sequence elements and that these are under stronger purifying selection (suggesting some functional relevance), both when compared to promoter-associated lncRNAs. They also show the concomitance of cis-disrupted splicing in elncRNAs and drops in expression in their target genes. Finally, they use causal inference analysis of joint seQTLs and joint scQTLs to investigate the causal relationship between splicing of elncRNAs and expression of putative gene targets and chromatin states at elncRNA cognate enhancers, respectively. They conclude that, in both cases, most associations are causally mediated by splicing of elncRNAs and therefore that this contributes to their enhancer activity.

      This manuscript is generally well written, targets an original question and potentially sets the seeds for a new exciting line of research on transcriptional regulation, by providing some evidence for the functional relevance of the splicing of elncRNAs. However, the overlooking of some important aspects of the regulation of RNA splicing led to biases in the design of data analyses and in the interpretation of the biological implications of some results that need to be dealt with before the described work can be considered sound enough for publication.

      Essential revisions:

      1.Results, page 10, lines 21-24: The statement that "the impact of SS variants on gene splicing efficiency depends on the total number of alternative transcripts and exons" is not properly substantiated. The four examples given in Figure S3 do not illustrate any dependence or trend. If such dependence is "expected" the underlying concept must be explained, i.e. why the impact of SS variants on splicing efficiency should depend on the number of alternative transcripts and exons. The reader is unrealistically expected to be used to the chosen splicing efficiency metric to intuit its dependence on the number of exons. Moreover, it is not obvious where the dependence on the number of alternative transcripts comes from, particularly given that alternative splicing (e.g. the skipping of a neighbouring exon, if internal) is not profiled.

      2.Along these same lines, and more importantly, why haven't the authors looked at the possibility that a variant disrupting a splice site would lead to skipping of the neighbouring exon (if internal)? Given how the spliceosome operates (in terms of intron and exon recognition), wouldn't this be the most likely scenario? When calculating the splicing index, are reads spanning junctions between non-consecutive exons considered? Otherwise, not profiling alternative isoforms generated by exon skipping will necessarily bias splicing efficiency quantifications by overlooking fully efficient splicing associated with such isoforms. Similarly, how did the authors make sure that splicing changes did not bias elncRNA expression estimates? How was the effective transcript length determined for the calculation of RPKMs? The authors need to make these methodological clarifications, as well as why exon skipping was not considered as a splicing disruption with potential functional implications. Calculating the percent spliced-in (PSI) for all internal exons would be much informative.

      3.All results in panels 3B-F are presented as fold differences. It is actually not clear what those differences refer to. For instance, the grey boxes are the distributions of the fold differences in splicing index / expression between individuals carrying reference alleles and what?

      4.It is expectable that most joint seQTLs result from variants directly impacting splicing in cis. As the quantification of splicing is noisier than that of expression, a stronger effect is required for the detection of an sQTL than an eQTL. In other words, joint seQTLs are essentially sQTLs. This illustrated by the example in Figure 4A, with the SNP in an intronic region of the elncRNA being associated with strong differences in splicing and tiny (<1%) and barely significant differences in expression. Moreover, current knowledge and reported evidence strongly suggests that cis regulation of splicing is essentially "local", i.e. directly involves the processed sequences and not the interference of neighbouring RNAs. Similarly, to my knowledge there is no evidence suggesting a trend for genes encoding splicing factors being associated to the same eQTL variants as those of their target RNAs. I would therefore predict that most joint seQTLs result from variants within the elncRNA loci directly impacting their splicing. If this is the case, causal inference analysis will naturally be biased towards more strongly linking the variants with elncRNA splicing and thereby suggesting its causal role. The same rationale applies to scQTLs. The authors need to control for that potential bias in their analyses or explain why there is no bias.

      5.It is not totally clear what message the authors intend to convey with the result of panel 4D. Are they talking about the relative position of the variant to the elncRNA transcript or the target transcript? If the former, shouldn't the known synergy between transcription and 5´ end splicing reflect on elncRNA expression? If the latter, it is not obvious how the result connects to the mentioned synergy.

      Proofreading edits:

      6.Introduction, page 3, line 13: double "in".

      7.Figure S2A, leftmost panel X-axis label: "intrno" instead of "intron".

      8.Results, page 10, line 30: remove "of".

      9.It is 5´ and 3´(prime) not 5' and 3' (apostrophe).

      Other suggestions:

      10.Violin plots (with included boxplots) would more comprehensively convey the differences in distributions than the chosen notched boxplots.

      Significance

      It is hard for me to assess the significance of this work (beyond some evidence for the potential functional relevance of the splicing of elncRNAs) until the aforementioned concerns are addressed but it is of potential interest to the broad RNA research community.

      I am a computational biologist with experience in the analysis of high-throughput transcriptomic data and a focus on transcriptional and alternative splicing regulation.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to reviewer comment for manuscript RC-2020-00207

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      **Major Comments:**

      The authors of the paper start the paper with just one protein narrowed down ie. HRG. The rest of the paper uses affinity based proteomics, antibody validation, GWAS and survival analysis to validate this target and support their claim that HRG is an age associate protein linked to mortality and certain clinical outcomes. How did the authors conclude that HRG was the only target to explore further in this paper? What methods or analysis was done for this? What were the other proteins if any that showed up in these studies?

      We appreciate this comment which reveals unclear explanation how the protein was chosen for further analysis. The protein profile obtained using HPA045005 was the top and single hit out of 7258 protein profiles using a threshold of adjusted P-value below 0.01. In other words, only the profile of HRG was statistically significantly associated with age in the screening sample set (N = 156). The results of all protein profiles were attached as Supporting Table 1. Phrases about the alpha level were added to the text to make the threshold clear. Because antibody validation of these exploratory studies requires enormous efforts and time, we could not choose a more liberal and inclusive threshold.

      For mortality outcome, it is not clear which class of disease is most strongly associated with increased risk of mortality from elevated HRG levels. If cause-specific mortality exists among the cohorts, could authors provide a more exact breakdown of the type of associated mortality by a disease class?

      We thank the reviewer for the question and have now added cause-specific data in the manuscript. Using cause of death data, mortality risk by diseases in circulatory system were compared with the risk by neoplasm and others. ElevatedHPA045005-HRG profiles were found to associate with mortality risk by diseases of the circulatory system (HR = 1.46 per SD, P = 2.80 × 10‑4, ICD-10 code I00-I99). It was larger than the risk by malignant neoplasms (HR = 1.28 per SD, P = 1.73 × 10‑2, ICD-10 code C00-C97). We chose big categories as ICD-10 codes "I" and "C" because the number of events was too small to get enough power in the survival analysis.

      Page 4 Section 3 (Results)-

      The authors say "We found consistent age-associated trends with HPA045005 across all eight replication sets (Supporting Figure 3)". On examining the supporting figure we noticed that the slope for the set with the largest number of subjects (Set 3 with ~3000 people) is visually negligibly positive (showing weakest age associated trends with HPA045005). Some comments from the authors on why they think the largest data set showed the weakest association.

      The plot for each cohort (in Supporting Figure 3) had different ranges in the y-axes. To make those plots comparable, the ranges in the y-axes of the different panels in the figure were modified to be the same for all cohorts. In the new version of the plot, it is easier to notice that there in fact is an increasing trend of the profiles in set 3. As we briefly discussed in Discussion, weaker age-association of the sample set may be due to the set was near to a random sample of population in the age range. Set 1, however, had over-representation of older people by selecting equal number of people in every age-intervals.

      From Figure 2 C in the main manuscript one concludes that for HPA045005, binding for CC individuals is ~ 2 times higher than TT individuals. Is it possible the age association showing up for HPA045005 is primarily a function of changing/increase in allele frequency as a function of age?

      The authors could consider adding a clarifying plot of Age vs Allele frequency or adding an interaction term of Age and Allele Frequency in the regression and survival analysis to address this question.

      As suggested, we now added a test of age association, and average age was compared by genotype. The result was added in Supporting Table 3. The heterozygote (CT) group has slightly higher average age without statistical significance (ANOVA P = 0.096).

      It is interesting that the signals were significant with the HPA045005 antibody but not with the BSI037 antibody. This is in spite of the fact that the GWAS for BSI0137 signals had an even stronger hit to the same locus. Can the authors please comment on why the signals from HPA045005 and BSI0137 were not highly correlated with one another and why the better antibody could not replicate the survival analysis results?

      We thank the reviewer for the comments. We believe that our text about our findings were not clear enough, though it is a primary finding. We modified the main text to easily distinguish the HPA045005-derived profiles that were influenced by the 204th amino-acid of HRG protein, from the BSI0137-derived profiles influenced by the 493the amino-acid. The signals from those two antibodies were likely obtained by capturing different parts of HRG, which are schematically illustrated in Figure 2D. What we found is that only one binder's profiles, not the other's, had predictive power for mortality risk within about 8.5 years. That suggests some age-dependent changes around the 204th residue of HRG reflected biological aging rather than whole protein level. To make our finding clearer, the two binders were compared in Table 2.

      **Minor Comments:**

      Figure 1: The authors description of the figure could use more clarification. "For each sample set, the estimated effect from the linear regression model.." estimated effect of what on what? On reading the main text one concludes it is the effect of age on HPA045005. This needs to be clarified in the label.

      We agree with the reviewer and have added these words.

      Figure 3: The X axis for the Kaplan Meir survival curve is labelled as Age. Survival is usually time to event and time is usually the follow up time. Further clarification for the choice of this label might be helpful.

      We clarified the choice of the time scale in the figure legend with a reference, where it was further discussed (Thiébaut & Bénichou, 2004). We chose age as the time scale, seeing age is the strongest risk factor for all-cause mortality, as the suggestion in the reference. We attempted to use follow-up time as the time scale with age adjustment before, which gave us almost the same results but violated the proportionality assumption of COX models.

      Figure 3: it would be good to include a table with the number of individuals at risk at the bottom of the plot at defined time intervals. The figure currently compares the bottom and top quartiles of HRP for visual assessment of mortality risk, it would also be informative to include middle quantiles.

      The figure was updated accordingly. The risk table was included and the results of the middle group were presented.

      Supporting Table 5: The note at the bottom of this table states "standardized HRG values by linear regression and scaling." What does standardization by linear regression mean?

      A sentence that explains the standardization was added in the footnote of the table.

      Supporting Table 5: It would be useful to understand that HRG carries additional risk beyond known Age and known clinical biomarkers listed in Table 2 (APOA1, APOB, TC, TG, Glucose, LDL). Could authors include a multivariate CoxPH regression with just Age? and with Age + clinical covariates?

      The impact of those clinical variables on survival models was examined and the results were added to Supporting Table 6 (which was Table S5). It turned out that the addition of those variables barely changed the results of the model for the HRG profile affected by 202th amino-acid.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Summary**

      The manuscript by Hong et al. describes the identification and validation of histidine-rich glycoprotein (HRG) as a marker of chronological age and all-cause mortality. HRG was determined using proteomics of serum and plasma samples in 9 different cohorts (total sample size ~4,100). The association with mortality was tested in the largest available cohort (TwinGene), comprising ~3,000 samples. The association with mortality seems to be stronger in women in comparison to men and could not be explained by CRP or diabetes-related traits. The HRG levels determined using an alternative antibody, BSI0137, did not show any association with mortality, indicating that the effect on mortality is likely isoform-dependent. The performed analyses seem to be statistically solid. However, the association with mortality still needs to be replicated in independent studies and the HRG measurement does not yet seem to be ready for standardized high-throughput measurement, which is necessary to make it usable as biomarker.

      **Major comments**

      • Although the authors have convincingly identified HRG to be associated with chronological age and mortality, it will require quite some additional work (including replication of the observed association with mortality in independent cohorts, testing the predictive ability, and making the measurement standardized and high-throughput) to prove its use as potential biomarker. At the moment, this is not at all discussed in the manuscript. Moreover, there have been some recent large-scale studies that identified biomarkers at the metabolic level that are not at all mentioned by the authors. The authors only refer once to the recent proteomic study by Lehallier in the Introduction, but do not at all discuss their findings in relation to this paper. Last but not least, HRG has already been associated with mortality in a previous study (https://www.ncbi.nlm.nih.gov/pubmed/29303798), but there is no mention of this anywhere in the manuscript. Hence, I think it would be good if the authors perform a thorough literature search to place their findings into context and rewrite their Discussion accordingly.

      We appreciate the reviewer's comments on the limitation of our paper. We are aware of the requirement of further investigation on HPA045005-HRG profiles as a biomarker to confirm it with independent cohorts. Instead, we supported our findings with a set of confirmatory analyses; we validated and annotated age-associated profile applying GWAS, sandwich assays, peptide arrays and mass spectrometry. Comparing two antibody profiles, we narrowed down to age-associated region within the protein HRG. The approach and finding, we believe, is novel.

      We added some discussion about recent large-scale proteomic studies such as Tanaka et al, 2018 and Lehallier et al, 2019. Unexpectedly, HRG was found not measured in those studies despite of the protein is one of the abundant proteins in blood (Poon et al, 2011). It may reflect challenges in assay development and missing piece in those large studies. The papers lack further investigation for molecular targets, which is common in proteomic papers, and makes it difficult to compare between studies and technologies. In that sense, our approach is different from other proteomic studies, because we invested time and efforts to investigate the molecular target.

      We are though thankful for the introduction of the suggested HRG publication, which we did not know about. We concluded that there are substantial differences in the subjects and suggested functions for the protein. Kuroda et al. found HRG as a biomarker for sepsis of ICU patients, while our study was done on the general population. They were measuring HRG protein level, whereas we found one particular region in HRG as a biomarker for all-cause mortality. Hence, we briefly discussed the reference in the paragraph about general information about HRG.

      • The authors need to add a Supplementary Table showing the association of all their 7,258 HPA antibodies with chronological age. Although I trust the authors, I can currently not tell if it is indeed correct that only one antibody was significantly associated with age in set 1.

      We agree with the reviewer. The table of association test results of all 7258 antibody profiles was attached to the paper as Supporting Table 1. We were also surprised that only one passed a conventional P-value threshold 0.01 after Bonferroni correction. It might be due to the low number of samples in the sample set 1 (N=156), compared to the number of antibodies or tests.

      • According to description in the Supporting Information, several samples in set 3-5 were overlapping with set 1 (45 in total). These samples should be removed from datasets 3-5 to make sure that there are no overlapping samples in the meta-analysis. However, I am not sure if the authors have actually done this. For the GWAS the overlapping samples from set 3 could still be included, given that set 1 is not involved in that. The authors could actually use these 45 overlapping samples to provide additional details about the reproducibility of HPA045005 between different measurements, for example by showing a correlation plot.

      We agree with the reviewer. Those 45 overlapping samples were excluded in the meta-analysis. As the reviewer's comment, only the data of sample set 3 was used for the GWAS.

      We also appreciate the comment regarding reproducibility and acknowledge that there are limitations to the technical performance of our exploratory SBA method. The procedure is tailored to handle large number of antibodies and profile 384 sample in the analysis plates. This setup allowed us to process relatively large number of samples per batch but it might be affected by batch effects. In our study set 3, there were 2999 samples randomized and analyzed in 8 different 384-well plates. The 44 overlapping samples between sets 1 and 3 were added to one of these 8 plates. This resulted in 1-11 samples to be analyzed on the same plate, hence, comparing these 44 with previous assays might be influenced if not dominated by plate effects. We went back to the initial data set generated during 2011/2012 and compared the first data with replicated assays using the same freeze-thawed samples. For HPA045005 we found the data to correlate by r=0.45. The next analyses of these 44 samples were conducted during 2015 using different sample aliquots and preparations as well as different SBAs. The correlation to previous assays was r

      • When looking at the effect of the rs9898-stratified analysis (Table S2) it seems that there only is an effect in the presence of the C-allele. Have the authors considered the presence of a potential recessive effect of this variant when looking at mortality?

      Average age of the individuals of each genotype of the SNP was compared and added into Supporting Table 3 (which was Table S2). No significant difference between the genotypes was found. As the reviewer noted, the mortality association of the HRG profiles affected by 204th amino-acid in the TT genotype group of rs9898 was milder and did not reach statistical significance. We believe that it is due to substantially smaller sample size and number of deaths in the genetic group. To clarify the difference in numbers, those numbers were added into the Supporting Table 3 (which was Table S2).

      • The authors need to discuss in more detail the implications of the difference between the two HRG antibodies in their association with mortality, for example in light of the use of HRG levels as a potential biomarker (i.e. how should one deal with the fact the way the levels are measured influences the outcome).

      We appreciated this valuable comment, which clearly reveals that our claim was not explained sufficiently. We modified the main text to distinguish those two antibody profiles more clearly. We also added Figure 2D and changed the structure of Table 2 to highlight the difference between the two antibody profiles.

      • Why did the authors put part of their Discussion in the Supplement? This is not common practice. They should either move it to the manuscript or remove it completely.

      We moved the discussion in the supplement to main text as the reviewer's suggestion.

      Reviewer #2 (Significance (Required)):

      The manuscript is clearly written and the analyses seem to be solid. However, although the findings described in the manuscript are interesting for the ageing field, they only provide a small step in the process of the usability of HRG as biomarker, i.e. many validation and follow-up studies will be necessary to prove its value. There have been some recent biomarker studies that have been much more advanced in this respect, which limits the novelty of this manuscript. I therefore feel that this manuscript may be best suitable for a medium-impact ageing-specific journal. My fields of expertise are ageing, genetics, and molecular epidemiology. Given my limited expertise when it comes to proteomics, I was not able to provide detailed comments on the methodology concerning this part.

      We thank the reviewer for the honest and constructive assessment of our work and agree with the suggestion to transfer this work to a medium-impact journal covering aspects of ageing research.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The manuscript by Hong et al. describes the identification and validation of histidine-rich glycoprotein (HRG) as a marker of chronological age and all-cause mortality. HRG was determined using proteomics of serum and plasma samples in 9 different cohorts (total sample size ~4,100). The association with mortality was tested in the largest available cohort (TwinGene), comprising ~3,000 samples. The association with mortality seems to be stronger in women in comparison to men and could not be explained by CRP or diabetes-related traits. The HRG levels determined using an alternative antibody, BSI0137, did not show any association with mortality, indicating that the effect on mortality is likely isoform-dependent. The performed analyses seem to be statistically solid. However, the association with mortality still needs to be replicated in independent studies and the HRG measurement does not yet seem to be ready for standardized high-throughput measurement, which is necessary to make it usable as biomarker.

      Major comments

      • Although the authors have convincingly identified HRG to be associated with chronological age and mortality, it will require quite some additional work (including replication of the observed association with mortality in independent cohorts, testing the predictive ability, and making the measurement standardized and high-throughput) to prove its use as potential biomarker. At the moment, this is not at all discussed in the manuscript. Moreover, there have been some recent large-scale studies that identified biomarkers at the metabolic level that are not at all mentioned by the authors. The authors only refer once to the recent proteomic study by Lehallier in the Introduction, but do not at all discuss their findings in relation to this paper. Last but not least, HRG has already been associated with mortality in a previous study (https://www.ncbi.nlm.nih.gov/pubmed/29303798), but there is no mention of this anywhere in the manuscript. Hence, I think it would be good if the authors perform a thorough literature search to place their findings into context and rewrite their Discussion accordingly.

      • The authors need to add a Supplementary Table showing the association of all their 7,258 HPA antibodies with chronological age. Although I trust the authors, I can currently not tell if it is indeed correct that only one antibody was significantly associated with age in set 1.

      • According to description in the Supporting Information, several samples in set 3-5 were overlapping with set 1 (45 in total). These samples should be removed from datasets 3-5 to make sure that there are no overlapping samples in the meta-analysis. However, I am not sure if the authors have actually done this. For the GWAS the overlapping samples from set 3 could still be included, given that set 1 is not involved in that. The authors could actually use these 45 overlapping samples to provide additional details about the reproducibility of HPA045005 between different measurements, for example by showing a correlation plot.

      Minor comments

      • When looking at the effect of the rs9898-stratified analysis (Table S2) it seems that there only is an effect in the presence of the C-allele. Have the authors considered the presence of a potential recessive effect of this variant when looking at mortality?

      • The authors need to discuss in more detail the implications of the difference between the two HRG antibodies in their association with mortality, for example in light of the use of HRG levels as a potential biomarker (i.e. how should one deal with the fact the way the levels are measured influences the outcome).

      • Why did the authors put part of their Discussion in the Supplement? This is not common practice. They should either move it to the manuscript or remove it completely.

      Significance

      The manuscript is clearly written and the analyses seem to be solid. However, although the findings described in the manuscript are interesting for the ageing field, they only provide a small step in the process of the usability of HRG as biomarker, i.e. many validation and follow-up studies will be necessary to prove its value. There have been some recent biomarker studies that have been much more advanced in this respect, which limits the novelty of this manuscript. I therefore feel that this manuscript may be best suitable for a medium-impact ageing-specific journal.

      My fields of expertise are ageing, genetics, and molecular epidemiology. Given my limited expertise when it comes to proteomics, I was not able to provide detailed comments on the methodology concerning this part.

      Joris Deelen

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The paper applied affinity based proteomics and antibody validation to choose and validate histidine-rich glycoprotein (HRG) as a protein/target of interest. Survival analysis techniques were used to show associations between this protein and certain biomarkers, age and all cause mortality.<br> These results and findings were used to conclude that HRG may serve as a molecular indicator of age and mortality risk.

      Major Comments:

      The authors of the paper start the paper with just one protein narrowed down ie. HRG. The rest of the paper uses affinity based proteomics, antibody validation, GWAS and survival analysis to validate this target and support their claim that HRG is an age associate protein linked to mortality and certain clinical outcomes. How did the authors conclude that HRG was the only target to explore further in this paper? What methods or analysis was done for this? What were the other proteins if any that showed up in these studies?

      For mortality outcome, it is not clear which class of disease is most strongly associated with increased risk of mortality from elevated HRG levels. If cause-specific mortality exists among the cohorts, could authors provide a more exact breakdown of the type of associated mortality by a disease class?

      Page 4 Section 3 (Results)-

      The authors say "We found consistent age-associated trends with HPA045005 across all eight replication sets (Supporting Figure 3)". On examining the supporting figure we noticed that the slope for the set with the largest number of subjects (Set 3 with ~3000 people) is visually negligibly positive (showing weakest age associated trends with HPA045005). Some comments from the authors on why they think the largest data set showed the weakest association.

      From Figure 2 C in the main manuscript one concludes that for HPA045005, binding for CC individuals is ~ 2 times higher than TT individuals. Is it possible the age association showing up for HPA045005 is primarily a function of changing/increase in allele frequency as a function of age? The authors could consider adding a clarifying plot of Age vs Allele frequency or adding an interaction term of Age and Allele Frequency in the regression and survival analysis to address this question.

      It is interesting that the signals were significant with the HPA045005 antibody but not with the BSI037 antibody. This is in spite of the fact that the GWAS for BSI0137 signals had an even stronger hit to the same locus. Can the authors please comment on why the signals from HPA045005 and BSI0137 were not highly correlated with one another and why the better antibody could not replicate the survival analysis results?

      Minor Comments:

      Figure 1: The authors description of the figure could use more clarification. "For each sample set, the estimated effect from the linear regression model.." estimated effect of what on what? On reading the main text one concludes it is the effect of age on HPA045005. This needs to be clarified in the label.

      Figure 3: The X axis for the Kaplan Meir survival curve is labelled as Age. Survival is usually time to event and time is usually the follow up time. Further clarification for the choice of this label might be helpful.

      Figure 3: it would be good to include a table with the number of individuals at risk at the bottom of the plot at defined time intervals. The figure currently compares the bottom and top quartiles of HRP for visual assessment of mortality risk, it would also be informative to include middle quantiles.

      Supporting Table 5: The note at the bottom of this table states "standardized HRG values by linear regression and scaling." What does standardization by linear regression mean?

      Supporting Table 5: It would be useful to understand that HRG carries additional risk beyond known Age and known clinical biomarkers listed in Table 2 (APOA1, APOB, TC, TG, Glucose, LDL). Could authors include a multivariate CoxPH regression with just Age? and with Age + clinical covariates?

      Significance

      The authors have identified a new biomarker for aging and mortality. Understanding the mechanism and pathways involved in HRG homeostasis and how aging causes dysregulation of this HRG could be a topic for further research. Overall, this pathway provides an opportunity of a new molecular target for aging-based drugs and research.

      This article should be of interest to researchers interested in the biology of aging and for researchers developing drugs to slow down the process of aging. In addition, it should be of interest to researchers studying the HRG as a biomarker (for example, in sepsis (https://ccforum.biomedcentral.com/articles/10.1186/s13054-018-2127-5, https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3437790).

      This paper was reviewed by 3 co-reviewers, a senior principal investigator with extensive bioinformatics, metabolomics/proteomics, epidemiological experience, a highly experienced computational biologist with a record of developing and applying methods in bioinformatics and computational biophysics and lastly an computational biologist with a background in applied mathematics and statistical analysis. All three scientists are interested in aging research and understanding how human physiology and biomarkers in specific, change as a function of age.

  4. May 2020
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      OVERALL RESPONSE

      We showed that chronic adaptation to mPOS, a new mechanism of cell stress initially discovered in yeast, induces muscle atrophy in a mouse model. We are pleased to see the overall enthusiasm from the reviewers about our work. The reviewers are unanimous in (1) that the work represents “a huge amount of work” that “has been well conducted regarding the characterization of the Ant1(TG/+) murine model that exhibits a muscle loss phenotype”, and (2) that our study linking mitochondria-induced proteostatic stress to muscle atrophy opens “a new field of investigation” and “is of interest to the scientific communities studying skeletal muscle pathophysiology and mitochondrial homeostasis”.

      Mitochondrial alterations are important hallmarks in skeletal muscles under many pathological conditions. Given that bioenergetic deficiency alone is not sufficient to explain muscle wasting as shown by other groups, the reviewers commented that our work “is original and opens new perspectives in the field of mitochondrial dysfunction related to myopathies”.

      Additional strengths noted by the reviewers include the demonstration of mPOS in an animal model, and the potential implication of our work for FSHD that is one of the most common muscle disease in humans.

      We are also pleased to learn that the reviewers reached “cross-referee” recommendations to improve the paper. We are excited to see these recommendations. We are motivated and have the capacity to implement all the four series of experiments recommended by the reviewers as outlined below.

      REVIEWERS’ COMMENTS AND OUR RESPONSE

      1. It would also be useful to perform cell fractionation and measure the accumulation of Ant1 and other unimported mitochondrial precursor proteins in the cytosol (reviewer 1). Characterizing the aggregates observed in muscles, to see whether they contain ANT1, ubiquitin, p62 and unimported mitochondrial proteins (reviewer 2 & 3). It would be informative to measure the formation of soluble and insoluble protein aggregates (reviewer 1&2).

      Response

              We propose to perform subcellular fractionation of muscle lysates using sucrose gradient centrifugation, coupled with western-blot. This will enable us to learn whether Ant1-induced stress increases the retention of unimported mitochondrial (pre)proteins (e.g., Ant1, Tom20, MDH2, TFAM, SDHA and Aco2) in the cytosol or extramitochondrial aggregates. We routinely practice this technique and we have all these antibodies validated in the lab.
      
              Our previous work showed that the giant aggresomes induced by ANT1 overexpression contain Ant1 and mitochondrial proteins in HEK293T cells (Liu et al., 2019, MBoC 30:1272-1284). However, the aggresomes we observed in the ANT1-transgenic muscles have sizes often comparable to mitochondria. Protein import stress may also lead to the accumulation and misfolding of precursors on the mitochondrial surface that are subject to ubiquitination and autophagic removal. It would be difficult to distinguish between aggresomes and mitochondria by IHC using antibodies against Ant1, ubiquitin, p62 and unimported mitochondrial proteins. To overcome this, we will take advantage of the subcellular fractionation technique described above.  This should enable us to clarify whether the cytosolic small aggregates co-fractionate with p62 and ubiquitin.
      
              As suggested by reviewer 1 & 2, we will determine whether NP-40 insoluble but SDS-soluble aggregates can be detected in the Ant1-transgenic muscles, using the dot blot technique as we previously published (Liu et al., 2019, MBoC 30:1272-1284). Antibodies against mitochondrial proteins, p62 and ubiquitin will be used to determine whether the aggregates are enriched in mitochondrial protein, p62 and ubiquitin.
      
              Collectively, the experiments proposed about will provide biochemical support for the retention, and possibly ubiquitination and p62-mediated aggregation of unimported mitochondrial proteins in the cytosol of Ant1-transgenic muscles.
      

      2. Finalizing the characterization of the EM analysis (reviewer 2 & 3) – The reviewers suggested that we should try to quantify the different aggresomal/autophagic/vacuolar structures in the transgenic and control muscles in the TEM experiments.

      Response

        Yes, we will perform the quantitation with the grids we prepared as suggested by the reviewers.
      

      3. Evidence reduced altered protein synthesis rate (Reviewer 1 & 3) – Reviewer 1 suggested that it may be helpful to provide biochemical evidence for potential changes to protein synthesis rate, in order the validate the RNA-Seq data. This is also echoed by reviewer 3. The non-radioactive SUnSET technique (FASEB J. 2011 Mar;25(3):1028-39. doi: 10.1096/fj.10-168799) was recommended for measuring protein synthesis rate in vivo.

      Response

        We appreciate reviewers’ suggestions and will be happy to set up this experiment. Briefly, we will inject the mice (n=3 for the transgenic and control mice) with puromycin to bind neosynthesized peptides. Muscle tissues will be collected and analyzed by western blot using an anti-puromycin antibody (#MABE343, Millipore Sigma). Quantitation of the western blot signals will inform whether relative protein synthesis rate is decreased in transgenic muscles compared with wild-type controls.
      

      4. Quantifying different lysosomal markers (reviewer 2) – Reviewer 2 suggested that we should quantify the levels of LC3I/II, Lmap2 and other lysosomal markers (e.g, Beclin1).

      Response

        We agree with this and the western blot experiments will be performed accordingly, using frozen muscle samples that we collected. We will also extend to the analysis using antibodies against CTSL (Abcam, #ab103574) and V-ATPase subunits such as ATP6V1H (Abcam, #ab187706) and ATP6V1G2 (Sigma, # WH0000534M2) that are upregulated in the transgenic muscles as revealed by RNA-Seq.
      

      ADDITIONAL RECOMMENDATIONS.

      The reviewers made additional minor recommendations that we found very constructive and helpful for improving the manuscript. These include providing for details of animal number and muscle types used in the experiments, statistical analysis, image analysis, and method sections for protein extraction and western blot. The reviewers also made suggestions for reorganization of some of the Figures. We have implemented some of these suggestions in the revised version of the manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this study the authors generated a transgenic mouse model of impaired mitochondrial import/loading by overexpressing ANT1, a mitochondrial carrier protein. The moderate overexpression of ANT1 was sufficient to cause progressive muscle wasting, characterized by a reduction of myofiber size. By transmission electron microscopy, the authors observed that aggresome-like structures in the cytosol of ANT1 Tg muscle cells, suggesting that occurrence of mitochondrial Precursor Over-accumulation Stress (mPOS). The authors characterized the transcriptional profile of the muscle of ANT1 Tg and WT mice by RNA-sequencing. The data supports that mPOS leads to changes in gene expression to promote: 1) mitochondrial protein import and proteostasis; 2) inhibition of protein synthesis; and 3) protein degradation. The authors speculate that this is an adaptative response to cope with the cytosolic proteostatic stress induced by ANT1 overloading but that, if chronically sustained, leads to a reduction of protein content and consequent muscle wasting.

      Major comments:

      The key conclusions of this work are that ANT1 overexpression induces accumulation of proteins in aggresomes in the cytosol, supporting the existence of overaccumulation of unimported mitochondrial precursor proteins (or mPOS) in an in vivo animal model. As an adaptive response, the cell's transcription profile changes to inhibit protein synthesis and promote protein degradation. This adaptation creates a protein imbalance that leads to muscle wasting.

      The data and methods are clearly presented. However, this work would benefit from more evidence to strengthen the main conclusions. I have the following comments:

      1) By transmission electron microscopy, the authors detected aggresome-like structures in the cytosol of ANT1Tg/+ but not in that of WT muscles. Is it possible to quantify the frequency of each type of structure in WT and ANT1Tg/+? Does this frequency increase with age or correlate with the degree of myofiber phenotype (reduction in size)?

      2) mPOS is characterised by the accumulation of unimported mitochondrial precursor proteins. Is there any evidence either that the aggresome-like structures contain unimported mitochondrial proteins or that mitochondrial precursor proteins, still containing their mitochondrial targeting sequence, accumulate in the cytosol? Does unimported Ant1 precursor protein accumulate in ANT1Tg/+ mice?

      3) Previous studies have shown that expression of mutant Ant1 protein causes mitochondrial morphology defects and mtDNA deletions. Do ANT1Tg/+ mice have altered mitochondrial morphology or deletions/loss of mtDNA in muscle?

      4) The transcriptomic data and the amplification of the lysosomal compartment suggest an activation of multiple protein degradation processes, that could contribute to the reduced protein content in ANT1Tg/+ muscles. The increase in P-4E-BP and eIF2alpha expression in ANT1Tg/+ muscles suggest protein synthesis may be decreased. These data are consistent with unbalanced protein synthesis versus degradation. Figure 7A demonstrates a reduction in steady state protein levels in ANT1Tg/+ muscles. However, Figure 7A is insufficient to confirm a mechanistic explanation of the muscle wasting phenotype as the authors state at the end of the Results. This would require direct evidence of altered protein synthesis rates and protein degradation, which, although challenging in vivo, have not been directly demonstrated. The authors should therefore modify the final sentence in the Results.

      Minor comments:

      Suggestions to improve the presentation of data and conclusions: On page 7, it reads "No myofiber type grouping was observed in muscle samples stained for mitochondrial activities, suggesting the lack of chronic neuropathy." This sentence is lacking a reference to a figure (maybe Fig 2, D?).

      On page 12, the authors say "These genes are known to be activated as an important regulatory circuit in the Integrated Stress Response (ISR), an elaborating signaling network that is stimulated by divers cellular stresses to decrease global protein synthesis and to activate selected genes in the benefit of cellular recovery (42)." The word diverse is misspelled.

      On page 13, the authors wrote: "First, we found that the transcription of genes encoding proteasomal subunits, NFE2L1 and NFE2L2 are upregulated (Fig. 6A & 6B). NFE2L1 and NFE2L2 activate the transcription of proteasomal genes." This section could be rephrased for clarity. The first sentence suggests that NFE2L1 and NFE2L2 are proteasomal subunits. But then the authors say they activate transcription. I believe what the authors meant was that the genes encoding proteasomal subunits were upregulated (Fig. 6A), as well as NFE2L1 and NFE2L2 (Fig. 6B), that activate the transcription of proteasomal genes.

      On page 13 (last paragraph), the authors mention that "numerous genes involved in autophagy, cytoskeletal organization and intracellular trafficking are upregulated in ANT1Tg/+ muscles" but they only explain what STBD1, ARHGAP33 and ARHGEF2 do. The other genes are not mentioned. If the authors want to speculate that the differences in gene expression are relevant, they should explain the role of the different genes/proteins.

      On page 14, where it reads "we found that Lamp2-possitive lysosomes and/or lysosome-derived structures are amplified in the ANT1Tg/+ muscles", the word positive is misspelled.

      Regarding Fig 1: in C there are two Tg bars, do they represent the two independent transgenic mice referred in the text? Or are these males and females, like in the following graphs? The authors should clarify this. What do the values and error bars represent and which statistical tests were used?

      Regarding Fig 2: How was the coefficient of variability calculated? A sentence explaining this might be helpful in the methods' section. There is no scale bar in D. Which statistical test was used in I-J? The authors mention " P values were calculated by unpaired Student's t test." for E-H but no for I-J.

      Fig 3. is missing a scale bar.

      In Fig 5, B-C: the labelling of males and females is missing from the graphs. Authors should indicate which statistical test was performed for each graph.

      In Fig S1: Why split A from B? Why only showing 1 of the Tg for each timepoint and not both? If the images are representative of both transgenics, then it should be mentioned. Suggestion: adding the timepoint info (3, 6, 17 months) above each panel makes it easier to understand the figure without reading the legend. E-H: the graphs show grouped variables, wouldn't a Two-way ANOVA be more appropriate to test for statistical significance?

      Significance

      mPOS has been observed in yeast (Wang and Chen 2015) and human cells (Liu et al. 2019). This works shows, for the first time, evidence for mPOS is an in vivo animal model. This work suggests that a moderate overexpression of ANT1 is sufficient to impair mitochondrial import and loading, causing mPOS and an adaptative response to cytosolic proteostatic stress. These findings are relevant to understand pathologies where ANT1 is affected, such as facioscapulohumeral muscular dystrophy (FSHD) (Laoudj-Chenivesse et al 2005). mPOS represents a novel mechanism through which mitochondrial dysfunction impacts on muscle wasting. Moreover, since impairment of the mitochondrial import and loading machinery is observed in aging and disease (MacKenzie and Payne 2007), these findings are relevant to better understand the impact of mitochondrial dysfunction in different cell types.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The authors investigate the role of mitochondria - other than bioenergetic or oxidative stress - in the loss of muscle mass. They hypothesize that the accumulation of immature mitochondrial proteins in the cytosol is responsible for muscle atrophy, independently to mitochondria metabolism. For this purpose, they generated a murine model that over-expresses ANT1, a mitochondrial protein. The two-fold increase in Ant1 protein level leads to an overload in mitochondria import machinery, thus an accumulation of ANT1 in the cytosol, and thus to mitochondrial Precursor Over-accumulation Stress (mPOS). Consequently, the protein degradation pathway is stimulated, leading to an imbalance between protein synthesis/degradation, and in long term in muscle wasting.

      Major comments:

      The authors have executed a huge amount of work. Before fully reaching the conclusion that mPOS induce an imbalance between protein degradation/protein synthesis, by mainly increasing the lysosomal pathway, the authors should test/validate few more things:

      1-Quantify the increase in LAMP2 muscle (WB and/or immunolabeling quantifications

      2-Quantify other lysosomal markers

      3-Characterize better the aggresomes observed in ANT1Tg/+ muscles

      4-Add details for the methods (details are missing and make it difficult to judge this section, see comments below) These experiments should be easily done, 3 months of work: regular WB analysis, immunohistology and analyses EM images the authors already have, cost for supply <$2000.

      Results:

      Figure 1, movie 1-3 and paragraph "Moderate Ant1 overexpression causes progressive muscle wasting." The authors generated two independent hemizygous transgenic mice (ANT1Tg/+) and characterized them. The authors show a greater level of ANT1 in the transgenic mice. Could they show the localization of ANT1 in ANT1Tg/+ muscles: cytosol? Near the mitochondria? Sub-sarcolemmal mitochondria or else? Does ANT1 form aggregates? If yes, do the aggregates co-localised with ubiquitin? Proteasome? Lysosomal markers?

      Figure 3 and paragraph "Cytosolic aggresome formation supports mPOS." The authors show EM images of muscle section of ANT1Tg/+ muscles at 1 and 2 years old. The authors wrote that there is an increase of aggresomes: they show in figure 3Q and M structures that look like mature lysosomes, or in 3F and 3R early mitophagy ... The authors should try to classify the different structures they observed and quantify these structures (eg number of autophagic vacuole per sarcomere). They should then perform some immunostaining on muscle sections at same age to confirm an increase in lyosomal markers for example. They still should do the same analysis (quantification and immunostaining) in WT muscle tissue same age. Figure 3 B and C suggest lipid vacuoles. Can the authors check using Oil red-O staining for example (or another staining)? The accumulation of lipid drops in transgenic muscle would suggest an impact on the metabolism, and more specifically on the lipid metabolism. All these structures are classic and should be observable in WT muscle, but probably at a lower frequency. Attempting to quantify these parameters and confirm by histochemistry would help to characterise better the murine model.

      Figure 4, Supplemental Figure 4, Supplemental Table1 and paragraph "Ant1 overloading activates genes involved in mitochondrial protein import and proteostasis, and those encoding small heat shock family B chaperones consistent with mPOS." The authors generated Supplemental table 2 but never mentioned it in the text. Figure 4 and supplemental Figure 4, can the authors add the stat. The authors conclude from these figures (Figure 4 and Supplemental Figure 4) that ANT1 overexpression causes a protein import stress on the mitochondria. This is based on transcriptomic analysis and RTqPCR. They should validate at the protein level: eg level of HSPBs, NACA and HsP90 by WB and localisation in muscle section by immunostaining (counterstaining with mitochondria marker)

      Figure 6 and 7 and paragraph "Activation of multiple protein degradation processes and reduced protein content in ANT1Tg/+ muscles." Figure 6H: the authors should quantify LAMP2 level. Other markers of the lysosomal should be assessed at the protein level (LC3I/II, Beclin1 etc) The proteasome pathway does not seem strongly stimulated as no increase in ubiquitinylation nor in P62 are observed by Western blot. However, the authors should check whether The aggresomes observed, do they colocalise with ubiquitin and/or P62 proteins in muscle section (if yes try to find a way to quantify this if there is some colocalization). Are the aggresomes soluble or non-soluble proteins? The latter could interfere on the absence of detection n of increases in protein ubiquitinylation.

      Material and methods

      Paragraph describing the statistical analysis is missing. Number of mice, sample used for each experiment should be added in the Mat and Methods as well.Which muscle was used for which experiments (for histology, EM and RNAseq in Mat & Methods)? The procedure for image analysing is missing: objective used, number of images analysis per sample, how many muscle were studied? Protein extraction and Western blot procedures is missing

      Minor comments:

      -Figure 1I: typo in the x axe legend: quadriceps instead of quadrucep

      -Figure 2 and paragraph "Mitochondrial respiration is moderately decreased in ANT1Tg/+ muscles." Figure 2A-B-C: the authors should move the supplemental figure 2A and B in the main figure, and place figure 2Band C in sup data. To confirm that there is a difference in fiber size distribution, the authors should perform a Kolmogorov Smirnov test. Can the author clarify if whether they are using minimum diameter of fibres throughout the results (figure 2c and supplemental figure 2A and B), and if this is what is meant by the term "lesser diameter"? Figure 2I-J: It would be interesting to compare the different respiratory state at different age using an ANOVA2 factor and post-hoc test.

      -Page 13: full stop missing after the reference (49): "ligases respectively, are frequently upregulated (49),"

      -Supplemental Figure 4: reorganise the plot: put the reference ANT1 in first position, then organise per pathway involvement (eg: put together SLC7A1, SLC7A5 and ASNS for the acid transport, MTHFD2 and PSAT1 together for the one-carbon metabolism etc).

      -The authors describe figure 6E and F before A,B,C,D... they thus may need to switch them around.

      Significance

      Muscle loss associated with cachexia, sarcopenia, or neuromuscular disorders, if of current interest to the field, with much work ongoing to study the role of inflammation, denervation, REDOX homeostasis and proteostasis. The current paper suggests a new mechanism that could be involved in muscle atrophy: mitochondrial protein load and import. The authors generated a new murine model that would be useful to the muscle community to investigate pathways involved in muscle wasting, in different physiological and pathological context. Working on different neuromuscular disorders and muscle ageing, the existence of such a model would be an interesting tool to investigate the role of mitochondrial dysfunctions (dysfunctions other than mitochondrial metabolism) in muscle wasting.

      REFEREES CROSS COMMENTING

      Reading the comments from other reviewers, it seems that there is general agreement that this paper has been well conducted regarding the characterization of Ant1TG/+ murine model, and muscle loss.

      Similarly, all the reviewers seem to agree that: Finalizing the characterization of the EM analysis, characterizing the aggregates observed in muscles (containing ANT1?, ub?, p62?, soluble or non-soluble aggregates), as well as quantifying the protein synthesis and different lysosomal markers would improve the paper.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary In this manuscript, Wang and colleagues show that chronic adaptation to mitochondria-dependent proteostatic stress in the cytosol induces muscle atrophy. Although mitochondrial dysfunction is known to cause muscle wasting, the underlying mechanism is unclear. The authors generated a transgenic mouse (ANT1Tg/+) in which the expression of the nuclear-encoded mitochondrial carrier protein Ant1 is increased by two-fold. These mice are characterized by the progressive loss of body weight and muscle mass. As revealed by muscle histology and immunocytochemistry analysis, ANT1Tg/+ mice have decreased myofiber size and increased myofiber size variability. Consistent with muscle wasting, ANT1Tg/+ mice are characterized by decreased home cage activities and exercise tolerance. Mechanistically, the authors found that ANT1 overexpression has a relatively mild effect on mitochondrial respiration. However, ANT1 overexpression induces cytosolic proteostasis stress (mPOS), the formation of aggresome-like structures and the activation of small heat shock proteins in the cytosol accompanied by the upregulation of the stress-activated transcriptions factors. The drastically remodeled transcriptome of ANT1Tg/+ mice muscles is indicated by the authors as an adaptive response to counteract mPOS.

      Major comments

      The phenotypic characterization of the ANT1Tg/+ skeletal muscle is well conducted and detailed. However, the key conclusions are mainly based on the interpretation of RNA-seq data with little experimental evidence of the underlying mechanism. For this reason, the authors should qualify some of their claims as preliminary or speculative. For instance, they find upregulation of SENS2 gene, which has been demonstrated to take part in mitophagy. In addition, they corroborate this data with electron microscopy images where mitophagy structures are present. However, these two data are not enough to state that mitophagy is involved. It would be advisable to focus on fewer genes but with a stronger validation process.

      I recommend performing the following experiments:

      To better characterize the effects of Ant1 overexpression on mitochondrial function, the author should address whether ANT1Tg/+ mitochondria are more prone to depolarize or not.

      It would be informative to measure the formation of soluble and insoluble protein aggregates.

      It would also be useful to perform cell fractionation and measure the accumulation of Ant1 and other mitochondrial proteins in the cytosol.

      Finally, I would suggest to measure protein synthesis by the non-radioactive SUNSET technique (FASEB J. 2011 Mar;25(3):1028-39. doi: 10.1096/fj.10-168799).

      Minor comments: The author should mention how many muscles were used for the EM studies. I would encourage discussion of the atrophic (and not dystrophic) phenotype of the ANT1Tg/+ mice related to the connection between Ant1 and FSHD.

      Significance

      The authors investigate a still poorly explored mechanism underlying muscle wasting based on mitochondrial import machinery dysfunction. The work is original and opens new perspectives in the field of mitochondrial dysfunction related to myopathies.

      Mitochondria alterations play a key role in the context of muscle decline in many diseases and in aging. It is well known that the alteration of mitochondrial respiration and the oxidative stress increase are hallmarks of mitochondria dysfunction in skeletal muscles under pathological conditions. However, there is evidence that these features are not sufficient to explain the severe phenotype of muscle wasting. This work opens the way to the possibility that non-bioenergetics factors could take part in the pathological scenario. In detail, the involvement of the mitochondrial import mechanism, which causes a cytosolic proteostatic stress, is a new field of investigation. A few years ago, the same authors demonstrated that the mitochondrial precursor over-accumulation stress (mPOS) triggers a cytosolic proteostatic stress in yeast, however until now there was no evidence whether this phenomenon could occur in animals and which tissues would be involved. Thanks to this work, the authors demonstrated that mPOS occurs in skeletal muscle.

      This study is of interest to the scientific communities studying skeletal muscle pathophysiology and mitochondrial homeostasis. My main research field is the role of mitochondrial homeostasis in skeletal muscle function in health and disease.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1:

      In this manuscript the authors explore the requirements for centromere transcription using single-molecule FISH. Previous studies have found that centromeres are transcriptionally active in a wide variety of organisms. Centromere transcription has been proposed to facilitate Cenp-A deposition through chromatin remodeling and to directly contribute to centromere/kinetochore function by producing a functional ncRNA. However, we currently know almost nothing about how transcription is initiated at the centromere or how levels of centromere transcripts are controlled. This manuscript makes several major findings that are potentially of importance to groups studying centromere transcription. 1.) Centromere RNAs are produced by RNA Polymerase II and are localized in the nucleus of a wide-range of cell types. 2.) Centromere RNAs do not localize to the centromere, which is in contrast to several recent studies. 3.) Centromere proteins are not required for transcription of alpha-satellite sequences. 4.) Localization of centromeres to the nucleolus represses centromere transcription. Overall, this is a solid manuscript and has the potential to make a significant impact in the field. Below I suggest a couple of experiments and modification to the data presentation that could improve the manuscript.

      We thank this reviewer for their interest in this paper and agree with their clear articulation of the key points.

        • All of the experiments in this manuscript rely on detection of centromere RNAs using single molecule FISH probes. These probes are validated by showing the RNase treatment removes the FISH signal. A strength of this approach is that the authors use multiple different probe sets and achieve comparable results. However, there is no orthogonal validation that the probes detect alpha satellite RNA. All of the experiments in this manuscript would be significantly improved by showing that the results presented here can be confirmed by a different approach. I suggest that the authors use Q-RT-PCR to validate the smFISH results. * The smFISH probes provide a powerful and unique strategy to detect alpha-satellite transcripts. To ensure that these experiments are carefully controlled, we analyzed multiple distinct probe sequences that recognize alpha-satellite transcripts derived from different chromosomes, as this reviewer highlights. We also conducted an in-depth computational analysis to ensure that these probes do not match genomic sequences outside of alpha-satellite regions. However, we recognize and agree that a complementary method to detect these transcripts would be a useful addition to this paper. We are currently highly constrained in our ability to conduct these experiments due to COVID-19-related laboratory closures, but if feasible our goal for a revised manuscript would be to conduct qPCR experiments for a subset of the conditions that are the most central to the key results in this paper (focusing particularly on HeLa and Rpe1 control cell lines, CENP-C iKO, Ki67 KO, and RNA Polymerase I and RNA Pol II inhibitors).
      • Several results in this manuscript directly contradict results in published studies, but these discrepancies are not discussed. I believe the authors need to discuss the following discrepancies between their results and those in the literature: *
      • McNulty et al. Dev. Cell. 2017. Show that alpha-satellite RNA is transcribed from all centromeres and remains localized to the site of transcription. The different results and possible explanations for the differences should be discussed. *
        • Additionally, Rosic et al. JCB 2014, Blower Cell Reports 2016 and Bobkov et al. JCB 2018 all show that centromere RNAs localize to centromere regions. The differences between these studies and the authors results should be discussed. *
      • The authors show that satellite RNA cannot be detected on mitotic chromosomes. However, Johnson et al. Elife 2017, Bobkov et al. JCB. 2018, and Perea-Resa et al. Mol. Cell. 2020 show that EU-labeled RNA can be detected at the centromere during mitosis. The authors should discuss the discrepancy between their results and these studies. Is it possible that their smFISH probes do not detect nascent, chromatin-bound transcripts? *

      We believe that a strength of our paper is that it assesses alpha-satellite transcripts in individual intact cells using fixation conditions that preserve the native behaviors without disruptive and harsh extraction. As our results differ from those of other laboratories in some cases, we agree that it would be helpful to comment more directly on these differences with prior work. Points a, b, and c above all relate to the presence of alpha-satellite transcripts at centromeres. For the revised paper, we will include a discussion of these prior observations and some possible reasons for the differing results. In particular, we think that these discrepancies reflect two key differences:

      1. Other strategies with harsh extraction conditions likely eliminate soluble alpha-satellite transcripts that are not tightly associated with centromeres, whereas our work preserves these.
      2. It is possible that we are unable to detect nascent transcripts by smFISH as these are embedded within the RNA polymerase. Extraction conditions: An advantage of the smFISH probes used in our paper is that these require mild fixation conditions without prior extraction to better preserve cellular structures allowing us to analyzed intact cells, rather than chromosome spreads. Thus, our approach maintains the diverse alpha-satellite transcripts that are not bound to centromeres, and which may have been washed away in other studies. In contrast, some prior studies used stringent extraction conditions and primarily conducted experiments in chromosome spreads (not intact cells). Although it is not feasible to precisely determine the basis for differences without repeating this work the precise approaches and conditions from each paper and working closely with each group, we believe that these substantial technical differences explain our differing observations that reveal that the majority of alpha-satellite transcripts do not remain at centromeres..

      Nascent transcripts: As suggested by this reviewer, we agree that our differing conditions may mean that we are unable to detect nascent transcripts that are closely associated with the RNA polymerase, inaccessible due to their chromatin proximity, or that are not sufficiently elongated such that they are present to hybridize to multiple copies of the smFISH probes to be detectable. The alpha-satellite transcripts must be derived from centromeric and pericentromeric regions and so must exist there at some point (as also attested to the EU signals that this reviewer mentions in the work from our collaborative the Blower lab; we have also detected EU signal at centromeres). However, our work suggests that alpha-satellite transcripts do not persist at centromeres indefinitely once generated, with mature transcripts in the nucleoplasm and liberated from chromosomes during mitosis. We believe that the combination of the relative inability of our smFISH probes to detect nascent transcripts, but stringent conditions disrupting non-centromere bound transcripts for prior work likely explain these distinctions.

      • The authors show nicely that deletion of Ki-67 reduces centromere localization to the nucleolus and increases centromere transcription. However, this has no effect on centromere function. Studies from the Earnshaw lab (e.g. Nakano et al. Dev Cell 2008 and Bergmann et al. EMBO J. 2011) show that increasing or decreasing centromere transcription results in loss of kinetochore function on a human artificial chromosome. The authors should discuss the differences between their results and these studies. Is it possible that the small size of the HAC exaggerates the importance of the correct levels of centromere transcription? *

      We are big fans of the Earnshaw lab work. In this case, there are a couple of possibilities to explain the strong effect that the Earnshaw lab observed on kinetochore function by perturbing centromere transcription. First, the degree of the change in centromere transcription may make a big difference. The Ki-67 results in an approximately 2-fold increase in alpha-satellite smFISH foci, which may still be within a permissive range for normal kinetochore function. Second, the experiments from the Earnshaw lab rely on targeting activating or silencing proteins to the centromere region, and it is possible that changes in centromere chromatin downstream of these factors contribute to the observed phenotypes in addition to altering the amount of centromere transcription. We will include a brief discussion of the Earnshaw work in a revised paper.

      • The authors treat cells with transcriptional inhibitors for 24 hours. I am concerned that this may result in massive cell death. It would be helpful to include cell viability data from these experiments. *

      We appreciate this point and agree that cell lethality is an important consideration given the essential role of the RNA polymerases. For the inhibitors, we first treated the cells for a variety of different time points to evaluate these behaviors. For example, we found that we could treat cells with RNA Polymerase II inhibitors for as much 48-72 hours without detecting noticeable cell death. Thus, at the 24 hour time point, the cells remain viable and intact, as is also visible in the images showing DNA staining for these treatments in Figure 3. We also note that this timing is consistent with prior studies that block transcription or translation. However, we did additionally conduct these experiments at earlier time points (5 hours and 12 hours post-drug addition) and obtained similar results. For example, for the Cdk7 inhibitor using the ASAT probe, we observed the following smFISH foci/cell: Control (3.4 foci/cell), 5 h (1.5 foci/cell), 12 h (1.2 foci/cell), 24 h (0.9 foci/cell). There is a clear effect even at 5 hours of treatment and a continued downward trend. Both for simplicity and because the replicates and number of cells that were quantified were lower for these conditions, we chose not to include these in the paper. We will include a statement regarding these earlier time points in the revised version.

      • In Figure 3C the authors examine the effects of centromere protein knock outs on centromere transcription. To me this is the most important experiment in the manuscript and is a major step forward for the field. The authors use inducible CRISPR knock out cell lines that are not 100% penetrant. It would be helpful if the authors could describe how they ensured that cells included in the image quantification were knock out cells. *

      Based on this comment and the other questions from the other reviewers, we recognize that we need to provide a much better description of the CRISPR knockout strategy, the prior validation of these cell lines, and the strategies that allow us to use these cell lines in a robust manner to ensure that we are effectively eliminating the target genes. We have systematically tested this strategy in multiple cases and find that this strategy is superior to RNAi for its efficacy and the potency of the phenotype, particularly for this type of cell biological assay.

      The Cas9-based strategy is a highly effective way to conditionally eliminate essential genes. In this case, the efficiency of the Cas9 nuclease ensures that the genomic locus is cleaved in essentially 100% of cases. As this is repaired in an error prone manner and typically using non-homologous end joining, 66% of individual events result in frame shifts mutations that disrupt the coding sequence of a target gene, with ~50% of cells resulting in frame shifts in both copies of a gene. In addition, if a sgRNA targets a region of a gene that cannot tolerate mis-sense mutations, this will result in an even greater fraction of mutant cells. Thus, these inducible knockout cell lines result in robust and irreversible gene knockout, with a large fraction of cells (50% or more) displaying a clear phenotype. However, it is also true that there are a subset of cells within the population that will repair the DNA damage following Cas9 cleavage in a way that preserves protein function such that they behave similarly to control cells. Importantly, this means that there will be two classes of cells within a population – those that are unaffected, and those that are strongly affected. As we are analyzing each cell individually instead of creating a population average, this will capture this phenotypic diversity to reveal two populations of behaviors in cases where eliminating a gene results in a substantial change in smFISH foci. For example, the smFISH foci/cell data for the CENP-C inducible knockout (Fig. 3C and 3E) indicates that many cells have smFISH foci numbers that are comparable to control cells, but others that display substantial differences and highly increased numbers. An ideal control in these experiments would be to additionally analyze the levels of the target protein together with the smFISH analysis. Unfortunately, many of the antibodies are not compatible with the conditions needed for the smFISH. For CENP-C, the antibody that we have is not compatible with the conditions that we are using for the smFISH, so it is not feasible to co-stain these cells as suggested. Instead, for our analysis of the centromere-nucleoli localization (for example), we used the presence of a clear CENP-C interphase phenotype (“bag of grapes” resulting from chromosome mis-segregation) as an indication that the cells had been knocked out for CENP-C.

      The majority of the Cas9-based inducible knockouts that we used for this paper were generated previously in the lab (McKinley et al. 2015; McKinley et al. 2017). For the centromere protein knockouts (McKinley et al. 2015), these were analyzed previously with respect to phenotype and monitored for the depletion of each gene target over time. For the larger collection of cell cycle and cell division inducible knockouts, for our prior work we systematically validated each of these with respect to their phenotype (see http://cellcycleknockouts.wi.mit.edu). Thus, we are confident that each of these cell lines is functional and effective for eliminating the target gene.

      For conducting the experiments using the inducible Cas9 cell lines in this paper, we used the presence of these previously-defined phenotypes within the population as a validation that the strategy is working. Again, in general we find these knockouts are both penetrant and severe in their phenotypes. Importantly, for this diverse set of genes, we note that our goal was to broadly survey diverse factors to identify changes in alpha-satellite transcript levels. We intended this analysis as a “screen” where we would identify factors that resulted in a substantial change in the number of smFISH foci. As with any larger analysis, it is possible that there are false negatives where we did not detect a strong effect on transcript levels (such that they may contribute to centromere transcription). We have tried to use caution not to indicate that this data excludes any possible role for these factors in transcript levels, although in general the majority of the tested factors did not show a substantial change in smFISH foci. For the revised paper, we will make an explicit statement to this effect.

      • On p8. The authors cite Quenet and Dalal. eLife 2014 for the idea that transcription during G1 is important for new Cenp-A loading. They should also cite Chen et al. Dev. Cell 2015 and Bobkov et al. JCB. 2018. *

      Thank you for these helpful suggestions. We will update the text to incorporate these references.

      Reviewer #2:

      The study by Bury et al. investigates the formation of two different types of alpha-satellite transcripts (ASAT, SF1 and 3) in different human cell lines. Using smFISH they find that during the cell cycle these centromeric transcripts don’t stay at the centromere and are found in the cytoplasm after mitosis. Using specific inhibitors, they find that transcription is dependent on RNAPII, but not on various centromere and kinetochore proteins taking advantage of an inducible CRISPR-depletion system that the lab had previously developed. Interestingly, they find that CENP-C, a major component of the centromere and previously characterised as an RNA-binding protein, negatively regulates alpha-satellite transcript levels. Another regulator for transcript levels appears to be centromere-nucleolus interactions (as also indicated in the title) acting to suppress expression of these non-coding RNAs.

      This is overall a really interesting study and indeed, transcription at the centromere is little understood at this point. Given the importance of the centromere the findings in this manuscript will be of high interest to both researchers in the field and a general audience. There are novel and interesting insights into centromeric transcripts but the study still requires some controls.

      We appreciate this reviewer’s kind words and their clear description of our work.

      1) The authors state that the majority of smFISH foci do not colocalise with centromeres in a combined IF/FISH experiment (some quantification and a % of that subpopulation should be given somewhere). This is a bit concerning but of course could also be true. It either means that alpha-satellite transcripts leave the centromere as suggested by the authors (although some should be visible at the centromeres during the act of transcription). Alternatively, a trivial explanation would be that there is a lot of unspecific staining, which can occur in FISH-experiments to varying degrees. The RNase treatment to control for the absence of potential DNA hybridization is convincing, but the FISH probe could also interact with non-centromeric cellular RNA. With the centromere localisation as a reference point gone, some control is needed to validate that the RNA-FISH signals are indeed recognising alpha-satellite RNA that emerged from centromeres. The authors could try competition experiments titrating unlabelled specific or unspecific DNA probes alongside their labelled specific FISH probe into their FISH experiment to see if they lose or maintain the signal and the number of foci. The specific RNA FISH probes could also be used in DNA FISH, to demonstrate they are working and recognising specific centromeres.

      For understanding this behavior, we believe that an important feature of alpha-satellite transcripts is that they are relatively stable (protected from nucleases within the nucleus), but that their overall number is low, consistent with transcription of other non-coding regions across the genome. Thus, if a transcript were produced at centromeres, but subsequently diffuses away, only a small subset would be detectable at centromeres. In addition to our validation these probes using RNAse, we would like to highlight that we have analyzed multiple distinct sequences that recognize different subsets of alpha-satellite repeats. In each case, the observed behaviors are very similar. In addition, the nature of the oligo FISH method requires multiple individual probes to anneal to the same transcript such that a signal is only detected if a sufficient number of oligos bind to the same transcript. This makes nonspecific binding unlikely to contribute to a false signal. Finally, a subset of the perturbations that we tested that are relevant to centromere function (including the CENP-C inducible knockout) clearly affect the levels of these transcripts, supporting a centromere origin. The additional control experiments suggested by the reviewer could be useful, but are technically complex with their own caveats in interpretation and we do not feel that they would add substantially to the existing paper. Instead, as discussed in response to Reviewer #1, point #1, we plan to validate key results described in the paper using qRT-PCR (if possible based on current experimental constraints in the lab associated with COVD-19).

      As described above in response to Reviewer #1, point #2, we also believe that some differences with prior work suggesting that alpha-satellite transcripts localize to centromeres may be due to stringent extraction conditions that eliminated non-centromere bound transcripts, while at the same time reflecting our inability to detect nascent transcripts. Quantifying “colocalization” within the nucleus is limited by the resolution in light microscopy, and we would prefer to use caution in defining which transcripts in our smFISH analysis overlap with centromeres. However, we believe that our work clearly highlights the fact that a general feature of mature alpha-satellite transcripts is that they localize throughout the nucleoplasm and are not strongly associated with mitotic chromosomes.

      2) Apart from Figure 4, there is no analysis shown for statistical significance. This should be done for most if not all quantifications. Are indeed ASAT and antisense RNA Foci number not significantly different? The authors say that the levels of alpha-sat RNA in Rpe1 cells are not substantially different from other cell lines, but is it also not significant (Fig 1F)? In Figure 2D it is concluded that transcripts foci number are increased in S/G2 (from G1) and remain stable in mitosis, but it looks like there is an increase in mitosis. Again, it looks like the higher number of smFISH foci/Cell is significantly higher for both ASAT and SF1, so some statistical analysis would be required here.

      For this paper, we quantified hundreds of cells for each condition, measuring the number of foci/cell in each case. Because of these large n’s, even relatively small differences between samples become statistically significant when tested using standard statistical comparisons (unpaired T test and one-way ANOVA test amongst others). For our experiments, every sample condition included an analysis of control cells, allowing us to compare the control condition to any perturbations on the same day. However, there is some variability between these different replicates, with the average number of ASAT smFISH foci/cell in HeLa cells ranging from 3.4 to 5.6. When compared relative to each other, a subset of these control samples will appear to be statistically different from each other despite the fact that this is not a substantial difference between replicates. Similarly, the majority of the tested inducible knockout cell lines are statistically different from control cells, even when the differences are relatively minor. Therefore, we have tried to use caution when applying the double-edged sword of statistics to these analyses. Instead, we have tried to consider differences with a “substantial magnitude” instead of “statistically significant” differences that may make modest, but statistically significant differences seem artificially more important. We believe that the graphs in which every data point is represented, together with listing the average number of foci/cell in each condition allow the reader to evaluate this data for themselves. Many of the trends that this reviewer highlights are indeed interesting comparisons to consider for future work.

      3) Starting with the description of Figure 1E in the main text the paper equates foci count of smFISH per cell with RNA transcript levels. I'm not convinced that these are necessarily the same. You could have many weak foci or few very bright with the same amount of overall transcripts in both. The authors start out introducing smFISH as highly sensitive "for accurate characterisation of number ...of RNA transcripts". This suggests that foci intensity could be used as a read-out for transcript levels. It should be possible to measure individual intensity of the foci for a subset of images. Do foci intensity correlate or anti-correlate with foci numbers? Is the sum of the intensities of all the foci less variable than the foci number for an individual cell type?

      Due to the repetitive nature of alpha-satellite sequences, an increased intensity of a smFISH foci could reflect either the close proximity of multiple separable transcripts, or a longer transcript with multiple binding sites for the smFISH probes. Because of this, throughout the paper, we have referred to these as “foci” instead of stating a specific transcript number. As part of the automated computational analysis of the smFISH images, we additional analyzed foci intensity. In general, these values were similar across a cell population and between various perturbations with the key results and findings consistent whether we measured foci number or overall foci intensity per cell. However, foci intensity can vary slightly across a coverslip (technical constraints, not biological differences), and thus we have focused on foci number as a more consistent metric that correlates with the production of alpha-satellite transcripts.

      4) I really like the use of the inducible CRISPR system to remove various centromere factors. However, some validation would be required to show that the system is effective in removing the proteins of interest in these experiments. For instance it would be helpful to show in Figure 3D an additional panel with CENP-C staining. Also for a subset of factors, some antibody staining co-staining with the smFISH could be provided in the supplemental material.

      We appreciate this point. However, we feel that the existing experiments appropriately consider the nature of the knockout. First, we primarily used Cas9-based inducible knockouts that were generated previously in the lab (McKinley et al. 2015 and McKinley et al. 2017). As these knockouts have been described previously and extensively validated with respect to phenotype (in every case; see http://cellcycleknockouts.wi.mit.edu for example) and antibody staining (in selected cases), we have not repeated this here for the diverse cell cycle knockouts used. In general, we find these knockouts are both penetrant and severe in their phenotypes. Given the broad number of knockouts that we tested, this is not feasible in every case. We also intended this analysis as a type of “screen” where we could validate any “hits” that were observed, and will use caution in our wording not to imply that a negative result is decisive.

      The important exceptions to this are CENP-C (which we analyzed more closely) and Ki67 (for which both the inducible and stable knockouts were generated for this paper). For Ki67, the antibody staining is shown and we believe that this is clear. For CENP-C, the antibody that we have is unfortunately not compatible with the conditions that we are using for the smFISH, so it is not feasible to co-stain these cells as suggested. For the smFISH analysis in the inducible CENP-C knockout, we analyzed every single cell, including some cells that are likely to have intact CENP-C levels. Thus, if anything, the potent increase in smFISH foci underrepresents the dramatic effect of CENPC depletion. Based on our prior work (McKinley et al. 2015) we found that the CENP-C knockout results in a pervasive “bag of grapes” phenotype in which chromosomes mis-segregate during mitosis and are packaged into separable interphase nuclei. For the analysis of the nucleoli, we selected cells that displayed this clear phenotype (as shown in the figures).

      5) Since none of the CRISPR iKO has a particular inhibiting phenotype it would be useful to include some positive control in the CRISPR experiment. Would it be possible to use a CRISPR iKO target that affect some factor of the transcription machinery (RNA Pol II or similar) to reduce transcript levels?

      Generating additional Cas9 iKO cell lines is feasible, but would be time consuming. In this case, we are not convinced of the value of generating and validating these additional cell lines (particularly with the additional current constraints due to COVID-19). For evaluating the role of the RNA polymerases, we believe that the effect of the drug treatment is clear. For creating a positive control to assess whether the CRISPR iKO strategy is a feasible way to conduct these experiments, we would like to highlight the CENP-C iKO cell line, which has a potent effect in this assay.

      6) The authors find a negative correlation between the nucleolus-centromere association and the number of alpha sat foci. This is really interesting and they suggest that the nucleolus association could negatively regulate centromere transcription. However, this correlation is rather indirect in the sense that cells with a higher-degree of nucleolus-centromere localisation have fewer smFISH foci and the inverse, disruption of the nucleolus increases smFISH foci number as a whole. A model based on physical association would suggest that a nucleolus associated centromere produces less or no transcripts. Given that this is not a population-based assay, it should be possible to address this directly by analysing the location of individual centromeres and corresponding transcripts to strengthen the hypothesis. This could be done by either analysing the smaller subset of centromere-associated foci that colocalise with the smFISH signal and test whether the majority of these signals are proximal or distal to the nucleolus (this would not work or be less meaningful if the subpopulation is very small). Or doing a combined DNA/RNA FISH experiment. The expectation would be that DNA FISH signals of centromeres close to the nucleolus would not produce an RNA FISH signal somewhere else, and vice versa.

      We predict that centromere-nucleolar associations are dynamic. Thus, we anticipate that centromeres would be associated transiently with the nucleolus (perhaps for a few hours), and that a given centromere would not be associated with the nucleolus in every cell at a specific time point. Thus, we believe that analyzing these behaviors across a diverse range of cells, as we did for this paper, is appropriate. In addition, technical considerations make these suggested experiments prohibitive. Defining the relationship between a centromere RNA and its originating centromere would require combined DNA and RNA FISH. The repetitive nature of alpha-satellite repeats and the strong similarity of these sequences between chromosomes makes it highly complex to visualize an individual centromere. Even if we were able to do this, the conditions required to simultaneously detect nucleoli (immunofluorescence), RNA (smFISH), and DNA (requires denaturation and hybridization) make this such that it would be complex to correlate the localization of an individual centromere with the levels of the corresponding alpha-satellite transcripts. In addition, these RNAs are likely to persist for an extended duration (possibly throughout the course of an entire cell cycle), such that they would not necessarily correlate with the current localization behavior of the centromere from which they are derived. For future work (beyond the scope of this paper), we plan to create cell lines expressing both centromere (CENP-A) and nucleolar markers (for example, Ki67) to conduct time lapse imaging to assess the dynamic associations between these structures.

      7) At the end of the abstract, the authors conclude that the control of centromere transcription might be regulated by the centromere-nucleolar contacts to modulate chromatin dynamics. What does that really mean? One possibility they give in the discussion is rejuvenating centromeric chromatin. It would be nice if they could show some effect along those lines at the centromere in one of the manipulations they did (either through inhibiting or increase transcription). At least as discussed in the paper (Supp. Fig 3 D) it appears that overall levels of CENP-A are not affected. Is this different for newly loaded CENP-A? Or some other aspect of chromatin dynamics that is modulated? I realise that this might have been difficult to detect and therefore missing in the current study.

      In a separate study from our lab as part of our recent work (Swartz et al. 2018), we found that CENP-A is gradually incorporated at centromeres in non-dividing quiescent cells, including non-transformed human Rpe1 cells and starfish oocytes. In the case of oocytes, which contain a substantial pool of mRNAs such that they do not require ongoing transcription for viability, we found that inhibiting RNA Polymerase II and preventing ongoing transcription blocked the incorporation of newly synthesized histones, including both canonical histone H3 and CENP-A. We realize that our description of this prior work was not sufficient to understand our integrated model, which relies on information from both papers. For the revised paper, we will update our discussion to better describe this data and present our model.

      • Page 8: The authors state that as cells entered mitosis, dissociation of smFISH foci from chromatin was observed. While the absence of co-localisation of DAPI and smFISH signals is obvious in mitotic cells, what evidence is there that smFISH foci are chromatin associated in interphase nuclei? Rephrasing this bit might avoid confusion here. *

      We appreciate this point. We did not mean to imply that the smFISH foci are bound to (or associate with) chromatin in the interphase nucleus. We will reword this as suggested.

      Reviewer #3:

      The manuscript of Bury et al. addresses how alpha-satellite transcription around centromeres is regulated. Using smFISH to detect alpha-satellite RNA transcripts, the authors find that alpha satellites are transcribed by RNA pol II, but their transcription is independent of centromeric proteins. In addition, they present evidence that nucleolar association represses alpha-satellite transcription. The data is convincing, solid and generally supports the conclusions. The manuscript includes appropriate control experiments, such as test for the validity of the RNA FISH probes. The manuscript is well-written and easy to follow, also for someone who is not directly an expert in the field.

      The authors use a single-cell technique (smFISH) to look at the localization and transcription of alpha-satellite transcription from centromeres. The technical advance of this paper is limited, as smFISH is a well-established technique by now. Nevertheless, applying this single-cell approach to these repetitive regions has resulted in new insights regarding the regulation of alpha-satellite transcription, especially their localization of centromeres to nucleoli. Regarding the significance of these insights in the context of centromere biology/regulation and its literature is hard to evaluate for me, because this is not my field of expertise (my background is in single-cell transcription regulation). As a researcher from a related research field, I think the findings of this manuscript are mostly relevant for the direct research community of centromere and alpha-satellite biology, but not for researchers outside the field.

      We appreciate these comments regarding the carefully controlled nature of our paper and the value of the advances for understanding alpha-satellite transcription. We also agree that smFISH is an established technique, although it has not been applied to these repetitive alpha-satellite sequences in prior work, allowing us to make important new observations usng the studies in this paper.

        • The description of the inducible knock out cell lines is very limited. My main concern is how is checked that the gene is actually knocked out. I went back to the referenced paper, but it is still is not clear to me whether the new knockouts are sufficiently checked. It would be more convincing if the authors could show western blots or other evidence that their knockouts are working. In any case, the description of the knockout generation should be more elaborate. * This important point was also noted by the other reviewers. Please see our responses to Reviewer #1 point 4 and Reviewer #2 point 4. As described above, for a revised paper, we will provide an improved description of these knockout cell lines, our validation of these tools, and how we conducted the experiments in this paper.
      • The authors nicely show that there is an inverse correlation between nucleolar association of the centromere and alpha-satellite transcription. The data supports this claim, but given the many knockouts and cell lines that were tested, with many intermediate phenotypes (such as CENP-B), I find the correlation based on 4 points a bit sparse. I would recommend filling up figure 4C with a few more mutants, to show that the inverse correlation holds for all mutants. These experiments would be straightforward for the authors, as the knockout/cell lines and techniques are already available. *

      We see a compelling general correlation between the fraction of nucleolar-localized centromeres and alpha-satellite transcript levels. Our goal for Figure 4C was to highlight this correlation for a selected subset of conditions. However, we do not believe that there will be a precise linear correlation between transcript levels and nucleolar centromeres under every condition. Indeed, it is quite possible that some perturbations would affect transcript levels without altering nucleolar associations. This is particularly true for perturbations that cause subtle phenotypes. Systematically analyzing centromere-nucleolar co-localization for each of the knockouts represents a substantial undertaking that we do not feel would contribute substantially to this existing paper.

      • The nucleolar repression is also supported by the Fibrillarin and Ki67 knockout. These are nice experiments which support their findings. What I am missing is whether these data quantitatively agree with the inverse correlation. Are these mutants completely lacking nucleoli, and if so, would you not expect both mutants to show the same upregulation? Similar to my point above, where do these mutants fall in the graph of figure 4C? *

      For the perturbations described in this paper, we believe that inhibiting RNA Polymerase I most closely approximates the condition where nucleolar function is eliminated. Although Ki67 is a nucleolar protein in interphase, loss of Ki67 does not cause lethality indicating that nucleolar function is largely intact. We agree that it would be a good experiment to assess nucleolar-centromere associations in the Ki67 knockout. In fact, we have tried these experiments several times. However, due to the absence of Ki67 (for which we have the best localization tools), we instead needed to use Fibrillarin to monitor nucleoli. We have found this antibody to be much more finicky and not as readily compatible with the fixation conditions needed to detect centromeres. Thus far, we have not been able to generate clear data for this behavior.

      • Related to this, since their imaging techniques have single-cell resolution, I wonder if cells that contain many centromeres in the nucleolus have less alpha satellite transcripts than cell with few centromeres. *

      The correlation between centromere-nucleolar associations and alpha-satellite transcript numbers is strongly supported by our data across a population. However, analyzing this in individual cells is additionally complicated by the fact that we found that transcript levels vary over the cell cycle (low in G1, higher in S/G2). In addition, monitoring each of these markers in individual cells is technically complicated. Thus, while we appreciate this suggestion, we believe that our data stands on its own.

      • One claim that is a bit speculative is the suggestion that transcription itself and not the RNA may be required for the function of the alpha-satellites. This is indeed supported by the fact that most transcripts are not localized at the centromeres. However, this contrasts to the findings of the papers that increasing alpha-satellite transcription in different mutants does not appear to result in any phenotype on centromere function. For a non-expert, the function of these transcripts/transcription itself is not clear from the current manuscript, so I would recommend discussing the nuances of its functions in more detail in the discussion. *

      We agree that our model is speculative, but have chosen to include this to provide our perspective on the possible roles for centromere transcription based on this paper and our other recent work (Swartz et al. 2018). We believe that our data provide a context and set of constraints for potential roles of centromere transcription, but also agree that future work is needed to resolve these. Based on this comment and those from the other reviewers, we will also provide a better description of the data in the Swartz et al. paper, which analyzed different features of centromere transcription.

      • To quantify the smFISH data, the authors count the number of foci. From the images, it looks like the different foci have very different intensities. This may occur if the transcripts are different length when transcribed from different genomic regions. However, this may also occur if several RNA co-localize to the same spot, i.e. if one spot contains several RNAs. Can the authors verify that the distribution of spot intensities matches the expected intensities based on the different transcribed alpha-satellite regions? *

      Please see our response to Reviewer #2, point #3.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The manuscript of Bury et al. addresses how alpha-satellite transcription around centromeres is regulated. Using smFISH to detect alpha-satellite RNA transcripts, the authors find that alpha satellites are transcribed by RNA pol II, but their transcription is independent of centromeric proteins. In addition, they present evidence that nucleolar association represses alpha-satellite transcription.

      The data is convincing, solid and generally supports the conclusions. The manuscript includes appropriate control experiments, such as test for the validity of the RNA FISH probes. The manuscript is well-written and easy to follow, also for someone who is not directly an expert in the field.

      Major comments:

      The description of the inducible knock out cell lines is very limited. My main concern is how is checked that the gene is actually knocked out. I went back to the referenced paper, but it is still is not clear to me whether the new knockouts are sufficiently checked. It would be more convincing if the authors could show western blots or other evidence that their knockouts are working. In any case, the description of the knockout generation should be more elaborate.

      The authors nicely show that there is an inverse correlation between nucleolar association of the centromere and alpha-satellite transcription. The data supports this claim, but given the many knockouts and cell lines that were tested, with many intermediate phenotypes (such as CENP-B), I find the correlation based on 4 points a bit sparse. I would recommend filling up figure 4C with a few more mutants, to show that the inverse correlation holds for all mutants. These experiments would be straightforward for the authors, as the knockout/cell lines and techniques are already available.

      The nucleolar repression is also supported by the Fibrillarin and Ki67 knockout. These are nice experiments which support their findings. What I am missing is whether these data quantitatively agree with the inverse correlation. Are these mutants completely lacking nucleoli, and if so, would you not expect both mutants to show the same upregulation? Similar to my point above, where do these mutants fall in the graph of figure 4C?

      Related to this, since their imaging techniques have single-cell resolution, I wonder if cells that contain many centromeres in the nucleolus have less alpha satellite transcripts than cell with few centromeres.

      Minor comments

      One claim that is a bit speculative is the suggestion that transcription itself and not the RNA may be required for the function of the alpha-satellites. This is indeed supported by the fact that most transcripts are not localized at the centromeres. However, this contrasts to the findings of the papers that increasing alpha-satellite transcription in different mutants does not appear to result in any phenotype on centromere function. For a non-expert, the function of these transcripts/transcription itself is not clear from the current manuscript, so I would recommend discussing the nuances of its functions in more detail in the discussion.

      To quantify the smFISH data, the authors count the number of foci. From the images, it looks like the different foci have very different intensities. This may occur if the transcripts are different length when transcribed from different genomic regions. However, this may also occur if several RNA co-localize to the same spot, i.e. if one spot contains several RNAs. Can the authors verify that the distribution of spot intensities matches the expected intensities based on the different transcribed alpha-satellite regions?

      Significance

      The authors use a single-cell technique (smFISH) to look at the localization and transcription of alpha-satellite transcription from centromeres. The technical advance of this paper is limited, as smFISH is a well-established technique by now. Nevertheless, applying this single-cell approach to these repetitive regions has resulted in new insights regarding the regulation of alpha-satellite transcription, especially their localization of centromeres to nucleoli. Regarding the significance of these insights in the context of centromere biology/regulation and its literature is hard to evaluate for me, because this is not my field of expertise (my background is in single-cell transcription regulation). As a researcher from a related research field, I think the findings of this manuscript are mostly relevant for the direct research community of centromere and alpha-satellite biology, but not for researchers outside the field.

      REFEREES CROSS COMMENTING

      I agree with all the points raised. There is indeed a lot of overlap. The experiments should not take long in normal circumstances, but given the current situation, some extra time may indeed be required.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The study by Bury et al. investigates the formation of two different types of alpha-satellite transcripts (ASAT, SF1 and 3) in different human cell lines. Using smFISH they find that during the cell cycle these centromeric transcripts don't stay at the centromere and are found in the cytoplasm after mitosis. Using specific inhibitors, they find that transcription is dependent on RNAPII, but not on various centromere and kinetochore proteins taking advantage of an inducible CRISPR-depletion system that the lab had previously developed. Interestingly, they find that CENP-C, a major component of the centromere and previously characterised as an RNA-binding protein, negatively regulates alpha-satellite transcript levels. Another regulator for transcript levels appears to be centromere-nucleolus interactions (as also indicated in the title) acting to suppress expression of these non-coding RNAs.

      Major points

      1) The authors state that the majority of smFISH foci do not colocalise with centromeres in a combined IF/FISH experiment (some quantification and a % of that subpopulation should be given somewhere). This is a bit concerning but of course could also be true. It either means that alpha-satellite transcripts leave the centromere as suggested by the authors (although some should be visible at the centromeres during the act of transcription). Alternatively, a trivial explanation would be that there is a lot of unspecific staining, which can occur in FISH-experiments to varying degrees. The RNase treatment to control for the absence of potential DNA hybridization is convincing, but the FISH probe could also interact with non-centromeric cellular RNA. With the centromere localisation as a reference point gone, some control is needed to validate that the RNA-FISH signals are indeed recognising alpha-satellite RNA that emerged from centromeres.¬ The authors could try competition experiments titrating unlabelled specific or unspecific DNA probes alongside their labelled specific FISH probe into their FISH experiment to see if they lose or maintain the signal and the number of foci. The specific RNA FISH probes could also be used in DNA FISH, to demonstrate they are working and recognising specific centromeres.

      2) Apart from Figure 4, there is no analysis shown for statistical significance. This should be done for most if not all quantifications. Are indeed ASAT and antisense RNA Foci number not significantly different? The authors say that the levels of alpha-sat RNA in Rpe1 cells are not substantially different from other cell lines, but is it also not significant (Fig 1F)? In Figure 2D it is concluded that transcripts foci number are increased in S/G2 (from G1) and remain stable in mitosis, but it looks like there is an increase in mitosis. Again, it looks like the higher number of smFISH foci/Cell is significantly higher for both ASAT and SF1, so some statistical analysis would be required here.

      3) Starting with the description of Figure 1E in the main text the paper equates foci count of smFISH per cell with RNA transcript levels. I'm not convinced that these are necessarily the same. You could have many weak foci or few very bright with the same amount of overall transcripts in both. The authors start out introducing smFISH as highly sensitive "for accurate characterisation of number ...of RNA transcripts". This suggests that foci intensity could be used as a read-out for transcript levels. It should be possible to measure individual intensity of the foci for a subset of images. Do foci intensity correlate or anti-correlate with foci numbers? Is the sum of the intensities of all the foci less variable than the foci number for an individual cell type?

      4) I really like the use of the inducible CRISPR system to remove various centromere factors. However, some validation would be required to show that the system is effective in removing the proteins of interest in these experiments. For instance it would be helpful to show in Figure 3D an additional panel with CENP-C staining. Also for a subset of factors, some antibody staining co-staining with the smFISH could be provided in the supplemental material.

      5) Since none of the CRISPR iKO has a particular inhibiting phenotype it would be useful to include some positive control in the CRISPR experiment. Would it be possible to use a CRISPR iKO target that affect some factor of the transcription machinery (RNA Pol II or similar) to reduce transcript levels?

      6) The authors find a negative correlation between the nucleolus-centromere association and the number of alpha sat foci. This is really interesting and they suggest that the nucleolus association could negatively regulate centromere transcription. However, this correlation is rather indirect in the sense that cells with a higher-degree of nucleolus-centromere localisation have fewer smFISH foci and the inverse, disruption of the nucleolus increases smFISH foci number as a whole. A model based on physical association would suggest that a nucleolus associated centromere produces less or no transcripts. Given that this is not a population-based assay, it should be possible to address this directly by analysing the location of individual centromeres and corresponding transcripts to strengthen the hypothesis. This could be done by either analysing the smaller subset of centromere-associated foci that colocalise with the smFISH signal and test whether the majority of these signals are proximal or distal to the nucleolus (this would not work or be less meaningful if the subpopulation is very small). Or doing a combined DNA/RNA FISH experiment. The expectation would be that DNA FISH signals of centromeres close to the nucleolus would not produce an RNA FISH signal somewhere else, and vice versa.

      7) At the end of the abstract, the authors conclude that the control of centromere transcription might be regulated by the centromere-nucleolar contacts to modulate chromatin dynamics. What does that really mean? One possibility they give in the discussion is rejuvenating centromeric chromatin. It would be nice if they could show some effect along those lines at the centromere in one of the manipulations they did (either through inhibiting or increase transcription). At least as discussed in the paper (Supp. Fig 3 D) it appears that overall levels of CENP-A are not affected. Is this different for newly loaded CENP-A? Or some other aspect of chromatin dynamics that is modulated? I realise that this might have been difficult to detect and therefore missing in the current study.

      Minor points

      Page 8: The authors state that as cells entered mitosis, dissociation of smFISH foci from chromatin was observed. While the absence of co-localisation of DAPI and smFISH signals is obvious in mitotic cells, what evidence is there that smFISH foci are chromatin associated in interphase nuclei? Rephrasing this bit might avoid confusion here.

      Significance

      This is overall a really interesting study and indeed, transcription at the centromere is little understood at this point. Given the importance of the centromere the findings in this manuscript will be of high interest to both researchers in the field and a general audience. There are novel and interesting insights into centromeric transcripts but the study still requires some controls.

      REFEREES CROSS COMMENTING

      I agree with the comments of the other reviewers. I appears that there is a lot of overlap between the referees regarding the exciting parts and those that raise concerns. In particular I share the view of Reviewer 1 on the imporantance of validating the FISH probes and the knockouts (also raised by R3) and the concern that a 24h transcription inhibition is prone to secondary effects. I would agree with both that less time might be required to complete revisions (may be 1-3 months) but was factoring in some extra time for wet experiments which likely take longer under the current conditions.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      In this manuscript the authors explore the requirements for centromere transcription using single-molecule FISH. Previous studies have found that centromeres are transcriptionally active in a wide variety of organisms. Centromere transcription has been proposed to facilitate Cenp-A deposition through chromatin remodeling and to directly contribute to centromere/kinetochore function by producing a functional ncRNA. However, we currently know almost nothing about how transcription is initiated at the centromere or how levels of centromere transcripts are controlled. This manuscript makes several major findings that are potentially of importance to groups studying centromere transcription. 1.) Centromere RNAs are produced by RNA Polymerase II and are localized in the nucleus of a wide-range of cell types. 2.) Centromere RNAs do not localize to the centromere, which is in contrast to several recent studies. 3.) Centromere proteins are not required for transcription of alpha-satellite sequences. 4.) Localization of centromeres to the nucleolus represses centromere transcription. Overall, this is a solid manuscript and has the potential to make a significant impact in the field. Below I suggest a couple of experiments and modification to the data presentation that could improve the manuscript.

      Major points:

      1.All of the experiments in this manuscript rely on detection of centromere RNAs using single molecule FISH probes. These probes are validated by showing the RNase treatment removes the FISH signal. A strength of this approach is that the authors use multiple different probe sets and achieve comparable results. However, there is no orthogonal validation that the probes detect alpha satellite RNA. All of the experiments in this manuscript would be significantly improved by showing that the results presented here can be confirmed by a different approach. I suggest that the authors use Q-RT-PCR to validate the smFISH results.

      2.Several results in this manuscript directly contradict results in published studies, but these discrepancies are not discussed. I believe the authors need to discuss the following discrepancies between their results and those in the literature:

      a. NcNulty et al. Dev. Cell. 2017. Show that alpha-satellite RNA is transcribed from all centromeres and remains localized to the site of transcription. The different results and possible explanations for the differences should be discussed.

      b. Additionally, Rosic et al. JCB 2014, Blower Cell Reports 2016 and Bobkov et al. JCB 2018 all show that centromere RNAs localize to centromere regions. The differences between these studies and the authors results should be discussed.

      c. The authors show that satellite RNA cannot be detected on mitotic chromosomes. However, Johnson et al. Elife 2017, Bobkov et al. JCB. 2018, and Perea-Resa et al. Mol. Cell. 2020 show that EU-labeled RNA can be detected at the centromere during mitosis. The authors should discuss the discrepancy between their results and these studies. Is it possible that their smFISH probes do not detect nascent, chromatin-bound transcripts?

      d. The authors show nicely that deletion of Ki-67 reduces centromere localization to the nucleolus and increases centromere transcription. However, this has no effect on centromere function. Studies from the Earnshaw lab (e.g. Nakano et al. Dev Cell 2008 and Bergmann et al. EMBO J. 2011) show that increasing or decreasing centromere transcription results in loss of kinetochore function on a human artificial chromosome. The authors should discuss the differences between their results and these studies. Is it possible that the small size of the HAC exaggerates the importance of the correct levels of centromere transcription?

      Minor Points

      1.The authors treat cells with transcriptional inhibitors for 24 hours. I am concerned that this may result in massive cell death. It would be helpful to include cell viability data from these experiments.

      2.In Figure 3C the authors examine the effects of centromere protein knock outs on centromere transcription. To me this is the most important experiment in the manuscript and is a major step forward for the field. The authors use inducible CRISPR knock out cell lines that are not 100% penetrant. It would be helpful if the authors could describe how they ensured that cells included in the image quantification were knock out cells.

      3.On p8. The authors cite Quenet and Dalal. eLife 2014 for the idea that transcription during G1 is important for new Cenp-A loading. They should also cite Chen et al. Dev. Cell 2015 and Bobkov et al. JCB. 2018.

      Significance

      Describes the requirements of transcription of centromere RNAs. Identifies factors that regulate centromere transcription.

      Audience: centromere biologists.

      REFEREES CROSS COMMENTING

      I agree with all the concerns raised by the other reviewers. I think that all three reviews taken together are a fair and constructive review of this manuscript.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the Reviewers

      We thank the reviewers for their thoughtful comments and suggestions how to improve our manuscript. Most of the remarks are now addressed in a new version of the manuscript with modifications marked in blue. We found especially interesting the idea to explore the changes in dynamics of microtubules that make up bridging fibers, which we will do in revision. In addition, we will perform Western blot analysis of PRC1 and acquire better images of cells with SiR-DNA for Fig. 2 A.

      ** Major issues: **

      Reviewer #1:

      The use of blue light necessary to relocate opto-PRC1 from the spindle to the membrane is a concern, specially given the strongest phenotype associated with acute vs. constitutive inactivation of PRC1. While these differences may indeed reflect distinct cellular adaptation responses to each procedure, the authors must rule out that phototoxicity caused by blue light (e.g. see Douthwright S, Sluder G. Live Cell Imaging: Assessing the Phototoxicity of 488 and 546 nm Light and Methods to Alleviate it. J Cell Physiol. 2017 Sep;232(9):2461-2468. doi: 10.1002/jcp.25588. PubMed PMID:27608139) is not responsible for the observed stronger phenotypes. A control of U2OS cells expressing the centromere marker (without opto-PRC1) in metaphase after exposure to the same blue light regimen (i.e. 200 ms every 10 sec for 20 min and same laser power) should be provided.

      Response: We thank the reviewer for raising this important point. We added the suggested experiments on U2OS cells without opto-PRC1 filmed with same blue light regimen, and updated Fig. S2 A-C, E to contain also the new measurements of inter-kinetochore distance (dKC), distance from equatorial plane (dEQ), corresponding time-lapse images, and angle between sister kinetochore axis and spindle long axis (αKC), respectively. We added the following text in Results: “The effects of PRC1 removal were found neither in control experiments without iLID, nor in a different set of control experiments where cells without opto-PRC1 and without iLID were exposed to the same laser illumination protocol (Fig. 2 B,E,F,H; Fig. S2, A-C, E), suggesting that the observed effects were not a consequence of laser photodamage (Douthwright and Sluder, 2017).”

      Reviewer #1:

      I could not find in the manuscript whether opto-PRC1 is RNAi resistant. I would assume so, as the authors are targeting the 3'-UTR of endogenous PRC1, but at least a western blot should be provided: 1) to properly ascertain depletion efficiency of the endogenous protein; and 2) the levels of opto-PRC1 after depletion.

      Response: We added a note in Methods that opto-PRC1 is RNAi-resistant. We have assessed the depletion efficiency of the endogenous PRC1 and the levels of opto-PRC1 after depletion by using immunofluorescence of PRC1 on the spindle (Fig. S1 A and B). Additionally, we will perform Western blot analysis to show depletion efficiency of endogenous PRC1 and the levels of transfected opto-PRC1 after depletion of endogenous PRC1. However, as we observed that the efficiency of opto-PRC1 plasmid transfection is low, Western blot analysis may provide biased levels of PRC1 in the complete population, not specific to the analyzed opto cells.

      Reviewer #1:

      One aspect related with data interpretation and the proposed model: if PRC1 selectively bundles anti-parallel microtubules, how could it mechanically couple sister k-fibers that are made of parallel MTs? This should be explained in detail, ideally supported by data.

      Response: This is an important issue, which we now explain in detail in Discussion: “As midzone-crossing microtubules associate with k-fibers on either side of the metaphase plate (O'Toole et al., 2020), PRC1 and probably also other microtubule-associated proteins crosslink antiparallel overlaps between k-fiber microtubules extending from one pole and bridging microtubules extending from the opposite spindle half, as well as antiparallel overlaps within the bridging fiber.”

      Reviewer #1:

      The author should find a way to unequivocally demonstrate that opto-PRC1 is fully functional and can rescue depletion of endogenous PRC1. The fact that recovery of PRC1 on spindles never fully rescue spindle architecture and chromosome properties might indicate that opto-PRC1 is not fully functional. For example, can it rescue anaphase or cytokinesis roles of PRC1?

      Response: To demonstrate functionality of opto-PRC1, we added images of cell's progression to cytokinesis in both control and opto cells in new Fig. S1D and added the following text to Results: “Importantly, after exposure to the blue light, opto cells were able to progress to cytokinesis (Fig. S1 D)”. Furthermore, as PRC1's major binding partners, Kif4A and MKLP1 (Fig. S4A, and new Fig. S4J, respectively), which depend on its localization in the spindle midzone in anaphase, are found to co-localize with opto-PRC1 in anaphase, opto-PRC1 is fully functional and rescues depletion of endogenous PRC1. We added the following text to Results: “In anaphase, MKLP1 also co-localized with opto-PRC1 in the spindle midzone (Fig. S4 J) (Gruneberg et al., 2006; Kurasawa et al., 2004)”.

      ** Minor issues: **

      Reviewer #1:

      Abstract: the authors introduce the problem by stating that chromosome position at the spindle equator is mainly regulated by forces by kMTs. We do not know this, actually there is evidence in the literature that kif4a on chromosome arms is required to maintain chromosomes aligned by exerting forces on ipMTs (e.g. Wandke et al., JCB, 2012). Along the same line, there is evidence from the Dumont lab that sister k-fibers are not mechanically coupled. These alternative views should be discussed and taken into account when formulating the problem under investigation in the present study.

      Response: We changed the sentence in Abstract to include polar ejection forces: “During metaphase, chromosome position at the spindle equator is regulated by the forces exerted by kinetochore microtubules and polar ejection forces”. When formulating the problem in Introduction, we discuss polar ejection forces and cite Wandke et al., 2012, and several other papers. We also discuss the findings about PRC1-mediated coupling of sister k-fibers from the Dumont lab in relation to our local effect of PRC1 removal on a fraction of sister kinetochore pairs: “This local effect is in line with weak mechanical coupling between neighboring k-fibers, yet strong coupling between sister k-fibers (Elting et al., 2017; Suresh et al., 2020).” In addition, we mention the Dumont lab results when we suggest that the persistent misorientation of kinetochores after PRC1 return to the spindle is due to perturbed overlap geometry during the absence of PRC1: “This is in agreement with a recent finding that PRC1 restricts pivoting of k-fibers near kinetochores by promoting tight coupling between sister k-fibers (Suresh et al., 2020).”

      Reviewer #1:

      The authors refer to kinetochore alignment or lagging kinetochores throughout the text. Although this is unquestionable, it might be more appropriate to refer to chromosome alignment or lagging chromosomes instead, as this is the object to me moved.

      Response: We agree with the reviewer and changed this at several places throughout the text.

      Reviewer #1:

      page 2: "...PRC1 regulates forces acting on kinetochores". The authors should mention that this would be indirect, as PRC1 is not at kinetochores itself.

      Response: This is true and therefore we added the word indirectly in this sentence.

      Reviewer #1:

      page 6: "PRC1 removal did not activate the spindle assembly checkpoint". Although this might be considered semantics, given that the SAC is constitutively active and needs to be satisfied, the authors might adopt a more accurate description such as "PRC1 removal did not prevent spindle assembly checkpoint satisfaction".

      Response: We changed the sentence into the suggested one.

      Reviewer #1:

      page 13: the authors mention about the localization of Kif18a on bridging fibers. Was this known? From the images it is unclear if we are looking at bridging fibers or k-fibers. Co-localization with PRC1 would help clarifying this issue. If indeed associated with bridging fibers, this would raise an alternative interpretation of how Kif18a contributes to maintain chromosome alignment.

      Response: We thank the reviewer for raising this important point. Kif18A localization in the bridge is a new observation, and to make it clearer we introduced merged images where both Kif18A and PRC1 are shown during optogenetic experiment (Fig. S4E) and four examples of enlarged regions around kinetochores with Kif18A-GFP to show its localization in the bridging fiber in mock treated cells and its lack of localization in the bridging fiber after PRC1 siRNA (Fig. S4F). We also added a discussion of a new potential role of Kif18A (and Kif4A and MKLP1) in chromosome alignment: “Interestingly, we found that Kif4A, MKLP1, and Kif18A localize in the bridging fibers in metaphase and this localization was lost after optogenetic or siRNA-mediated PRC1 removal. During anaphase, the PRC1-dependent Kif4A and MKLP1 in the bridging fibers are involved in sliding of antiparallel microtubules to elongate the spindle (Vukušić et al., 2019). Kif4A and MKLP1 may have a similar role in metaphase, and thus Kif4A removal from the bridging fibers induced by PRC1 removal may affect chromosome alignment by affecting microtubule sliding in the bridging fiber. This possibility is in agreement with previous work showing that Kif4A depletion reduces microtubule flux (Wandke et al., 2012). Similarly, Kif18A in the bridging fiber may have microtubule-sliding and crosslinking activities similar to those of the yeast kinesin-8 (Su et al., 2013), which may promote chromosome alignment. The roles of these and other motors within bridging fibers in chromosome alignment will be an intriguing topic for future studies.”

      ** Major concerns: **

      Reviewer #2:

      Regarding lagging chromosomes: Page 6 and Fig. 2F: "Kinetochore remains displaced even after opto-PRC1 return": Why is this? The reasoning in the discussion is not clear/convincing. Is it possible that these irreversible changes reflect light-induced deactivation of protein? Or, could these irreversible changes arise from a perturbation in the structure of microtubules at the end of the 'light' period? Discussion or additional supportive evidence to address this will be helpful.

      Response: We thank the reviewer for raising this question. As we have not observed these effects in control cells with opto-PRC1 and without iLID that relocates opto-PRC1 to the membrane, we do not find light-induced deactivation of opto-PRC1 likely. We find the latter possibility more realistic, thus we added the following text to Discussion: “Kinetochore positions and orientations did not revert to the initial values within 10 min of PRC1 return. We speculate that upon PRC1 removal the geometry of the overlap structures is perturbed due to a change in the force balance in the spindle. When PRC1 returns to the perturbed overlaps, it likely confines the chromosomes in new positions and orientations. This is in agreement with a recent finding that PRC1 restricts pivoting of k-fibers near kinetochores by promoting tight coupling between sister k-fibers (Suresh et al., 2020).”

      Reviewer #2:

      The correlation between misaligned kinetochores and lagging chromosomes is not clear. Are lagging chromosomes more frequently attached to kinetochores that show high deq values (Fig. S2G) in metaphase?

      Response: This is an interesting point. To clarify our results, we rewrote the text: “Opto cells that showed lagging kinetochores in anaphase had a slightly smaller inter-kinetochore distance before anaphase than opto cells without lagging kinetochores, but we did not find correlation between lagging kinetochores in anaphase and kinetochore misalignment in metaphase (Fig. S2 H).” Fig. S2 H is the old Fig. S2 G. Please note that we were not able to backtrack individual lagging kinetochores to metaphase to see if they had a higher value of d_eq. Instead, we measured mean d_eq of all kinetochores in opto cells that had a lagging chromosome and in those that did not.

      Reviewer #2:

      Regarding the contribution of other motors: Looking at the contributions of various other microtubule-associated proteins in accounting for effects of PRC1 removal is a good addition to the paper (Fig. 4). However, the consequences of the depletion of Kif18A and MKLP1 from the bridging fiber are not elaborated upon. Is the presence of these motors at the bridging fiber functionally important? It would be good to incorporate their known activity in the final model for how PRC1-crosslinked fibers align chromosomes. In particular, a recent biorxiv submission from this group has a thorough examination of the consequences of motor removal in anaphase, and perhaps some of their findings and other literature can be used to draw some insights into if and how the presence of these motors on PRC1-crosslinked fibers contribute to chromosome alignment.

      Response: We thank the reviewer for this important idea, which we now elaborate in Discussion: “Interestingly, we found that Kif4A, MKLP1, and Kif18A localize in the bridging fibers in metaphase and this localization was lost after optogenetic or siRNA-mediated PRC1 removal. During anaphase, the PRC1-dependent Kif4A and MKLP1 in the bridging fibers are involved in sliding of antiparallel microtubules to elongate the spindle (Vukušić et al., 2019). Kif4A and MKLP1 may have a similar role in metaphase, and thus Kif4A removal from the bridging fibers induced by PRC1 removal may affect chromosome alignment by affecting microtubule sliding in the bridging fiber. This possibility is in agreement with previous work showing that Kif4A depletion reduces microtubule flux (Wandke et al., 2012). Similarly, Kif18A in the bridging fiber may have microtubule-sliding and crosslinking activities similar to those of the yeast kinesin-8 (Su et al., 2013), which may promote chromosome alignment. The roles of these and other motors within bridging fibers in chromosome alignment will be an intriguing topic for future studies.”

      Reviewer #2:

      Page 13 and Fig. 4A & 4B: "The localization of Kif18A in the bridge was perturbed by both acute and long-term PRC1 removal." However, this is not apparent from Figures 4A and 4B. It would be helpful to clarify how this interpretation was made from the data in the figure.

      Response: We thank the reviewer for pointing this out. To clarify this issue, we added merged-channel images of the cell from Fig. 4 A to Fig. S4 E to show colocalization of Kif18A and PRC1. Moreover, we added enlargements from spindles without and with PRC1 depletion in Fig. S4 F to show presence and absence of Kif18A in the bridging fibers, respectively. To clarify how the images were evaluated, we added the following text in Methods: “Localization test of GFP-Kif4A or Kif4A-GFP, MKLP1-GFP, Kif18A-GFP, EGFP-CLASP1, and CENP-E-GFP in the bridging fibers of either opto cells or cells treated with mock siRNA or PRC1 siRNA was performed by visually inspecting the GFP signal through the z-stack, in the region where PRC1-labeled fibers were found, i.e., in the region that spans between sister kinetochores and continues ~2 µm laterally from sister kinetochores.”

      Additional suggested experiment and analysis:

      Reviewer #2:

      One factor that could potentially contribute to the changes in chromosome alignment and increase in lagging chromosomes upon PRC1 removal, is changes in dynamics of microtubules that make up bridging fibers. This may also provide insights on the role of associated proteins. One possible experiment is to look at tubulin turnover in the bundles (example by FRAP). Another alternative possibility is to examine EB3 comets in the presence and absence of PRC1 (note: these are just some potential suggestions; the authors may have other ways of addressing the question). Examining the dynamics would help in addressing if bridging fibers is dynamically remodeled through metaphase and early anaphase and whether the loss of PRC1 causes a change in the dynamics of these microtubules.

      Response: We thank the reviewer for this exciting suggestion. We will perform experiments to examine EB3 comets (their numbers and velocities) in the bridging fibers in the presence and absence of PRC1.

      Reviewer #2:

      Do kinetochores oscillate / fluctuate about the metaphase plate over time? Does the absence of PRC1 affect these fluctuations? Since the authors already have the data (Fig. 2), they can track the trajectories of sister kinetochore displacement from the equatorial plane as a function of time from prometaphase on, both in the presence and absence of PRC1. This analysis will be informative in understanding how kinetochores and bridging fibers act together to maintain force balance in the spindle and how misalignments are corrected.

      Response: This is an interesting point. We added the following results: “Kinetochore displacement was not a result of higher oscillation amplitude because kinetochores fluctuated to a similar extent in the presence and absence of opto-PRC1, but in its absence the displaced kinetochores fluctuated within a region that was offset from the equatorial plane (Fig. S2 D).”

      ** Minor points: ** Reviewer #2:

      Is the number of microtubules that make up the bridging fiber the same for outermost and inner kinetochores?

      Response: This is an interesting question. Even though we could not measure this here, we suggest that the outermost bridges may have more microtubules: “The misaligned kinetochores were found in the inner part of the spindle, where PRC1 signal disappeared faster than on the outer part, which indicates that the inner bridging fibers were more severely affected by PRC1 removal and/or that they are made up of fewer microtubules than the outer bridges.”

      Reviewer #2:

      The quantity dax that is plotted in fig. S2F has not been defined in the text.

      Response: We now define dAX in the caption of Fig. S2 F: “Graphs show aKC versus corresponding dEQ and the distance from the midpoint between sister kinetochores to the long spindle axis, dAX (left), ...”

      Reviewer #2:

      Discussion of these findings in the context of recent work from the lab of Sophie Dumont will be interesting (Suresh et al. eLife 2020;9:e53807).

      Response: We thank the reviewer for reminding us to discuss this highly relevant recent paper by the Dumont lab. We included a discussion of the findings about PRC1-mediated coupling of sister k-fibers in relation to our local effect of PRC1 removal on a fraction of sister kinetochore pairs: “This local effect is in line with weak mechanical coupling between neighboring k-fibers, yet strong coupling between sister k-fibers (Elting et al., 2017; Suresh et al., 2020).” In addition, we mention these results when we suggest that the persistent misorientation of kinetochores after PRC1 return to the spindle is due to perturbed overlap geometry during the absence of PRC1: “This is in agreement with a recent finding that PRC1 restricts pivoting of k-fibers near kinetochores by promoting tight coupling between sister k-fibers (Suresh et al., 2020).”

      Reviewer #3:

      Some of the images are sub-optimal. For example Fig 2A, there doesn't seem to be much/any PRC1 on the spindle in the "Dark 0 min" condition, although some is visible after the reversal. Do the authors have a better example to show here? In Figures 1 and 2 we can see the removal clearly yet in later images, the spindle is zoomed such that the relocation cannot be observed.

      Response: We agree that PRC1 is not properly visible in Fig. 2 A and we will do new experiments to obtain better images. Regarding the later images in Figs. 3 and 4 that are zoomed, they are displayed in this manner to show the localization of proteins in the bridging fiber and/or at the ends of kinetochore fibers. In these experiments, the removal of PRC1 was the same as in earlier images, which is visible in examples shown in Figs. S3 and S4.

      Reviewer #3:

      Have the authors looked at whether the cells progress normally after removal and reversal of PRC1? In the paper the authors describe how the knockdown and re-expression of opto-PRC1 does not interfere with mitotic progression, but we wondered whether cells recover after the optogenetic operation, compared to a control with similar illumination.

      Response: We found that cells were able to progress to cytokinesis and added an example to Fig. S1 D and the following text to Results: “Importantly, after exposure to the blue light, opto cells were able to progress to cytokinesis (Fig. S1 D).”

      Reviewer #3:

      Is there a reason why no Dark-state images are shown in Fig 2C and I?

      Response: We swapped those images with Dark-state images.

      Reviewer #3:

      For some of the plots, the y-axis is not shown scaled from 0. This is misleading because it exaggerates differences. Examples are 2E,F,G,H, 3G,J, S3E,G,H, S5C,D,E,F,G.

      Response: We agree and we now show graphs with the y-axis starting at 0 for Fig. 2 F,H, Fig. S2 B,E, Fig. 3 G,J, Fig. S3 G, and Fig. S5 C,F,G. However, we did not change the graphs for d_kc, theta, spindle length and width, and widths of bridging and k-fibers, because these values span a rather narrow range, which is far from zero. Because the differences are statistically significant, we chose a scale at which they can be easily visualized.

      Reviewer #3:

      In the legend, formulae should be written in correct notation.

      Response: Corrected: Formulae y=A*exp(-τ*x) and y=A*exp(-τ*x)+c were used for opto-PRC1 removal and return, respectively.

      Reviewer #3:

      In Fig 2 legend it says that a 0.5-pixel-radius Gaussian blur is applied. Doesn't the kernel for transformation result in an identity matrix?

      Response: To clarify this, we replaced “0.5-pixel-radius Gaussian blur” with “0.5-pixel-sigma Gaussian blur” in figure captions and added the following to Methods: “To remove high frequency noise in displayed images a Gaussian blur filter with a 0.5-pixel sigma (radius) was applied where stated”.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript by Jagrić et al. shows a role for PRC1 at metaphase by using an optogenetic method to rapidly remove PRC1 from bridging fibres in the mitotic spindle. They show that this methodology (which uses light to relocate PRC1 temporarily on the plasma membrane) is superior to long-term depletion by siRNA and, because it is reversible, has advantages over chemically-induced protein translocation. They put the method to use to examine PRC1's role in bridging fibres the results are consistent with siRNA approaches but cleaner due to the acute nature of the method. Overall the paper is convincing and is likely to be of interest to cell biologists working on mitosis.

      We have only minor comments that can be easily addressed during the current crisis. Note that we covered this paper in our lab journal club when it went up on bioRxiv and our comments in that pre-pandemic time were the same as now.

      1.Some of the images are sub-optimal. For example Fig 2A, there doesn't seem to be much/any PRC1 on the spindle in the "Dark 0 min" condition, although some is visible after the reversal. Do the authors have a better example to show here? In Figures 1 and 2 we can see the removal clearly yet in later images, the spindle is zoomed such that the relocation cannot be observed.

      2.Have the authors looked at whether the cells progress normally after removal and reversal of PRC1? In the paper the authors describe how the knockdown and re-expression of opto-PRC1 does not interfere with mitotic progression, but we wondered whether cells recover after the optogenetic operation, compared to a control with similar illumination.

      3 Is there a reason why no Dark-state images are shown in Fig 2C and I?

      4.For some of the plots, the y-axis is not shown scaled from 0. This is misleading because it exaggerates differences. Examples are 2E,F,G,H, 3G,J, S3E,G,H, S5C,D,E,F,G

      5.In the legend, formulae should be written in correct notation.

      6.In Fig 2 legend it says that a 0.5-pixel-radius Gaussian blur is applied. Doesn't the kernel for transformation result in an identity matrix?

      Significance

      We thought the paper is likely to be of significant interest to cell biologists working on mitosis.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Jagric and colleagues adapt an optogenetic method for acute and reversible removal of spindle associated proteins to the cell membrane. They apply this technique to deplete the microtubule crosslinking protein PRC1 from the metaphase spindle with high temporal accuracy. They establish that the spindle localization of PRC1 can be perturbed in a fast and reversible manner, on a timescale of minutes, using this method.

      Next, they use this system to show that acute depletion of PRC1, which has previously been shown to localize to the bridging fibers that link kinetochore pairs. They find that PRC1 depletion modestly disrupts chromosome alignment on the metaphase plate and results in an increased frequency of lagging kinetochores during anaphase. The advantage of the optogenetic system is that they can look at the reversibility and compare the effects of acute depletion to long-time course methods such as RNAi. This comparison is well done and well presented in the paper. The authors further probe the mechanism underlying the defects associated with PRC1 depletion and find a decrease in the number of microtubules that make up a bridging fiber. The localization of other proteins to the kinetochores are not affected by the removal of PRC1, but the localization of Kif18a and MKLP1 to bridging fibers is disrupted. Together, the authors propose a model where the movement of bi-oriented chromosomes is restricted to the region containing PRC1-crosslinked bridging fibers, and this buffering is important in maintaining chromosome alignment.

      Overall, the paper is well written, and the schematics, figures and descriptions of experiments are easy to follow. The microscopy experiments and data analysis are carefully performed and thorough, and the representation of data in figures and tables is very clear. The comparison between RNAi and opto-depletion has been well executed and a great addition. The advance in this paper is the establishment of an optogenetics system to selectively and reversibly perturb PRC1. While the method is not novel (Guntas et al., 2015), its development and application to a spindle protein will be of interest to researchers in the field, and I expect this work to be a major resource in that regard. I am less enthusiastic about the biological findings as the effects of PRC1-removal from the bridging fiber are modest. In addition, some effects, such as kinetochore misalignment and decrease in the number of microtubules in the bridging fiber, are not reversible, which raises some concerns about whether these effects are directly mediated by specific protein depletion. I have outlined my specific comments below:

      Major concerns:

      1 . Regarding lagging chromosomes:

      •Page 6 and Fig. 2F: "Kinetochore remains displaced even after opto-PRC1 return": Why is this? The reasoning in the discussion is not clear/convincing. Is it possible that these irreversible changes reflect light-induced deactivation of protein? Or, could these irreversible changes arise from a perturbation in the structure of microtubules at the end of the 'light' period? Discussion or additional supportive evidence to address this will be helpful.

      •The correlation between misaligned kinetochores and lagging chromosomes is not clear. Are lagging chromosomes more frequently attached to kinetochores that show high deq values (Fig. S2G) in metaphase?

      2 . Regarding the contribution of other motors

      •Looking at the contributions of various other microtubule-associated proteins in accounting for effects of PRC1 removal is a good addition to the paper (Fig. 4). However, the consequences of the depletion of Kif18A and MKLP1 from the bridging fiber are not elaborated upon. Is the presence of these motors at the bridging fiber functionally important? It would be good to incorporate their known activity in the final model for how PRC1-crosslinked fibers align chromosomes. In particular, a recent biorxiv submission from this group has a thorough examination of the consequences of motor removal in anaphase, and perhaps some of their findings and other literature can be used to draw some insights into if and how the presence of these motors on PRC1-crosslinked fibers contribute to chromosome alignment.

      •Page 13 and Fig. 4A & 4B: "The localization of Kif18A in the bridge was perturbed by both acute and long-term PRC1 removal." However, this is not apparent from Figures 4A and 4B. It would be helpful to clarify how this interpretation was made from the data in the figure.

      Additional suggested experiment and analysis:

      1 . One factor that could potentially contribute to the changes in chromosome alignment and increase in lagging chromosomes upon PRC1 removal, is changes in dynamics of microtubules that make up bridging fibers. This may also provide insights on the role of associated proteins. One possible experiment is to look at tubulin turnover in the bundles (example by FRAP). Another alternative possibility is to examine EB3 comets in the presence and absence of PRC1 (note: these are just some potential suggestions; the authors may have other ways of addressing the question). Examining the dynamics would help in addressing if bridging fibers is dynamically remodeled through metaphase and early anaphase and whether the loss of PRC1 causes a change in the dynamics of these microtubules.

      2 . Do kinetochores oscillate / fluctuate about the metaphase plate over time? Does the absence of PRC1 affect these fluctuations? Since the authors already have the data (Fig. 2), they can track the trajectories of sister kinetochore displacement from the equatorial plane as a function of time from prometaphase on, both in the presence and absence of PRC1. This analysis will be informative in understanding how kinetochores and bridging fibers act together to maintain force balance in the spindle and how misalignments are corrected.

      Minor points:

      1 . Is the number of microtubules that make up the bridging fiber the same for outermost and inner kinetochores?

      2 . The quantity dax that is plotted in fig. S2F has not been defined in the text.

      3 . Discussion of these findings in the context of recent work from the lab of Sophie Dumont will be interesting (Suresh et al. eLife 2020;9:e53807).

      Significance

      The advance in this paper is the establishment of an optogenetics system to selectively and reversibly perturb PRC1. While the method is not novel (Guntas et al., 2015), its development and application to a spindle protein will be of interest to researchers in the field, and I expect this work to be a major resource in that regard. I am less enthusiastic about the biological findings as the effects of PRC1-removal from the bridging fiber are modest.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Jagric et al. investigates the role of PRC1 in the maintenance of chromosome alignment at the spindle equator using acute inactivation by optogenetic control. This is an elegant system based on the iLID system that recruits a protein of interest to the cell membrane in a reversible way. Accordingly, acute removal of PRC1 resulted in reduction of bridging fibers and decreased inter-kinetochore distances, while widening the metaphase plate and increasing the frequency of lagging chromosomes in anaphase. The authors investigate whether acute PRC1 removal from bridging fibers compromise other proteins and conclude that PRC1 acts essentially by coupling bridging and kinetochore fibers. They propose that PRC1 uses this role to buffer kinetochore movements to promote chromosome alignment. Overall, this is a very high quality study that adds to our knowledge about the roles of PRC1 and bridging fibers in spindle mechanics and will be of interest to a specialized readership of mitosis researchers. Nevertheless, there are still few remaining issues, mostly concerning additional controls and interpretations that should be addressed prior to publication.

      Major issues:

      1- The use of blue light necessary to relocate opto-PRC1 from the spindle to the membrane is a concern, specially given the strongest phenotype associated with acute vs. constitutive inactivation of PRC1. While these differences may indeed reflect distinct cellular adaptation responses to each procedure, the authors must rule out that phototoxicity caused by blue light (e.g. see Douthwright S, Sluder G. Live Cell Imaging: Assessing the Phototoxicity of 488 and 546 nm Light and Methods to Alleviate it. J Cell Physiol. 2017 Sep;232(9):2461-2468. doi: 10.1002/jcp.25588. PubMed PMID:27608139) is not responsible for the observed stronger phenotypes. A control of U2OS cells expressing the centromere marker (without opto-PRC1) in metaphase after exposure to the same blue light regimen (i.e. 200 ms every 10 sec for 20 min and same laser power) should be provided.

      2- I could not find in the manuscript whether opto-PRC1 is RNAi resistant. I would assume so, as the authors are targeting the 3'-UTR of endogenous PRC1, but at least a western blot should be provided: 1) to properly ascertain depletion efficiency of the endogenous protein; and 2) the levels of opto-PRC1 after depletion.

      3- One aspect related with data interpretation and the proposed model: if PRC1 selectively bundles anti-parallel microtubules, how could it mechanically couple sister k-fibers that are made of parallel MTs? This should be explained in detail, ideally supported by data.

      4- The author should find a way to unequivocally demonstrate that opto-PRC1 is fully functional and can rescue depletion of endogenous PRC1. The fact that recovery of PRC1 on spindles never fully rescue spindle architecture and chromosome properties might indicate that opto-PRC1 is not fully functional. For example, can it rescue anaphase or cytokinesis roles of PRC1?

      Minor issues:

      1- Abstract: the authors introduce the problem by stating that chromosome position at the spindle equator is mainly regulated by forces by kMTs. We do not know this, actually there is evidence in the literature that kif4a on chromosome arms is required to maintain chromosomes aligned by exerting forces on ipMTs (e.g. Wandke et al., JCB, 2012). Along the same line, there is evidence from the Dumont lab that sister k-fibers are not mechanically coupled. These alternative views should be discussed and taken into account when formulating the problem under investigation in the present study.

      2- The authors refer to kinetochore alignment or lagging kinetochores throughout the text. Although this is unquestionable, it might be more appropriate to refer to chromosome alignment or lagging chromosomes instead, as this is the object to me moved.

      3- page 2: "...PRC1 regulates forces acting on kinetochores". The authors should mention that this would be indirect, as PRC1 is not at kinetochores itself.

      4- page 6: "PRC1 removal did not activate the spindle assembly checkpoint". Although this might be considered semantics, given that the SAC is constitutively active and needs to be satisfied, the authors might adopt a more accurate description such as "PRC1 removal did not prevent spindle assembly checkpoint satisfaction".

      5- page 13: the authors mention about the localization of Kif18a on bridging fibers. Was this known? From the images it is unclear if we are looking at bridging fibers or k-fibers. Co-localization with PRC1 would help clarifying this issue. If indeed associated with bridging fibers, this would raise an alternative interpretation of how Kif18a contributes to maintain chromosome alignment.

      Significance

      If additional controls are provided, this manuscript represents a significant technical advance in the study of PRC1 function. The results however are just incremental relative to previous state-of-the-art and will be of interest to more specialized researchers working on mitosis and spindle architecture. The concept of buffer for kinetochore movement is interesting, but how exactly PRC1 contributes to this is not addressed in the present work. Maybe some modeling would help test some ideas.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We are grateful for the reviewers’ appreciation and comments. We have tried to address all concerns, and believe that those changes have greatly ameliorated the precision and presentation of our findings. All of our responses are in green in this document, and so are the changes in the manuscript and figure legends.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In "An asymmetry in the frequency and position of mitosis in the epiblast precedes gastrulation and suggests a role for mitotic rounding in cell delamination during primitive streak epithelial-mesenchymal transition", Mathiah, Despin-Guitard and colleagues study divisions during mouse gastrulation. They perform ex vivo culture, live imaging and immunostaining to observe the frequency and position of mitosis within the embryo as well as the destiny of daughter cells after their divisions. The find that divisions on the posterior side of the embryo tend to be more basally located and could contribute to cell delamination into the mesodermal layer. Authors also affect antero-posterior signaling by genetically preventing the migration of the anterior visceral endoderm, which leads to mitosis away from the apical side of the epithelium on all lateral parts of the embryo.

      This study tackles a key developmental process which is poorly understood in mammals due to its concomitance with the implantation phase. Therefore, any carefully-made description of this process has the capacity to be eye-opening. This is potentially the case for this report, which provides nice images that most likely required skills and important efforts to obtain. The authors have written a clear manuscript with an interesting narrative. However, the quantifications are very poorly described, which makes it impossible for anyone to reproduce these results. I describe below a number of suggestions to clarify the quantifications, which is in my opinion a prerequisite to consider the conclusions from the authors.

      Fig1: Authors describe differences in the formation of rosettes between the anterior and posterior sides of the embryo. The microscopy images and movies provided are overlaid with drawings from the authors but without this visual help, I, and I assume other readers, see more rosettes than highlighted and fail to see some of the rosettes that are marked. To avoid this subjectivity, a clear methodology is required. In the methods, the authors state: "For quantification, rosettes were manually annotated and counted on Z sections located 5 μm from the basal side of the epiblast." And that is all. What defines a rosette? How many cells need to share a vertex to be considered as part of a rosette? How long do they need to persist not to be considered as occurring by chance? What about cells part of multiple rosettes? Does the rosette organization need to be apical, basal or all the way? Having those clearly defined criteria would be essential for anyone else to reproduce this quantification and would also offer a much more comprehensive description of the phenomenon and allow for more powerful conclusions.

      A rosette is defined as a multicellular transient structure composed of at least 5 cells converging to a central vertex. Practically, a region of interest where cell contours are in focus is determined on the Z-section located 5 mm from the epiblast basal side, which is easily identified as the epithelial architecture changes radically when one enters the visceral endoderm. Only rosettes that are visible throughout the epiblast layer, from the basal to the apical side, are counted. To ensure this, manual segmentation of all cells (for all Z plan acquired, from the basal plane to the apical side) contributing to a rosette was performed for lightsheet imaging. This is illustrated in Video 2. For confocal imaging, segmentation was annotated only at the basal plane, but visual verification that the rosette structure is persistent throughout the layer was performed. One cell could be part of several rosettes, and rosette events were counted even when visible only on one timeframe, but this was consistent for all embryo sides. Due to the time resolution of confocal imaging, rosettes could not be followed overtime. However, the time resolution of lightsheet imaging allowed observing rosettes lifespan and resolution. The protocol for image analysis has been better detailed in the results (lines 162-163) and the methods section of the revised version of the manuscript (lines 452-467, copied hereunder).

      "Rosettes: For lightsheet imaging, embryos were dissected at E5.75. Images were acquired for 10 to 12 hours. Quantification focused on the first 20 to 30 frames (around 3 hours) to capture pregastrulation events and reduce the risk of bias from imaging. The rest of the frames showed that the embryo continued growing for several hours. Z-stacks from 4 sides were fused using Zeiss plugin for lightsheet Imaging. Images were then processed using Arivis Vision4D v2.12.3 (Arivis, Germany). Embryo contours were segmented manually on each Z-slice and time point, in order to adjust for embryo rotation manually if necessary. For each side of the embryo, Z stack was cropped to an average of 30 Z slices, from the basal side (5 microns from VE layer, which can be morphologically distinguished due to cell shape and membrane Tomato distribution) to the cavity, marking the apical side. Rosettes were identified and counted on Z sections located 5 µm from the basal side of the epiblast. Practically, vertices were systematically scanned to find those in which 5 cells or more met. Cells contributing to a rosette were then manually segmented on each Z-slice and time point by highlighting cellular membranes using Wacom’s Cintiq 13HD, to create a 3D reconstruction. For confocal imaging, rosettes were identified using the same method, and counted on Z sections located 5 µm from the basal side of the epiblast after visual verification that it was present throughout the Z-stack. For both techniques, presence of associated apical rounding was assessed for each vertex. Cells could contribute to several rosettes."

      In addition, the data are given as "rosette/frame" and as "rosette/mm2". What is the point of giving both data, which are essentially the same? The frame is irrelevant. It would be more interesting to know how many cells there are in this area, as cell packing could be a determinant of rosette formation. "Rosette/mm2/min" is very confusing. It should state "rosette.mm-2.min-1" or "rosette/mm2.min".

      Following this comment, we indeed chose to get rid of the data expressed as “rosette/frame”. Cells were counted in the area of the epiblast in focus to present data as number of rosettes normalized by the number of cells in the region of interest for both lightsheet and confocal microscopy data (described in results section lines 140 and 164). These measurements led to a similar conclusion, confirming that rosettes are more frequent in posterior epiblast. Difference in cell packing was indeed essential to rule out. We estimated cell packing as the ratio of cell number to surface area, and found it to be similar in posterior, anterior, and lateral sides of embryos at a given stage, which indicates that cell packing is not a determinant for difference in rosette frequency in this context. We discussed packing in the Results section (lines 169-173, copied hereunder).

      "The cell number per surface area was similar on all sides, which indicates that the higher number of rosettes was not due to increased cell packing. Rosettes have also been identified in the chick PS (Wagstaff, Bellett, Mogensen, & Münsterberg, 2008), where they were proposed to facilitate ingression during gastrulation."

      We modified the legend to use "rosette/mm2.min”.

      On a conclusive note, I fail to understand how relevant the formation of rosettes would be. The authors should clarify this point.

      Epithelial rosettes have been observed as common intermediates in numerous morphogenesis events. In particular cases, such as Drosophila germ band extension, or zebrafish lateral line development, the mechanisms of formation (planar cell polarity (PCP) and apical constriction, respectively) and resolution have been very well described. In the mouse embryo, anterior visceral endoderm (AVE) migration has been linked to PCP signaling-dependent rosette formation (Trichas 2012). In primitive streak (PS) formation, rosettes with actin-rich centers were described in the chick PS and found to be Nodal dependent (Wagstaff 2008 and Yaganawa 2011). Their mode of formation or resolution is currently unknown. Our observations confirm the findings in chick and highlight the presence of rosettes at an earlier stage, before PS can be identified. Interestingly, rosettes are enriched on the posterior side at the same time when Nodal signaling becomes asymmetric, leading to posterior restriction of basal membrane perforations (Kyprianou 2020). To progress towards understanding rosettes’ significance in the mouse gastrulation context, it would be interesting to study whether the distribution of rosettes is homogenous before anterior-posterior axis specification. Additionally, it would be important to assess whether random epiblast cells delaminate before PS formation, as observed in chick (Voiculescu 2014). We could not attempt those experiments so far, as we perform most experiments by two-photon microscopy, by which only one embryo side can be recorded at a time, and have no way to distinguish embryo orientation before AVE migration. A better understanding of rosette mode of formation and resolution, including the role of Nodal, would also be necessary to assess the importance of our observations. The technical evolution in mouse embryo imaging will probably permit solving those questions in the near future, through prolonged imaging with tracking of every cell fate (McDole 2018). We have tried to improve the discussion (lines 314-326, copied hereunder), and acknowledge the limitations of our findings to a description of a phenomenon without proven significance at this stage.

      "However, since we observed a marked imbalance in rosette frequency as soon as the anterior-posterior axis was specified, it is possible that rosettes reflect increased epithelium fluidity in posterior epiblast, which is exposed to a distinct mechanical context, at the very beginning of PS morphogenesis. Indeed, a posterior shift in the distribution of basement membrane perforations was identified just after AVE migration, due to an asymmetry in Nodal signaling dependent metalloproteinase activity (Kyprianou et al., 2020). To progress towards understanding rosette formation significance in this context, it would be interesting to study whether the distribution of rosettes is homogenous before anterior-posterior axis specification, and to assess whether random epiblast cells delaminate before PS formation, as observed in chick (Voiculescu, Bodenstein, Lau, & Stern, 2014). As Nodal plays a major role in PS initiation, the presence and distribution of rosettes should be studied in models in which Nodal signaling can be tuned (Kumar, Lualdi, Lewandoski, & Kuehn, 2008)."

      Fig2: I have essentially the same issue for bottle cells and delamination counting as for rosettes. In this case, there is nothing in the methods section.

      We have added a paragraph to describe the mosaic analysis in the Methods section (lines 472-488):

      "Mosaic: Embryos were recorded in a lateral position. As the proportion of GFP positive cells varied between mosaic embryos, normalisation was performed by dividing by the number of green cells in a given embryo. Anterior and posterior halves were defined by drawing a line perpendicular to the embryonic/extraembryonic boundary and passing through the distal tip. Bottle-shaped cells were identified as having a thin attachment on the apical surface (less than a third of the larger section), and the majority of the cell body located in the basal side. Quantification was performed both on the 3D rendering, and through navigating through the Z-stack. The same criteria where used on all sides of the embryo, and quantification was verified by two independent investigators. Delamination was defined as retraction of the apical process, and displacement of the cell body in the mesoderm layer, which could be identified because of the ubiquitous membrane Tomato labelling. Cell division was characterized by cell rounding followed by the appearance of daughter cells. Cell dispersion after mitosis was defined as absence of basolateral contact between daughter cells, which implies presence of at least one epiblast cell (more often 2 or 3) between daughter cells. Mitosis was considered “non-apical” when happening at least 10 µm away from the apical pole, hence not in the first pseudo-layer of nuclei lining the apical pole."

      What defines a cell as bottle shape and not bottle shape (apical vs basal width for example)?

      Bottle-shaped cells were visually identified as having a thin attachment on the apical surface (less than a third of the larger section), and the majority of the cell body located in the basal side. Quantification was performed both on the 3D rendering, and by navigating through the Z-stack. Due to the large variation in shape, no systematic measurement was performed. However, the same criteria were used on all sides of the embryo, and quantification was verified by two independent investigators. As proposed by Reviewer 2, those criteria would include scutoids with smaller apical surface, which explains why we observe bottle-shaped cells both on the anterior and posterior sides. In addition to Methods, we included a better description of the methodology in the Results (lines 196-200).

      "The quantification of bottle-shaped cells was performed in 3D and through Z-stack navigation and included all cells with an apical section smaller than a third of the basal section. Some cells had a round basal cell body and a thin apical extension while others resembled the recently described scutoids performing apico-basal transitions (Gómez-Gálvez et al., 2018)."

      Where does a cell need to be to be counted as delaminated (a distance needs to be stated, absolute (better) or relative)?

      Delamination is defined as retraction of the apical process, and displacement of the cell body in the mesoderm layer. Using the ubiquitous membrane tomato marker we could easily distinguish the epiblast, mesoderm and visceral endoderm layers, notably through cell packing, morphology and arrangement. This was described in Results (lines 200-204).

      "Asymmetrical cells were present on both sides, but more frequent on the posterior side, and cell delamination (retraction of the apical process and cell body shift in the mesoderm layer) only took place on the posterior side. Cells maintained an apical attachment until their basally located cell body had begun crossing the PS/mesoderm border, and only fully detached after delamination."

      What defines sister cells as dispersing after division? How far apart do they have to be? After how much time? From the movies provided, the acquisition time seems to short to assess cell dispersal.

      Cell dispersion after mitosis was defined as absence of basolateral contact between daughter cells as they extend towards the basal side, which implies intercalation of at least one epiblast cell (more often 2 or 3) between daughter cells. After cytokinesis was completed, extension and separation of daughter cells was visible in the next time point (after 25 min). The time resolution was thus sufficient to note that daughter cells were not adjacent, which is consistent with other studies (Abe 2018).

      We have modified the Methods (copied above) and the Results section of the revised version of the manuscript (lines 213-217).

      "Upon elongation of daughter cells to reach the basal pole of the epiblast, the majority displayed no basolateral connection between each other and were instead separated by intercalating epiblast cells, which would be expected to result in daughter cells dispersion over time, as described in (Abe, Kutsuna, Kiyonari, Furuta, & Fujimori, 2018)."

      Fig3: Mitotic index calculation is described in the figure legend but not in the methods section. It should also be in the methods section and made explicit that the number of mitotic cells is normalized to green cells only, not the entire cell population. The mitotic index seems higher in this population than in the entire embryo as seen in Fig4.

      The mitotic index (MI) was indeed calculated differently so numbers cannot be directly compared. MI identified for anterior and posterior epiblast is not statistically different from the ones found in Figure 4 for E7 embryos. In mosaic embryos, we do not have a way to delimitate the PS. In Figure 4, measurements of MI in the PS (delimitated by the area where the basement membrane is degraded) include cells that are destined to delaminate as wells as those that won't. In the mosaic embryos, MI is measured in cells that delaminate only, and is indeed higher. This represents a small population, which likely explains why it does not reach statistical significance and manifests as a trend.

      We have fixed the Methods (see above) and Results (lines 222-228) sections.

      "For systematic quantification, epiblast regions were defined as anterior or posterior by tracing a line passing by the distal pole and perpendicular to the embryonic/extraembryonic border, and GFP positive cells undergoing rounding were followed overtime (Fig. 3a-c). Although the frequency of cell division (normalized to the total number of GFP positive cells) was similar in anterior and posterior epiblast, there was a trend towards a higher division rate specifically in cells undergoing delamination to become mesoderm (Fig. 3d)."

      What defines an exiting cell**?

      An exiting cell is characterized by morphological remodelling, apical retraction, as well as the position of the cell body across the mesoderm/epiblast border visualized by the precise membrane Tomato labelling. It is now described in Methods and in Results (lines 201-204: " cell delamination (retraction of the apical process and cell body shift in the mesoderm layer) only took place on the posterior side. Cells maintained an apical attachment until their basally located cell body had begun crossing the PS/mesoderm border, and only fully detached after delamination".

      Regarding the non-apical rounding, why not calling it basal rounding? How far from the apical side does a cell need to be counted as non-apical?

      The reason for that denomination is that these so-called “non-apical mitoses” are not strictly basal either. Indeed, mitosis is considered “non-apical” when happening at least 10µm away from the apical pole, meaning that these mitoses do not occur within the first pseudo-layer of nuclei lining the apical pole. This is described in Methods.

      In the panel h, with the posterior division outcome, is that for all divisions or only for non-apical divisions?

      The panel (Fig. 3g, there was an error in figure labelling in the previous version) has been modified to better precise cell outcomes. It represents all posterior divisions, and quantifies the outcome according to the position of mitosis along the apical-basal axis of the cell. See Results, line 230-232: "Non-apical mitosis in the posterior epiblast was preferentially associated with EMT, as it resulted in formation of one or two mesoderm cells (Fig. 3g)."

      Do basal divisions give rise to more epi?

      No, non-apical divisions mainly give rise to mesoderm cells. Indeed, approximatively 66% of basal divisions give rise to two mesoderm cells, and 33% to an epiblast and a mesenchymal cell (Figure 3g). We never observed a non-apical division resulting in two epiblast cells.

      Is epi or meso fate only determined by location in a different layer or are fate markers used?

      Epiblast or mesenchymal fate was determined by both morphological and localization criteria. Epiblast cells have an apical and a basal pole. Mesoderm cells have no apical process, and display initiation of front-rear polarity often defined by the presence of nascent migration appendix. As stated before, membrane Tomato labelling allows exact distinction of germ layers.

      What happens to the non-apical mitosis on the anterior side?

      On the anterior side, the very few anterior non-apical mitoses only give epiblast cells (not shown).

      Fig4: Methods state "For Phospho-histone H3 quantifications, sections were chosen at least 10 μm apart to ensure that each cell was only counted once, and counting was performed using the Icy software" and legend states "The PS region is defined by the area where the basal membrane (yellow) is degraded, and the posterior region quantification excludes counts from the PS region".

      What about cells at the boundary between PS and non-PS regions? This needs to be extended and brought together in the methods section. **Also, the tissue architecture in the PS is not as well defined as in the rest of the tissue.

      A cell was counted as being part of the PS region if at least 50% of its cell body (visual measuring) was within the area where the basal membrane is non-ambiguously degraded, and if the cell retained its attachment to the apical pole (cell contours were determined by F-actin detection using Phalloidin). The Methods section has been completed in the revised version of the manuscript (lines 490-501).

      "Phospho-histone H3: For Phh3 quantifications, sections were chosen at least 10 mm apart to ensure that each cell was only counted once, and counting was performed using the Icy software (http://icy.bioimageanalysis.org). For sagittal sections, anterior and posterior regions were defined by drawing a line perpendicular to the embryonic/extraembryonic boundary and passing through the distal tip. For transverse sections, anterior-posterior boundary was placed at mid-distance between the anterior and posterior poles. The PS region was defined by the area where the basement membrane was degraded, and the posterior region quantification excluded counts from the PS region. A cell was counted as being part of the PS region if at least 50% of its cell body was within the area where the basement membrane was non-ambiguously degraded, and if the cell retained its attachment to the apical pole (cell contours were defined by F-actin detection using Phalloidin)."

      Is the epithelial polarity clear enough to be determined without AB marker in the PS?

      We considered that a cell retained its AB polarity if the cell extended to both apical and basal pole. Even if the pseudostratified epithelium architecture is complex, most cell contours could be delimited when navigating through the Z-stack.

      Finally, the number of cells counted is missing. This has been fixed in the Figure legend.

      Supp Fig5: based on available images of the Rac1KO embryo, I am not sure that epithelial architecture is established well enough to assess the location of mitosis along the apico-basal axis.

      Indeed, the architecture of the Rac1 KO mutants is vastly altered. As a consequence, only a small number of Rac1 mutants in which we could delimitate the germ layers were analysed, and only the cells we could unambiguously locate were considered. The Rac1 VE-deleted phenotype, on the other hand, was not severe enough as there is only a partial AVE migration defect in most mutants (Migeotte et al., 2010). This is why we confirmed the data on AVE migration defective embryos by using the RhoA VE-deleted mutant, which has a strong AVE migration defect but retains good tissue architecture. We tried to increase figure clarity by annotating the embryo cavity as well as the embryonic/extraembryonic boundary. We also submit a less compressed version of the figures, which we hope will facilitate image analysis.

      Reviewer #1 (Significance (Required)):

      Although I am not as familiar with mouse gastrulation as I would like to be, I am familiar with gastrulation, live imaging and analysis. At this point, I find it difficult to discuss the conclusions of the study since the methodology is so unclear. Nevertheless, any carefully-made description of mammalian gastrulation has the capacity to be eye-opening. This is potentially the case for this report, which provides nice images that most likely required skills and important efforts to obtain.

      We hope the changes we made help better understanding the methodology, and thank Reviewer 1 for positive comments and the help in identifying the points we had failed to properly describe.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This manuscript, from Mathiah and colleagues, describes an in-depth analysis of differences in cell organization and division within the epiblast of the very early mouse embryo, and in particular, with the onset of gastrulation. Their data indicate a difference in the organization of cells between the anterior/lateral and posterior regions of the epiblast even before gastrulation has commenced, as well as differences in the location of mitoses relative to the apical and basal ends of the cells. The data provide new insight into the early regionalization of the epiblast. However, the authors should include reference to, and discussion of, the paper by Michael Snow on growth and regionalization of the epiblast (Snow MHL (1977) Gastrulation in the mouse: Growth and regionalization of the epiblast. J. Emb. Exp. Morph. 42: 293-303), where he did a much more fine-grained analysis of mitotic index across the entire epiblast, defining a proliferative zone in the anterior part of the primitive streak where the mitotic index was higher between E6.5 and E7.5. He also describes non-apical mitoses specifically in the primitive streak region as compared to all other regions of the epiblast. The results of the present study dovetail nicely with the results presented by Snow.

      This was indeed a major oversight, and we apologize for it. The work of Snow identifies very nicely a proliferative zone in the anterior part of the PS. We did not comment on that as our study focuses on posterior PS. We included the reference in the revised version of the manuscript, and pointed the fact that he first described non-apical mitosis in the PS (lines 233-236).

      "Remarkably, this concurs with the observation by Snow (Snow, 1977) that in the PS of E6.5 and E7 embryos, mitosis could be found at all levels of the tissue, including adjacent to the endoderm, while it was located at the apical surface of the pseudostratified tissue everywhere else."

      Overall, this is a very nice study, but some revisions would help with clarity at certain points. The data on rosette formation are interesting, but it is not clear what an increase in rosettes in the posterior region means. The authors contend (lines 169-170) that this represents a dynamic epithelium primed for EMT, but it is not clear how rosettes facilitate or promote EMT, and especially why that would be seen at E5.75 before EMT has begun. An alternative interpretation might be that the shape of the epithelium may be changing and the packing of the epithelial cells has to change to accommodate this. We do know that the overall shape of the embryo changes from elongate medial-lateral to elongate anterior-posterior just as EMT is initiated (Perea-Gomez et al., (2004) Current Biology 14: 197-207) and it may be that changes in cell packing are required to accommodate this. The authors may want to consider whether the rosettes that they observed represent scutoids (Gomez-Galvez et al., (2018) Scutoids are a geometrical solution to three-dimensional packing of epithelia. Nat. Commun. 9:2960). An analysis of the 3-dimensional organization of the cells within the rosettes (i.e. at all Z levels) may shed some light on this.

      Following on the comments by Reviewer 1 and 2, we quantified cell packing, and found it to be identical on all sides at a given stage. We have added a better description of rosette quantification (lines 169-172 and lines 452-470), a video showing 3D reconstruction of cells in a rosette (Video 2), and an extended discussion (lines 314-326) in the revised version of the manuscript. Some cells within the epiblast are indeed likely to be shaped as scutoids, some with an apical-basal asymmetry (lines 196-202). The reference was added to the manuscript (line 200).

      Figure 2b,c and Figure 3a, a', b, c - Addition of dotted lines to indicate the apical and basal ends of the epiblast would be helpful in orienting the reader**.

      We have added lines to indicate apical and basal ends of the epiblast.

      Figure 2c' - what these graphs represent exactly is somewhat vague, and the figure legend is also very vague. In particular, the third graph on cell dispersion is not clear. Does this mean that the daughter cells are separated from one another following division? Or that they are in different compartments (epiblast/mesoderm) after division? A better description should be included in the figure legend.

      Following on the comments by Reviewer 1 and 2, we have added a better description of cell dispersion in the Results (lines 213-217), Methods (lines 483-486) and figure legend.

      Figure 3g would appear to show the proportion of the total number of posterior divisions that give rise to particular combinations of daughter cells (epi/epi, epi/meso, meso/meso). However, the discussion of this graph in the text (lines 215-219) suggests that it demonstrates that non-apical mitoses always result in meso/meso and epi/meso daughter cells, which it does not. That analysis would be very interesting to add to Figure 3, with the daughter cell types broken down into those coming from apical mitoses and those coming from basal mitoses.

      The analysis was broken down as suggested, and has been added to the revised version of the manuscript (line 352, Figure 3g).

      In Figure 4, it is not clear how anterior and posterior are defined, and what criteria were used to distinguish posterior from primitive streak. This is nicely demonstrated in Supplementary Figure 3 - maybe panels A and B could be included in Figure 4 to improve the clarity of the analysi**s.

      We have better described the quantification methodology in the Methods section (lines 490-501), moved panel a from Supplementary Figure 3 to Figure 4a as suggested, and added an explanatory drawing (Figure 4b) to the revised version of the manuscript.

      The data on mitotic index in Figures 2, 4, and 5 do not appear to be consistent. The mitotic index for E7.25 in Figure 2e is similar between anterior and posterior, even though the posterior includes the primitive streak, while the mitotic index presented for the three stages in Figure 4b would imply that the mitotic index for the entire posterior region should be higher than the anterior at all three stages. Similarly, in Figure 5a' and b', the mitotic index in anterior and posterior regions of E5.75 and E6.25 embryos are not significantly different despite the primitive streak being included in the posterior count, while the data presented in Figure 4 would imply that the entire posterior region including primitive streak should be much higher than the anterior. The authors should clarify this in the Results.

      In Figure 2 and 3, the mitotic index (MI) is calculated as number of cell division among GFP+ cells divided by the total number of GFP+ cells, while in Figure 4 and 5 it is quantified as Phospho-histone H3+ cells per total number of cells (DAPI). We have clarified this in the revised graphs and legends of the novel version of the manuscript. Those numbers cannot be directly compared. Nonetheless, we found no statistical difference between the MI shown in Figure 3d, and the MI shown in Figure 4c third row (E7). In sagittal view, the PS area cannot be delimited, so we compared anterior and posterior regions, with the PS included in the posterior region, and saw no difference in MI. In transverse section, there was no MI difference when comparing anterior and posterior embryo halves. However, when we refined the analysis by defining the PS as the area where the basal membrane was degraded, a higher MI emerged specifically in the PS compared to anterior and posterior (not including PS) regions. This difference was thus lost by dilution when the PS area was included in the posterior region. We have also stated this distinction more clearly in the revised version of the manuscript (lines 269-271).

      The data on non-apical mitoses in the RhoA-VE deleted (Figure 6) and Rac1ko embryos (Supplemental figure5) are not particularly compelling. It is hard to see the basal mitoses in the new AVE-opposed regions in the mutant embryos in the images presented. Perhaps the graphs in these two figures could have the AVE-opposed data broken down into two groups - the region that is posterior and the region that is anterior but not adjacent to AVE. Better images would improve the clarity of these data as well.

      As explained in response to Reviewer 1, we have attempted to clarify the anatomy through annotation, and provide less compressed images. We agree that the embryos are altered. Nonetheless, especially in RhoA-VE deleted, the germ layers could be distinguished and non-apical mitosis identified through combining 3D analysis and navigation through the Z-stack. We honestly admit those are the best images we could get, and we believe that they allow to make the point that non-apical mitosis are only found in the area further away from the AVE.

      Reviewer #2 (Significance (Required)):

      The data on differential proliferation and apical vs. basal mitoses are complementary to data already published, but the present study updates the existing data by the addition of live imaging and 3-dimensional reconstruction of cell shapes, providing a more complete insight into the process. The observation that rosettes are detectable at the basal ends of the epiblast, and more so in the posterior, is novel, but the significance for embryonic development is not well rationalized.

      These data are of interest to those investigating the mechanisms of early morphogenesis, as well as those interested in the cellular correlates of molecular regionalization that results from the well-described signaling pathways regulating axis specification.

      My background is in early mouse embryo morphogenesis, therefore I feel that I have sufficient expertise to evaluate the data presented.

      We thank Reviewer 2 for positive comments, and are grateful for the constructive criticism and important references.

      \*Referees Cross Commenting***

      I agree with Reviewer 1 on the lack of detail about the methods - my comments stemmed from the same confusion about how measurements were made, but Reviewer 1 more articulately addressed the key points. I agree with Reviewer 3 on the quality of the videos. It is very difficult to see how they could follow cells with a 20 minute interval.

      I would like to address the comment by Reviewer 3 on the use of agarose in the imaging experiments. The methods section states that agarose was used to make the culture "chambers" used for light-sheet imaging, which was not the major approach used for imaging in this study. Only the data in Figure 1A came from those experiments, and it was validated by confocal data in 1B,C where the embryos were cultured in Ibidi chambers with culture medium and no agarose present. So I don't think agarose effects on embryo development are a major worry. Also, this same approach was used by Ryan Udan in Mary Dickinson's lab to visualize yolk sac vasculogenesis, and it did not appear to have a deleterious effect on development in that case, although the embryos imaged here were much earlier and are definitely differentially sensitive to culture conditions from those cultured at E8.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This is an essentially descriptive study, looking at primitive streak formation and cell ingression from the epiblast in the mouse from about E5.75 to E7.5 or so using time-lapse microscopy (light sheet and confocal) of cultured embryos. The study also takes advantage of some genetically encoded reporters, some of them inducible by tamoxifen, which allow following cells, closer examination of their shapes, and in some cases unambiguous orientation of the embryos based on expression of the reporter. Overall the study is well designed.

      I have two very major concerns about this paper - first, the culture system used in most experiments uses agarose, which has been found in several labs to affect normal cell movements and other cell behaviours. It is essential to determine that embryos cultured in these conditions develop normally for much longer than the period of imaging to ensure that the findings are relevant to normal development rather than an artefact. This is particularly important because mouse embryos develop rather poorly at peri-implantation stages with any culture method, and this one could make matters even worse.

      Embryos were mounted into an agarose cylinder in which a tunnel had been created with a 150 microns wide copper wire. Embryos were mounted vertically, with the cone oriented on the bottom, to avoid restriction of growth at distal tip of the embryo. As embryos had a smaller diameter than the tunnel, they could comfortably grow without being restricted (Methods, lines 413-414). Although embryos could not been recovered after the long imaging period (12h), embryos similarly mounted in the agarose cylinder but not imaged were kept in culture, and showed normal growth compared to a free-floating embryo (Methods, lines 431-433). In addition, we focused on the first hours of imaging to reduce the risk of phototoxicity-induced anomalies (Methods, lines 453-455). Moreover, although we identified the asymmetry in rosette abundance through lightsheet imaging, we confirmed the finding through confocal imaging of free-floating embryos, and found similar results (Results lines 153-167, Figure 1b and c, Video 3).

      While it has been reported that agarose can affect the development of chick embryos in culture, agarose has been a widely used culture matrix for live imaging particularly for lightsheet imaging in other organisms including drosophila, zebrafish, and mouse. We thank Reviewer 2 (in cross-comments) for highlighting that in Udan et al., (2014), a report from Mary Dickinson’s lab, embryos are cultured in agarose “chambers” for lightsheet. Although some of the experiments in Udan et al., (2014) are performed at E8, this paper also focuses on pre-gastrulation mouse embryos as they culture E6.5 embryos for 24 hours, image from 5 view angles, analyze 572 z-slices representing half of the embryo (Fig5 and Fig6 Udan et al., 2014) and show no adverse effects.

      The second concern is that for a paper that is almost entirely about time-lapse microscopy observations of live embryos, the movies are very poor. Although the images are generally good and the 3-d sequences/images from the light-sheet microscope sequences are quite impressive (and have good spatial resolution), the time resolution is extremely poor and the movies very short. It is largely impossible to follow cell behaviours or movements in these sequences.

      Indeed, the time lapse between time points as well as the total duration of the acquisition is limited, especially when embryos are imaged by confocal microscopy. These measures were taken mainly to preserve the integrity of the embryo and thus ensure that growth conditions were the closest to optimal in vivo conditions. For rosette analysis, the 20 minutes interval was too long to follow rosette resolution, as stated in the manuscript. For mosaically labelled embryos, we quantified only the cells for which the fate and/or progeny could be identified without ambiguity, which was made easier as we chose a 4OH-tamoxifen posology that resulted in a low proportion of labelling. As both cell delamination and mitosis are relatively slow processes, this time resolution proved sufficient. Time resolution for lightsheet was 7 minutes, which is similar compared to other works on mouse gastrulation (such as Williams et al., 2012), and actually higher than most two-photon or confocal studies, including that of our previous reports (Migeotte et al., 2010, Saykali et al., 2019, Trichas et al., 2012) in which cell tracking could be efficiently performed. This high time resolution allowed following individual rosettes overtime (Sup. Fig.2c).

      Reviewer #3 (Significance (Required))

      The study focuses on cell shape changes and various processes that accompany ingression and reports that ingression may occur through a variety of different mechanisms that occur at the same time, including rosette formation, individual ingression of bottle-shaped cells, and larger population ingression events. This is very similar to what has been described in chick embryos (eg. Voiculescu et al. eLife 2014 - surprisingly this is not cited), although in rodents primitive streak formation occurs in the absence of large-scale movements of cell sheets. Basically there are no surprises in the findings either for mouse or in comparison with other species, but the study is OK in terms of contributing useful information about streak formation and function in mouse (if the above problems are fixed).

      We thank Reviewer 3 for helpful comments and references. We respectfully disagree concerning the risk of bias due to agarose cylinder culture, as exposed above. Concerning the videos, we have provided less compressed videos to retain as much image quality as possible. Although it would evidently be better to have a higher time resolution and longer movies, we believe it is not a limitation for the events we study and describe as they can be reliably followed with the time resolution and observation length we provide. The reference to Voiculescu et al., 2014 is indeed important, we have added it to the revised version of the manuscript (line 324) and apologize for the oversight.

      \*Referees Cross Commenting***

      In response to reviewer 2: One issue with this is that one does not know whether there is a "deleterious" effect of the agarose on movements until one is sure that (a) one understands what the movements would look like without agarose and that there are no differences, and (b) (a serious shortcoming here) that embryos need to be shown to develop completely normally in those culture conditions WAY beyond the period of imaging. There are lots of observations by several labs (some unpublished of course, but some are published) suggesting that agar and agarose do interfere with cell movements. In chick for example the Chapman and Schoenwolf method where embryos are placed on agarose, there are always head defects due to impaired movements and the agarose interfering with tissue tensile forces.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This is an essentially descriptive study, looking at primitive streak formation and cell ingression from the epiblast in the mouse from about E5.75 to E7.5 or so using time-lapse microscopy (light sheet and confocal) of cultured embryos. The study also takes advantage of some genetically encoded reporters, some of them inducible by tamoxifen, which allow following cells, closer examination of their shapes, and in some cases unambiguous orientation of the embryos based on expression of the reporter. Overall the study is well designed.

      I have two very major concerns about this paper - first, the culture system used in most experiments uses agarose, which has been found in several labs to affect normal cell movements and other cell behaviours. It is essential to determine that embryos cultured in these conditions develop normally for much longer than the period of imaging to ensure that the findings are relevant to normal development rather than an artefact. This is particularly important because mouse embryos develop rather poorly at peri-implantation stages with any culture method, and this one could make matters even worse.

      The second concern is that for a paper that is almost entirely about time-lapse microscopy observations of live embryos, the movies are very poor. Although the images are generally good and the 3-d sequences/images from the light-sheet microscope sequences are quite impressive (and have good spatial resolution), the time resolution is extremely poor and the movies very short. It is largely impossible to follow cell behaviours or movements in these sequences.

      Significance

      The study focuses on cell shape changes and various processes that accompany ingression and reports that ingression may occur through a variety of different mechanisms that occur at the same time, including rosette formation, individual ingression of bottle-shaped cells, and larger population ingression events. This is very similar to what has been described in chick embryos (eg. Voiculescu et al. eLife 2014 - surprisingly this is not cited), although in rodents primitive streak formation occurs in the absence of large-scale movements of cell sheets. Basically there are no surprises in the findings either for mouse or in comparison with other species, but the study is OK in terms of contributing useful information about streak formation and function in mouse (if the above problems are fixed).

      Referees Cross Commenting

      In response to reviewer 2

      One issue with this is that one does not know whether there is a "deleterious" effect of the agarose on movements until one is sure that (a) one understands what the movements would look like without agarose and that there are no differences, and (b) (a serious shortcoming here) that embryos need to be shown to develop completely normally in those culture conditions WAY beyond the period of imaging.

      There are lots of observations by several labs (some unpublished of course, but some are published) suggesting that agar and agarose do interfere with cell movements. In chick for example the Chapman and Schoenwolf method where embryos are placed on agarose, there are always head defects due to impaired movements and the agarose interfering with tissue tensile forces.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript, from Mathiah and colleagues, describes an in-depth analysis of differences in cell organization and division within the epiblast of the very early mouse embryo, and in particular, with the onset of gastrulation. Their data indicate a difference in the organization of cells between the anterior/lateral and posterior regions of the epiblast even before gastrulation has commenced, as well as differences in the location of mitoses relative to the apical and basal ends of the cells. The data provide new insight into the early regionalization of the epiblast. However, the authors should include reference to, and discussion of, the paper by Michael Snow on growth and regionalization of the epiblast (Snow MHL (1977) Gastrulation in the mouse: Growth and regionalization of the epiblast. J. Emb. Exp. Morph. 42: 293-303), where he did a much more fine-grained analysis of mitotic index across the entire epiblast, defining a proliferative zone in the anterior part of the primitive streak where the mitotic index was higher between E6.5 and E7.5. He also describes non-apical mitoses specifically in the primitive streak region as compared to all other regions of the epiblast. The results of the present study dovetail nicely with the results presented by Snow.

      Overall, this is a very nice study, but some revisions would help with clarity at certain points. The data on rosette formation are interesting, but it is not clear what an increase in rosettes in the posterior region means. The authors contend (lines 169-170) that this represents a dynamic epithelium primed for EMT, but it is not clear how rosettes facilitate or promote EMT, and especially why that would be seen at E5.75 before EMT has begun. An alternative interpretation might be that the shape of the epithelium may be changing and the packing of the epithelial cells has to change to accommodate this. We do know that the overall shape of the embryo changes from elongate medial-lateral to elongate anterior-posterior just as EMT is initiated (Perea-Gomez et al., (2004) Current Biology 14: 197-207) and it may be that changes in cell packing are required to accommodate this. The authors may want to consider whether the rosettes that they observed represent scutoids (Gomez-Galvez et al., (2018) Scutoids are a geometrical solution to three-dimensional packing of epithelia. Nat. Commun. 9:2960). An analysis of the 3-dimensional organization of the cells within the rosettes (i.e. at all Z levels) may shed some light on this.

      Figure 2b,c and Figure 3a, a', b,c - Addition of dotted lines to indicate the apical and basal ends of the epiblast would be helpful in orienting the reader.

      Figure 2c' - what these graphs represent exactly is somewhat vague, and the figure legend is also very vague. In particular, the third graph on cell dispersion is not clear. Does this mean that the daughter cells are separated from one another following division? Or that they are in different compartments (epiblast/mesoderm) after division? A better description should be included in the figure legend.

      Figure 3g would appear to show the proportion of the total number of posterior divisions that give rise to particular combinations of daughter cells (epi/epi, epi/meso, meso/meso). However, the discussion of this graph in the text (lines 215-219) suggests that it demonstrates that non-apical mitoses always result in meso/meso and epi/meso daughter cells, which it does not. That analysis would be very interesting to add to Figure 3, with the daughter cell types broken down into those coming from apical mitoses and those coming from basal mitoses.

      In Figure 4, it is not clear how anterior and posterior are defined, and what criteria were used to distinguish posterior from primitive streak. This is nicely demonstrated in Supplementary Figure 3 - maybe panels A and B could be included in Figure 4 to improve the clarity of the analysis.

      The data on mitotic index in Figures 2, 4, and 5 do not appear to be consistent. The mitotic index for E7.25 in Figure 2e is similar between anterior and posterior, even though the posterior includes the primitive streak, while the mitotic index presented for the three stages in Figure 4b would imply that the mitotic index for the entire posterior region should be higher than the anterior at all three stages. Similarly, in Figure 5a' and b', the mitotic index in anterior and posterior regions of E5.75 and E6.25 embryos are not significantly different despite the primitive streak being included in the posterior count, while the data presented in Figure 4 would imply that the entire posterior region including primitive streak should be much higher than the anterior. The authors should clarify this in the Results.

      The data on non-apical mitoses in the RhoA-VEdeleted (Figure 6) and Rac1ko embryos (Supplemental figure5) are not particularly compelling. It is hard to see the basal mitoses in the new AVE-opposed regions in the mutant embryos in the images presented. Perhaps the graphs in these two figures could have the AVE-opposed data broken down into two groups - the region that is posterior and the region that is anterior but not adjacent to AVE. Better images would improve the clarity of these data as well.

      Significance

      The data on differential proliferation and apical vs. basal mitoses are complementary to data already published, but the present study updates the existing data by the addition of live imaging and 3-dimensional reconstruction of cell shapes, providing a more complete insight into the process. The observation that rosettes are detectable at the basal ends of the epiblast, and moreso in the posterior, is novel, but the significance for embryonic development is not well rationalized.

      These data are of interest to those investigating the mechanisms of early morphogenesis, as well as those interested in the cellular correlates of molecular regionalization that results from the well-described signaling pathways regulating axis specification.

      My background is in early mouse embryo morphogenesis, therefore I feel that I have sufficient expertise to evaluate the data presented.

      Referees Cross Commenting

      I agree with Reviewer 1 on the lack of detail about the methods - my comments stemmed from the same confusion about how measurements were made, but Reviewer 1 more articulately addressed the key points.

      I agree with Reviewer 3 on the quality of the videos. It is very difficult to see how they could follow cells with a 20 minute interval.

      I would like to address the comment by Reviewer 3 on the use of agarose in the imaging experiments. The methods section states that agarose was used to make the culture "chambers" used for light-sheet imaging, which was not the major approach used for imaging in this study. Only the data in Figure 1A came from those experiments, and it was validated by confocal data in 1B,C where the embryos were cultured in Ibidi chambers with culture medium and no agarose present. So I don't think agarose effects on embryo development are a major worry. Also, this same approach was used by Ryan Udan in Mary Dickinson's lab to visualize yolk sac vasculogenesis, and it did not appear to have a deleterious effect on development in that case, although the embryos imaged here were much earlier and are definitely differentially sensitive to culture conditions from those cultured at E8.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In "An asymmetry in the frequency and position of mitosis in the epiblast precedes gastrulation and suggests a role for mitotic rounding in cell delamination during primitive streak epithelial-mesenchymal transition", Mathiah, Despin-Guitard and colleagues study divisions during mouse gastrulation. They perform ex vivo culture, live imaging and immunostaining to observe the frequency and position of mitosis within the embryo as well as the destiny of daughter cells after their divisions. The find that divisions on the posterior side of the embryo tend to be more basally located and could contribute to cell delamination into the mesodermal layer. Authors also affect antero-posterior signaling by genetically preventing the migration of the anterior visceral endoderm, which leads to mitosis away from the apical side of the epithelium on all lateral parts of the embryo.

      This study tackles a key developmental process which is poorly understood in mammals due to its concomitance with the implantation phase. Therefore, any carefully-made description of this process has the capacity to be eye-opening. This is potentially the case for this report, which provides nice images that most likely required skills and important efforts to obtain. The authors have written a clear manuscript with an interesting narrative. However, the quantifications are very poorly described, which makes it impossible for anyone to reproduce these results. I describe below a number of suggestions to clarify the quantifications, which is in my opinion a prerequisite to consider the conclusions from the authors.

      Fig1: Authors describe differences in the formation of rosettes between the anterior and posterior sides of the embryo. The microscopy images and movies provided are overlaid with drawings from the authors but without this visual help, I, and I assume other readers, see more rosettes than highlighted and fail to see some of the rosettes that are marked. To avoid this subjectivity, a clear methodology is required. In the methods, the authors state: "For quantification, rosettes were manually annotated and counted on Z sections located 5 μm from the basal side of the epiblast." And that is all. What defines a rosette? How many cells need to share a vertex to be considered as part of a rosette? How long do they need to persist not to be considered as occurring by chance? What about cells part of multiple rosettes? Does the rosette organization need to be apical, basal or all the way? Having those clearly defined criteria would be essential for anyone else to reproduce this quantification and would also offer a much more comprehensive description of the phenomenon and allow for more powerful conclusions. In addition, the data are given as "rosette/frame" and as "rosette/mm2". What is the point of giving both data, which are essentially the same? The frame is irrelevant. It would be more interesting to know how many cells there are in this area, as cell packing could be a determinant of rosette formation. "Rosette/mm2/min" is very confusing. It should state "rosette.mm-2.min-1" or "rosette/mm2.min". On a conclusive note, I fail to understand how relevant the formation of rosettes would be. The authors should clarify this point.

      Fig2: I have essentially the same issue for bottle cells and delamination counting as for rosettes. In this case, there is nothing in the methods section. What defines a cell as bottle shape and not bottle shape (apical vs basal width for example)? Where does a cell need to be to be counted as delaminated (a distance needs to be stated, absolute (better) or relative)? What defines sister cells as dispersing after division? How far apart do they have to be? After how much time? From the movies provided, the acquisition time seems to short to assess cell dispersal.

      Fig3: Mitotic index calculation is described in the figure legend but not in the methods section. It should also be in the methods section and made explicit that the number of mitotic cells is normalized to green cells only, not the entire cell population. The mitotic index seems higher in this population than in the entire embryo as seen in Fig4. What defines an exiting cell? Regarding the non-apical rounding, why not calling it basal rounding? How far from the apical side does a cell need to be to be counted as non-apical? In the panel h, with the posterior division outcome, is that for all divisions or only for non-apical divisions? Do basal divisions give rise to more epi? Is epi or meso fate only determined by location in a different layer or are fate markers used? What happens to the non-apical mitosis on the anterior side?

      Fig4: Methods state "For Phospho-histone H3 quantifications, sections were chosen at least 10 μm apart to ensure that each cell was only counted once, and counting was performed using the Icy software" and legend states "The PS region is defined by the area where the basal membrane (yellow) is degraded, and the posterior region quantification excludes counts from the PS region". This needs to be extended (what about cells at the boundary between PS and non-PS regions?) and brought together in the methods section. Also the tissue architecture in the PS is not as well defined as in the rest of the tissue. Is the epithelial polarity clear enough to be determined without AB marker in the PS? Finally, the number of cells counted is missing.

      Supp Fig5: based on available images of the Rac1KO embryo, I am not sure that epithelial architecture is established well enough to assess the location of mitosis along the apico-basal axis.

      Significance

      Although I am not as familiar with mouse gastrulation as I would like to be, I am familiar with gastrulation, live imaging and analysis. At this point I find it difficult to discuss the conclusions of the study since the methodology is so unclear. Nevertheless, any carefully-made description of mammalian gastrulation has the capacity to be eye-opening. This is potentially the case for this report, which provides nice images that most likely required skills and important efforts to obtain.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The work reports finding a molecular genetic basis for individual differences in behavior in different strains of outbred mice, even including individual behavioral differences between mice of the same inbred genetically isogenic strain. The authors were able to measure copy numbers for the tandemly repeated intronic snoRNA clusters SNORD115 and SNORD116 and found correlation with measures of anxiety in open-field test and elevated plus maze. Expression data for previously proposed targets of these snoRNAs are also provided.

      Major comments:

      1.The techniques to measure copy numbers are challenging and the authors' conclusion that ddPCR was their method of choice is convincing. They were able to obtain limited optical mapping (Bionano zephyr) data, only for SNORD116 and only in mouse, but these data are useful to corroborate those obtained with ddPCR.

      2.Figure 3 reports single copy numbers for individuals that are presumably heterozygous. Do we have to assume that the numbers reported represent the larger alleles since the ddPCR method does not allow to distinguish two different size alleles, as was shown for optical mapping?

      3.The analyses reported do not take into account the specific parental origin of the alleles used in the regression analyses. Since PWSCR-specific SNORDs are only expressed from the paternal chromosomes, this generates some uncertainty about the whole dataset.

      4.Lines 353-365: The ankrd11 exon-specific RNAseq data are confusing and too preliminary. More work needs to be done to resolve the splice variants in this region and their relationship to SNORD116 copy numbers. Alternatively lines 356-361 should be deleted.

      5.In all tested rodents, higher SNORD copy number was correlated with higher relative anxiety score. In the human samples, however, higher anxiety scores were associated with lower copy numbers. These apparently contradictory results are not mentioned in the abstract, and are not satisfactory explained in the text.

      6.Extension to other species would be desirable but was limited by availability of genomic data: Results are presented for wood mouse only for SNORD115 and for the guinea pig for SNORD116.

      Minor comments:

      1.The authors present skull shape data related to SNORD116 copy numbers, but fail to consider how these data are relevant to the craniofacial abnormalities reported in an ankrd11 mutation. Barbaric et al (2008) reported a dominant ENU- induced mutation caused shortened snouts, wider skull, deformed nasal bones, reduced BMD, reduced osteoblast activity and reduced leptin levels. This phenotype was traced to a heterozygous missense mutation (conserved glutamate to lysine change) in an HDAC binding site. They postulated that the mutation fails to recruit HDACs to a transcription complex and to inhibit hormone-receptor activated gene transcription. What is the postulated link between this mechanism and the here reported skull shape data correlated with SNORD copy number variation?

      2.The observed co-variation of copy numbers between the two SNORD clusters could indicate a duplication involving the entire region, This could be tested by determining the dosage of IPW, UBE3a and Snrpn genes.

      3.Line 129 "the RNA coding region" and Line 148 "snoRNA coding parts" (and elsewhere) does seems correct, as by definition, this is non-coding RNA. The region they are referring to could be called the "processed C/D box snoRNA". The mechanism that generates these C/D box snoRNAs is well established: the "genes" are located in introns of host genes - and after transcription - the spliced out introns are exonucleolytically trimmed to the functional sizes. Both SNORD115 and 116 clusters are within a large transcript that originates from the SNRPN promoter of the paternal allele.

      4.Figure 2 does not show data on skull shape as claimed in the legend.

      5.S1 Figure: Snprn should be Snrpn

      Significance

      This provocative work proposes the regulation of behavioral variance by dosage changes of a regulatory RNA. The dosage changes are apparently caused by dynamic and frequent alteration in copy number. This is a novel concept and worthy of publicizing. Extensive data documentation is provided for others to analyze and possibly replicate. The data potentially throw light on the function of the tandemly repeated imprinted snoRNA clusters in the PWS critical region.

      Novel aspects of this work include the discovery of copy number variation of these snoRNAs; and validation of a target of SNORD116: Ankrd11 is one of many potential targets of SNORD116 that was previously computationally predicted, this paper reports experimental evidence for this interaction.

      The work would be of interest to researchers in behavioral evolution, non-coding RNA function, epigenetics and overall genome evolution.

      Define your field of expertise with a few keyword: Molecular genetic disorders, Prader-Willi syndrome, mouse models

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Dear dr. Monaco,

      We thank you and the reviewers for the positive and encouraging reviews on our manuscript entitled “Protective anti-prion antibodies in human immunoglobulin repertoires” and are glad to address the reviewer’s suggestions

      In the following you will find a point-by-point response to the referees' critique.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Comment 1: Abtract: Although it is clear and direct, the last sentence where it refers to "a link to the low incidence of spontaneous prion diseases in human populations", is not easy to understand without a detailed explanation as given in the Discussion. I suggest a re-wording.

      Response 1: We have reworded the sentence in the abstract and given more explanation in the discussion (see also Reviewer 2, Comment 2).

      Comment 2: Results: It is clear how these Fabs act in preventing prion-induced neurotoxicity as shown in the COCS model. In addition to this effect, they also inhibit prion spreading, although this appears to be a lesser effect than inhibition of neurotoxicity. Thus, it would be interesting to discuss the possible effect of a Fab therapy, which provide a fully inhibition of the neurotoxicity but only partially inhibition of the prion propagation.

      • Response 2: As suggested by the reviewer, we have added appropriate text to the discussion to comment on the option of a potential Fab therapy with a fully inhibition of neurotoxicity and partially inhibition of prion propagation. Comment 3: The therapeutic effect of the Fabs in the cell model was performed by adding the Fabs to the medium 1 h after infection and during splitting. Is there any study that evaluates the effect of Fabs added to the medium before inoculation or at later times?

      Response 3: The goal of these experiments was to investigate whether the antibodies in question would counteract prion infections in principle, rather than performing a precise range-finding of the optimal therapeutic window. We have opted to not add the Fabs before inoculation, because past experience (and many papers) show that the “prophylactic” treatment rarely correlated with post-exposure efficacy. We also have not treated the cells after prion infection at later time points, because the data at later time points may be less pronounced and more variable. As for the treatment of cells with anti-PrP antibodies prior to exposure to prions, a study has been conducted in N2a cells (Pankievicz J et al., 2006, https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1779824/). There, preincubation of N2a cells with mouse monoclonal anti-PrP antibodies (Mabs) before prion infection (22L) and preincubation of the inoculum with Mabs before infection of the cells led to a significant reduction in PrPSc levels as assessed by proteinase-K Western blot. This paper is now discussed in our manuscript.

      Comment 4: Discussion: The authors repeatedly refer to the toxicity that antibodies against GD might have. Related to this, there is currently a therapy (experimental medicine) in humans using an antibody against this region. Perhaps it would be interesting to make a comment on this.

      Response 4: Our findings (Sonati et al, Nature 2013, and several following papers) are fundamentally incompatible with those of the London lab on the toxicity of anti-GD antibodies, and elsewhere I have warned loudly against the use of such antibodies in humans. However, this discussion is peripheral to the findings presented here. We have added some text to the discussion but we would rather not expand on this specific issue.

      Comment 5: Page 15. I have found the speculative comment: "Accordingly, clinically silent prion generation may occasionally occur in healthy individuals. PrPSc aggregates arising de novo may result in exposure of neoepitopes and/or epitopes occluded in cell-borne PrPC." interesting. However, some of the auto-antibodies found in healthy humans are against a region believed to be structurally unaltered in PrPSc, which it doesn't fit with the theory of exposure to neo-epitopes.

      • Response 5: I still believe that my hypothesis is viable, but of course I concede that – thus far – I have no supporting data. We have therefore modified the text to alleviate this comment.

      Reviewer #2:

      Comment 1: This is a technically advanced and carefully executed study that clearly demonstrate the presence of natural autoantibodies to PrP, some of which show protective properties, in an unselected human population. Although this finding is interesting on its own right, its impact on issues such as incidence of sporadic prion diseases is unclear given that apparently only 0.06% of the nearly 38,000 subjects examined carried these antibodies "in high titer".

      Response 1: We agree with the reviewer and have modified the statement as follows: “The frequency of high-titer anti-PrP antibody carriers (0.06%) is much lower than the occurrence of Fab71-like HCDR3 sequences in published human repertoires. This discrepancy could mean that most anti-PrP specificities exist in a dormant state, or are expressed as B-cell receptors, but do not produce circulating antibodies. It will be interesting to discover the triggers that may ignite antibody production and, possibly, afford protection against prions”. The discrepancy between the frequency of anti-PrP antibodies found in the plasma screen and by analysis of the antibody repertoires in the NGS datasets could stem from the fact that most anti-PrP specificities exist in a dormant state, or are expressed as B-cell receptors, but do not produce circulating antibodies (Joseena Iype et al., J Immunol 2019; now also included in the manuscript).

      Comment 2: Furthermore, this reviewer could not locate the base of the pivotal statement made in the Abstract that these autoantibodies lack in carriers of disease-associated PRNP mutations. These two points need to be clarified.

      Response 2: The statement refers to the study by Frontzek et al. (citation #48: Frontzek, K. et al. Autoantibodies against the prion protein in individuals with PRNP mutations Neurologyhttps://n.neurology.org/content/early/2020/02/25/WNL.0000000000009183?rss=1). Although listed in the references, the citation got lost in the discussion. We have inserted the reference again.

      Comment 3: The manuscript suffers for the excessive amount of data that are crammed in the five figures. Combined these figures display a total of 33 panels some of which are quite complicated. The authors should be more selective and roll over some of the nonessential information i.e. that related to methodology, to the Supplement.

      • Response 3: We agree with the reviewer and have moved several panels to the Supplement.

      Comment 4: The use of acronyms is excessive and should be reduced (see for example COCS).

      • Response 4: We have attempted to reduce the number of acronyms. We have however introduced the term COCS in Falsig et al., Nature Neuroscience 2007, and have used it regularly in more than a dozen follow-up papers. Comment 5: The legends need to be carefully checked for clarity, especially figure 4

      • Response 5: We have revised the legends to improve their clairity.

      Reviewer #3:

      Comment 1: On page 10, the authors state that Fab71 (Figure 3e) and Fab100 (Extended data Figure 7) substantially lowered PrPSc levels in prion-infected cells. However, in both cases, only about half of the cultures tested showed less PrPSc than either the control samples or samples treated with other Fabs. This variability undercuts the conclusion that what they are observing is a substantial, reproducible effect. The authors should consider moderating their conclusion somewhat to better fit the data.

      Response 1: We agree with the reviewer. The effect of Fab71 and Fab100 in reducing PrPSc levels in cells as compared to control samples and samples treated with the other Fabs is only partially present and variable among the replicates, but still statistically significant (One-way ANOVA; p

      Comment 2: In figure 2, the legend to panel a does not match the figure. Fab3 and Fab71 are represented by the blue lines, not the red lines as stated in the legend.

      • Response 2: We thank the referee for pointing this out. We have now corrected it.

      Comment 3: In the legend to Extended data figure S4, please give the epitopes to Fab10 and Fab53.

      Response 3: We have included the epitopes of these two Fabs (OR51-91 for Fab10 and CC2-HC92-120 for Fab53) in the Figure legend.

      Comment 4: In Figure 3c, the lines indicating the significant groups are not well-aligned. In the left side of the panel, the lines should connect the dark gray control group squares with the Fab25 pink diamonds. Likewise, in the right side of the panel, the lower set of lines should connect the dark gray control group squares with the Fab83 dark blue triangles.

      • Response 4: We have corrected this issue.

      Comment 5: I agree with the comments of both reviewers. The suggestion of reviewer #2 to move methodology-related panels in the main figures to supplemental data would make it much easier for the reader to focus on the critical experimental data.

      • Response 5: See response to comment 2 of reviewer 2. With all issues addressed, we hope that our revised manuscript will now be found suitable for proceeding to the next steps.

      Best regards,

      Adriano Aguzzi

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Senatore et al. is large scale study looking for natural human antibodies directed against prion protein (PrP). Using a synthetic human Fab phage display library, they found and characterized multiple human anti-PrP Fabs most of which recognized epitopes to a region of PrP from amino acid residues 92-120. Based on this information, they searched for and found low affinity, long-lived anti-PrP antibodies in both a repertoire of human antibodies and in 27 of almost 38,000 human clinical samples. They speculate that anti-PrP antibodies may help to protect against sporadic forms of prion disease and conclude that they may represent a source of potential immunotherapeutics against human prion infection.

      Minor comments:

      1) On page 10, the authors state that Fab71 (Figure 3e) and Fab100 (Extended data Figure 7) substantially lowered PrPSc levels in prion-infected cells. However, in both cases, only about half of the cultures tested showed less PrPSc than either the control samples or samples treated with other Fabs. This variability undercuts the conclusion that what they are observing is a substantial, reproducible effect. The authors should consider moderating their conclusion somewhat to better fit the data.

      2) In figure 2, the legend to panel a does not match the figure. Fab3 and Fab71 are represented by the blue lines, not the red lines as stated in the legend.

      3) In the legend to Extended data figure S4, please give the epitopes to Fab10 and Fab53.

      4) In Figure 3c, the lines indicating the significant groups are not well-aligned. In the left side of the panel, the lines should connect the dark gray control group squares with the Fab25 pink diamonds. Likewise, in the right side of the panel, the lower set of lines should connect the dark gray control group squares with the Fab83 dark blue triangles.

      Significance

      This is an extensive, well-written study which provides significant data suggesting that humans can make anti-PrP antibodies. This is a novel finding that raises important questions about how the body may respond to spontaneous formation of infectious prions. Technically, the study is sound with appropriately interpreted data. Overall the study and the antibodies it characterizes, some of which are novel, will be of interest to prion researchers.

      Referees cross commenting

      I agree with the comments of both reviewers. The suggestion of reviewer #2 to move methodology-related panels in the main figures to supplemental data would make it much easier for the reader to focus on the critical experimental data.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The authors extensively and rigorously characterized a subset of antibodies to PrP identified in a human Fab phage display library. These selected antibodies were compared and found to be similar to repertoires of naturally occurring human antibodies present in circulating B cells. Profiling of antibodies harvested from an unbiased 38,000 patient population uncovered the presence of high titer anti-PrP autoantibodies in 21 individuals sharing no specific pathologies. This finding demonstrates the presence of apparently innocuous immunity to prion in an unselected population. Based also on "the reported lack of such antibodies in carriers of disease-associated PRNP mutations" the authors propose that the low incidence of "spontaneous" prion diseases may be linked to the presence of these protective antibodies in the general population.

      Major comments:

      This is a technically advanced and carefully executed study that clearly demonstrate the presence of natural autoantibodies to PrP, some of which show protective properties, in an unselected human population. Although this finding is interesting on its own right, its impact on issues such as incidence of sporadic prion diseases is unclear given that apparently only 0.06% of the nearly 38,000 subjects examined carried these antibodies "in high titer". Furthermore, this reviewer could not locate the base of the pivotal statement made in the Abstract that these autoantibodies lack in carriers of disease-associated PRNP mutations. These two points need to be clarified. The manuscript suffers for the excessive amount of data that are crammed in the five figures. Combined these figures display a total of 33 panels some of which are quite complicated. The authors should be more selective and roll over some of the nonessential information i.e. that related to methodology, to the Supplement.

      Minor comments:

      The use of acronyms is excessive and should be reduced (see for example COCS). The legends need to be carefully checked for clarity, especially figure 4

      Significance

      Significance

      See above

      Referees Cross Commenting

      I agree with most of the comments by Reviewers 1 and 3. However, my queries remain.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is a very interesting article with important implications in the prion field. It is extremely well detailed and exquisitely well written. The objective of the article is very clear and the results obtained are not only interesting but also have very important implications for understanding prion diseases.

      I have some comments and a few minor concerns.

      Abtract:

      Although it is clear and direct, the last sentence where it refers to "a link to the low incidence of spontaneous prion diseases in human populations", is not easy to understand without a detailed explanation as given in the Discussion. I suggest a re-wording.

      Results:

      It is clear how these Fabs act in preventing prion-induced neurotoxicity as shown in the COCS model. In addition to this effect, they also inhibit prion spreading, although this appears to be a lesser effect than inhibition of neurotoxicity. Thus, it would be interesting to discuss the possible effect of a Fab therapy, which provide a fully inhibition of the neurotoxicity but only partially inhibition of the prion propagation.

      The therapeutic effect of the Fabs in the cell model was performed by adding the Fabs to the medium 1 h after infection and during splitting. Is there any study that evaluates the effect of Fabs added to the medium before inoculation or at later times?

      Discussion:

      The authors repeatedly refer to the toxicity that antibodies against GD might have. Related to this, there is currently a therapy (experimental medicine) in humans using an antibody against this region. Perhaps it would be interesting to make a comment on this.

      Page 15. I have found the speculative comment: "Accordingly, clinically silent prion generation may occasionally occur in healthy individuals. PrPSc aggregates arising de novo may result in exposure of neoepitopes and/or epitopes occluded in cell-borne PrPC." interesting. However, some of the auto-antibodies found in healthy humans are against a region believed to be structurally unaltered in PrPSc, which it doesn't fit with the theory of exposure to neo-epitopes.

      Significance

      The advance is highly significance for two reasons: 1) the tools that the authors have generated are really useful for the community and 2) The fact the healthy humans can generate anti-PrP antibodies is completely new and open new ways to understand the prion diseases mechanisms.

      The audience is principally for those working on prion and prion-like diseases.

      My expertise is in prion and prion-like diseases.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): \*Summary:** Reproducibility of genetic interactions across studies is low. The authors identify reproducible genetic interactions and ask the question of what are properties of robust genetic interactions. They find that 1. oncogene addiction tends to be more robust than synthetic lethality and 2. genetic interactions among physically interacting proteins tend to be more robust. They then use protein-protein interactions (PPIs) to guide the detection of genetic interactions involving passenger gene alterations. **Major comments:** The claims of the manuscript are clear and well supported by computational analyses. My only concern is the influence of (study) bias on the observed enrichment of physical protein interactions among genetic interactions. 1. Due to higher statistical power the here described approach favors genetic interactions involving frequently altered cancer genes (as acknowledged by the authors). 2. Also some of the libraries in the genetic screens might be biased towards better characterized screens. 3. PPI networks are highly biased towards well studied proteins (in which well studied proteins - in particular cancer-related proteins - are more likely to interact). The following tests would help to clarify if and to which extend these biases contribute to the described observations:*

      Our response: We thank the reviewer for the positive assessment of our manuscript and have addressed the issue of study bias in response to the specific queries below.

      * 1 . The authors should demonstrate that the PPI enrichment in reproducible vs non-reproducible genetic interactions is not solely due to the biased nature of PPI networks. One simple way of doing so would be to do the same analysis with a PPI network derived from a single screen (eg PMID: 25416956). I assume that due to the much lower coverage the effect will be largely reduced but it would be reconfirming to see a similar trend in addition to the networks on which the authors are already testing. Another way would be to use a randomized network (with the same degree distribution as the networks the authors are using and then picking degree matched random nodes) in which the observed effect should vanish.

      *

      Our response: We appreciate the reviewer’s point and have now assessed both of the suggested approaches.

      The overlap with unbiased yeast two-hybrid (y2h) screens, even the recent HuRI dataset (Luck et al, Nature 2020), was too small in scale to draw any conclusions. Among the ~140,000 interactions tested for genetic interactions, only 51 overlap with y2h interactions. Two of the discovered genetic interactions were supported by a y2h interaction, while one of the robust genetic interactions was supported by a y2h interaction. While this is actually more than would be expected based on the overlap of interactions in the test space the numbers are not especially convincing.

      We therefore focused on two alternative assessments. We first compared our results with the network derived from the systematic AP-MS mapping of protein interactions in HEK293 cells (BioPlex 3.0, Huttlin et al, Biorxiv 2020). We restricted our analysis of genetic interactions to gene pairs that could conceivably be observed in the BioPlex dataset (i.e. between baits screened and preys expressed in HEK293T). We found that although the numbers were small, the same pattern of enrichment was observed:

      This analysis has now been added to the revised manuscript as Supplementary Table S4 and Figure S3E (shown below):

      We next compared the results we observed with the real STRING protein-protein interaction network to 100 degree-matched randomisations of this network. We observed that the number of discovered and validated genetic interactions observed using the real STRING interaction network was greater than that observed using the randomised networks. With this in mind, we have now revised the manuscript to state:

      ‘Previous work has demonstrated that the protein-protein interaction networks aggregated in databases are subject to significant ascertainment bias – some genes are more widely studied than others and this can result in them having more reported protein-protein interaction partners than other genes(Rolland et al., 2014). As cancer driver genes are studied more widely than most genes, they may be especially subject to this bias. To ensure the observed enrichment of protein-protein interactions among genetically interacting pairs was not simply due to this ascertainment bias, we compared the results observed for the real STRING protein-protein interaction network with 100 degree-matched randomised networks and again found that there was a higher than expected overlap between protein-protein interactions and both discovered and validated genetic interactions (Supplemental Fig. S4).’

      Supplemental Figure S4. Genetic interactions are more enriched in real protein-protein interaction networks than randomised networks. Histograms showing the overlap between 100 degree matched randomisations of the STRING medium confidence protein-protein interaction and discovered (a and b) and validated (c and d) genetic interactions. The observed overlap with the real STRING protein interaction are highlighted with the orange lines.

      * 2 . What's the expected number of robust genetic interactions involving passenger gene alterations? Is it surprising to identify 11 interactions? This question could be addressed with some sort of randomization test: When selecting (multiple times) 47,781 non-interacting random pairs between the 2,972 passenger genes and 2,149 selectively lethal genes, how many of those pairs form robust genetic interactions?

      *

      Our response: We have now addressed this as follows:

      “At an FDR of 20% we found 11 robust genetic interactions involving passenger gene alterations (Supplemental Table S6). To assess whether this is more than would be expected by chance we randomly sampled 47,781 gene pairs from the same search space 100 times. The median number of robust genetic interactions identified amongst these randomly sampled gene pairs was 1 (mean 1.27, min 0, max 6) suggesting that the 11 robust genetic interactions observed among protein-protein interacting pairs was more than would be expected by chance.”

      \*Minor comments:**

      Two additional analyses would add in my opinion value to the manuscript:

      -The authors state that reasons for irreproducibility of genetic interactions are of technical or biological nature. Is it possible to disentangle the contribution of the two factors given the available data? Eg how many genetic interactions are reproducible in two different screening platforms using the same cell line vs how similar are results of screens from two different cell lines in the same study?

      *

      Our response: We are also very interested in this question, but with the available data, we are not confident that we could draw solid conclusions.

      -The authors state that "some of the robust genetic dependencies could be readily interpreted using known pathway structures" and argue that they recover for example MAPK or Rb pathway relationships. Is this a general trend? Do genes forming a robust genetic interactions have a higher tendency to be in the same pathway as opposed to different pathways?

      Our response: We have now systematically tested the robust genetic interactions for each driver gene for enrichment in specific pathways. Relevant text is as follows:

      ‘To test if this enrichment of pathway members among the robust dependencies associated with specific driver genes was a common phenomenon, for each driver gene with at least three dependencies we asked if these dependencies were enriched in specific signalling pathways (see Methods). Of the twelve driver genes tested, we found that five of these were enriched in specific pathways and in all five cases found that the driver gene itself was also annotated as a member of the most enriched pathway (Table SX). As expected RB1 (most enriched pathway ‘G1 Phase’) and BRAF (most enriched pathway ‘Negative feedback regulation of MAPK pathway’) were among the five driver genes, alongside PTEN (‘PI3K/AKT activation’), CDKN2A (‘Cell cycle’), and NRAS (‘Ras signaling pathway’).’

      Details in the methods are as follows:

      ‘Pathway enrichment was assessed using gProfiler (Raudvere et al., 2019) with KEGG (Kanehisa et al., 2017) and Reactome (Jassal et al., 2020) as annotation databases and the selectively lethal genes as the background list.’

      *I think the pathway topic could be in general better exploited: eg does pathway (relative) position play a role?**

      *

      Our response: We agree that pathway position, especially distance from driver gene in an ordered pathway, would be very interesting to tease out but we don’t think that current pathway annotations are reliable enough nor the set of robust genetic interactions large enough to analyse this properly.

      *Reviewer #1 (Significance (Required)):**

      Personalized cancer medicine aims at the identification of patient-specific vulnerabilites which allow to target cancer cells in the context of a specific genotype. Many oncogenic mutations cannot be targeted with drugs directly. The identification of genetic interactions is therefore of crucial importance. Unfortunately, genetic interactions show little reproducibility accross studies. The authors make an important contribution to understanding which factors contribute to this reproducibility and thereby providing means to also identify more reliable genetic interactions with high potential for clinical exploitation or involving passenger gene alterations (which are otherwise harder to detect for statistical reasons).

      REFEREES CROSS COMMENTING

      Reviewer 2 raises a few valid points, which if addressed would certainly increase the clarity of the paper. In particular addressing the first point (the self interactions of tumor suppressors) seems important to me. From what I can see all of reviewer 2's comments can be addressed easily.

      *

      End of Reviewer 1 comments

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      *In this manuscript, Lord et al. describe the analysis of loss-of-function (LOF) screens in cancer cell lines to identify robust (i.e., technically reproducible and shared across cell lines) genetic dependencies. The authors integrate data from 4 large-scale LOF studies (DRIVE, AVANA, DEPMAP and SCORE) to estimate the reproducibility of their individual findings and examine their agreement with other types of functional information, such as physical binding. The main conclusions from the analyses are that: a) oncogene-driven cancer cell lines are more sensitive to the inhibition of the oncogene itself than any other gene in the genome; b) robust genetic interactions (i.e., those observed in multiple datasets and cell lines driven by the same oncogene/tumour suppressor) are enriched for gene pairs encoding physically interacting proteins.

      **Main comments:**

      I think this study is well designed, rigorously conducted and clearly explained. The conclusions are consistent with the results and I don't have any major suggestions for improving their support. I do, however, have a few suggestions for clarifying the message.

      *

      Our response: We thank the reviewer for this positive assessment of our manuscript and have addressed the requests for clarity below.

      -Could the authors provide some intuitive explanation (or speculation) about the 2 observed cases of tumour suppressor "addiction" (TP53 and CDKN2A)? While the oncogene addiction cases are relatively easy to interpret, the same effects on tumour suppressors are less clear. Is it basically an epistatic effect, which looks like a relative disadvantage? For example, if we measure fitness: TP53-wt = 1, TP53-wt + CRISPR-TP53 = 1.5, TP53-mut = 1.5, TP53-mut + CRISPR-TP53 = 1.5. That is, inhibiting TP53 in TP53 mutant cells appears to be disadvantageous (relative to WT) only because inhibiting TP53 in wild-type cells is advantageous?

      Our response: The reviewer is correct – the TP53 / TP53 dependency is similar to an epistatic effect. In a TP53 mutant background targeting TP53 with shRNA or CRISPR has a neutral effect, while in a TP53 wild type background targeting TP53 with shRNA or CRISPR often causes an increase in cell growth. We have clarified this in the text below (new text in bold)

      ‘We also identified two (2/9) examples of ‘self vs. self’ dependencies involving tumour suppressors -TP53 (aka p53) and CDKN2A (aka p16/p14arf) (Supplemental Fig. S2c). This type of relationship has previously been reported for TP53: TP53 inhibition appears to offer a growth advantage to TP53 wild type cells but not to TP53 mutant cells(Giacomelli et al., 2018). Inhibiting TP53 in TP53 mutant cells has a largely neutral effect, while on average inhibiting TP53 in TP53 wild type cells actually increases fitness growth. Consequently, we observed an association between TP53 status and sensitivity to TP53 inhibition. A similar effect was observed for CDKN2A, although the growth increase resulting from inhibiting CDKN2A in wild-type cells is much lower than that seen for TP53 (Supplemental Fig. S2c).;

      *-In the analysis of overlap between genetic and physical interactions, the result should be presented more precisely. Currently, the text reads "when considering the set of all gene pairs tested, gene pairs whose protein products physically interact were more likely to be identified as significant genetic interactors". However, the referenced figure (Fig. 5a) shows an orthogonal perspective: relative to all gene pairs tested, those that have a significant genetic interaction are more likely to have a physical interaction as well. In other words, in the text, we are comparing the relative abundance of genetic interactions in 2 sets: tested and physically interacting. However, in the figure, we are comparing the relative abundance of protein interactions in 2 sets -- tested and genetically interacting. The odds ratio and the p-values stay the same but the result would be more clear if the figure matched the description in the text.

      *

      Our response: Due to the fact that genetic interactions are rare (~1% of all gene pairs tested have a discovered genetic interaction, ~0.1% have a validated genetic interaction) it’s hard to convey the enrichment effectively. This is demonstrated in the below figure – it’s clear that there are more discovered / validated genetic interaction pairs among the protein-protein interaction pairs but the scale is hard to appreciate:

      Focusing only on the discovered/validated genetic interactions makes the picture a little clearer but does not effectively show that the discovered pairs themselves are enriched among protein-protein interaction pairs

      As we feel the original figures convey the main message most effectively, we have altered the text rather than the images as follows:

      “We found that, when considering the set of all gene pairs tested, gene pairs identified as significant genetic interactors in at least one dataset are more likely to encode proteins that physically interact (Fig. 5a)”

      \*Minor comments:**

      There're a few places where the more explicit explanation would improve the readability of the manuscript.

      -Page 5: The multiple regression model used to identify genetic interactions is briefly mentioned in the text (and described more extensively in the methods). I think it would be better to explicitly describe the dependent and independent variables of the model in the text, so that the reader can intuitively understand what is being estimated*.

      Our response: We have added additional information to the main text as follows:

      ‘This model included tissue type, microsatellite instability and driver gene status as independent variables and gene sensitivity score as the dependent variable (Methods). Microsatellite instability was included as a covariate as it has previously been shown to be associated with non-driver gene specific dependencies (Behan et al., 2019), while tissue type was included to avoid confounding by tissue type.’*

      -Page 5: "Using this approach, we tested 142,477 potential genetic dependencies..." -- could the authors provide a better explanation of where that number is coming from? E.g., 142,477 = ... driver genes x 2470 selectively lethal genes?*

      Our response: Because not every selectively lethal gene is tested in every dataset (e.g. DRIVE only screened ~8,000 genes instead of the whole genome) the 142,477 number does not correspond to a simple multiplication of number of driver genes times number of selectively lethal gene. However, we have added additional information in bold as follows:

      ‘Using this approach, we tested 142,477 potential genetic dependencies between 61 driver genes and 2,421 selectively lethal genes. We identified 1,530 dependencies that were significant in at least one discovery screen (Fig. 2a, Supplemental Fig. S1). All 61 driver genes had at least one dependency that was significant in at least one discovery screen while less than half of the selectively lethal genes (1,141 / 2,421) had a significant association with a driver gene. Of the 1,530 dependencies that were significant in at least one discovery screen, only 229 could be validated in a second screen (Supplemental Table S3, Fig. 2a). For example, in the AVANA dataset TP53 mutation was associated with resistance to inhibition of both MDM4 and CENPF, but only the association with MDM4 could be validated in a second dataset (Fig. 2b, 2c). Similarly, in the DEPMAP dataset NRAS mutation was associated with increased sensitivity to the inhibition of both NRAS itself and ERP44, but only the sensitivity to inhibition of NRAS could be validated in a second dataset (Fig. 2b, 2c).

      The 229 reproducible dependencies involved 31 driver genes and 204 selectively lethal genes.’

      -Page 5: Repeating the number of findings of each type would help understanding the landscape of the genetic dependencies (suggested numbers in brackets): "Of the (229?) reproducible genetic dependencies nine were 'self vs self' associations". "The majority (7/9?) of these ... were oncogene addiction effects". "We also identified 2 (2/9?) examples of 'self vs self' dependencies involving tumour suppressors".

      Our response: We have taken the reviewer’s advice and added these figures to the main text for clarity

      * -Page 12: "Three of these interactions involve genes frequently deleted with the tumour suppressor CDKN2A (CDKN2B and MTAP) and mirror known associations with CDKN2A". It is not clear what "mirror" means -- do they recapitulate known interactions?

      *

      Our response: Yes, we meant to indicate that they recapitulate known CDKN2A interactions and have now replaced ‘mirror’ with ‘recapitulate’.

      -Page 15: "Although we have not tested them here, other features predictive of between-species conservation may also be predictive of robustness to genetic heterogeneity" -- could the authors explicitly list the features?

      Our response: We have now explicitly listed these features as follows:

      “Previous work has also shown that genetic interactions between gene pairs involved in the same biological process, as indicated by annotation to the same gene ontology term, are more highly conserved across species (Ryan et al., 2012; Srivas et al., 2016). Similarly, genetic interactions that are stable across experimental conditions (e.g. that can be observed in the presence and absence of different DNA damaging agents) are more likely to be conserved across species (Srivas et al., 2016). Although we have not tested them here, these additional features predictive of between-species conservation may also be predictive of robustness to genetic heterogeneity.”

      *Reviewer #2 (Significance (Required)):

      The identification of a significant overlap between genetic and physical interactions in cancer cell lines is an interesting and promising observation that will help understanding known genetic dependencies and predicting new ones. However, similar observations have been made in other organisms and biological systems. These past studies should be referenced to provide a historical perspective and help define further analyses in the cancer context. In particular, studies in yeast S. cerevisiae have shown that, not only there is a general overlap between genetic interactions (both positive and negative) and physical interactions, but at least 2 additional features are informative about the relationship: a) the relative strength of genetic interactions and b) the relative density of physical interactions (i.e., isolated interaction vs protein complexes). Here's a sample of relevant studies: 1) von Mering et al., Nature, 2002; 2) Kelley & Ideker, Nat Biotechnol, 2005; 3) Bandyopadhyay et al., PLOS Comput Biol, 2008; 4) Ulitsky et al., Mol Syst Biol, 2008; 5) Baryshnikova et al., Nat Methods, 2010; 6) Costanzo et al., Science, 2010; 7) Costanzo et al., Science, 2016.

      Similar observations have also been made in mammalian systems: e.g., in mouse fibroblasts (Roguev et al., Nat Methods, 2013) and K562 leukemia cells (Han et al., Nat Biotech, 2017). I don't think that past observations negate the novelty of this manuscript. The analysis presented here is more focused and more comprehensive as it is based on a large integrated dataset and is driven by a series of specific hypotheses. However, a reference to previous publications should be made.

      As a frame of reference: my expertise is in high-throughput genetics of model organisms, including mapping and analyzing genetic interactions.

      *

      Our response: We thank the reviewer for highlighting this point.

      We have attempted to provide better context for our work in the discussion as follows:

      ‘In budding and fission yeast, multiple studies have shown that genetic interactions are enriched among protein-protein interaction pairs and vice-versa (Costanzo et al., 2010; Kelley and Ideker, 2005; Michaut et al., 2011; Roguev et al., 2008). Pairwise genetic interaction screens in individual mammalian cell lines have also revealed an enrichment of genetic interactions among protein-protein interaction pairs (Han et al., 2017; Roguev et al., 2013). Our observation that discovered genetic interactions are enriched in protein-protein interaction pairs is consistent with these studies. However, these studies have not revealed what factors influence the conservation of genetic interactions across distinct genetic backgrounds, i.e. what predicts the robustness of a genetic interaction. In yeast, the genetic interaction mapping approach relies on mating gene deletion mutants and consequently the vast majority of reported genetic interactions are observed in a single genetic background (Tong et al., 2001). In mammalian cells, pairwise genetic interaction screens across multiple cell lines have revealed differences across cell lines but not identified what factors influence the conservation of genetic interactions across cell lines(Shen et al., 2017). While variation of genetic interactions across different strains or different genetic backgrounds has been poorly studied, previous work has analysed the conservation of genetic interactions across species and shown that genetic interactions between gene pairs whose protein products physically interact are more highly conserved (Roguev et al., 2008; Ryan et al., 2012; Srivas et al., 2016). Our analysis here suggests that the same principles may be used to identify genetic interactions conserved across genetically heterogeneous tumour cell lines.’

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Lord et al. describe the analysis of loss-of-function (LOF) screens in cancer cell lines to identify robust (i.e., technically reproducible and shared across cell lines) genetic dependencies. The authors integrate data from 4 large-scale LOF studies (DRIVE, AVANA, DEPMAP and SCORE) to estimate the reproducibility of their individual findings and examine their agreement with other types of functional information, such as physical binding. The main conclusions from the analyses are that: a) oncogene-driven cancer cell lines are more sensitive to the inhibition of the oncogene itself than any other gene in the genome; b) robust genetic interactions (i.e., those observed in multiple datasets and cell lines driven by the same oncogene/tumour suppressor) are enriched for gene pairs encoding physically interacting proteins.

      Main comments:

      I think this study is well designed, rigorously conducted and clearly explained. The conclusions are consistent with the results and I don't have any major suggestions for improving their support. I do, however, have a few suggestions for clarifying the message.

      -Could the authors provide some intuitive explanation (or speculation) about the 2 observed cases of tumour suppressor "addiction" (TP53 and CDKN2A)? While the oncogene addiction cases are relatively easy to interpret, the same effects on tumour suppressors are less clear. Is it basically an epistatic effect, which looks like a relative disadvantage? For example, if we measure fitness: TP53-wt = 1, TP53-wt + CRISPR-TP53 = 1.5, TP53-mut = 1.5, TP53-mut + CRISPR-TP53 = 1.5. That is, inhibiting TP53 in TP53 mutant cells appears to be disadvantageous (relative to WT) only because inhibiting TP53 in wild-type cells is advantageous?

      -In the analysis of overlap between genetic and physical interactions, the result should be presented more precisely. Currently, the text reads "when considering the set of all gene pairs tested, gene pairs whose protein products physically interact were more likely to be identified as significant genetic interactors". However, the referenced figure (Fig. 5a) shows an orthogonal perspective: relative to all gene pairs tested, those that have a significant genetic interaction are more likely to have a physical interaction as well. In other words, in the text, we are comparing the relative abundance of genetic interactions in 2 sets: tested and physically interacting. However, in the figure, we are comparing the relative abundance of protein interactions in 2 sets -- tested and genetically interacting. The odds ratio and the p-values stay the same but the result would be more clear if the figure matched the description in the text.

      Minor comments:

      There're a few places where the more explicit explanation would improve the readability of the manuscript.

      -Page 5: The multiple regression model used to identify genetic interactions is briefly mentioned in the text (and described more extensively in the methods). I think it would be better to explicitly describe the dependent and independent variables of the model in the text, so that the reader can intuitively understand what is being estimated.

      -Page 5: "Using this approach, we tested 142,477 potential genetic dependencies..." -- could the authors provide a better explanation of where that number is coming from? E.g., 142,477 = ... driver genes x 2470 selectively lethal genes?

      -Page 5: Repeating the number of findings of each type would help understanding the landscape of the genetic dependencies (suggested numbers in brackets): "Of the (229?) reproducible genetic dependencies nine were 'self vs self' associations". "The majority (7/9?) of these ... were oncogene addiction effects". "We also identified 2 (2/9?) examples of 'self vs self' dependencies involving tumour suppressors".

      -Page 12: "Three of these interactions involve genes frequently deleted with the tumour suppressor CDKN2A (CDKN2B and MTAP) and mirror known associations with CDKN2A". It is not clear what "mirror" means -- do they recapitulate known interactions?

      -Page 15: "Although we have not tested them here, other features predictive of between-species conservation may also be predictive of robustness to genetic heterogeneity" -- could the authors explicitly list the features?

      Significance

      The identification of a significant overlap between genetic and physical interactions in cancer cell lines is an interesting and promising observation that will help understanding known genetic dependencies and predicting new ones. However, similar observations have been made in other organisms and biological systems. These past studies should be referenced to provide a historical perspective and help define further analyses in the cancer context. In particular, studies in yeast S. cerevisiae have shown that, not only there is a general overlap between genetic interactions (both positive and negative) and physical interactions, but at least 2 additional features are informative about the relationship: a) the relative strength of genetic interactions and b) the relative density of physical interactions (i.e., isolated interaction vs protein complexes). Here's a sample of relevant studies: 1) von Mering et al., Nature, 2002; 2) Kelley & Ideker, Nat Biotechnol, 2005; 3) Bandyopadhyay et al., PLOS Comput Biol, 2008; 4) Ulitsky et al., Mol Syst Biol, 2008; 5) Baryshnikova et al., Nat Methods, 2010; 6) Costanzo et al., Science, 2010; 7) Costanzo et al., Science, 2016.

      Similar observations have also been made in mammalian systems: e.g., in mouse fibroblasts (Roguev et al., Nat Methods, 2013) and K562 leukemia cells (Han et al., Nat Biotech, 2017). I don't think that past observations negate the novelty of this manuscript. The analysis presented here is more focused and more comprehensive as it is based on a large integrated dataset and is driven by a series of specific hypotheses. However, a reference to previous publications should be made.

      As a frame of reference: my expertise is in high-throughput genetics of model organisms, including mapping and analyzing genetic interactions.

      REFEREES CROSS COMMENTING

      I agree with the questions raised by reviewer #1. And I think the authors should be able to address them (either through analyses or reasoning) within 1-3 months.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Reproducibility of genetic interactions across studies is low. The authors identify reproducible genetic interactions and ask the question of what are properties of robust genetic interactions. They find that 1. oncogene addiction tends to be more robust than synthetic lethality and 2. genetic interactions among physically interacting proteins tend to be more robust. They then use protein-protein interactions (PPIs) to guide the detection of genetic interactions involving passenger gene alterations.

      Major comments:

      The claims of the manuscript are clear and well supported by computational analyses. My only concern is the influence of (study) bias on the observed enrichment of physical protein interactions among genetic interactions. 1. Due to higher statistical power the here described approach favors genetic interactions involving frequently altered cancer genes (as acknowledged by the authors). 2. Also some of the libraries in the genetic screens might be biased towards better characterized screens. 3. PPI networks are highly biased towards well studied proteins (in which well studied proteins - in particular cancer-related proteins - are more likely to interact). The following tests would help to clarify if and to which extend these biases contribute to the described observations:<br> 1 . The authors should demonstrate that the PPI enrichment in reproducible vs non-reproducible genetic interactions is not solely due to the biased nature of PPI networks. One simple way of doing so would be to do the same analysis with a PPI network derived from a single screen (eg PMID: 25416956). I assume that due to the much lower coverage the effect will be largely reduced but it would be reconfirming to see a similar trend in addition to the networks on which the authors are already testing. Another way would be to use a randomized network (with the same degree distribution as the networks the authors are using and then picking degree matched random nodes) in which the observed effect should vanish.

      2 . What's the expected number of robust genetic interactions involving passenger gene alterations? Is it surprising to identify 11 interactions? This question could be addressed with some sort of randomization test: When selecting (multiple times) 47,781 non-interacting random pairs between the 2,972 passenger genes and 2,149 selectively lethal genes, how many of those pairs form robust genetic interactions?

      Minor comments:

      Two additional analyses would add in my opinion value to the manuscript:

      -The authors state that reasons for irreproducibility of genetic interactions are of technical or biological nature. Is it possible to disentangle the contribution of the two factors given the available data? Eg how many genetic interactions are reproducible in two different screening platforms using the same cell line vs how similar are results of screens from two different cell lines in the same study?

      -The authors state that "some of the robust genetic dependencies could be readily interpreted using known pathway structures" and argue that they recover for example MAPK or Rb pathway relationships. Is this a general trend? Do genes forming a robust genetic interactions have a higher tendency to be in the same pathway as opposed to different pathways? I think the pathway topic could be in general better exploited: eg does pathway (relative) position play a role?

      Significance

      Personalized cancer medicine aims at the identification of patient-specific vulnerabilites which allow to target cancer cells in the context of a specific genotype. Many oncogenic mutations cannot be targeted with drugs directly. The identification of genetic interactions is therefore of crucial importance. Unfortunately, genetic interactions show little reproducibility accross studies. The authors make an important contribution to understanding which factors contribute to this reproducibility and thereby providing means to also identify more reliable genetic interactions with high potential for clinical exploitation or involving passenger gene alterations (which are otherwise harder to detect for statistical reasons).

      REFEREES CROSS COMMENTING

      Reviewer 2 raises a few valid points, which if addressed would certainly increase the clarity of the paper. In particular addressing the first point (the self interactions of tumor suppressors) seems important to me. From what I can see all of reviewer 2's comments can be addressed easily.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      1.The stimulatory effect of LPHC diet on browning of some WATs has been previously reported (Nutrition, 42, 37-45 Oct 2017). Also, the activation of AMPK was observed in this study. However, the detailed mechanism responsible for AMPK activation by LPHC diet remains elusive in the present study, which lowers its scientific importance.

      Response: OK. We will adequately mention previous study in which the AMPK activation was observed upon LPHC diet and more deeply decipher the molecular mechanisms that lead to AMPK activation by analyzing AMP- and ROS/Ca2+-dependent pathways according to the Reviewer’s suggestions (for more details, see point 2-4).

      2.Different with WAT, LPHC diet increases glucose uptake and FA synthesis in BAT (Nutrition. 30 (4), 473-80 Apr 2014). __Is it possible that AMPK activation in WAT due to the lowered glucose uptake, which might increase AMP/ATP ratio? It is recommended to determine the uptake of glucose in WAT.__

      Response: In the article cited by the Reviewer, authors measured glucose uptake only in BAT and found that it was significantly increased. On the contrary, no data are reported regarding glucose metabolism in WAT. Our in vitro data clearly indicate that AMPK activation occurs upon amino acid restriction (AAR) and in the presence of glucose in the culture medium (see Fig. 6C, Suppl. Fig. 5I). Moreover, glucose uptake is increased upon this condition (see Fig. 5G). Hence a decreased glucose uptake by WAT and the activation of AMPK via a decrease of AMP/ATP ratio has to be likely excluded. However, we will test AMP and ATP levels both in vivo and in vitro and this, together with experiments aimed at deciphering the contribution of mitochondrial ROS (mtROS) and CaMKK (see points 3, 4), we will hopefully clarify the mechanisms of AMPK activation upon LPHC diet.

      The present study indicates that the promotional effect of LPHC diet on WAT browning is dependent on mitochondrial ROS generation. However, it is still unknown why the production of ROS increased and why ROS could activate AMPK. The authors should clarify these critical steps.

      Response: Redox unbalance is widely reported to directly or indirectly stimulate AMPK activation (Shao et al., 2015, Cell Metab; Hinchi et al., 2018, J Biol Chem). Moreover, it has been demonstrated that activation of AMPK could depend on mtROS and be independent of an increase in AMP/ATP ratio (Emerlin et al., 2009, Free Radic Biol Med). Based on this evidence and our results, we believe that, upon AAR or LPHC diet, the recorded increase of mtROS concentration could not derive from an enhanced production but rather to a decrease of intracellular availability of the sulfur amino acid cysteine that represents an efficient ROS scavenger. Actually, by replenishing cysteine through N-acetyl cysteine (NAC) treatment we were able to buffer mtROS increase (see Fig. 6A), as assayed by cytofluorimetric analyses through mitoSox staining, and avoid AMPK phosphorylation (see Fig. 6C and 6J) as well as the downstream upregulation of brown fat and muscular genes (see Fig 6B and 6I). In line with this result, treatment with erastin, a cysteine depleting agent, was able to mimic the effects of AAR and LPHC diet by up-regulating the expression of brown-like and muscular genes (see Fig. 6G). Therefore, to more deeply decipher the mechanisms involved in AMPK activation and to further involve cysteine depletion in mtROS increase and AMPK activation, we could assay mtROS and AMPK levels also following erastin treatment. Of course, to involve cysteine decrease in AMPK activation, measuring intracellular cysteine levels upon AAR and LPHC diet is mandatory and will be carried out. Importantly, we have preliminary data, not included in the present manuscript, indicating that cysteine is decreased both upon AAR and LPHC diet; hence, after increasing the sample size, we will include this result in the revised version.

      4.The relationship between cytosolic calcium and AMPK was not clear. In addition to the fact that AMPK regulates SERCA to increase cytosolic Ca depicted in the present study, AMPK could also be activated by increased Ca via CaMKK. A recent study indicates that the activation of AMPK requires TRPV4-mediated Ca release from ER (Cell Metabolism Volume 30, Issue 3, 3 September 2019, Pages 508-524.e12). This issue should also be clarified.

      Response: Regarding the possible involvement of TRPV4-mediated Ca release from ER, through RNAseq we found that TRPV4 mRNA is slightly expressed in subcutaneous white adipose tissue and changes in its expression were not found upon LPHC diet. Moreover, TRPV4 protein was not detected in our samples by proteomic analysis. Notably, by integrating transcriptomic and proteomic data, it emerged that cell membrane intracellular calcium transporters (i.e. CACNG1, CACNA2D1), which are interconnected to the network of sarcoplasmic reticulum calcium cycle, are upregulated upon LPHC diet (see Fig. 5I). Therefore, we will evaluate the effects of a calcium channel blocker (e.g. Verapamil) and/or extracellular calcium chelator (e.g. BAPTA) on AMPK activation and its downstream gene expression cascade. In parallel, to possibly involve CAMKK in the activation of AMPK, treatment with a CAMKK inhibitor (e.g. Sto-609) will be carried out. Importantly, mtROS are upstream inducers of intracellular calcium raise (Mungai et al., 2011, Mol Cell Biol) and therefore an involvement of mtROS-Ca2+ axis could not be ruled out. In line with this hypothesis, by buffering mtROS through NAC treatment, we were able to abrogate intracellular calcium raise elicited by AAR (see Fig. 6F). Therefore, by performing the above described experiments and by evaluating CAMKK following NAC treatment, we will be hopefully able to establish whether AMPK activation is AMP-(in)dependent and/or relies on mtROS/Ca2+/CAMKK pathway.

      Reviewer #2

      o Interesting paper but see comments below.

      Response: OK, thanks

      o The relevance of the described effects for whole-body energy balance regulation is not shown. Indirect calorimetry could be interesting. The only whole-body effect (slightly improved glucose clearance in oGTT) was very small.

      Response: OK. As suggested by this Reviewer we can include indirect calorimetry to give a more comprehensive view of the effects of LPHC diet on the whole-body energy balance (see also the following point).

      o …1) Indirect calorimetry could be very helpful to show effects on energy metabolism. 2) Can the authors discuss why they didn't conduct the experiment also under thermoneutral conditions?

      Response: OK. As stated above, we will add indirect calorimetry experiments and, as suggested by this Reviewer, we will discuss this issue in the revised version. Importantly, we already have indirect calorimetry data that were not included in the present version of the manuscript and that we will add in the revised version.

      o Maybe an additional collaborator is necessary.

      Response: Yes, collaborators who performed indirect calorimetry will be included as co-authors in the revised version.

      o Article numbers of all diets must be added and information if the all diets were purified diets. This could have effects on the gut microbiome.

      Response: OK. We will add the article numbers as well as more detailed information about all the diets.

      o Sample sizes are very low. The authors should explain why only males were used in the experiments. oGTT analysis should also include calculation of area under the curve. No explicit statement if correction for multiple testing is required or other measures to reduce false positive results.

      Response: We have used only male mice to avoid sex bias. We will edit the OGTT analysis graph to include calculation area under the curve. Regarding the sample size, a mistake occurred when the figure legends have been written. Actually, in materials and methods section, we clearly indicated the number of animals used (n=8 mice for WD and n=6 mice for LPHC diet and not n=3). Information regarding the statistical analyses was included in Bioinformatics and Statistical Analysis section. In this section, we described how the correction for multiple testing was carried out (i.e. one-way ANOVA followed by Dunnetts correction). In the revised version, we will dedicate a separate section for statistical analysis to avoid misreading.

      **Minor comments:**

      o Are prior studies referenced appropriately?* Relevant reference: Desjardins, E.M., Steinberg, G.R. Emerging Role of AMPK in Brown and Beige Adipose Tissue (BAT): Implications for Obesity, Insulin Resistance, and Type 2 Diabetes. Curr Diab Rep 18, 80 (2018). __https://doi.org/10.1007/s11892-018-1049-6____ __

      Response: We thank the Reviewer for this suggestion and we will include and appropriately discuss this paper.

      o *Are the text and figures clear and accurate?* YES

      Response: Ok, thanks for this positive evaluation.

      Reviewer #3

      My main critique, coming from the perspective of a dietitian that works in human trials in the US, is that the diet called a "Western" diet is not similar to the diet that humans with metabolic problems typically eat…

      Response: OK. The aim of this work was to study at molecular level the responses of white adipose tissue to changes in protein to carbohydrate ratio. We completely agree with the Reviewer that “Western” diet is not an appropriate term to describe the diet that we have used; hence, we will change “Western diet” in “Control diet” throughout the manuscript. Actually, according to the general guidelines for nutrition studies on mice, when experimental animals are fed a special diet (i.e. LPHC in our study), the control animals should be fed a diet matched in every way to the special diet, except of course for the dietary variable (i.e. P/C ratio in our study) that the researcher is studying (Pellizzon and Ricci, The common use of improper control diets in diet-induced metabolic disease research confounds data interpretation: the fiber factor (2018). Nutrition & Metabolism 15:3).

      **Major comments:**

      -The authors provide strong support their key findings.

      Response: We thank this reviewer for this positive evaluation.

      -The mice were on the LPHC diet for a short period of time (2 weeks). Ongoing amino acid deficiency has potential to promote frailty and other deleterious outcomes. No long-term diet outcomes can be inferred from this study.

      Response: OK. We will discuss this issue, highlighting that this dietary regimen should be recommended on human only for a short period and that further study is needed for understanding the long-term effects of LPHC diet.

      -The authors have provided no evidence that a LPHC diet improves human health, so I think they need to scale back those assertions, particularly as it relates to people shifting to a LPHC from what they currently eat, since people don't typically eat what the authors refer to as a "Western" diet as it's defined in this paper.

      Response: OK. We will reference studies in which LPHC diet has been suggested to improve human health.

      -As far as I can tell, no additional experiments are needed to support their claims identifying how the LPHC affects AMPK activated pathways in mice.

      Response: We thank this reviewer for this positive evaluation.

      -The methods are rigorous and sufficiently described to be reproducible.

      Response: We thank this reviewer for this positive evaluation.

      **Minor comments:**

      -__Minor grammatical issues through e.g. "It is worth to notice" in last paragraph on page 12; there are font differences in the methods section __

      Response: OK. We will correct these minor grammatical/font issues.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The authors use a mouse model to compare molecular responses to a 23% protein/57 carbohydrate/20 fat diet to a 7% protein/73% carbohydrate/20% fat diet. The authors show that the low protein diet enhanced activation of biological pathways related to fatty acid catabolism including FAO, TCA cycle and electron transport chain in sWAT but not BA, similar to cold exposure. The authors use redundant assays and experiments in cell models to validate the genes and molecular pathways involved in the sWAT response to a low protein diet in mice. The authors show that AMPK activation promotes the induction of typical brown fat and muscular genes in sWAT. The authors identify novel non-canonical pathways (Serca1 and Serca2a,) that are upregulated in sWAT browning.

      My main critique, coming from the perspective of a dietitian that works in human trials in the US, is that the diet called a "Western" diet is not similar to the diet that humans with metabolic problems typically eat. The typical US diet is closer to approximately a 17% protein/50% carbohydrate/33% split (https://doi.org/10.1016/j.nut.2015.02.007, https://doi.org/10.1038/s41430-017-0031-8). This level of protein utilized for the experimental "Western" diet here is comparable to levels used for "high protein" diets in some human studies (https://doi.org/10.1111/nure.12111).

      Since the experimental diet differs substantially from what metabolically sick people typically eat, the ability to speculate how the findings from this study may apply to humans with metabolic diseases is very limited. This paper is really well-done, but I think the authors should call the experimental diet a high-protein, moderate carbohydrate diet (HPMC), not a "Western" diet. There are many who argue that such a HPMC is metabolically advantageous and promotes weight loss/improved body composition, so this study lays the groundwork for refuting that guidance. It would be exciting to see a head to head comparison of the two diets in humans in the future!

      Major comments:

      -The authors provide strong support their key findings

      -The mice were on the LPHC diet for a short period of time (2 weeks). Ongoing amino acid deficiency has potential to promote frailty and other deleterious outcomes. No long-term diet outcomes can be inferred from this study.

      -The authors have provided no evidence that a LPHC diet improves human health, so I think they need to scale back those assertions, particularly as it relates to people shifting to a LPHC from what they currently eat, since people don't typically eat what the authors refer to as a "Western" diet as it's defined in this paper.

      -As far as I can tell, no additional experiments are needed to support their claims identifying how the LPHC affects AMPK activated pathways in mice.

      -The methods are rigorous and sufficiently described to be reproducible

      Minor comments:

      -Minor grammatical issues through e.g. "It is worth to notice" in last paragraph on page 12; there are font differences in the methods section

      Significance

      The work is significant as it describes the metabolic effects of a LPHC at the molecular level for the first time. This paper demonstrates how a low protein diet may promote longevity and improve glucose metabolism, which has been shown to some extent in humans, but hasn't had a mechanistic explanation until now.

      If similar findings were supported in longer term animal and human trials, it could lay the groundwork for modifying dietary recommendations to promote metabolic health and longevity.

      This paper is of interest to basic scientists studying diet and energy metabolism. The potential health implications are interesting to people in healthcare and scientists studying human metabolism.

      I am a dietitian who has conducted weight loss trials in humans, emphasizing varying macronutrient ratios. I have also done whole body metabolism work in humans using metabolic chambers. I have experience in urinary proteomics, but I lack sufficient expertise to scrutinize much of the methodology of the basic work you present here.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Aquilano et al. submitted a manuscript investigating the effects of a low-protein/high-carbohydrate diet on AMPK-dependent thermogenic activity in subcutaneous adipose tissue in mice presumably resulting in stimulated energy dissipation. Based on the observation that LPHC diets may promote metabolic benefits the authors aimed to study the underlying molecular functions. They focused mainly on a comparison of molecular markers for thermogenesis and the related metabolic pathways in brown and subcutaneous white adipose tissue in response to feeding mice a LPHC diet for two weeks. Using a proteomics approach first, they identified 75 proteins differentially present in sWAT compared to BAT. These could be linked both to canonical as well as non-canonical (muscular) thermogenic functions as the authors state. Overall, they conclude that feeding a LPHC diet induces a white-to-brown conversion in sWAT. Deep RNA-sequencing identified 416 up and 52 down-regulated gene transcripts in sWAT. GO terms analysis showed enrichment for biological processed related to mitochondrial fatty acid catabolism, response to cold, and muscle contraction genes. Following up this rational, they conducted several experimental approaches to identify regulators in this system. For example, they tried to rule out that changes in gut microbiome composition could mediate metabolic benefits in response to LPHC diet. Finally, they hypothesized that nutrient shortage in particular amino acid lowering is responsible for sWAT browning. Here, AMPK seems to play a central role in the browning of sWAT in response to LPHC diet.

      Major comments:

      o Are the key conclusions convincing? YES interesting paper but see comments below.

      o Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? YES - The relevance of the described effects for whole-body energy balance regulation is not shown. Indirect calorimetry could be interesting. The only whole-body effect (slightly improved glucose clearance in oGTT) was very small.

      o Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary to evaluate the paper as it is, and do not ask authors to open new lines of experimentation. YES - 1) Indirect calorimetry could be very helpful to show effects on energy metabolism. 2) Can the authors discuss why they didn't conduct the experiment also under thermoneutral conditions?

      o Are the suggested experiments realistic for the authors? It would help if you could add an estimated cost and time investment for substantial experiments. Maybe an additional collaborator is necessary.

      o Are the data and the methods presented in such a way that they can be reproduced? YES mostly - but article numbers of all diets must be added and information if the all diets were purified diets. This could have effects on the gut microbiome.

      o Are the experiments adequately replicated and statistical analysis adequate? Sample sizes are very low. The authors should explain why only males were used in the experiments. oGTT analysis should also include calculation of area under the curve. No explicit statement if correction for multiple testing is required or other measures to reduce false positive results.

      Minor comments:

      o Are prior studies referenced appropriately? Relevant reference: Desjardins, E.M., Steinberg, G.R. Emerging Role of AMPK in Brown and Beige Adipose Tissue (BAT): Implications for Obesity, Insulin Resistance, and Type 2 Diabetes. Curr Diab Rep 18, 80 (2018). https://doi.org/10.1007/s11892-018-1049-6

      o Are the text and figures clear and accurate? YES

      Significance

      My expertise: Energy metabolism in rodent models for metabolic disease, body temperature regulation, body mass regulation

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This work found that a LPHC meal activates browning of sWAT by ROS/AMPK pathway, and tried to clarify the detailed mechanism of the beneficial effect of LPHC diet. Although the paper contains scientific novelty and is well-written, most of the results are descriptive and deeper mechanistic study seems lacking. Here listed some comments and questions.

      1.The stimulatory effect of LPHC diet on browning of some WATs has been previously reported (Nutrition, 42, 37-45 Oct 2017). Also, the activation of AMPK was observed in this study. However, the detailed mechanism responsible for AMPK activation by LPHC diet remains elusive in the present study, which lowers its scientific importance.

      2.Different with WAT, LPHC diet increases glucose uptake and FA synthesis in BAT (Nutrition. 30 (4), 473-80 Apr 2014). Is it possible that AMPK activation in WAT due to the lowered glucose uptake, which might increase AMP/ATP ratio? It is recommended to determine the uptake of glucose in WAT.

      1. The present study indicates that the promotional effect of LPHC diet on WAT browning is dependent on mitochondrial ROS generation. However, it is still unknown why the production of ROS increased and why ROS could activate AMPK. The authors should clarify these critical steps.

      4.The relationship between cytosolic calcium and AMPK was not clear. In addition to the fact that AMPK regulates SERCA to increase cytosolic Ca depicted in the present study, AMPK could also be activated by increased Ca via CaMKK. A recent study indicates that the activation of AMPK requires TRPV4-mediated Ca release from ER (Cell Metabolism Volume 30, Issue 3, 3 September 2019, Pages 508-524.e12). This issue should also be clarified.

      Significance

      This work indicates that LPHC diet promotes browing of WAT through activation of AMPK by elevating mitochondrial ROS production. Compared to previous studies, this work firstly found the critical importance of mitochondrial ROS in activation of AMPK through a series of works on omics data. However, they failed to clearly explain the detailed mechanism responsable for either enhanced mitochondrial ROS production by LPHC diet or activation of AMPK by mitochondrial ROS. Therefore, due to most of the conclusions have been presented in some previous published papers, the main novelty of the present work should be greatly improved by further mechanistic stidies.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      I thank the referees for their enthusiasm and time providing critical feedbacks to our manuscript. The novelty of our work is the identification of the importance of Mfn2 in regulating the Rac signaling and neutrophil migration& adhesion, which is significantly relevant to the mitochondrial field and cell biology in general. Below please find our point-to-point response to the comments.

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)):**

      Introduction:

      "Although mitochondria-derived ATP possibly regulates neutrophil chemotaxis in vitro (Bao et al., 2015), removal of extracellular ATP improves neutrophil chemotaxis in vivo (Li et al., 2016). These conflicting reports prompted us to search for mechanisms delineating the role of mitochondria in neutrophil migration outside the realm of ATP or cellular energy (Bi et al., 2014; Schuler et al., 2017; Zanotelli et al., 2018)." This sentence is superficial and misleading: extracellular ATP may interfere with chemotaxis through various energy-independent mechanisms (see for example Zumerle et al. Cell Reports 2019) and this is not conflicting with the role of intracellular ATP in migration.*

      We were not clear in the writing that Bao et al suggest that neutrophils secret ATP at the leading edge and mitochondria at the leading edge is the source of the extracellular ATP. Both studies focused on extracellular ATP. We agree that the reports are not necessarily conflicting since exogenous ATP can induce additional signaling. We rewrote this sentence emphasizing that we are looking for mechanisms in addition to ATP, which is distinct from previous studies.

      Figure 1: The authors didn't show evidence of the genome edition (PCR, RFLP or Sequencing over the sgRNA target) or at least RT-PCR or WB for MFN2. In Fig 1b, 1c the scale bar is missing. "Neutrophils were sorted from both lines and their respective loci targeted by the 4 sgRNAs were deep sequenced." There are no data about sorting strategies for zebrafish neutrophils in the figure. Moreover, only 2 sgRNAs are shown and there are no sequencing data.

      To show evidence of the genome edition, we have deep sequenced this loci of mfn2 and opa1 and the mutation frequencies were stated in the original text. The sorting strategies were described in Methods-Mutational Efficiency Quantification. Each mfn2 KO has 2 individual sgRNAs, and two KO (mfn2 KO and mfn2 KO#2) were shown in Fig 1b, so there are 4 sgRNAs targeting mfn2. Since each embryos have approximately 150 neutrophils, WB is not feasible. Sequencing is the standard method (Ablain et al., 2015; Zhou et al., 2018). We only stated the mutation efficiency in the manuscript because amplifying the genomic DNA from the sorted cells introduces PCR bias and the numbers are not a quantitative reflection of the degree of gene disruption. We will include the sequencing result of the sgRNA target sites in a supplemental Figure.

      We used one scale bar for all the panels in Fig 1b,c. All panels are at the some magnification.

      Figure 2:** In the WB, reconstitution is not obvious. In general, all WBs are not quantified (and they should be quantified). The in vivo experiment does not have proper controls. For example, can the authors exclude that in these mice there is reduced inflammation because neutrophils have defective activation? What about NETs? And cytokines/chemokines? And exocytosis? In the absence of these controls, the experiment cannot be properly interpreted.

      We have quantified all WBs in our study. The results were sometimes stated in the text only. We will add the quantifications to each blot.

      The mice model we chose is used to evaluate in vivo neutrophil migration. We used a neutrophil specific promoter to delete mfn2 in mice and collected data at a very early time point when the tissue inflammatory environment is determined by tissue resident sentinel cells, such as macrophages. Although our results support that mfn2 is required for neutrophil migration in mammals, we agree that we can not fully rule out that other neutrophil functions are also regulated by mfn2.

      To address whether other neutrophil functions are affected by MFN2, we will performed assays to evaluate NETosis and degranulation in MFN2 KD HL-60 cells to evaluate the other neutrophil functions.

      Figure 3: The conclusion of the authors "In summary, Mfn2 modulates the actin cytoskeleton and cell migration in MEFs" should be supported by experiments to distinguish between the specific role of Mfn2 and the role of mitochondrial dynamics (Opa1, Drp1, Mfn1). It is also not clear why the authors decided to use MEFs instead of other cells (more similar to neutrophils which are not adherent cells). The results obtained in MEFs may be irrelevant for neutrophils.

      We agree that MEFs are very different from neutrophils. We chose MEFs since the function of Mfn2 in MEF is well characterized (Chen et al., 2003; de Brito and Scorrano, 2008; Naon et al., 2016). Both Mfn1 and Mfn2 MEF have fragmented mitochondria. Mfn1, which is very similar to Mfn2, serves as the best control. We will confirm the mitochondria structure in the KO cells.

      For specificity, in addition to mfn2, we looked at Mfn1 and opa1 in different systems. We did not select Drp1 since the mitochondrial network in neutrophils is highly fused (Fig 4 and 5)(Maianski et al., 2002; Zhou et al., 2018).

      We have also knocked down Opa 1 in HL-60s. We observed massive cell death in this line and cell migration is affected, possibly due to a depletion of cellular ATP as reported (Amini et al., 2018). We will include the data showing cell death, qRT to show knockdown efficiency and chemotaxis. In zebrafish neutrophils, knocking out Opa1 also reduced cell migration (Fig 1S).

      Figure 4-5: Fig 5a: in ctrl and sh1 the ER seems to be larger than the phalloidin (=cytoskeleton=cell border approximately) in a few regions. Only the sh1+T seems to fit correctly.

      We use the F-actin staining as an indicator of cell front. F-Actin is predominant at cell front, but much less in the cell body and uropod. Here we set the confocal laser power at a certain level to give us a good resolution of brighter signals which may not be strong enough to detect signals in the cell body. That’s why the fluorescence is very dim or even absent in the cell body. However, the majority of ER do fit in the cell border if look closer.

      The TEM image (only 1 in supplementary) is not sufficient to convince that the tethering is lost. Quantification of number of contacts and distance between ER and mitochondria should be included.

      Using EM method, Mfn2 ablation decreases the ER-contacting mitochondrial surface by ∼20–35% (Naon et al., 2017). Using the same cells, different groups reached different conclusions using TEM(Filadi et al., 2017). We reason that ER-mitochondria contact sites are rare events in TEM since the samples are sliced. We will try to take more TEM images to quantify the distance. However, we are not sure that we can come up with a definitive conclusion by TEM. Nevertheless, we observed significant mitochondrial structural changes using IF and observed the changes in cytosolic calcium levels, which is consistent with the known function of Mfn2 as a ER-mitochondrial tether (Naon et al., 2016).

      The title of figure 5 is wrong. However, in these figures, it is clear that cells are beautifully polarized, with mitochondria accumulating at the uropod (and even more in the absence of Mfn2). When comparing these images with those published by Campello et al (JEM 2006), there are 2 observations that can be made: first of all, these data confirm that mitochondrial fission promotes cell polarity; second, they suggest that the defect is not at the level of cell polarity/chemotaxis.

      We have fixed the title of figure 5.

      We agree that mfn2 defective neutrophils does not have a defect in cell polarization. The defects in migration is possibly due to other reasons such as poor adhesion or regulation in the actin cytoskeleton dynamics. However, our data is not sufficient to support that mitochondrial fission promotes cell polarization and chemotaxis.

      Figure 6: Calcium data are, in general, very weak. First of all, controls with ionomycin are missing. Statistical analyses of the curves should be included. As for the use of the MCU inhibitor Ru360, is there any evidence that it is cell-permeant in this context? Is it blocking MCU? Since the authors can show mitochondrial calcium upon FMLP, they should also demonstrate that Ru360 is indeed working and inhibiting mitochondrial calcium uptake. The sentence "The MCU inhibitor Ru360did not cause further reduction of chemotaxis in MFN2 knockdown dHL-60 cells (Supplementary Fig. 6c, d and Supplementary Movie. 12), indicating that MCU and MFN2 lies in the same pathway in terms of regulating chemotaxis in dHL-60 cells" is speculative. In general, there is no solid demonstration that the effect is calcium-mediated.

      We will include the control of ionomycin and include statistics of the results.

      Ru360 is a widely used MCU inhibitor. The fact that Ru360 itself inhibited neutrophil migration supported that the chemical enters cells. We agree that stating “indicating that MCU and MFN2 lies in the same pathway in terms of regulating chemotaxis in dHL-60 cells" is speculative. In addition, we tried to reduce cytosolic calcium levels in mfn2 KD cells either using Ca2+ chelator (BAPTA, in Fig S6) or an IP3 receptor inhibitor. In both cases we observed reduced migration blocking calcium signal alone. The mfn2 KD phenotype was not rescued. This could due to that multiple molecules/pathways are calcium dependent in cell migration. We will include all the negative data. We thus far are still unable to establish a functional link of the calcium with mfn2 regulated signaling.

      We have moved the calcium data to Fig 4. The elevated calcium signal is an indirect evidence to support the loss of ER-mitochondria tether. We have modified our conclusion to leave out calcium as a relevant signal regulated by mfn2 for neutrophil migration.

      As for Rac, it is surprising to see that Rac inhibition has no effect on cell migration. Rac is known to promotes migration in fibroblasts and other cell types and Rac deficiency inhibits migration (see for example Steffen et al, JCS 2013). Two sets of experiments are absolutely required: 1) verify this in fibroblasts since it has been elegantly shown that Rac is essential in these cells for migration; 2) analyse the effect of Rac inhibitors in pPak kinetics.

      Rac is required for cell migration and the growth of branched actin network. The Rac inhibitors we selected here are specific to two rac GEFs, vav and Tiam. Steffen et al, JCS 2013 used Rac1 KO MEF, which is different from ours. Thus the works are not contradictory. MEFs are very different from neutrophils. We chose MEFs since there are knockout cells available and well characterized. The MFN2 KO cells display prominent lamellipodia, which is also consistent with the observation in Steffen et al, JCS 2013. We have used these two inhibitors in MEF wound closure and did not observe a strong phenotype.

      We will analyze the effect of Rac inhibitors in pPak kinetics in the control and Mfn2 deficient dHL-60 cells.

      *Reviewer #1 (Significance (Required)):**

      As presented here, the manuscript has a modest significance. The audience would be specialised: cell migration, cell signalling. My expertise is immunology, cell activation, cell migration, cell signalling.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Major comments:**

      Although the results could be very interesting, and could be significantly relevant to the mitochondrial field and the cell biology one in general, major points need to be addressed to fully support conclusions of the authors. Different controls and quantification are missing, Actin dynamics analysis should be improved, effects of the artificial tether is weakly characterized and the demonstration of the specific role of mito-ER contacts via mfn2 in migration should be reinforced.

      -In figure 1, quantification of circulating neutrophils is required in Mfn2 KO embryos. The authors should also show these quantified results for OPA1KO, which are just mentioned in the text. In addition, in figure 1b and d, the neutrophils from the Mfn2KO embryos seem bigger compared to control. Can the authors comment on neutrophils size and potential contribution to the phenotype? Finally, the authors propose a defect in neutrophil migration in Mfn2-KO, however neutrophils are found in the circulation. The authors should explain these results.*

      Since the cells are all in circulation, we can only estimate the percentage. Overall, the phenotype is drastic, shown in movie S1. We will state how many fish embryos we have imaged and how often we observe this phenotype (only 1 or 2 in the tissue (mfn2 KO) or in circulation (control)). The bigger spots are resulted from cells outside the focal plane-zebrafish embryos are thick tissues. We agree that since neutrophils in the KO fish are all circulation, we cannot make a conclusion whether they can migrate in tissue in zebrafish. We conclude that “mfn2 regulates neutrophil tissue retention and extravasation in zebrafish”, but did not comment on chemotaxis.

      -The authors need to reinforce the Mfn2 specificity for their phenotype. In particular in Fig S1, they show that loss of OPA1 significantly decreases neutrophil migration in vivo. However, they then only study the effect of Mfn1 silencing in neutrophil and MFN1 KO MEFs (Sup Fig s3). The authors should perform the same experiments in neutrophil and MEF upon loss of OPA1 (similar to Fig S3). Does loss of OPA1 and Mfn1 decrease neutrophil arrest to activated endothelial cells?

      We knocked down OPA1 in HL-60 cells. The cells appear unhealthy and display a migration defect, consistent with the data in zebrafish. We are not comfortable making conclusions here since secondary affects in dying cells may cause any phenotype not directly attributed to the loss of OPA1. Nevertheless, we will include the data.

      We have decided not to include Opa1 KO MEF since the cell morphology as documented in ATCC is similar to that of WT MEF. Only the MEF2 KO MEF is more circular. MFN1 KO MEF is a better specificity control which we have characterized in depth.

      Since Mfn1 KD HL-60 cells migrate well on surface, they are not expected to have adhesion defects. Nevertheless, we will determine whether loss of Mfn1 decrease neutrophil arrest to activated endothelial cells and include the data.

      -Using their images, the authors should also document on the directionality of the cell during cell migration. Do Mfn2 depleted cells do not migrate because they are arrested or because they are lacking directionality? Environment/chemokine sensing defects?

      We will quantified the directionality of the cells. As pointed out by reviewer 1, mfn2 deficient cells can polarize and not defective in chemokine sensing. We do not expect a significant change in directionality defect.

      -Actin dynamics analysis should be improved. Loss of Mfn1 and Mfn2 lead to cell shape changes. The authors should quantify this phenotype by analysing cell circularity (as well as for Opa1 loss). Stress fibres number or Phalloidin intensity quantification in cell body should also be performed.

      We will quantified the circularity, stress fiber numbers and phalloidin intensity in Mfn1 and Mfn2 KO MEFs.

      -Can the migration defects could be attributed to Focal adhesion protein dynamics defects? The authors shown an hyperactivation of Rac1 and an hyperphosphorylation of PAK, which can control FAP (focal adhesion proteins) dynamics. In addition, immunofluorescence analysis shows a decreased signal and cellular misdistribution of paxillin. The authors should characterize these phenotypes. FAP levels (Paxillin/Phospho-Paxillin and Vinculin) should be analysed by immunoblot, the number of FAP/cell, distribution and size should also be quantified. Their dynamics should also be analysed by live cell imaging. Finally, Paxillin level and distribution seems to be also impacted in Mfn1KO cells. Can the authors comment on that? The different quantifications would help to better understand the effect of different mitofusins in cytoskeleton dynamic.

      We thank the reviewer for the great advices for our follow up work. So far our results supports Rac over activation as a relevant pathway how mfn2 regulates neutrophil migration. Although Rac can regulate focal adhesion dynamics in other cells (Rooney et al., 2010), how Rac activation regulates focal adhesion dynamics in neutrophils is not clear. Mfn2 regulated membrane tether could affect lipid trafficking, cellular metabolism and other signaling molecules. It will take substantial amount of work to make a conclusion and it is more suitable a separate report. This is one of the directions we will pursuit in our future studies.

      -Please perform rescue experiments for cell migration in MFN2KO and MFN1KO MEFs. Immunoblots showing protein levels of these proteins would be appreciated. To really discriminate how Mfn2 regulates cell migration, the authors should also perform rescue experiments using a fusogenic mutant Mfn2 ((K109A). It will help to demonstrate the relevance of mito-ER contacts and not mitochondrial fusion in the phenotype.

      For the reason mentioned above, we do not plan to do additional experiments in MEF cells since this work is focused on neutrophils. It is documented that Mfn2 K109A cannot restore mitochondrial fusion. However, it is not clear whether this construct can restore ER-tether. Result using this construct will be hard to interpret.

      -Figure 4, the authors stipulate that Mfn2 regulates ER-mitochondria tethering. However, the authors present no evidence for this conclusion. The authors should perform manders coefficient in MFN2 KO cells and compared it to control. Also, loss of Mfn2 induces mitochondrial fragmentation, which can lead to problem for mito-er contacts quantification by light microscopy. The authors should use their TEM pictures to quantify mito-ER contacts (Number, length and % of mito perimeter), not only mitochondrial morphology. Mfn1 should be used as negative control. it would be interesting also to determine the status of the mito-ER contact in the different conditions used in the manuscript to stimulate cell migration like fMLP treatment.

      We have performed manders coefficient in the mfn2 KD cells and observed no difference compared with the control. It is possibly due to the prevalent ER structure in the cells-despite the structural change, mitochondria are still mostly on top of ER when examined using IF. Using EM method, Mfn2 ablation decreases the ER-contacting mitochondrial surface by ∼20–35% (Naon et al., 2017). Using the same cells, different groups reached different conclusions using TEM(Filadi et al., 2017). We reason that ER-mitochondria contact sites are rare events in TEM since the samples are sliced. We will try to take more TEM images to quantify the distance. However, we are not sure that we will come up with a definitive conclusion by TEM. Nevertheless, we observed significant mitochondrial structural changes using IF and observed the changes in cytosolic calcium levels, which is consistent with the known function of Mfn2 as a ER-mitochondrial tether (Naon et al., 2016).

      -The authors use an artificial tether to manipulate mito-ER contacts in cellulo. However, no information from its origin, or its design are documented in the manuscript. In addition, the authors should show that this tether efficiently works by analyzing mito-ER contacts upon expression by EM and mitochondrial calcium uptake. Does this tether rescue mito-ER contacts defects induced by loss of Mfn2? How the authors explain that the tether rescues mitochondrial morphology defects in MFN2KO? In these conditions, mitochondria should not be able to fuse anymore as Mfn2 is lost? This is really intriguing results. Does the tether rescue the other parameters? Mitochondrial distribution (with quantification)? Cell shape? Paxillin defects? ROS and membrane potential? These rescue experiments analyses are important to determine which parameters are really involved in cell migration defects due to the decreased tethering. Finally, it would be of great interest to analyse the effect of the tether alone on cell migration, Rac1 activity, cell shape? Gain of function? These results may reinforce the idea that contact sites regulate cell migration.

      The tether is a GFP protein carrying both ER and mitochondrial localization sequences at the ends (Kornmann et al., 2009). The details are now added to the manuscript.

      In HL-60 mfn2 KD cells, tether expression partially rescues mitochondrial distribution (quantified in Fig 5c), cell migration and Rac over activation. Although ROS and membrane potential are slightly affected by Mfn2 deletion in HL-60 cells, it is not clear whether they play any roles in mfn2 regulated cell adhesion or migration. We will attempt to use TEM to determine the mitochondrial structure upon tether rescue.

      Despite multiple attempts, we could not obtain a line over-expressing the tether in wt HL-60 cells. We suspect that further increase in the tether is toxic to the cells.

      -It is well established that a decrease of membrane potential leads to a decrease of mitochondrial calcium uptake. Calcium results obtained by the authors without any information on the roles of the tether could not lead to any conclusion. Does the tether rescues membrane potential and calcium uptake by the mitochondria? So far, the decrease of mitochondrial calcium upon stimulation in Mfn2KO cab be attributed to both mito-ER contact or membrane potential defects. It has been shown that MEFs MFN2 KO can lead to a decrease of MCU provel level leading to a decrease of mitochondrial calcium uptake (PMID: 25870285). The authors should also check MCU protein level.

      We observed that mfn2 deficiency resulted in a minor reduction in membrane potential. Although Mfn2 KO MEF has reduced level of Mcu, Mfn2 silence in MEF does not affect Mcu levels (Filadi et al., 2015). Another group also concluded that Mfn2 deletion does not necessarily affect Mcu levels (Naon et al., 2016). Nevertheless, we will measure the MCU protein level in the Mfn2 knockdown HL-60 cells.

      -Hyperactivation of Rac1 is only based on phosphorylation of PAK, which is quite weak. The authors should better describe the hyperactivation of RAC1 or other RhoGTPases in their Mfn2 KO MEFs. What are the levels of RAC1 and other RhoGTPases? Subcellular distribution in the cell? Kits are also available to determine RhoGTPase activity by pull down assay (Cell biolabs).

      In Mfn2 KO MEFs, Rac overactivation is suggested by the increased lamellipodia formation, classical Rac readouts. Since the current manuscript focuses on neutrophils, we will performed the Rac GFP pull down experiments in HL-60 cells. We will also stain Rac GTP in HL-60 cells.

      *-The references are up-to-date. The text and the figures are clear and accurate.**

      **Minor comments:**

      -The authors should show the efficiency of the KO generated for Mfn2 and Opa1 in zebrafish embryos. Sequencing results to highlight the position of the mutations and their consequences on the coding protein should be shown, as well as immunoblot analysis should be performed to analyse Mfn1, Mfn2 and OPA1 protein levels. The generation of a MFN1-KO transgenic line would have been appreciated to finely compare the roles of the 3 GTPases involved in mitochondrial fusion during neutrophil infiltration and migration in vivo.*

      To show evidence of the genome edition, we have deep sequenced this loci of mfn2 and opa1 and the mutation frequencies were stated in the original text. Since each embryos have approximately 150 neutrophils, WB is not feasible. Sequencing is the standard method (Ablain et al., 2015; Zhou et al., 2018). We only stated the mutation efficiency in the manuscript because amplifying the genomic DNA from the sorted cells introduces PCR bias and the numbers are not a quantitative reflection of the degree of gene disruption. We will include the sequencing result of the sgRNA target sites in a supplemental Figure.

      The mfn1 gene in zebrafish is duplicated. We are not sure whether we can obtain efficient disruption at both loci. We hope the results using Mfn1 KO MEF and MFN1 KD HL-60 cells are enough to show a specific role of Mfn2 in cell migration.

      -MFN1, MFN2, AND OPA1 protein levels should be analysed by immunoblot in the Mfn1 and Mfn2 KO MEFs.

      It is unlikely that mfn1/2 KO will affect OPA1 levels (Saita et al., 2016). Both MFN1 and MFN2 MEF display fragmented mitochondrial network which can be rescued by overexpression of MFN1 or MFN2 (Chen et al., 2003). The level of OPA1 in the cells are not relevant. We will stain mitochondria in the mfn1/2 KO MEFs to make sure that the cells have fragmented mitochondria as expected.

      -In cell spreading assay, it would be great to identify cells during the process, by an asterix for example. "wt MEFs extended transient filopodia and lamellipodia and eventually elongated, whereas Mfn2-null MEFs generated extensive membrane ruffles and retained the circular shape". It would be interesting to quantify these different parameters.

      We will add Asterixes to the cells. We will quantify the percentage of cells that can rearrange their cell shape in the WT and Mfn2 KO MEFs.

      -For all their immunoblot analysis, the authors should use a mitochondrial marker as loading control (VDAC1, TOM20, HSP60...). In figure 5, Vinculin should not be used a loading control, with its role in focal adhesion dynamics.

      Vinculin is stable in HL-60 cells under multiple conditions and selected as a control. The signal intensity correlates well with the amount of protein loaded. Using mitochondrial proteins as loading controls is not common and may be risky as the amount of mitochondria in cells can be variable.

      -Legends for figures 5 and 6 are inverted.

      Thanks, we have changed the heading of figure 5. The legends were correct.

      -Please document in the material and methods section, how confocal images have been acquired: number of z-stacks, reconstitution, 3D analysis...

      We will update in the method the parameters of imaging acquisition.

      -The authors should show their results of blood cell composition quantification in ctrl vs MFn2 depletion.

      We will include the results of blood cell composition in a supplemental figure.

      -The authors should describe all the acronyms used throughout all the manuscript. For example, LTB4, fMLP...

      We have describe all the acronyms in the updated manuscript.

      *Reviewer #2 (Significance (Required)):

      Beyond their role in energy production, mitochondria are involved in numerous cellular functions including cell migration. Mitochondria form a network balanced by fission and fusion events, where membrane contact sites with the endoplasmic reticulum are crucial. These contact sites are also involved in mitochondrial and cellular functions via their capacity to exchange lipids, metabolites and calcium. The role of mitochondria in cell migration has started to emerge where mitochondrial fragmentation and/or mitochondrial calcium homeostasis are acknowledged to drive cancer cell migration and to regulate actin dynamics. In this manuscript, Zhou W and colleagues proposed for the first time the role of mitochondria-ER contacts in cell migration. Mechanistically, this can be associated to the capacity of these contacts to control mitochondrial functions or mitochondrial calcium homeostasis. These findings are physiologically relevant and of particular interest to the mitochondrial and cell migration field but also to general cell biology. It represents a novel function associated to these membrane contact sites and point-out these contacts as signalling platform creating microdomains of metabolites exchanges involved in cell migration.

      Keywords: Mitochondria - Membraned dynamics - calcium homeostasis - Membrane contact sites*

      *Reviewer #3 (Evidence, reproducibility and clarity (Required)):**

      Mitofusin 2 (Mfn2) is a mitochondrial outer membrane protein that is important for mitochondrial fusion and the establishment of mitochondrial ER contacts. It has been published before that these contact sites are important for calcium signaling. Zhou et al. examined the role of Mfn2 in neutrophils. They propose a model in which mitochondrial ER contacts established via Mfn2 are crucial for regulation of Rac, which is a small GTPase driving cell migration by promoting actin polymerization. Loss of Mfn2 results in elevated cytosolic calcium, over-activation of Rac, and defects of chemotactic movements. These defects can be partially rescued by restoration of mitochondrial ER contacts through expression of an artificial tether protein.

      **Major points**

      1.The authors claim on p. 6 that decreased neutrophil retention is not simply due to defects in mitochondrial fusion. However, the experimental setup they used for mfn2 (Fig. 1) is different from that for opa1 (Fig. S1), and therefore the results are not directly comparable. Unfortunately, the authors don't show fragmentation of mitochondria, neither in mfn2 nor in opa1 depleted cells. To support their statement they must show that lack of Mfn2 and Opa1 causes mitochondrial fragmentation to the same extent and then examine neutrophil retention in the same assay. Also, it would make sense to include Mfn1 in this analysis, as the authors later claim that the effects they observed are specific for Mfn2.*

      Since Mfn2 KO neutrophils are not in tissue, the experiment in Fig 1S to look at cell speed in tissue is not feasible. Since the cells are all in circulation, we can only estimate the percentage. Overall, the phenotype is very drastic, see movie S1. We will state how many fish embryos we have imaged and how often we observe this phenotype (only 1 or 2 in the tissue (mfn2 KO) or in circulation (control and opa1 KO)). Opa1 KO neutrophils are not in circulation.

      To show evidence of the genome edition, we have deep sequenced this loci of mfn2 and opa1 and the mutation frequencies were stated in the original text. Since each embryos have approximately 150 neutrophils, WB and other biochemical assays are not feasible. Sequencing is the standard method (Ablain et al., 2015; Zhou et al., 2018). We only stated the mutation efficiency in the manuscript because amplifying the genomic DNA from the sorted cells introduces PCR bias and the numbers are not a quantitative reflection of the degree of gene disruption. We will include the sequencing result of the sgRNA target sites in a supplemental Figure.

      Since Mfn2 KO neutrophils are all in circulating, we cannot observe their mitochondrial morphology. This is the reason why we used HL-60 cells for the mechanistic study. The mfn1 gene in zebrafish is duplicated. We have generated an mfn1b KO line and did not observe any phenotype. However we are not sure whether we can obtain efficient disruption at both loci. We hope the results using Mfn1 KO MEF and MFN1 KD HL-60 cells are enough to show a specific role of Mfn2 in cell migration.

      We will have stained the mitochondria structure in the MEF1/2 MEF cells and the in Mfn1/2 KD dHL-60 cells. Opa1 KD HL-60 cells display extensive cell death and we are not confident interpreting any results from this line.

      2.The authors should examine mitochondrial morphology in MFN2 shRNA treated cells (Fig. 2) and in mfn2-null MEFs (Fig. 3).

      Mitochondrial morphology is examined in MFN2 shRNA treated cells (Fig 4c and 5a). The mitochondrial morphology in mfn2-null MEFs are published (Chen et al., 2003). We will further confirmed their results by staining mitochondrial structure in the KO MEFs.

      3.The authors claim that chemotaxis defects of neutrophils are specific for MFN2 knock down, but not for MFN1. They show a Western blot of mfn1 knock down cells in Figure S3s. There is a band clearly visible, which appears to be much stronger than the MFN2 band in sh1 cells in Fig. 2a. Therefore, this conclusion is not valid.

      The band intensities are dependent on the antibody quality and imaging acquisition and display. We don’t feel comfortable comparing the amount of two different proteins from two separate blots.

      4.The colocalization of MFN2 with mitochondria and ER, shown in Fig. 4a, should be improved. Both mitochondria and ER appear abnormally clumped. The authors should stain mitochondria, ER and Mfn2 in the same cells. Images should be displayed much larger. The same is true for Fig. 5a. The authors claim that an artificial tether restored mitochondrial morphology in mfn2 knock down cells. They should state in the text which tether was used. Furthermore, they should explain their criteria for judgement of mitochondrial morphology. At least in my exes, mitochondria appear highly clumped in the image shown for sh1+T cells. In Fig. 5c it is not indicated how many cells were scored.

      We will replace Fig 4a with a more representative image.

      Neutrophils are blood cells and do not spread as well as adherent cells. We have also overexposed the images to show the smaller mitochondria, which cannot be visualized without saturating the signals. We tried to stain the cells with Mfn2 and Calnexin. However we cannot retain the mitotracker signal in fixed cells and could not do a triple label in dHL-60 cells. For this reason we have done double staining of mitochondria-ER, MFN2-mitochondria and MFN2-ER.

      We have included the citation and the description of the tether. The tether is composed of a GFP protein carrying both ER and mitochondrial localization sequences at the ends, which functions independently of MFN2.

      The criteria for the judgement of the mitochondrial morphology is now included in the methods, clustering analysis.

      \*Minor points**

      5.The Western blot shown in Fig. 5d suggests that expression of the tether construct reduced the amount of MFN2. How can this be explained?*

      This Mfn2 amount is not significantly altered by the tether expression when quantified. We will add the quantification of all WB to the figures.

      6.The paper is sometimes hard to digest for a reader who is not familiar with the authors' experimental systems. The description of the experiments in the main text is highly condensed.

      We will elaborate on the experimental system in the results section.

      7.Page 11: "5 m post stimulation" should read "5 min post stimulation".

      Thank you. We have made this correction.

      8.Some references are incomplete (page numbers are lacking).

      We will reformat our references and checked for page numbers.

      *Reviewer #3 (Significance (Required)):

      Apparently, the manuscript is written for an audience with a special interest in chemotactic movements of neutrophils. I guess that the results reported in this manuscript will be of interest for this field. My background is mitochondrial biology and dynamics and I don't have the expertise to evaluate the aspects specific for neutrophils.*

      It is well established that mfn2 mediates mitochondrial fusion and ER contact. Our novelty is the discovery that mfn2 suppresses Rac activation, which is essential for neutrophil adhesion and migration.

      References:

      Ablain, J., E.M. Durand, S. Yang, Y. Zhou, and L.I. Zon. 2015. A CRISPR/Cas9 vector system for tissue-specific gene disruption in zebrafish. Developmental cell. 32:756-764.

      Amini, P., D. Stojkov, A. Felser, C.B. Jackson, C. Courage, A. Schaller, L. Gelman, M.E. Soriano, J.M. Nuoffer, L. Scorrano, C. Benarafa, S. Yousefi, and H.U. Simon. 2018. Neutrophil extracellular trap formation requires OPA1-dependent glycolytic ATP production. Nature communications. 9:2958.

      Chen, H., S.A. Detmer, A.J. Ewald, E.E. Griffin, S.E. Fraser, and D.C. Chan. 2003. Mitofusins Mfn1 and Mfn2 coordinately regulate mitochondrial fusion and are essential for embryonic development. The Journal of cell biology. 160:189-200.

      de Brito, O.M., and L. Scorrano. 2008. Mitofusin 2 tethers endoplasmic reticulum to mitochondria. Nature. 456:605-610.

      Filadi, R., E. Greotti, G. Turacchio, A. Luini, T. Pozzan, and P. Pizzo. 2015. Mitofusin 2 ablation increases endoplasmic reticulum-mitochondria coupling. Proceedings of the National Academy of Sciences of the United States of America. 112:E2174-2181.

      Filadi, R., E. Greotti, G. Turacchio, A. Luini, T. Pozzan, and P. Pizzo. 2017. On the role of Mitofusin 2 in endoplasmic reticulum-mitochondria tethering. Proceedings of the National Academy of Sciences of the United States of America. 114:E2266-E2267.

      Kornmann, B., E. Currie, S.R. Collins, M. Schuldiner, J. Nunnari, J.S. Weissman, and P. Walter. 2009. An ER-mitochondria tethering complex revealed by a synthetic biology screen. Science. 325:477-481.

      Maianski, N.A., F.P. Mul, J.D. van Buul, D. Roos, and T.W. Kuijpers. 2002. Granulocyte colony-stimulating factor inhibits the mitochondria-dependent activation of caspase-3 in neutrophils. Blood. 99:672-679.

      Naon, D., M. Zaninello, M. Giacomello, T. Varanita, F. Grespi, S. Lakshminaranayan, A. Serafini, M. Semenzato, S. Herkenne, M.I. Hernandez-Alvarez, A. Zorzano, D. De Stefani, G.W. Dorn, 2nd, and L. Scorrano. 2016. Critical reappraisal confirms that Mitofusin 2 is an endoplasmic reticulum-mitochondria tether. Proceedings of the National Academy of Sciences of the United States of America. 113:11249-11254.

      Naon, D., M. Zaninello, M. Giacomello, T. Varanita, F. Grespi, S. Lakshminaranayan, A. Serafini, M. Semenzato, S. Herkenne, M.I. Hernandez-Alvarez, A. Zorzano, D. De Stefani, G.W. Dorn, 2nd, and L. Scorrano. 2017. Reply to Filadi et al.: Does Mitofusin 2 tether or separate endoplasmic reticulum and mitochondria? Proceedings of the National Academy of Sciences of the United States of America. 114:E2268-E2269.

      Rooney, C., G. White, A. Nazgiewicz, S.A. Woodcock, K.I. Anderson, C. Ballestrem, and A. Malliri. 2010. The Rac activator STEF (Tiam2) regulates cell migration by microtubule-mediated focal adhesion disassembly. EMBO reports. 11:292-298.

      Saita, S., T. Ishihara, M. Maeda, S. Iemura, T. Natsume, K. Mihara, and N. Ishihara. 2016. Distinct types of protease systems are involved in homeostasis regulation of mitochondrial morphology via balanced fusion and fission. Genes to cells : devoted to molecular & cellular mechanisms. 21:408-424.

      Zhou, W., L. Cao, J. Jeffries, X. Zhu, C.J. Staiger, and Q. Deng. 2018. Neutrophil-specific knockout demonstrates a role for mitochondria in regulating neutrophil motility in zebrafish. Disease models & mechanisms. 11.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Mitofusin 2 (Mfn2) is a mitochondrial outer membrane protein that is important for mitochondrial fusion and the establishment of mitochondrial ER contacts. It has been published before that these contact sites are important for calcium signaling. Zhou et al. examined the role of Mfn2 in neutrophils. They propose a model in which mitochondrial ER contacts established via Mfn2 are crucial for regulation of Rac, which is a small GTPase driving cell migration by promoting actin polymerization. Loss of Mfn2 results in elevated cytosolic calcium, over-activation of Rac, and defects of chemotactic movements. These defects can be partially rescued by restoration of mitochondrial ER contacts through expression of an artificial tether protein.

      Major points

      1.The authors claim on p. 6 that decreased neutrophil retention is not simply due to defects in mitochondrial fusion. However, the experimental setup they used for mfn2 (Fig. 1) is different from that for opa1 (Fig. S1), and therefore the results are not directly comparable. Unfortunately, the authors don't show fragmentation of mitochondria, neither in mfn2 nor in opa1 depleted cells. To support their statement they must show that lack of Mfn2 and Opa1 causes mitochondrial fragmentation to the same extent and then examine neutrophil retention in the same assay. Also, it would make sense to include Mfn1 in this analysis, as the authors later claim that the effects they observed are specific for Mfn2.

      2.The authors should examine mitochondrial morphology in MFN2 shRNA treated cells (Fig. 2) and in mfn2-null MEFs (Fig. 3).

      3.The authors claim that chemotaxis defects of neutrophils are specific for MFN2 knock down, but not for MFN1. They show a Western blot of mfn1 knock down cells in Figure S3s. There is a band clearly visible, which appears to be much stronger than the MFN2 band in sh1 cells in Fig. 2a. Therefore, this conclusion is not valid.

      4.The colocalization of MFN2 with mitochondria and ER, shown in Fig. 4a, should be improved. Both mitochondria and ER appear abnormally clumped. The authors should stain mitochondria, ER and Mfn2 in the same cells. Images should be displayed much larger. The same is true for Fig. 5a. The authors claim that an artificial tether restored mitochondrial morphology in mfn2 knock down cells. They should state in the text which tether was used. Furthermore, they should explain their criteria for judgement of mitochondrial morphology. At least in my exes, mitochondria appear highly clumped in the image shown for sh1+T cells. In Fig. 5c it is not indicated how many cells were scored.

      Minor points

      5.The Western blot shown in Fig. 5d suggests that expression of the tether construct reduced the amount of MFN2. How can this be explained?

      6.The paper is sometimes hard to digest for a reader who is not familiar with the authors' experimental systems. The description of the experiments in the main text is highly condensed.

      7.Page 11: "5 m post stimulation" should read "5 min post stimulation".

      8.Some references are incomplete (page numbers are lacking).

      Significance

      Apparently, the manuscript is written for an audience with a special interest in chemotactic movements of neutrophils. I guess that the results reported in this manuscript will be of interest for this field. My background is mitochondrial biology and dynamics and I don't have the expertise to evaluate the aspects specific for neutrophils.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Zhou W and colleagues entitled "Mitofusin 2 regulated neurophil adhesive migration and actin cytoskeleton" proposed that mitochondrial outer membrane GTPase Mitofusin 2 controls cell migration via its capacity to regulate mitochondria-endoplasmic reticulum (ER) contacts, independently of its fusogenic activity. Using transgenic Mfn2 zebrafish, they first show that Mfn2 mutant embryos exhibit circulating neutrophils and defects in neutrophil recruitment to generated wound, compared to control. Then, using a combination of in cellulo and in vivo mouse models, they show that loss of Mfn2 decreases neutrophil migration, their adhesion under sheer stress and their infiltration to the peritoneal cavity in vivo. Third, they confirm these results using Mfn2 KO MEFs, where they show migration and actin skeleton defects, in contrast to Mfn1 KO MEFs. Mechanistically, they propose that migration defects induced by Mfn2 loss are associated to a decrease of membrane contact sites between the ER and the mitochondria. Using different in cellulo cell migration assays, they show that migration defects in Mfn2-null was rescued upon an artificial mito-ER tether. Finally, they propose that the loss of Mfn2 leads to cytosolic calcium accumulation, inducing hyper-activation of the RhoGTPase, Rac1, a key regulator of actin dynamics and cell migration. Together, the authors proposed a new function of Mfn2 in regulating cell migration via mito-ER contacts tethering.

      Major comments:

      Although the results could be very interesting, and could be significantly relevant to the mitochondrial field and the cell biology one in general, major points need to be addressed to fully support conclusions of the authors. Different controls and quantification are missing, Actin dynamics analysis should be improved, effects of the artificial tether is weakly characterized and the demonstration of the specific role of mito-ER contacts via mfn2 in migration should be reinforced.

      -In figure 1, quantification of circulating neutrophils is required in Mfn2 KO embryos. The authors should also show these quantified results for OPA1KO, which are just mentioned in the text. In addition, in figure 1b and d, the neutrophils from the Mfn2KO embryos seem bigger compared to control. Can the authors comment on neutrophils size and potential contribution to the phenotype? Finally, the authors propose a defect in neutrophil migration in Mfn2-KO, however neutrophils are found in the circulation. The authors should explain these results.

      -The authors need to reinforce the Mfn2 specificity for their phenotype. In particular in Fig S1, they show that loss of OPA1 significantly decreases neutrophil migration in vivo. However, they then only study the effect of Mfn1 silencing in neutrophil and MFN1 KO MEFs (Sup Fig s3). The authors should perform the same experiments in neutrophil and MEF upon loss of OPA1 (similar to Fig S3). Does loss of OPA1 and Mfn1 decrease neutrophil arrest to activated endothelial cells?

      -Using their images, the authors should also document on the directionality of the cell during cell migration. Do Mfn2 depleted cells do not migrate because they are arrested or because they are lacking directionality? Environment/chemokine sensing defects?

      -Actin dynamics analysis should be improved. Loss of Mfn1 and Mfn2 lead to cell shape changes. The authors should quantify this phenotype by analysing cell circularity (as well as for Opa1 loss). Stress fibres number or Phalloidin intensity quantification in cell body should also be performed.

      -Can the migration defects could be attributed to Focal adhesion protein dynamics defects? The authors shown an hyperactivation of Rac1 and an hyperphosphorylation of PAK, which can control FAP (focal adhesion proteins) dynamics. In addition, immunofluorescence analysis shows a decreased signal and cellular misdistribution of paxillin. The authors should characterize these phenotypes. FAP levels (Paxillin/Phospho-Paxillin and Vinculin) should be analysed by immunoblot, the number of FAP/cell, distribution and size should also be quantified. Their dynamics should also be analysed by live cell imaging. Finally, Paxillin level and distribution seems to be also impacted in Mfn1KO cells. Can the authors comment on that? The different quantifications would help to better understand the effect of different mitofusins in cytoskeleton dynamic.

      -Please perform rescue experiments for cell migration in MFN2KO and MFN1KO MEFs. Immunoblots showing protein levels of these proteins would be appreciated. To really discriminate how Mfn2 regulates cell migration, the authors should also perform rescue experiments using a fusogenic mutant Mfn2 ((K109A). It will help to demonstrate the relevance of mito-ER contacts and not mitochondrial fusion in the phenotype.

      -Figure 4, the authors stipulate that Mfn2 regulates ER-mitochondria tethering. However, the authors present no evidence for this conclusion. The authors should perform manders coefficient in MFN2 KO cells and compared it to control. Also, loss of Mfn2 induces mitochondrial fragmentation, which can lead to problem for mito-er contacts quantification by light microscopy. The authors should use their TEM pictures to quantify mito-ER contacts (Number, length and % of mito perimeter), not only mitochondrial morphology. Mfn1 should be used as negative control. it would be interesting also to determine the status of the mito-ER contact in the different conditions used in the manuscript to stimulate cell migration like fMLP treatment.

      -The authors use an artificial tether to manipulate mito-ER contacts in cellulo. However, no information from its origin, or its design are documented in the manuscript. In addition, the authors should show that this tether efficiently works by analysing mito-ER contacts upon expression by EM and mitochondrial calcium uptake. Does this tether rescue mito-ER contacts defects induced by loss of Mfn2? How the authors explain that the tether rescues mitochondrial morphology defects in MFN2KO? In these conditions, mitochondria should not be able to fuse anymore as Mfn2 is lost? This is really intriguing results. Does the tether rescue the other parameters? Mitochondrial distribution (with quantification)? Cell shape? Paxillin defects? ROS and membrane potential? These rescue experiments analyses are important to determine which parameters are really involved in cell migration defects due to the decreased tethering. Finally, it would be of great interest to analyse the effect of the tether alone on cell migration, Rac1 activity, cell shape? Gain of function? These results may reinforce the idea that contact sites regulate cell migration.

      -It is well established that a decrease of membrane potential leads to a decrease of mitochondrial calcium uptake. Calcium results obtained by the authors without any information on the roles of the tether could not lead to any conclusion. Does the tether rescues membrane potential and calcium uptake by the mitochondria? So far, the decrease of mitochondrial calcium upon stimulation in Mfn2KO cab be attributed to both mito-ER contact or membrane potential defects. It has been shown that MEFs MFN2 KO can lead to a decrease of MCU provel level leading to a decrease of mitochondrial calcium uptake (PMID: 25870285). The authors should also check MCU protein level.

      -Hyperactivation of Rac1 is only based on phosphorylation of PAK, which is quite weak. The authors should better describe the hyperactivation of RAC1 or other RhoGTPases in their Mfn2 KO MEFs. What are the levels of RAC1 and other RhoGTPases? Subcellular distribution in the cell? Kits are also available to determine RhoGTPase activity by pull down assay (Cell biolabs).

      -The references are up-to-date. The text and the figures are clear and accurate.

      Minor comments:

      -The authors should show the efficiency of the KO generated for Mfn2 and Opa1 in zebrafish embryos. Sequencing results to highlight the position of the mutations and their consequences on the coding protein should be shown, as well as immunoblot analysis should be performed to analyse Mfn1, Mfn2 and OPA1 protein levels. The generation of a MFN1-KO transgenic line would have been appreciated to finely compare the roles of the 3 GTPases involved in mitochondrial fusion during neutrophil infiltration and migration in vivo.

      -MFN1, MFN2, AND OPA1 protein levels should be analysed by immunoblot in the Mfn1 and Mfn2 KO MEFs.

      -In cell spreading assay, it would be great to identify cells during the process, by an asterix for example. "wt MEFs extended transient filopodia and lamellipodia and eventually elongated, whereas Mfn2-null MEFs generated extensive membrane ruffles and retained the circular shape". It would be interesting to quantify these different parameters.

      -For all their immunoblot analysis, the authors should use a mitochondrial marker as loading control (VDAC1, TOM20, HSP60...). In figure 5, Vinculin should not be used a loading control, with its role in focal adhesion dynamics.

      -Legends for figures 5 and 6 are inverted.

      -Please document in the material and methods section, how confocal images have been acquired: number of z-stacks, reconstitution, 3D analysis...

      -The authors should show their results of blood cell composition quantification in ctrl vs MFn2 depletion.

      -The authors should describe all the acronyms used throughout all the manuscript. For example, LTB4, fMLP...

      Significance

      Beyond their role in energy production, mitochondria are involved in numerous cellular functions including cell migration. Mitochondria form a network balanced by fission and fusion events, where membrane contact sites with the endoplasmic reticulum are crucial. These contact sites are also involved in mitochondrial and cellular functions via their capacity to exchange lipids, metabolites and calcium. The role of mitochondria in cell migration has started to emerge where mitochondrial fragmentation and/or mitochondrial calcium homeostasis are acknowledged to drive cancer cell migration and to regulate actin dynamics. In this manuscript, Zhou W and colleagues proposed for the first time the role of mitochondria-ER contacts in cell migration. Mechanistically, this can be associated to the capacity of these contacts to control mitochondrial functions or mitochondrial calcium homeostasis. These findings are physiologically relevant and of particular interest to the mitochondrial and cell migration field but also to general cell biology. It represents a novel function associated to these membrane contact sites and point-out these contacts as signalling platform creating microdomains of metabolites exchanges involved in cell migration.

      Keywords: Mitochondria - Membraned dynamics - calcium homeostasis - Membrane contact sites

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Zhou et al. describes the role of mitofusin-2 in neutrophil adhesive migration. The authors suggest that MFN-2 is required to sustain neutrophil migration and link this observation to the role of MFN-2 in maintaining mitochondria-ER contacts and suppressing Rac activation. Although some of the experiments are convincing, the authors come to conclusions that are not entirely supported by their data and a few statements appear the result of inductive reasoning. A major problem is the distinction between adhesion and migration: in several parts of the manuscript, there is confusion between these two events and the experiments are not designed (and not discussed) in order to clarify this point. For example, the fact that in zebrafish embryos lacking Opa1 there is no defect in neutrophil retention but reduced neutrophil migration should suggest that MFN-2 controls adhesion rather than migration. But this is not properly elaborated. The same problem comes with the role of Rac, which has been elegantly shown to be required for cell migration but not for cell spreading or focal adhesion formation (Steffen et al, JCS 2013). Again, it is necessary to distinguish between migration and other functions requiring the actin cytoskeleton.

      Specific comments:

      Introduction:

      "Although mitochondria-derived ATP possibly regulates neutrophil chemotaxis in vitro (Bao et al., 2015), removal of extracellular ATP improves neutrophil chemotaxis in vivo (Li et al., 2016). These conflicting reports prompted us to search for mechanisms delineating the role of mitochondria in neutrophil migration outside the realm of ATP or cellular energy (Bi et al., 2014; Schuler et al., 2017; Zanotelli et al., 2018)." This sentence is superficial and misleading: extracellular ATP may interfere with chemotaxis through various energy-independent mechanisms (see for example Zumerle et al. Cell Reports 2019) and this is not conflicting with the role of intracellular ATP in migration.

      Figure 1: The authors didn't show evidence of the genome edition (PCR, RFLP or Sequencing over the sgRNA target) or at least RT-PCR or WB for MFN2. In Fig 1b, 1c the scale bar is missing. "Neutrophils were sorted from both lines and their respective loci targeted by the 4 sgRNAs were deep sequenced." There are no data about sorting strategies for zebrafish neutrophils in the figure. Moreover, only 2 sgRNAs are shown and there are no sequencing data.

      Figure 2: In the WB, reconstitution is not obvious. In general, all WBs are not quantified (and they should be quantified). The in vivo experiment does not have proper controls. For example, can the authors exclude that in these mice there is reduced inflammation because neutrophils have defective activation? What about NETs? And cytokines/chemokines? And exocytosis? In the absence of these controls, the experiment cannot be properly interpreted.

      Figure 3: The conclusion of the authors "In summary, Mfn2 modulates the actin cytoskeleton and cell migration in MEFs" should be supported by experiments to distinguish between the specific role of Mfn2 and the role of mitochondrial dynamics (Opa1, Drp1, Mfn1). It is also not clear why the authors decided to use MEFs instead of other cells (more similar to neutrophils which are not adherent cells). The results obtained in MEFs may be irrelevant for neutrophils.

      Figure 4-5: Fig 5a: in ctrl and sh1 the ER seems to be larger than the phalloidin (=cytoskeleton=cell border approximately) in a few regions. Only the sh1+T seems to fit correctly. The TEM image (only 1 in supplementary) is not sufficient to convince that the tethering is lost. Quantification of number of contacts and distance between ER and mitochondria should be included. The title of figure 5 is wrong. However, in these figures, it is clear that cells are beautifully polarized, with mitochondria accumulating at the uropod (and even more in the absence of Mfn2). When comparing these images with those published by Campello et al (JEM 2006), there are 2 observations that can be made: first of all, these data confirm that mitochondrial fission promotes cell polarity; second, they suggest that the defect is not at the level of cell polarity/chemotaxis.

      Figure 6: Calcium data are, in general, very weak. First of all, controls with ionomycin are missing. Statistical analyses of the curves should be included. As for the use of the MCU inhibitor Ru360, is there any evidence that it is cell-permeant in this context? Is it blocking MCU? Since the authors can show mitochondrial calcium upon FMLP, they should also demonstrate that Ru360 is indeed working and inhibiting mitochondrial calcium uptake. The sentence "The MCU inhibitor Ru360did not cause further reduction of chemotaxis in MFN2 knockdown dHL-60 cells (Supplementary Fig. 6c, d and Supplementary Movie. 12), indicating that MCU and MFN2 lies in the same pathway in terms of regulating chemotaxis in dHL-60 cells" is speculative. In general, there is no solid demonstration that the effect is calcium-mediated. As for Rac, it is surprising to see that Rac inhibition has no effect on cell migration. Rac is known to promotes migration in fibroblasts and other cell types and Rac deficiency inhibits migration (see for example Steffen et al, JCS 2013). Two sets of experiments are absolutely required: 1) verify this in fibroblasts since it has been elegantly shown that Rac is essential in these cells for migration; 2) analyse the effect of Rac inhibitors in pPak kinetics.

      Significance

      As presented here, the manuscript has a modest significance. The audience would be specialised: cell migration, cell signalling. My expertise is immunology, cell activation, cell migration, cell signalling.

  5. Apr 2020
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers and the editor for the insightful and thorough assessment of our manuscript. In this response to review letter, we have listed the original review (black text) and responded to each critique after it.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Yang et al. submitted a manuscript describing the detection of pseudogenes ("retrocopies") of APOBEC3 (A3) genes in primates. The evolutionary history and relationship to specific A3s was analyzed and speculated that the maintained A3 retrocopies had a functional role at least early in the evolution and some may have still now. Functional data on some of the expressed retrocopies are presented on L1 and HIV.

      The authors claim that "retrocopying expands the functional repertoire of A3 antiviral proteins in primates". While almost of the genetic findings were published recently (Ito et al. 2020), the authors should more clearly describe how their data differ or confirm the data of Ito et. al.

      We thank the reviewer for their helpful comments which have guided revisions to our manuscript. We have taken steps to clarify the dramatic differences between our work and the recent publication from Ito, Gifford, and Sato.

      Foremost, we respectfully disagree with the reviewer that the genetic findings in our work were contained within the Ito, et al manuscript. Using a computational screen of assembled mammalian genome, the Sato group cataloged the gain and loss of APOBEC3 genes during the evolution of mammals. They found a fascinating correlation between the dynamics of A3s and ERVs that formed the precis of the paper. From their genome-wide search for A3s, Ito et al describe several retrocopies of A3s in two New World Monkey species, one of which retains a full-length open reading frame, leading to the statement that this gene may be functional.

      We note that the retrocopies found in the Ito et al paper span only two of the more than 20 species in which we identify A3 retrocopies. Further, as a result of the breadth of our search for A3s, we find additional retrocopies in the same two New World Monkey species that were examined in the Ito et al paper. Finally, our study also examined functional capabilities of these additional A3s. These differences are highlighted by reviewer 3 who writes that relative to Ito et al, our manuscript studies the phenomenon of A3 retrocopies “more deeply both by in silico analyses and cell culture experiments.” Reviewer 3 also summarizes the most important difference in our studies – our work presents a “conceptual advance that the antiviral gene expansion has achieved not only via tandem gene duplication but also via gene retrocopying”.

      Lastly, we want to point out that the findings of our manuscript and Ito et al. 2020 were made concurrently. Indeed, throughout the preparation process of this manuscript, we were both aware of each other’s findings and shared preprints with each other. Most of the participating journals in Review Commons have “scoop protection” mechanisms that typically extend 6 months after the publication of the first article (Ito et al was published Jan 2020), and our article was first submitted to Reviewer Commons on February 14, 2020. Therefore, we feel confident that the ‘no scoop’ policy applies to the minimal overlap between our paper and that of Ito et al.

      Nevertheless, we have modified the text to more clearly acknowledge the parallel finding of some New World monkey retrogenes in the Ito, et al. paper.

      The functional data (Fig. 6) are interesting, but in the current form not complete. The authors have to show protein expression in the transfected cells (A3, L1, HIV) and level of encapsidation into viral particles. In addition, please analyze if the retrocopies express cytidine deaminase active enzymes.

      We thank the reviewer for this comment, and we have added a Western blot of the six long-ORF-containing retrocopies as Figure S5. In this blot (from early in the project), we detected protein production in 293T cells for 3/6 retrocopies. In later optimizations of subsets of this blot, we were able to detect expression of the marmoset A3G and the other two marmoset retrocopies (marmoset-2 and marmoset-4). Despite optimization attempts, we were unable to detect protein for one of the retrocopies that restricts HIV-1ΔVif (capuchin-C1). Unfortunately, at this time the included blot is the only one we have in which all 6 constructs are included on a single blot. Optimally, all 6 constructs would be side-by-side in a single blot with optimized conditions, and we are happy to complete this experiment as soon as we are able to return to our lab after the SARS-2 quarantine is lifted. However, we think the added blot shows that some of the retrocopies produce protein and the absence of detectable protein from capuchin-C1 could suggest that this retrocopy is especially potent in its restriction function or an idiosyncratic problem with detecting this protein using Western blot analyses.

      We have not previously tested our lentiviral particles for levels of encapsidation of protein from each retrocopy. The value we see in this experiment is in explaining why some of the retrocopies that are expressed in producer cells may not restrict in target cells. While we note that precedent in the literature suggests that A3 proteins which restrict HIV-1ΔVif are invariably encapsidated, we would be happy to carry out this experiment when our lab reopens.

      In response to the reviewer’s request to test deaminase activity for each retrocopy, we note that Figure 4 shows the intactness of the deaminase motif in each retrocopy. However, we feel that a description of the mechanisms of restriction of these retrocopies is not a major point of this paper and is beyond the scope of the current investigation.

      Reviewer #1 (Significance (Required)):

      Minor advance compared to Ito et al. 2020.

      We respectfully and rigorously disagree with this assessment. Please refer back to the reviewer’s first comment. We defer, again, to Reviewer 3’s assessment that our work presents a “conceptual advance that the antiviral gene expansion has achieved not only via tandem gene duplication but also via gene retrocopying”. Moreover, we must point out that the Ito et al 2020 paper was entirely computational; indeed, several retrogenes that could computationally be predicted to be ‘dead’ were confirmed by us as having antiviral activity.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      Yang et al. study the expansion of APOBEC3 (A3) cytidine deaminases genes in primates. Authors find A3 retrocopies in several lineages in primates using Blast searches. Some are old and some are species specific. Some have disablements and some have intact ORFs. Authors study their mode of evolution, expression and functionality. Authors have performed detailed analyses including functional analyses. Some A3 retrocopies are broadly expressed and some have retained ability to restrain retroelements. I agree with the authors that their data supports that retrocopying has contributed the turnover in the repertoire of host retroelement restriction factors. Authors show that some retrocopies have remained active for long periods of time and they still show that they can restrict retroelements/retrovirus. This work provides an interesting example of immune system diversification. This study of the A3 family of proteins that are part of the vertebrate innate immune system and the data supporting turnover of these kind of immune system genes is strong. The work underscores that this is a way immunity genes evolve and it has parallels in the evolution of the TRIM gene family of immune genes. I just have a few comments. I think the work can gain from analyzing some aspects of the data in more detail and presenting the big picture in a summary table, even if it is just supplementary.

      **Major comments:**

      A3I is in many species. Does this mean it was preserved (i.e., functional for a while)? For how long have disabling mutations been accumulating? Can we get a sense of that? Even for other retrocopies, do we have a sense of how recent has the pseudogenization been? If it is very recent that means that the gene was active until not long ago.

      Our analyses suggest that A3I was born in the common ancestor of simian primates and pseudogenized before the Catarrhini/Platyrrhini split. It is possible that A3I was functional within this extended period (~12-15 million years), but the presence of a shared truncating stop codon amongst all simian A3Is suggests the gene was no longer full-length at the time of diversification of the simians. Instead, the simian LCA likely encoded an A3I with a predicted ORF of 261 codons; if this truncated ORF were functional, it was then further truncated/pseudogenized with additional frame-breaking mutations which follow the phylogeny of primates.

      We estimated the timeline of pseudogenization of each retrocopy using the species distribution of each syntenic retrocopy. We also note that we find full-length ORFs in three retrocopies which have been retained for a period of time at least as long as the age of the last common ancestor of the four New World monkeys. These old but intact retrocopies motivated our simulations of ORF retention rates (Figure 5).

      In the PAML analyses test could be performed to test if the rate of evolution that are higher or lower than 1 for particular genes are actually significantly higher or lower than 1 for the particular gene comparing the likelihoods of the modes with the given rate with the one with the rate fixed to 1. Is there enough power to do this?

      We thank the reviewer for pointing out this omission in our analysis. We did perform these tests and find a significant p-value for two of the nodes p=0.058 and p=0.025 respectively). We have updated the legend for figure S4 to incorporate these p-values

      Page 9. It seems to me that the synteny data Figure S2 reveals they are derived from independent retroposition events and not duplications of segments because those would include flanking genes. Is this correct? Authors could comment on that.

      Yes, we think that each retrocopy we show in Figure S2 is likely created via an independent retrotransposition event. We have clarified in the text that Figure S2 shows the genes used to establish synteny to support orthology of the retrocopies shared amongst multiple species and that each of these ortholog groups presumably originated via distinct retrotransposition events.

      In figure S4, I am not sure why orthologous genes are not grouped together in the phylogeny and why p is smaller than 0.05. How should that figure and the probability be interpreted?

      We thank the reviewer for their comments on this figure. First, the reviewer identified an error in the tree in which the branch labels for ‘night monkey-C2’ and ‘night monkey-SS1’ were inadvertently switched. The corrected tree now follows the pattern expected by the reviewer. Second, we employed RELAX to “determine whether selective strength was relaxed or intensified in one of these subsets relative to the other” (Wertheim, et al. MBE 2014). In this case, the p-value corresponds to the finding that the retrocopies (test branches) show intensification of selection relative to the intron-containing A3Gs (reference branches).

      We have modified Figure S4 and the associated text to more clearly explain the specific hypothesis test we report.

      It would be good to have a summary table that summarizes what genes have support for past or current functionality (preservation for long time or recent pseudogenization, expression, purifying or positive selection, ability to restrict retroelements) and in what lineages.

      We agree with this reviewer suggestion. We have added the additional information including the number of frame disrupting mutations as a measure of age, intactness, and ability to restrict retroelements to Table S1. Thanks to this suggestion, Table S1 now serves as the master table to summarize the analyses of each retrocopy.

      **Minor comments:**

      1. Page 3. Authors say "...the exons and UTRs..." but UTRs are part of exons. Authors could talk about exons only that include protein-coding regions and UTRs.

      Changed the text to "exons".

      Page 7. I would say disabled instead of "... becoming degraded by mutation."

      Fixed according to the reviewer's suggestion.

      I would say neutral evolution not neutral selection.

      Fixed according to the reviewer's suggestion.

      Reviewer #2 (Significance (Required)):

      This work provides an interesting example of immune system diversification. Authors study the APOBEC3 family of proteins that is part of the vertebrate innate immune system and the data supporting turnover of these kind of immune system genes. The work underscores that this is a way immunity genes evolve and it has parallels in the evolution of the TRIM gene family of immune genes revealing patterns in the mode of evolution of immunity genes. The audience of this work will be people interested in evolution of immunity, arms races and gene diversification and all evolutionary biologists interested in adaptation. I work in the field of comparative genomics and molecular evolution.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      This manuscript by Yang et al. is an well-written, intriguing paper highlighting the evolutionary significance of the gene creation via "retrocopying". The authors investigated the expansion of antiviral A3 genes via retrocopy in Primates, and found that A3G-like retrocopies have been generated repeatedly during primate evolution. A part of A3 retrocopies found in New World monkeys retained full length open reading flames and anti-lentiviral capacities. Interestingly, the spectrum of anti-retroelement activity of A3 retrocopies was different from the original (i.e., intron-containing) A3G gene in these species, suggesting the occurrence of the functional differentiation followed by gene amplification. However, one of the main findings that many A3 retrocopies are present in New World monkey is in-line to a previous report (i.e., Ito et al., 2020, PNAS), and the experimental validations were based on the human (not New World monkey's) retroelements. Nevertheless, this study deeply investigated the possible importance of A3 retrocopies for the host defense system evolution both by in silico analyses and cell culture experiments. This study provides the findings that can potentially expand our knowledge on the evolutionary arms races between retroelements and the hosts.

      **Major:**

      To strengthen the impact of this work, it would be better to increase the numbers of retroviruses in which the anti-retroviral capacities are investigated. I understand that it is difficult to examine retroviruses or L1s that are colonized naturally with New World monkeys, but I suppose it is not so difficult to investigate a variety of representative retroviruses such as murine leukemia virus (MLV) or the reconstructed human endogenous retrovirus K (HERV-Kcon). This additional experiment would be helpful to highlight that the spectrum of anti-retroviral activity of A3 retrocopies is divergent from the original A3G gene in these species and strengthen the concept to be proposed by this study.

      The reviewer raises a fascinating question about whether retrocopies might have different restriction abilities relative to the other A3s in a given species. First, we feel that showing activity against one pathogen is sufficient for our claim that some of the A3 retrocopies have antiviral potential. Second, we discuss in the paper the idea that HIV-1 is not the actual target of these (or any) innate immune genes in New world primates. We argue that any other targets we might test would also be surrogates for the ‘true’ target of these genes.

      **Specific:**

      1, Since the authors found the expansion of "functional" repertoire of A3 retrocopies specifically in New World monkey, it would be better to rephrase the title as "Retrocopying expands the functional repertoire of APOBEC3 antiviral proteins in New World monkeys".

      We thank the reviewer for this comment but point out that a large portion of our manuscript presents our work on primates outside the New World monkeys. The reviewer is correct to note that our finding of restriction activity is limited to New World monkey retrocopies, but we feel that the current title will attract a broader audience and reflects the broader relevance of this work.

      2, It might be better to add a figure summarizing which A3 retrocopies in which species retain nearly full length ORFs. For example, how about making a figure like Fig. 2A for all the four representative New World monkey species?

      We agree. We have added the length of the longest ORF for each retrocopy to Table S1.

      3, Fig. 3

      It would be helpful to clarify that which cell of the heatmap corresponds to the intact A3 retrocopies.

      We have added labels to indicate the intact A3 retrocopies and adjusted the legend accordingly.

      4, Page 4, line 5

      It would be better to replace the word "protected" with "escaped" because this retrocopy subset should include the ones that are intact but not functional.

      Changed as suggested.

      5, Page 4, line 25

      It would be better to rephrase "the common ancestor of mammals" as "the common ancestor of placental mammals" because A3 gene is absent in Marsupial.

      Changed as suggested.

      6, Page 5, line 5

      Please rephrase "ongoing" as "recently-occurred".

      Changed as suggested.

      7, Page 6, line 19

      I checked the multiple sequence alignment in File S1 and suspect that the codon (alignment) position of the shared premature stop codon is 261 (not 264).

      We thank the reviewer for pointing out this discrepancy. We have revised the text to reflect the correct position of the shared stop.

      8, Page 6, line 23

      I could not understand the meaning of the sentence "Intriguingly, one lineage-specific mutation...".

      Please specify the position of mutation which the authors mentioned (in File S1 or Fig. 1B).

      This portion of the text refers to a reversion of a stop codon in the orangutan A3I; specifically, the stop codon shared in all simians acquired a second mutation that created a longer ORF in only this species. We have removed this sentence from the text for the sake of clarity.

      9, Page 12, line 8

      Please refer Fig. S4 here.

      Changed as suggested.

      10, Page 12, line 8

      Please say "Significant relaxed selection was not detected" rather than "Our analysis detected no relaxation...".

      Changed as suggested.

      11, Page 12, line 8

      Fig. S4 indicates "p=0.015", but the authors regard it as "not significant"?

      We thank the reviewer for pointing out this confusing wording. We employ RELAX to “determine whether selective strength was relaxed or intensified in one of these subsets relative to the other” (Wertheim, et al. MBE 2014). In this case, the p-value corresponds to the finding that the retrocopies show intensification of selection.

      We have modified Figure S4 to more clearly explain the specific hypothesis test for this p-value. We have also modified the text to clarify this point.

      12, Page 12, line 9

      Please here refer the data showing the claim "Instead, these A3G retrocopies have evolved more rapidly than...".

      Changed as suggested; see previous point.

      13, Page 12, line 11

      Did the authors perform the statistical test on the dN/dS ratio analysis? If so, please mention the result of the test.

      Yes we did. Please refer to Reviewer 2’s ‘Major Point 3’.

      14, Page 12, line 15

      It would be better to modify the phrase "show evidence of recurrent selection for functional innovation"

      Changed as suggested.

      Reviewer #3 (Significance (Required)):

      This study provides a conceptual advance that the antiviral gene expansion has achieved not only via tandem gene duplication but also via gene retrocopying.

      Compare to existing published knowledge.

      Although one of the main findings that many A3 retrocopies are present in New World monkey is in-line to a previous report (i.e., Ito et al., 2020, PNAS), this study investigated the above finding more deeply both by in silico analyses and cell culture experiments.

      Audience.

      Evolutionary biologists and researchers in the field of viruses (particularly retroviruses including HIV-1) and transposable elements would be interested in this work.

      Your expertise.

      Bioinformatics, genome biology, viruses, and transposable elements

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This manuscript by Yang et al. is an well-written, intriguing paper highlighting the evolutionary significance of the gene creation via "retrocopying". The authors investigated the expansion of antiviral A3 genes via retrocopy in Primates, and found that A3G-like retrocopies have been generated repeatedly during primate evolution. A part of A3 retrocopies found in New World monkeys retained full length open reading flames and anti-lentiviral capacities. Interestingly, the spectrum of anti-retroelement activity of A3 retrocopies was different from the original (i.e., intron-containing) A3G gene in these species, suggesting the occurrence of the functional differentiation followed by gene amplification. However, one of the main findings that many A3 retrocopies are present in New World monkey is in-line to a previous report (i.e., Ito et al., 2020, PNAS), and the experimental validations were based on the human (not New World monkey's) retroelements. Nevertheless, this study deeply investigated the possible importance of A3 retrocopies for the host defense system evolution both by in silico analyses and cell culture experiments. This study provides the findings that can potentially expand our knowledge on the evolutionary arms races between retroelements and the hosts.

      Major:

      To strengthen the impact of this work, it would be better to increase the numbers of retroviruses in which the anti-retroviral capacities are investigated. I understand that it is difficult to examine retroviruses or L1s that are colonized naturally with New World monkeys, but I suppose it is not so difficult to investigate a variety of representative retroviruses such as murine leukemia virus (MLV) or the reconstructed human endogenous retrovirus K (HERV-Kcon). This additional experiment would be helpful to highlight that the spectrum of anti-retroviral activity of A3 retrocopies is divergent from the original A3G gene in these species and strengthen the concept to be proposed by this study.

      Specific:

      1, Since the authors found the expansion of "functional" repertoire of A3 retrocopies specifically in New World monkey, it would be better to rephrase the title as "Retrocopying expands the functional repertoire of APOBEC3 antiviral proteins in New World monkeys".

      2, It might be better to add a figure summarizing which A3 retrocopies in which species retain nearly full length ORFs. For example, how about making a figure like Fig. 2A for all the four representative New World monkey species?

      3, Fig. 3 It would be helpful to clarify that which cell of the heatmap corresponds to the intact A3 retrocopies.

      4, Page 4, line 5 It would be better to replace the word "protected" with "escaped" because this retrocopy subset should include the ones that are intact but not functional.

      5, Page 4, line 25 It would be better to rephrase "the common ancestor of mammals" as "the common ancestor of placental mammals" because A3 gene is absent in Marsupial.

      6, Page 5, line 5 Please rephrase "ongoing" as "recently-occurred".

      7, Page 6, line 19 I checked the multiple sequence alignment in File S1 and suspect that the codon (alignment) position of the shared premature stop codon is 261 (not 264).

      8, Page 6, line 23 I could not understand the meaning of the sentence "Intriguingly, one lineage-specific mutation...". Please specify the position of mutation which the authors mentioned (in File S1 or Fig. 1B).

      9, Page 12, line 8 Please refer Fig. S4 here.

      10, Page 12, line 8 Please say "Significant relaxed selection was not detected" rather than "Our analysis detected no relaxation...".

      11, Page 12, line 8 Fig. S4 indicates "p=0.015", but the authors regard it as "not significant"?

      12, Page 12, line 9 Please here refer the data showing the claim "Instead, these A3G retrocopies have evolved more rapidly than...".

      13, Page 12, line 11 Did the authors perform the statistical test on the dN/dS ratio analysis? If so, please mention the result of the test.

      14, Page 12, line 15 It would be better to modify the phrase "show evidence of recurrent selection for functional innovation"

      Significance

      This study provides a conceptual advance that the antiviral gene expansion has achieved not only via tandem gene duplication but also via gene retrocopying.

      Compare to existing published knowledge.

      Although one of the main findings that many A3 retrocopies are present in New World monkey is in-line to a previous report (i.e., Ito et al., 2020, PNAS), this study investigated the above finding more deeply both by in silico analyses and cell culture experiments.

      Audience.

      Evolutionary biologists and researchers in the field of viruses (particularly retroviruses including HIV-1) and transposable elements would be interested in this work.

      Your expertise.

      Bioinformatics, genome biology, viruses, and transposable elements

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: Yang et al. study the expansion of APOBEC3 (A3) cytidine deaminases genes in primates. Authors find A3 retrocopies in several lineages in primates using Blast searches. Some are old and some are species specific. Some have disablements and some have intact ORFs. Authors study their mode of evolution, expression and functionality. Authors have performed detailed analyses including functional analyses. Some A3 retrocopies are broadly expressed and some have retained ability to restrain retroelements. I agree with the authors that their data supports that retrocopying has contributed the turnover in the repertoire of host retroelement restriction factors. Authors show that some retrocopies have remained active for long periods of time and they still show that they can restrict retroelements/retrovirus. This work provides an interesting example of immune system diversification. This study of the A3 family of proteins that are part of the vertebrate innate immune system and the data supporting turnover of these kind of immune system genes is strong. The work underscores that this is a way immunity genes evolve and it has parallels in the evolution of the TRIM gene family of immune genes. I just have a few comments. I think the work can gain from analyzing some aspects of the data in more detail and presenting the big picture in a summary table, even if it is just supplementary.

      Major comments:

      1. A3I is in many species. Does this mean it was preserved (i.e., functional for a while)? For how long have disabling mutations been accumulating? Can we get a sense of that? Even for other retrocopies, do we have a sense of how recent has the pseudogenization been? If it is very recent that means that the gene was active until not long ago.
      2. In the PAML analyses test could be performed to test if the rate of evolution that are higher or lower than 1 for particular genes are actually significantly higher or lower than 1 for the particular gene comparing the likelihoods of the modes with the given rate with the one with the rate fixed to 1. Is there enough power to do this?
      3. Page 9. It seems to me that the synteny data Figure S2 reveals they are derived from independent retroposition events and not duplications of segments because those would include flanking genes. Is this correct? Authors could comment on that.
      4. In figure S4, I am not sure why orthologous genes are not grouped together in the phylogeny and why p is smaller than 0.05. How should that figure and the probability be interpreted?
      5. It would be good to have a summary table that summarizes what genes have support for past or current functionality (preservation for long time or recent pseudogenization, expression, purifying or positive selection, ability to restrict retroelements) and in what lineages.

      Minor comments:

      1. Page 3. Authors say "...the exons and UTRs..." but UTRs are part of exons. Authors could talk about exons only that include protein-coding regions and UTRs.
      2. Page 7. I would say disabled instead of "... becoming degraded by mutation."
      3. I would say neutral evolution not neutral selection.

      Significance

      This work provides an interesting example of immune system diversification. Authors study the APOBEC3 family of proteins that is part of the vertebrate innate immune system and the data supporting turnover of these kind of immune system genes. The work underscores that this is a way immunity genes evolve and it has parallels in the evolution of the TRIM gene family of immune genes revealing patterns in the mode of evolution of immunity genes. The audience of this work will be people interested in evolution of immunity, arms races and gene diversification and all evolutionary biologists interested in adaptation. I work in the field of comparative genomics and molecular evolution.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Yang et al. submitted a manuscript describing the detection of pseudogenes ("retrocopies") of APOBEC3 (A3) genes in primates. The evolutionary history and relationship to specific A3s was analyzed and speculated that the maintained A3 retrocopies had a functional role at least early in the evolution and some may have still now. Functional data on some of the expressed retrocopies are presented on L1 and HIV.

      The authors claim that "retrocopying expands the functional repertoire of A3 antiviral proteins in primates". While almost of the genetic findings were published recently (Ito et al. 2020), the authors should more clearly describe how their data differ or confirm the data of Ito et. al.

      The functional data (Fig. 6) are interesting, but in the current form not complete. The authors have to show protein expression in the transfected cells (A3, L1, HIV) and level of encapsidation into viral particles. In addition, please analyze if the retrocopies express cytidine deaminase active enzymes.

      Significance

      Minor advance compared to Ito et al. 2020.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their close reading and constructive comments on our manuscript. We believe that their insight has substantially strengthened our manuscript. Please find our response/revision plan for each comment below (in bold).

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This is a very interesting study that addresses an important topic. In brief, the authors build on their previous data showing that LSD1 seems to be neuroprotective. Here they follow the hypothesis that Tau-dependent sequestration of LSD1 to the cytoplasm leads to loss of nuclear LSD1 function. Crossing Tau mutant mice (PS19) to heterozyous LSD1 knock out mice exacerbates phenotypes in PS19 mice, while viral overexpression of LSD1 rescues part of these phenotypes.

      As said the data is interesting but lack mechanistic explanation that would allow in my view publication in a very high profile journal. Moreover, there are some data such as the RNA-seq that would not be acceptable in the present for by any journal. However, all of these issues could be addressed by the authors in case reviewers would refer to them.

      The sequestration of LSD1 in the cytoplasm by tau, along with the co-localization of LSD1 with tau in human cases (in our previous Nature Communications paper-Christopher et al. 2017) provide a mechanistic explanation (sequestration) for why we are able to exacerbate and rescue tau-mediated neurodegeneration by modulating LSD1. As the reviewer pointed out, we believe that we can address all of the critiques brought up (see responses below). By addressing these critiques we believe that we can further substantiate the mechanism underlying our ability to functionally modulate tau-mediated neurodegeneration in vivo.

      **Here are some specific issues.**

      1 . Especially the proposed link of Tau-mediated sequestration of LSD1 to the cytoplasm is not fully supported by the data. A key finding shows that LSD1 is seen more in cytoplasm in PS19 mice. However, the biological relevance of this observation cannot be fully appreciated at present, since the magnitude of this phenotype is unclear. Approaches to perform a quantitative analysis in addition to the representative IHC images would be helpful.

      The change in localization of LSD1 from nuclear to cytoplasmic that we observe in Tau PS19 mice is dramatic. We tried to convey this magnitude of sequestration in different brain regions by showing a range of representative images. Consistent with this, Reviewer 2 commented that “These data are very strong, the effect is impressive.” Nevertheless, we can attempt to further quantify the change in localization. To accomplish this, we can try two different methods. (1) We can add a nuclear marker and attempt to quantify the level of nuclear versus cytoplasmic LSD1 from the immunofluorescence images. (2) We can also attempt to generate nuclear versus cytoplasmic fractions and quantify LSD1 levels by western blot.

      2 . Point 1 might be of specific importance since the subsequent experiments built on the idea that mice with already recued LSD1 levels should have a more severe phenotype in case of Tau pathology. However, they do not really address the role of Tau-mediated sequestration of LSD1 anymore. The authors employ mice that constitutively lack one allele of LSD1 which generally leads to a more severe phenotype in PS19 mice. This is very interesting, but I wonder if reduced LSD1 levels might generally put the network in a more vulnerable state and that other detrimental stimuli that do not cause intracellular protein aggregation might have a similar effect. The authors realize this and address this question by comparing via RNA-seq the gene-expression changes observed in Lsd1+/+, Lsd1Δ/+, PS19 Tau, and PS19;Lsd1Δ/+ littermates. Comparatively few changes are observed. However, the major issue with this experiment is that an n=2/group is simply no acceptable anymore to be published in any serious journal. Thus, this data is not interpretable as it stands.

      To further address the role of tau-mediated sequestration of LSD1, we can attempt to quantify (see above) nuclear versus cytoplasmic LSD1 in PS19 Tau mice with heterozygous Lsd1, and compare it to the level of sequestration observed in PS19 Tau mice alone.

      To strengthen the RNAseq data, we can perform two additional replicates.__However, because (1) the RNAseq results were only used for genome-wide comparisons, (2) the replicates were very tight, and (3) the results were clear, it is very unlikely that additional replicates are going to alter the result. Thus, alternatively we might be able to alter the language of the manuscript to qualify the result somewhat. In this regard, it should be noted that reviewer 2 commented that “The data are very convincing, and provide a strong molecular base showing a tight overlap in the effected molecular pathways associated with both pathological tau and Lsd1 heterozygosity.” Reviewer 3 also commented that the transcriptomic dataset “__strengthen some of the conclusions.”

      Reviewer #1 (Significance (Required)):

      The data will be interesting to the field and help to further understand the role of LSD1 in neuroegenerative dieases linked to tauopathy.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript "The inhibition of LSD1 via sequestration contributes to tau-mediated neurodegeneration" by Amanda K. Engstrom, Alicia C. Walker, Rohitha A. Moudgal, Dexter A. Myrick, Stephanie M. Kyle and David J. Katz, is an excellent study that beautifully describes the implication of the epigenetic enzyme LSD1, as downstream mediator of tau pathology in neurodegenerative disease.

      The same authors in a previous paper showed that i) loss of LSD1 in the adult mouse brain, leads to significant neuronal cell death and ii) loss of LSD1 induces genome-wide expression changes that significantly overlap with those observed in the brains of postmortem human AD. In this work, are presented initial evidences that in AD brain, LSD1 nuclear function could be disrupted by mislocalization to pathological tau.

      In the present work, using the PS19 mouse model, the authors provide the first cytological evidence that pathological tau can prevent LSD1 from properly localizing to the nucleus in hippocampal and cortical neurons. These data are very strong, the effect is impressive. Crossing the PS19 mouse model of taupathology with a mouse model of LSD1 brain heterozygosity LSD1Δ /+, the authors provide functional data that the inhibition of LSD1 function contributes to tau induced neurodegeneration. Indeed, several pathological parameters measured in the PS19 mouse model, are exacerbated in a reduced genetic LSD1 background. Survival rate, motor activity measured with a rotarod test. The behavioral analysis is nicely paralleled by the analysis of spinal cord motor neurons, showing abnormal morphology in the double mutant mice, compared to the PS19. Overall morphology of the hippocampus shows decreased brain size and brain weight. The analysis is accompanied by MRI analysis, showing again very impressive results, with the double mutant being the most affected and the LSD1Δ /+ very similar to WT.

      The second part of the work is aimed at demonstrating specificity of the functional interaction between tau pathology and LSD1. The authors provide a very well planned transcriptional profiling of the different mouse models, choosing the most relevant time point (prior the onset of neuronal cell death), very clearly justifying the rational of their choice. The data are very convincing, and provide a strong molecular base showing a tight overlap in the effected molecular pathways associated with both pathological tau and Lsd1 heterozygosity. As final approach, the authors rescue neurodegeneration in the hippocampus of PS19 Tau mice overexpressing LSD1 using a neuron-specific virus. Overall, these data establish LSD1 as a major downstream effector of tau-mediated neurodegeneration indicating that the LSD1 pathway is a potential late stage target for intervention in tauopathies, such as AD.

      **Minor point:**

      In material and Methods is missing a section dedicated to a detailed description of statistical analysis.

      We have added a section to the materials and methods dedicated to a detailed description of statistical analysis (lines 509-518).

      Reviewer #2 (Significance (Required)):

      I believe that this work will be of great interest for the neurodegenerative together with the neuro-epigenetic field (my personal area of expertise). The identification of a clear new pathway implicated in AD and neurodegeneration together with the suggestion of a possible new therapeutic target (disruption of tau-LSD1 interaction) is of high potential impact for future studies.

      We really appreciate this very positive review, which acknowledges the thoroughness of our results, the mechanistic insight that we provided and the “high potential impact” of our work.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The work on LSD1 in this manuscript is based on earlier studies that deleting the histone demethylase Lsd1 in adult mice leads to neuronal cell death and that the neurofibrillary tangles in Alzheimer's disease brains can be stained for LSD1.

      The manuscript first shows that LSD1 is sequestered in PS19 tau transgenic mice, that a reduction of Lsd1 exacerbates the pathology and that overexpression rescues, complemented by a separate transcriptomic dataset used to strengthen some of the conclusions. As it currently stands, in my view, this work is very preliminary, and I am not sharing all conclusions made by the authors.

      **My specific points are following the headers of the Results section:**

      (1) Tau pathology depletes LSD1 from the nucleus in the PS19 Tau mouse model: What is clear from the images is that in the PS19 Tau tg mice LSD1 is depleted from the nucleus. What is not correct is that in WT it is only localized to the nucleus. What should be done, is to quantify the relative localization to the two compartments. In addition, a subcellular fractionation could be performed (see point further below).

      LSD1 is strictly a nuclear protein with a well-defined nuclear localization signal that interacts with the importin __a__complex (Jin et al, J. Biochem 2014). Reference to this has been added to the text in the introduction (lines 53-54) and in the results (line 87-89). Nevertheless, we can also attempt subcellular fractionation and localization of LSD1 in the nuclear versus cytoplasmic fraction (see response to reviewer 1 above).

      (2) Reduction of LSD1 increases the mouse tauopathy phenotype: 2.1. The PS19 Tau Tg mice have been crossed with an Lsd1 heterozygous mutant (LSD1 delta/-). I tried to find the reference as to how this mutant has actually been made (refs 32-34). Ref 32 describes a conditional KO (The position of the gene trap insertion (STOP), downstream of exon 3, truncates the LSD1 open reading frame within the SWIRM domain prior to the amine oxidase domain, which is essential for the catalytic activity of LSD1), which leaves a 210 amino acid truncated protein which is an obvious confound which should be mentioned and discussed. Besides from that, it is not clear to me how the Lsd1 gene was deleted for the current study, i.e. which promoter has been used.

      The Lsd1 allele used in this study was generated in the Rosenfeld lab (Wang et al., Nature 2007). This was stated in the acknowledgements, but has now been added to both the main text (lines 105-109) and the methods (lines 386-391) for clarity. ThisLsd1 allele is a null allele that has also been used previously by both our group, as well as by additional groups (For example: Christopher et al., Nature Communications 2017 and Lyons et al., Cell 2013). In this current study, Lsd1 was deleted with the Vasa**-Cre transgenic line. Once the deletion allele passes through the germline, Lsd1 is heterozygous throughout the mouse. We deeply regret this oversight.

      2.2. Fig S2 shows that LSD1 is reduced in the heterozygote, but increased in PS19 by 20% and then again decreased in the PS19 x LSD1 delta/-. Clearly, a subcellular fractionation or histological quantification is needed to understand what the levels are in the cytoplasm as compared to the nucleus.

      The data referred to in Figure S2 is from bulk brain homogenate showing that there is a reduction of LSD1 in mice carry the deletion allele both in a wild-type and PS19 Tau background. Nevertheless, we can attempt subcellular fractionation and quantification of LSD1 localization in the nucleus versus cytoplasm (see response to reviewer 1 above) to further clarify this result.

      2.3. The rescue in Fig 2A is really modest. Certainly, with tau in PS19 potentially trapping LSD1 in the cytoplasm there should be less of LSD1 in the nucleus when there is only one functioning allele. What is needed is a quantification of nuclear and cytoplasmic LSD1 in the genotypes.

      We can attempt subcellular fractionation and quantification of LSD1 localization in the nucleus versus cytoplasm (see response to reviewer 1 above) to further clarify this result.

      2.4. I don't agree with the statement: 'started only slightly earlier than PS19 Tau mice, but after the appearance of pathological tau in neurons (p. 6)' as tau pathology develops gradually and is present before the age of 6 months in this strain.

      This statement refers specifically to the AT8 positive pathology that was quantified in this manuscript (Figure S6). This quantification shows that AT8 positive pathology is present in the hippocampus and cortex, when PS19 Tau mice with reduced LSD1 begin to decline. The text has been amended to clarify this (line 124-125).

      (3) Tau pathology is not affected by change in LSD1 levels: This is to be expected as Tau is upstream of LSD1 in a pathocascade.

      The quantification of tau pathology was included as a negative control. As we expected, tau pathology is not affected by the change in LSD1 levels. As the reviewer correctly points out, this data is consistent with our model, that tau pathology is upstream of LSD1.

      (4) The functional interaction between tau pathology and LSD1 inhibition is specific: The specificity of the interaction needs to be tested by co-immunoprecipitations or proximity ligation assays and by mapping which domains of LSD1 and Tau have a role in trapping, using the appropriate positive and negative controls, as is being routinely done for these kinds of studies.

      We too are very interested in whether LSD1 interacts directly or indirectly with tau pathology, and what domains of LSD1 are required for LSD1 to co-localize with tau pathology. However, to address these questions, we will need to perform multiple biochemical experiments (such as the ones suggested by the reviewer) on mice of different ages, as well as human cases. We believe that this is significantly beyond the scope of the current study, which is focused on the functional interaction between tau pathology and LSD1 in mice.

      (5) Overexpression of LSD1 rescues neurodegeneration in the hippocampus of PS19 Tau mice: The data in Figure 5 are not convincing.

      It is not clear why the reviewer is not convinced by the rescue data in Figure 5. Reviewer 1 acknowledged that “viral overexpression of LSD1 rescues part of these phenotypes” and reviewer 2 agreed that “the authors rescue neurodegeneration in the hippocampus of PS19 Tau mice overexpressing LSD1 using a neuron-specific virus.”

      **Minor points:**

      Abstract: The statement 'However, the mechanism through which tau contributes to neurodegeneration remains unknown.' is not correct and should be removed. There is a wealth of information on tau-based pathomechanisms available and several studies have identified proteins which become, as seems to be the case for LSD1, trapped by tau in the cytosol.

      This statement in the abstract has been modified (lines 13-15).

      Reviewer #3 (Significance (Required)):

      This form asks me about my expertise. I am working on tau pathomechanisms since more than two decades and the revision experiments I am asking for is what we are doing in our own studies. I find the data on LSD1 interesting, but definitely more work needs to be done to substantiate the claims.

      We thank the reviewer for their careful reading of the manuscript and appreciate that they found that the data on LSD1 are interesting.

      Overall, we feel that the reviews of our manuscript are very positive. We hope that our response/revision plan will be suitable for publication.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The work on LSD1 in this manuscript is based on earlier studies that deleting the histone demethylase Lsd1 in adult mice leads to neuronal cell death and that the neurofibrillary tangles in Alzheimer's disease brains can be stained for LSD1.

      The manuscript first shows that LSD1 is sequestered in PS19 tau transgenic mice, that a reduction of Lsd1 exacerbates the pathology and that overexpression rescues, complemented by a separate transcriptomic dataset used to strengthen some of the conclusions. As it currently stands, in my view, this work is very preliminary, and I am not sharing all conclusions made by the authors.

      My specific points are following the headers of the Results section:

      (1) Tau pathology depletes LSD1 from the nucleus in the PS19 Tau mouse model: What is clear from the images is that in the PS19 Tau tg mice LSD1 is depleted from the nucleus. What is not correct is that in WT it is only localized to the nucleus. What should be done, is to quantify the relative localization to the two compartments. In addition, a subcellular fractionation could be performed (see point further below).

      (2) Reduction of LSD1 increases the mouse tauopathy phenotype: 2.1. The PS19 Tau Tg mice have been crossed with an Lsd1 heterozygous mutant (LSD1 delta/-). I tried to find the reference as to how this mutant has actually been made (refs 32-34). Ref 32 describes a conditional KO (The position of the gene trap insertion (STOP), downstream of exon 3, truncates the LSD1 open reading frame within the SWIRM domain prior to the amine oxidase domain, which is essential for the catalytic activity of LSD1), which leaves a 210 amino acid truncated protein which is an obvious confound which should be mentioned and discussed. Besides from that, it is not clear to me how the Lsd1 gene was deleted for the current study, i.e. which promoter has been used.

      2.2. Fig S2 shows that LSD1 is reduced in the heterozygote, but increased in PS19 by 20% and then again decreased in the PS19 x LSD1 delta/-. Clearly, a subcellular fractionation or histological quantification is needed to understand what the levels are in the cytoplasm as compared to the nucleus.

      2.3. The rescue in Fig 2A is really modest. Certainly, with tau in PS19 potentially trapping LSD1 in the cytoplasm there should be less of LSD1 in the nucleus when there is only one functioning allele. What is needed is a quantification of nuclear and cytoplasmic LSD1 in the genotypes.

      2.4. I don't agree with the statement: 'started only slightly earlier than PS19 Tau mice, but after the appearance of pathological tau in neurons (p. 6)' as tau pathology develops gradually and is present before the age of 6 months in this strain.

      (3) Tau pathology is not affected by change in LSD1 levels: This is to be expected as Tau is upstream of LSD1 in a pathocascade.

      (4) The functional interaction between tau pathology and LSD1 inhibition is specific: The specificity of the interaction needs to be tested by co-immunoprecipitations or proximity ligation assays and by mapping which domains of LSD1 and Tau have a role in trapping, using the appropriate positive and negative controls, as is being routinely done for these kinds of studies.

      (5) Overexpression of LSD1 rescues neurodegeneration in the hippocampus of PS19 Tau mice: The data in Figure 5 are not convincing.

      Minor points:

      Abstract: The statement 'However, the mechanism through which tau contributes to neurodegeneration remains unknown.' is not correct and should be removed. There is a wealth of information on tau-based pathomechanisms available and several studies have identified proteins which become, as seems to be the case for LSD1, trapped by tau in the cytosol.

      Significance

      This form asks me about my expertise. I am working on tau pathomechanisms since more than two decades and the revision experiments I am asking for is what we are doing in our own studies. I find the data on LSD1 interesting, but definitely more work needs to be done to substantiate the claims.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript "The inhibition of LSD1 via sequestration contributes to tau-mediated neurodegeneration" by Amanda K. Engstrom, Alicia C. Walker, Rohitha A. Moudgal, Dexter A. Myrick, Stephanie M. Kyle and David J. Katz, is an excellent study that beautifully describes the implication of the epigenetic enzyme LSD1, as downstream mediator of tau pathology in neurodegenerative disease.

      The same authors in a previous paper showed that i) loss of LSD1 in the adult mouse brain, leads to significant neuronal cell death and ii) loss of LSD1 induces genome-wide expression changes that significantly overlap with those observed in the brains of postmortem human AD. In this work, are presented initial evidences that in AD brain, LSD1 nuclear function could be disrupted by mislocalization to pathological tau.

      In the present work, using the PS19 mouse model, the authors provide the first cytological evidence that pathological tau can prevent LSD1 from properly localizing to the nucleus in hippocampal and cortical neurons. These data are very strong, the effect is impressive. Crossing the PS19 mouse model of taupathology with a mouse model of LSD1 brain heterozygosity LSD1Δ /+, the authors provide functional data that the inhibition of LSD1 function contributes to tau induced neurodegeneration. Indeed, several pathological parameters measured in the PS19 mouse model, are exacerbated in a reduced genetic LSD1 background. Survival rate, motor activity measured with a rotarod test. The behavioral analysis is nicely paralleled by the analysis of spinal cord motor neurons, showing abnormal morphology in the double mutant mice, compared to the PS19. Overall morphology of the hippocampus shows decreased brain size and brain weight. The analysis is accompanied by MRI analysis, showing again very impressive results, with the double mutant being the most affected and the LSD1Δ /+ very similar to WT.

      The second part of the work is aimed at demonstrating specificity of the functional interaction between tau pathology and LSD1. The authors provide a very well planned transcriptional profiling of the different mouse models, choosing the most relevant time point (prior the onset of neuronal cell death), very clearly justifying the rational of their choice. The data are very convincing, and provide a strong molecular base showing a tight overlap in the effected molecular pathways associated with both pathological tau and Lsd1 heterozygosity. As final approach, the authors rescue neurodegeneration in the hippocampus of PS19 Tau mice overexpressing LSD1 using a neuron-specific virus. Overall, these data establish LSD1 as a major downstream effector of tau-mediated neurodegeneration indicating that the LSD1 pathway is a potential late stage target for intervention in tauopathies, such as AD.

      Minor point:

      In material and Methods is missing a section dedicated to a detailed description of statistical analysis.

      Significance

      I believe that this work will be of great interest for the neurodegenerative together with the neuro-epigenetic field (my personal area of expertise). The identification of a clear new pathway implicated in AD and neurodegeneration together with the suggestion of a possible new therapeutic target (disruption of tau-LSD1 interaction) is of high potential impact for future studies.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is a very interesting study that addresses an important topic. In brief, the authors build on their previous data showing that LSD1 seems to be neuroprotective. Here they follow the hypothesis that Tau-dependent sequestration of LSD1 to the cytoplasm leads to loss of nuclear LSD1 function. Crossing Tau mutant mice (PS19) to heterozyous LSD1 knock out mice exacerbates phenotypes in PS19 mice, while viral overexpression of LSD1 rescues part of these phenotypes.

      As said the data is interesting but lack mechanistic explanation that would allow in my view publication in a very high profile journal. Moreover, there are some data such as the RNA-seq that would not be acceptable in the present for by any journal. However, all of these issues could be addressed by the authors in case reviewers would refer to them.

      Here are some specific issues.

      1 . Especially the proposed link of Tau-mediated sequestration of LSD1 to the cytoplasm is not fully supported by the data. A key finding shows that LSD1 is seen more in cytoplasm in PS19 mice. However, the biological relevance of this observation cannot be fully appreciated at present, since the magnitude of this phenotype is unclear. Approaches to perform a quantitative analysis in addition to the representative IHC images would be helpful.

      2 . Point 1 might be of specific importance since the subsequent experiments built on the idea that mice with already recued LSD1 levels should have a more severe phenotype in case of Tau pathology. However, they do not really address the role of Tau-mediated sequestration of LSD1 anymore. The authors employ mice that constitutively lack one allele of LSD1 which generally leads to a more severe phenotype in PS19 mice. This is very interesting, but I wonder if reduced LSD1 levels might generally put the network in a more vulnerable state and that other detrimental stimuli that do not cause intracellular protein aggregation might have a similar effect. The authors realize this and address this question by comparing via RNA-seq the gene-expression changes observed in Lsd1+/+, Lsd1Δ/+, PS19 Tau, and PS19;Lsd1Δ/+ littermates. Comparatively few changes are observed. However, the major issue with this experiment is that an n=2/group is simply no acceptable anymore to be published in any serious journal. Thus, this data is not interpretable as it stands.

      Significance

      The data will be interesting to the field and help to further understand the role of LSD1 in neuroegenerative dieases linked to tauopathy.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the Reviewers for the positive assessment of our work and their insightful remarks. Please find below a point-by-point response to each comment.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      *The authors present a well written article describing distinct transcriptomic profiles generated by RNA sequencing analysis of hippocampus, a distinct anatomical area, at well spaced and defined time points of clinical progression following prion inoculation in an established mouse model. The authors contribute significantly in the detailed transcriptomic definition of changes during disease progression, especially during the early and almost asymptomatic stages.

      The brain region chosen to perform their analysis is logical as the hippocampus shows clear signs of neuronal degeneration in prion disease progression and furthermore provides a well defined area for analysis that is easily accessible experimentally; Although, more information would be needed to strengthen this choice in relation to the hippocampus playing a key role in the initiation stages of the disease. It remains an anatomical subset of the whole brain and the study would benefit if extended to include other affected areas. *

      The hippocampus is one of the most affected and therefore most studied regions during prion disease (Moreno et al., 2012, Nature). We have clarified this in the text (page 3). While the analysis of transcriptional changes in additional brain regions would be of interest, the main conclusions derived by the present analyses on the hippocampus already opens new perspectives to our understanding of this complex disease (e.g. premature pathological changes at 8 weeks occur long before the development of neuropathological and clinical signs). In light of the new findings observed from the first cohort of experimental animals, we designed the rest of the study to prioritize more analyses (e-g- splicing and RNA editing) and validations (e.g. second cohort, aging cohort, plasma administration cohort etc.) in order to provide comprehensive and robust dataset and corroborate our findings. We are currently working on a follow-up study thoroughly describing transcriptional changes during prion disease development in other brain regions. We believe that the inclusion of these data would not be instrumental to support the main conclusions of the present study and may unduly add complexity to the current manuscript.

      The article presents comprehensive bioinformatics analysis of the gene expression profiles, during disease progression and continues focusing on two early stages whose profiles clearly cluster together. The authors elegantly query the transcriptomic data extrapolating clusters representative of different cell types and conclude that at preclinical stages microglial-related DEGs are enriched. Importantly, data trends are replicated in an independent animal cohort supporting the experimental design, reproducibility and bioinformatics analysis. Enriched microglial populations from challenged animals compared to controls, would have added more value to the approach.

      We agree with the reviewer that certain cell types, including microglia should be investigated in more detail. We are currently working on a study investigating prion-induced changes in a cell-type specific manner using ribosomal profiling. While space reasons prevent us from adding these studies to the present manuscript, we are planning to publish a comprehensive searchable database that will include both the transcriptomics and translatomics data.

      The authors proceed to conclude that these transcriptomic enrichment of microglial related DEGs are suggestive of driver events in the initiation of prion disease. Although the statement is gaining a lot of interest in the current literature, it is yet immature to conclude from only RNA sequencing data that microglial neuroinflammation is the causative driver event and not the result of the infection and subsequent neurodegeneration. Taking also into consideration the route of infection (ic) which is expected to initiate an acute immune response in the brain.

      Towards that comment, the immunohistochemistry data should show increased immune reaction from the early time points pi.

      While the simultaneous occurrence of microglia-related changes and motor decline suggests that microglia may be the ultimate drivers of prion disease progression, we agree that correlation does not prove causation, and have toned down our conclusions to this respect.

      Clearly, microglia activation does not play a major role during the early stages of prion disease: we do not see any increased immune reaction at the early time points as the reviewer pointed out, nor do we see any RNA expression changes in microglia-enriched genes at the early time points.

      We also don’t believe that the infection is the source of microglia activation for the following reason: if the inoculation itself would induce microglia activation we would expect a strong microglia response directly after the injection that should progressively decrease. Instead, we see no expression change in microglia-enriched genes until 16 wpi. We have clarified the corresponding sections in the text.

      To address the reviewer’s point that the route of infection may contribute to the observed changes we have added the following datasets as new Supplemental Fig. 4:

      We have analyzed prion induced changes 8wpi and the terminal stage from intraperitoneally inoculated mice (new Supplemental Fig. 4). The prion induced changes between the different routes of administration correlate at the respective timepoint, indicating that the induced changes are independent of the route of prion inoculation. To strengthen the point that the 8 wpi changes are indeed prion-dependent (and thus require in vivo prion replication by incorporation of cellular prion protein PrP), we have additionally included 8 wpi samples from PrP knock-out mice. The knockout mice don’t show any prion-induced changes at 8 wpi (new Supplemental Fig. 4), suggesting that the 8 wpi changes are not the result of the infection and more importantly, are in fact prion-dependent.

      Also, the paper would gain significantly, if there were random as well as targeted (eg microglial specific) molecular targets selected, for independent validation by Real-time QC PCR and immunohistochemistry. This would be especially interesting if it was combined with the targets that showed selective splicing like Ctsa, a microglial related gene.

      We respectfully disagree with the reviewer on this issue. In the early days of RNAseq, most scientists would validate their results with qPCR of select genes. However, by now RNAseq is widely accepted as the state-of-the-art technique to profile whole transcriptomes and is considered to be more reliable, accurate and sensitive compared to orthogonal methods such as RT-qPCR. Also, RNAseq and RT-qPCR data are highly correlated (typically ~85% and well above 90% when genes with a low expression are neglected; PMID: 28484260). The inclusion of an orthogonal technology is thus only needed when a) no biological replicates are available (potentially detrimental intra-group variability); b)definitive conclusions depend on genes with extremely low expression levels (potentially detrimental high dispersion); c) the main findings of an experiment revolve around one or a handful of genes (potentially detrimental false positives). None of the above applies to this study. Moreover, in terms of overall validation, we already include data from a second, independent cohort of mice with the same experimental settings (Supplementary Fig. 3), as well as from aged mice and from mice with plasma/saline treatment. We therefore maintain that qPCR verification is unnecessary in this instance and may potentially even produce confounders.

      *RNA binding deaminase proteins show a similar pattern to a recent report, strengthening the finding that protein levels do not change and/or compensate with other RNA binding and editing enzymes, even though edited targets and editing frequency shift significantly.

      The authors continue with RNA editing analysis concluding that they did not find any (apart from two targets being edited) differential RNA editing sites contradictory to a recent study. We believe that this contradiction is a premature conclusion since, the analysis was based on an older protocol that was published by the same group based on GTAK version 3.4.0 from 2011. The predicted RNA edited sites were only based in previously catalogued samples from hippocampus of young mice by Stilling et al 2014. They did not take into account C-U editing in all genomic locations in the whole brain regardless of aging or region. Also, the depth of sequencing was not taken into account which would increase the novel identification of editing sites instead of being limited to previously identified non-validated RNA editing. The study would significantly benefit from Sanger sequencing validation of random and non random edited targets. How do the identified targets validate? *

      As suggested by the Reviewer, we have reanalyzed RNA editing using the same editing pipeline as Kanata et al. (PMID: 31492812), neither restricting the analysis to a pre-existing list of candidate sites, nor limiting the analysis to A-to-I editing events. Following this approach, a number of editing sites comparable to those reported by Kanata et al. were identified. However, we did not observe a statistical difference between control and PrD samples at the locus level.

      As discussed in the manuscript Kanata et al, analyzed a different brain region using a different infection model. Furthermore, the fact that we assessed triplicates, and the application of strict filters in the selection of putative editing sites might have contributed to us not detecting differentially edited sites. While we used the same parameters linked to quality and depth of coverage, we only considered the intersection of both REDItools and VarScan2, and required that at least 2 out of 3 samples were edited. Regarding the validation of the editing events through Sanger sequencing, we believe it is outside of the scope of the present study because our main goal is not that of exactly pinpointing specific editing sites and hypothesizing on their potential effects. We rather view the editing analysis as an auxiliary layer to the main conclusions of the manuscript, and through the updated analysis and results we believe we have reached such a goal.

      Finally, the study concludes with the administration of young plasma at 8 weeks (early stage of the disease) and the authors support that this intervention improves the phenotype of the affected animals without lifespan changes. In our view, this part of the study should either be omitted, or full transcriptomic and clinicopathological improvement should be demonstrated with clear emphasis on microglial-related molecular targets.

      While plasma administration does not prolong lifespan and terminal prion-induced changes are very similar in plasma vs saline-treated animals (Fig. 6d-e), we did in fact observe a full transcriptomic improvement upon plasma administration at 8 wpi (Fig. 6b). We currently don’t know if prion-induced 8 wpi changes and the plasma-induced improved health span are linked to microglia-related changes (see also response above). We have therefore not put any additional emphasis on microglia-related targets. We therefore feel that the plasma experiments do add to the present paper, but we would be prepared to discuss with the editors whether it may be appropriate to omit this part and publish it separately. \*Minor comment:**

      Other behavioral tests such as T-maze, Morris water-maze, novel object recognition, wouldn't it be better suited for memory assessment? *

      Although we agree with the reviewer that these tests are better suited for memory assessment, the purpose of the rotarod evaluation (together with histological and biochemical tests) was to obtain an objective monitoring of clinical disease development. Rotarod assessment has been instrumental to objectivate the genetic or pharmacological modulation of prion disease development (e.g PMID: 29176838; PMID: 26246168; PMID:25502554). A more sophisticated behavioral assessment would go beyond the scope of this study and would require access to specific infrastructures which are not available in our veterinary bio contained research facility allowing the handling of prion-infected mice.

      *Reviewer #1 (Significance (Required)):

      The authors present a very detailed and informative transcriptomic profiling of a well structured in vivo experiment with a satisfactory number of time points that has provided significant transcriptomic and splicing information at the preclinical stage of the disease. The field would definitely benefit from such a profile oriented approach however the above should be sufficiently addressed. *

      *Reviewer #2 (Evidence, reproducibility and clarity (Required)): **

      In the present paper by Sorce et al. the authors studied mRNA changes, splicing and editing alterations during the progression of prion disease in an experimental animal model (inocculated mice). The main findings are that changes in RNA processing and abundance occured very early at around 8 wpi. Interestingly, changes in microglia-enriched genes appeared early and coincided with the onset of clinical symptoms while neuronal genes were unchanged and played a more prominent role at later stages of the disease suggesting that glial cells might be the driving force and pivotal for the early stages and disease progression. Young plasma restored mRNA alterations and was beneficial for delaying neurological symptoms. This conclusion seems to be supported by the data and overall the study was well performed. The findings are clearly presented, the discussion is insightful and balanced and the figures are in general of high quality but there are some concerns that need to be clarified.

      The authors could tackle the following comments with a straightforward revision:

      1) On Page 3 it is mentioned that RNA sequencing was performed in n=3 samples per time point and for the 20 wpi time point in controls only n=2 samples have been used. Overall this is a very low sample size that needs to be increased. More samples need to be analyzed in order to provide biological relevance. *

      We agree with the reviewer that, like in any other biological study entailing a certain degree of experimental variability, increased sample sizes always increase the statistical power and may allow for the detection of changes that might otherwise go undetected. However, there are opportunity costs that go along with enlarging the study. Also, the Swiss Animal Protection Law requires us to adhere to the 3R principles (Replacement, Reduction and Refinement of animal experimentations). We have aimed at using the minimum number of animals allowing us to identify a subset statistically significant and robust changes. Animal welfare considerations and an attempt to prevent an escalation of cost resulted in the majority of experiments being performed with 3 samples per time point.

      It is accepted in the field that three replicates are sufficient to identify the vast majority of biologically relevant changes in mRNA abundance. Unless major claims are made about individual genes at the lower end of the expression’s dynamic range (which is not the case in this study), three replicates ensure that about 85 % of the relevant changes are accurately captured (PMID: 29767357; PMID: 26813401). This is particularly true when the variability across replicates is low and appropriate analysis tools, such as edgeR, are employed (PMID: 30726870).

      We used age and gender matched inbred C57BL/6J mice in a microbiologically tightly-controlled environment (see Methods) to minimize interindividual variability. This allowed us to identify thousands of statistically significant prion-dependent changes, despite the low sample number.In few exceptions we sequenced two instead of three replicates (eg because a sample got lost, the RNA was degraded, or the sample did not pass quality control after sequencing). In these instances, we ensured that both replicates showed a high correlation and could thus still yield reliable results. Furthermore, we have validated the RNA expression changes in an independent cohort of mice with the same experimental settings (Supplementary Fig. 3), as well as from aged mice and from mice with plasma/saline treatment, indicating that the observed changes are robust.

      2) On page 4 it is written that 'While clusters 2 and 3 consist predominantly of microglia and neuronal genes, cluster 1 and 4 contain genes enriched in multiple cell types'. A few sentences later, the authors write, that 'Neuronal genes almost exclusively belonged to clusters 3 and 4................, whereas microglia genes were essentially contained in clusters 1 and 2. These two statements are contradictory. Please explain and clarify.

      Compared to clusters 2 and 3, the enriched genes in clusters 1 and 4 don’t predominantly fall into one category (eg cluster 4 contains ~30% neuronal-enriched genes, ~25% oligodendrocyte enriched genes, 20% endothelial-enriched genes – see Fig. 1d). However, of 203 neuronal-enriched DEGs, 143 are cluster 3 genes (~70%), 50 are cluster 4 genes (~25%), while only 10 are cluster 1 genes (~5%) and 0 are cluster 2 genes. To better illustrate this point, we have included these numbers as an additional Table in Supplementary Fig. 2.

      3) On page 5 the authors claim that 'astro- and microgliosis became evident at 16 wpi...........' This statement is based solely on histological images and needs to be confirmed by quantification. However in Supplementary figure 5c astrocytes and microglia (GFAP and Iba1 staining) are almost not visible and the overview images too superficial. I recommend high resolution images and additional inserts and a solid quantification.

      We have added high resolution pictures, additional inserts and a quantification of the stainings (new Supplementary Fig. 6).

      4) On Page 6 the authors write 'We observed progressive decline in motor performance starting 18 wpi'. However, the graph in figure 3a clearly shows only a significant difference at 19 wpi'. This needs to be corrected.

      • The decline in motor performance shows a visible trend at 18 wpi but only becomes statistically significant at 19 wpi. We have clarified this in the text.*

      5) Figure 6c: it would make sense to combine both graphs (saline and plasma) for a direct comparison of prion infected mice that received saline or plasma so that potential differences would be easier to recognize ......although they seem to be pretty modest.

      We have combined both graphs from Fig. 6c into one but believe that it becomes very difficult to extract any information from the figure. We shall defer to the reviewer’s judgment but we would prefer to keep the original figure.

      Reviewer #2 (Significance (Required)):

      The findings are of interest to a wide readership and the paper thus seems suited to be published, but there are some concerns that need to be clarified (see specific comments above).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the present paper by Sorce et al. the authors studied mRNA changes, splicing and editing alterations during the progression of prion disease in an experimental animal model (inocculated mice). The main findings are that changes in RNA processing and abundance occured very early at around 8 wpi. Interestingly, changes in microglia-enriched genes appeared early and coincided with the onset of clinical symptoms while neuronal genes were unchanged and played a more prominent role at later stages of the disease suggesting that glial cells might be the driving force and pivotal for the early stages and disease progression. Young plasma restored mRNA alterations and was beneficial for delaying neurological symptoms. This conclusion seems to be supported by the data and overall the study was well performed. The findings are clearly presented, the discussion is insightful and balanced and the figures are in general of high quality but there are some concerns that need to be clarified.

      The authors could tackle the following comments with a straightforward revision:

      1) On Page 3 it is mentioned that RNA sequencing was performed in n=3 samples per time point and for the 20 wpi time point in controls only n=2 samples have been used. Overall this is a very low sample size that needs to be increased. More samples need to be analyzed in order to provide biological relevance.

      2) On page 4 it is written that 'While clusters 2 and 3 consist predominantly of microglia and neuronal genes, cluster 1 and 4 contain genes enriched in multiple cell types'. A few sentences later, the authors write, that 'Neuronal genes almost exclusively belonged to clusters 3 and 4................, whereas microglia genes were essentially contained in clusters 1 and 2. These two statements are contradictory. Please explain and clarify.

      3) On page 5 the authors claim that 'astro- and microgliosis became evident at 16 wpi...........' This statement is based solely on histological images and needs to be confirmed by quantification. However in Supplementary figure 5c astrocytes and microglia (GFAP and Iba1 staining) are almost not visible and the overview images too superficial. I recommend high resolution images and additional inserts and a solid quantification.

      4) On Page 6 the authors write 'We observed progressive decline in motor performance starting 18 wpi'. However, the graph in figure 3a clearly shows only a significant difference at 19 wpi'. This needs to be corrected.

      5) Figure 6c: it would make sense to combine both graphs (saline and plasma) for a direct comparison of prion infected mice that received saline or plasma so that potential differences would be easier to recognize ......although they seem to be pretty modest.

      Significance

      The findings are of interest to a wide readership and the paper thus seems suited to be published, but there are some concerns that need to be clarified (see specific comments above).

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors present a well written article describing distinct transcriptomic profiles generated by RNA sequencing analysis of hippocampus, a distinct anatomical area, at well spaced and defined time points of clinical progression following prion inoculation in an established mouse model. The authors contribute significantly in the detailed transcriptomic definition of changes during disease progression, especially during the early and almost asymptomatic stages.

      The brain region chosen to perform their analysis is logical as the hippocampus shows clear signs of neuronal degeneration in prion disease progression and furthermore provides a well defined area for analysis that is easily accessible experimentally; Although, more information would be needed to strengthen this choice in relation to the hippocampus playing a key role in the initiation stages of the disease. It remains an anatomical subset of the whole brain and the study would benefit if extended to include other affected areas.

      The article presents comprehensive bioinformatics analysis of the gene expression profiles, during disease progression and continues focusing on two early stages whose profiles clearly cluster together. The authors elegantly query the transcriptomic data extrapolating clusters representative of different cell types and conclude that at preclinical stages microglial-related DEGs are enriched. Importantly, data trends are replicated in an independent animal cohort supporting the experimental design, reproducibility and bioinformatics analysis. Enriched microglial populations from challenged animals compared to controls, would have added more value to the approach.

      The authors proceed to conclude that these transcriptomic enrichment of microglial related DEGs are suggestive of driver events in the initiation of prion disease. Although the statement is gaining a lot of interest in the current literature, it is yet immature to conclude from only RNA sequencing data that microglial neuroinflammation is the causative driver event and not the result of the infection and subsequent neurodegeneration. Taking also into consideration the route of infection (ic) which is expected to initiate an acute immune response in the brain.

      Towards that comment, the immunohistochemistry data should show increased immune reaction from the early time points pi. Also, the paper would gain significantly, if there were random as well as targeted (eg microglial specific) molecular targets selected, for independent validation by Real-time QC PCR and immunohistochemistry. This would be especially interesting if it was combined with the targets that showed selective splicing like Ctsa, a microglial related gene.

      RNA binding deaminase proteins show a similar pattern to a recent report, strengthening the finding that protein levels do not change and/or compensate with other RNA binding and editing enzymes, even though edited targets and editing frequency shift significantly.

      The authors continue with RNA editing analysis concluding that they did not find any (apart from two targets being edited) differential RNA editing sites contradictory to a recent study. We believe that this contradiction is a premature conclusion since, the analysis was based on an older protocol that was published by the same group based on GTAK version 3.4.0 from 2011. The predicted RNA edited sites were only based in previously catalogued samples from hippocampus of young mice by Stilling et al 2014. They did not take into account C-U editing in all genomic locations in the whole brain regardless of aging or region. Also, the depth of sequencing was not taken into account which would increase the novel identification of editing sites instead of being limited to previously identified non-validated RNA editing. The study would significantly benefit from Sanger sequencing validation of random and non random edited targets. How do the identified targets validate?

      Finally, the study concludes with the administration of young plasma at 8 weeks (early stage of the disease) and the authors support that this intervention improves the phenotype of the affected animals without lifespan changes. In our view, this part of the study should either be omitted, or full transcriptomic and clinicopathological improvement should be demonstrated with clear emphasis on microglial-related molecular targets.

      Minor comment:

      Other behavioral tests such as T-maze, Morris water-maze, novel object recognition, wouldn't it be better suited for memory assessment?

      Significance

      The authors present a very detailed and informative transcriptomic profiling of a well structured in vivo experiment with a satisfactory number of time points that has provided significant transcriptomic and splicing information at the preclinical stage of the disease. The field would definitely benefit from such a profile oriented approach however the above should be sufficiently addressed.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1:

      **Summary**

      Jang et al., address the important question of spatially localized or compartmentalized metabolic enzymes with a focus on the glycolytic enzyme PFK1. Using a good strategy of inserting a fluorescent tag at the endogenous PFK1 locus with tissue-specific inducible expression in C. elegans, combined with strong quantitative longitudinal imaging and innovative bioengineered microfluidic-hydrogels to control oxygen availability as well as optogenetic approaches, they show PFK1 condensates, which are not stress granules and not seen in normoxia, assemble with hypoxia. PFK1 condensates are dynamic, reversible, localized at the synapse in neurons, and recruit aldolase, another glycolytic enzyme. Although glycolytic proteins were previously shown to compartmentalize near the plasma membrane, and PFK1 was previously shown to assemble into filaments in vitro and be punctate at the plasma membrane in mammalian cells, evidence for cellular localized PFK1 condensates in animals is highly significant. The work includes strong biophysical characterization of PFK1 phase-separated condensates, but no clear indication of the composition of condensates. More significantly, the findings lack functional significance related to PFK1 activity or glycolytic flux with hypoxia vs normoxia. Despite previous work by this group showing that disrupting subcellular localization of glycolytic enzymes impairs neuronal activity in response with hypoxia, the reader is left with questions on the importance of localized and PFK1 condensates and their make-up .

      **Major comments:**

      Key conclusions are convincing, and most experimental approaches, biophysical characterization including thermodynamic principles, and data analysis are exemplary and well described. However, as indicated above, the work is limited to a descriptive analysis of cellular localization of PFK1 condensates and their biophysical properties without insights on functional significance relative to enzyme activity - or at least glycolytic flux or metabolic reprogramming with hypoxia. At best, only correlations can be drawn from hypoxia-induced localized PFK1 condensates and the authors' previous report (Jang et al., 2016) on hypoxia-regulated neuronal activity. Some insight or at least prediction in the discussion on the differences in spatially localized PFK1 in muscle vs neurons with regard to metabolic or energy distinctions should be included.

      We have added additional discussions on the differences of the spatially localized PFK-1.1 in muscles versus neurons, explaining that in both tissues the cellular enrichment appears to be at sites predicted to have high ATP consumption (lines 128-133; 482-484).

      Despite the strong biophysical analysis of condensates, several important features are not determined. First is at best a rudimentary analysis of the composition of condensates and also how PFK1 is assembled into these structures. For the former, is the core of the condensate predominantly PFK1 with perhaps aldolase only recruited to the periphery or is aldolase an integral component of the structure. Hence, is it a PFK1 condensate or a glycolytic condensate? For the latter question, is there a particular orientation for PFK1 in condensates, i.e a collection of filaments as previously reported, which might provide insight on assembly? Finally, and less critical but also important is the criterion for spherical, which is not well defined, and at least some idea or speculation on determinants for a spherical morphology - compared with filaments that have been reported for other non-glycolytic metabolic enzymes.

      We have now co-expressed PFK-1.1 and ALDO-1 and examined their dynamic formation during hypoxic conditions. We observe PFK-1.1 and ALDO-1 form condensates simultaneously, with gradual enrichment of both molecules. We now include this new data in Figure 7E and Video 8; lines 422-441, 964-989). We also include genetic data demonstrating the ALDO-1 requires pfk-1.1 to form condensates, and that PFK-1.1 requires aldo-1 as well. Therefore, the enzymes are interdependent on each other to form condensates (Figures 7G, 7H, S7B, and S7C).

      The spheroid geometry reflects liquid-like properties, which arises from surface tension of molecules loosely held together via multi-valent interactions. Filamentous arrangements reflect crystalline-like structures resulting from more stable interactions between molecules into solid-like states. While we did not perform high resolution studies, like Cryo-EM, to resolve this question, the spheroid geometry of PFK-1.1 condensates, along with its fluid-like properties, suggest the condensates are liquid-like compartment distinct to filamentous structures. We now add this discussion in lines 467-470.

      The work is an important advance in our understanding on the self-assembly of metabolic enzymes by showing hypoxia-induced PFK1 condensates in vivo, their spatially-restricted subcellular localization in muscle cells and neurons, and their biophysical properties, the latter being distinct from those of stress granules. Taken together, these findings are more extensive than many previous reports on the assembly of metabolic enzymes into filaments or condensates, but fall short for new insights on functional significance.

      We focus this study on the biophysical characterization of the condensates, and how that results in compartmentalized enrichment of glycolytic proteins. Examination of the functional significance of the phase separation to the enzymatic reactions in vivo is not currently possible because we lack probes we can use in vivo to measure the metabolites resulting from the reaction. We have now added discussion acknowledging this and framing its significance in the context of what has been published in the field (lines 484-492). For example, a recent manuscript in ChemRxiv demonstrated, in vitro, that the enzymatic activity of glycolytic proteins, hexokinase and glucose-6 phosphate dehydrogenase, promote these enzymes condensing into liquid droplets. The authors further found that the condensation accelerated the glycolytic reactions (Ura et al., 2020). This raises the question whether glycolytic proteins compartmentalize, and form condensates, in vivo, which we address in this manuscript. We capture this point in (lines 444-464) where we explain that, while it has long been hypothesized that glycolytic proteins like PFK-1 could be compartmentalized, this remained controversial due to lack of dynamic in vivo imaging. In our study, and through a systematic examination of endogenous PFK-1.1 via the use of a hybrid microfluidic-hydrogel device, we conclusively determine that PFK-1.1 indeed displays distinct patterns of subcellular localization in specific tissues in vivo.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This paper reports on the condensation of the glycolytic enzyme PFK-1 in response to hypoxic conditions in neurons of C. elegans. The authors employ a microfluidic-hydrogel device to dynamically monitor the relocalisation of PFK-1 from a mostly diffuse state to clusters in response to hypoxia and show that PFK-1 can undergo multiple rounds of PFK-1 clustering and dissolution. The authors work through the key features of a liquid-like compartment (sphericity, fusion, fast internal rearrangements) and give evidence that PFK-1 may have all three. Finally, the authors tag PFK-1 with the light-inducible multimerization domain Cry2 and find that even without light PFK-1 will constitutively form clusters that sequestrate endogenous PFK-1 as well as other glycolytic proteins. The strength of this work is that it is characterizing what appears to likely be phase separation in the context of a whole animal experiencing a stress that it could encounter in the natural world. A limitation of the work is that it is unclear what the functional implications are of condensates of PFK-1 at the molecular or cell scale.

      **Major comments:**

      -All experiments were performed using fluorescently tagged PFK-1 expressed from endogenous promoter or from the native genetic locus which is important for excluding overexpression artifacts. However, there is still risk that the GFP tag is driving the assembly process. In order to exclude tag-specific effects that may cause aggregation of the tetrameric PFK-1, ideally a control would be done in which PFK-1 is visualized through immunofluorescence experiments of WT cells. Alternatively, a short tag (e.g HA, His) as epitope for is an alternative .

      We used fluorescent tags to observe the dynamic relocalization in vivo. While in the study we have not performed immunofluorescence, we established the validity of the labeling method by: 1) using monomeric versions of GFP; 2) using different fluorophores to show the same condensation phenomenon; 3) performing CRISPR for single copy insertions; 4) Demonstrating that different glycolytic proteins form condensates; 5) demonstrating the GFP-tagged versions of the protein are capable of rescuing the loss-of-function alleles and 6) Now adding new data demonstrating the observed localization specifically depend on the presence of other glycolytic proteins. This last result supports that GFP tag is not driving the assembly process of glycolytic condensate and that the glycolytic condensate formation requires the presence of specific molecules in the pathway. I add that we routinely use fluorophore markers to over a dozen distinct proteins that label subcellular compartments, and we have never observed the dynamic relocalization reported here, with the exception of other glycolytic proteins that interact with PFK, suggesting this is a property specific to glycolytic proteins, and, based on the genetic studies, dependent on the glycolytic reaction. We add and discuss these findings in Figures 7G, 7H, S7B, and S7C; lines 422-441, 964-989.

      -For the Cry2-section, the complementation of the pfk-1 mutant supports functionality of the synaptic clustering phenotype. Are there other features of function that can be evaluated or could you look at how Cry-2 vs wt worms recover from different durations of stress or frequencies. Could you see if the Cry-2-fusion will rescue function to a partial-loss-of-function allele or a tetramerization deficient allele? A detailed analysis of the effects of constitutive presence of PFK-1-Cry2 clusters would be necessary to bolster claims that this is fully functional construct. Can enzyme activity be somehow monitored?

      We did not observe any difference between wild-type worms and CRY2-expressing worms with regards to their development, survival, locomotive behavior or synaptic phenotype. While we can not discard the possibility that this is not a full rescue, with available tools, we can not distinguish the recue with PFK-1-Cry2 from that of just PFK-1.

      -The analysis of the sphericity of clusters (4A) is limited due to the diffraction limit of light which limits an analysis of a compartment of this size. While this is a limitation of the live organism, this should be more clearly acknowledged.

      We have included in the Methods section our criteria for quantifying condensates and avoiding diffraction limit artifacts. Briefly, “Considering the resolution limit of a spinning disc confocal (approximately 300nm), any structure with a diameter less than 500nm and an area smaller than 0.2 µm2 was excluded from the analyses”. To better clarify this point, we also now add a description of the criteria used in the main text (lines 242-243).

      In addition, we observed that PFK-1.1 condensates are not perfect spheres, but constrained spheroids (which can not be explained by diffraction-limited point spread functions). We can explain the observed spheroid shapes based on liquid-like properties of the condensates, and the constrains of the diameter of the neurite. To better highlight this finding, we have now moved Figure S4E into the main figure (Figure 4B’).

      -Fusion experiments (4C) do not fully exclude that clusters overlap instead of merging. It would be beneficial to show the foci for several subsequent frames. One would expect that upon fusion, the condensate size would increase, but video 3 suggests the opposite. It would be useful to quantify condensate size before and after fusion for several separate fusion events. -an alternative possible experiment would be the tagging of PFK-1 with a photoconvertible fluorophore (e.g. Dendra2) and subsequent analysis of fusion events

      To better show the fusion events in Figure 4C, we now include all xy, yz, and zx plane views of before and after fusion events of Figure 4C (Figure S5B). We also added a quantification of four independent fusion events in which we compare the sum of the areas of the two puncta before fusion and the size of the area of the single punctum after fusion (Figure S5C). These data support that we are observing fusions events.

      -4D). It is unclear if foci are indeed undergoing fission or if two clusters next to each other are moving apart.

      For Figure 4D, in all the frames we had recorded, a single structure maintains a continuous signal until fission occurs and splits into two structures. To better present this event, we now include an unabridged version of figure of 4D in the supplement that shows all the frames captured (Figure S5D).

      -The analysis of side-by-side growth and dissolution kinetics are interesting and a novel view into the non-equilibrium aspects of phase separation in cells.

      -Purification of PFK-1 and in vitro reconstitution of condensates would be supportive of liquid-like characteristics although I don't think it is necessary however it would add a lot to the relevance to show enzyme activity is different +/- condensate state but I am not sure if an easy enzymatic assay exists in vitro.

      We agree. But the significance of this particular paper, specifically in the context of the in vitro enzymatic work on glycolytic proteins, is to examine the dynamic in vivo localization and the biophysical characteristics of the condensates. To better underscore this in the context of the field, we add discussion of a recent in vitro manuscript demonstrating that liquid droplet formation of glycolytic proteins affect their enzymatic activity (Ura et al., 2020) (lines 444-464; 484-492). While we see the value of future studies reconstituting the glycolytic particles, we believe that is beyond the scope of this particular in vivo study.

      **Minor comments:**

      -Stress granules in other organisms (yeast paper) have different composition depending on stress type. To make the claim that the PFK-1 compartments are independent of SGs one would ideally test multiple different SG markers.

      We selected the stress granule protein TIAR-1 because it is one of the most studied stress granule markers in C. elegans and it is reportedly one of the core proteins and universal components of stress granules irrespective of a stress type (Buchan et al., 2011; Gilks et al., 2004; Huelgas-Morales et al., 2016; Kedersha et al., 1999). Although we did not include images in the manuscript, we had tested a total of three stress granule markers: TIAR-1, TDP-43, and G3BP1 with similar results. We now added that as data not shown (lines 193-194).

      -it should be stated in the main text that the microfluidic-hydrogel device was fabricated following previously published protocols

      We have added the reference in the main text (line 170) to supplement what we had written in the Methods section: “A reusable microfluidic PDMS device was fabricated to deliver gases through a channel adjacent to immobilized animals, following protocols as previously described (Lagoy and Albrecht, 2015)”.

      -Figure 4b: Y-axis should be changed from probability to fraction of occurrence

      We have corrected this in both the figure and the figure legends (Figure 4B).

      -The discussion should be less speculative concerning any effects seen in PFK1-Cry2 expressing C. elegans

      We have modified the discussion as suggested.

      -it is perplexing that a protein known to tetramerize with no disordered or RNA-binding domains forms condensates like this. Is there anything known from other systems of additional interacting proteins that may have features that promote liquidity and serve to fluidize these assemblies?

      Condensates can form via multivalent interactions, which include, but is not exclusive, to disordered or RNA-binding domains. Because glycolytic proteins have dihedral symmetries that can facilitate multivalent interactions, we believe these structural properties, in combination with regulated conformational changes, promote multivalent interactions leading to their condensation. We had a statement in the discussion (lines 494-519) now add this more clearly in the results (lines 395-398).

      Reviewer #2 (Significance (Required)):

      Stimulus-induced phase separation has been observed for dozens of metabolic enzymes from various different pathways (reviewed in Prouteau, 2018). Several studies have published the formation of condensates through PFK-1 in diverse organisms (C. elegans, Yeast, human cancer cells) in response to hypoxia or in some cancer lines also without hypoxia (Jin, 2017, Jang, 2016, Kohnhorst 2017, etc.). A yeast study showed that PFK-1 condensates contain various other glycolytic enzymes and that condensate formation enhances glycolytic rates (Jin, 2017).

      This study gives the advance of analyzing the dynamics of PFK-1 condensate formation in vivo in the context of a live animal using a microfluidic-hydrogel device and showing that PFK-1 relocalizes to reversible condensates within minutes of hypoxia. If further appropriate experiments (as mentioned above) are performed, this study would strongly suggest that the underlying process of PFK-1 condensate formation is liquid-liquid phase separation. Ideally, if at all feasible, it would be strengthened if there was some insight into the functional consequences of the condensed assemblies formed in hypoxia. These findings may be interesting to researchers working on glycolysis and metabolism in different cells but particularly in neurons.

      Field of expertise

      -Phase separation, microscopy, in vitro reconstitution

      -no experience with C. elegans biology and do not have a practical handle on ease or difficulties of genetic manipulation of C. elegans or metabolic assays for PFK-1

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      In this manuscript, the authors focus on the subcellular localization of the key glycolytic enzyme PFK-1.1 in C. elegans, initially in whole animals through GFP tagging of the endogenous locus and subsequently in single cells/tissues using a clever genome editing strategy that permitted tissue-specific expression of GFP-tagged PFK-1.1 from its endogenous locus. They observe that PFK-1.1 localization differs from cell-type to cell-type and can be dynamically reorganized in response to exogenous cues. Focusing on hypoxia, they observe that PFK-1.1 forms foci near synapses in neurons under this stress condition. These foci are not stress granules and they are dissolved upon re-oxygenation. These condensates have properties of liquid droplets and can mature (harden) over time. PFK-1.1 fused to the CRY domain can trigger condensate formation under normoxic conditions, which can co-recruit WT PFK-1.1 as well as aldolase.

      **Major comments:**

      The conclusions are convincing but the impact could be increased if the authors were able to demonstrate the physiological role that the observed phase separation plays in this stress response. Would it be possible to assess glycolytic flux under hypoxia vs normoxia?

      It is currently not possible to assess glycolytic flux in vivo in our system, as we lack metabolic sensors (an active area of work we are trying to address, but will take several years to perform correctly). We have added discussion of new in vitro studies examining the consequences of metabolic flux due to glycolytic compartmentation into liquid droplets (Ura et al., 2020), and the significance of those findings in the context of our in vivo studies (lines 444-464; 484-492).

      The authors should comment on viability during the hypoxia time course.

      C. elegans can survive anoxic condition for a day (Powell-Coffman, 2010). Our hypoxic conditions last minutes, and we can rescue live C. elegans upon completion of the assays. We now include a description of this in the Methods (lines 1216-1218).

      The co-clustering of ALDO-1 and PFK-1.1::mCh::CRY2 in Figure 7 should be properly quantified/statistically analyzed

      We quantified the fraction of animals that displays ALDO-1 clustering in PFK-1.1::mCh::CRY2 co-expressing animals, as suggested (Figure S7C).

      A control of mCh::CRY2 + ALDO-1::EGFP is missing from the experiments shown in Figure 7. Is the presence of mCh::CRY2 sufficient to drive ALDO-1::EGFP clustering?

      As a control for the CRY2 tag promoting the formation of glycolytic condensates, we had co-expressed mCh::CRY2 with PFK-1.1::EGFP, which is insufficient to cause the formation of the condensate (Figure 7C). We have now added a new data where we show that in pfk-1.1 deletion mutants, ALDO-1 condensate formation is suppressed, which further demonstrates the dependency between PFK-1.1 and ALDO-1 (Figures 7H and S7C).

      Does hypoxia trigger co-clustering of ALDO-1 and PFK-1.1?

      To answer this question, we examined the dynamic formation of ALDO-1 and PFK-1.1 condensates by co-expressing the two proteins together and observed that hypoxia triggers their co-clustering. We now include this in Figure 7E and Video 8.

      The authors speculate that hypoxia acts via diminished energy (altered ATP AMP ratios). Can this be measured? To support this hypothesis, the authors may wish to test if similar phase separation is triggered by mitochondrial poisons.

      We currently lack sensors that can reliably measure, in vivo, the subcellular changes in energy or metabolic flux in C. elegans neurons. However, we previously did test mitochondrial mutants and observed that in those mutants we observe glycolytic condensates (Jang et al., 2016), supporting the idea that defects in energy production promotes the formation of glycolytic condensates.

      **Minor comments:** Is 21% O2 not hyperoxic for worms?

      While C. elegans are known to prefer lower percentage of oxygen than those in air, in the lab animals are reared in normal air. We therefore used 21% oxygen present in air as our normoxic conditions.

      Can the authors speculate more on how do these condensates exhibit "memory" (how they're able to cluster in the same place repeatedly)? Is there any role for the cytoskeleton in mediating nucleation and/or condensation of PFK and glycolytic enzymes?

      When we were testing the reversibility of PFK-1.1 condensates, we were not expecting the reappearance of PFK-1.1 condensates in the same place repeatedly. Our current speculation is that, because many glycolytic enzymes, such as PFK-1.1, are allosterically regulated by nucleotides, AMP/ATP ratio may play a role on where glycolytic condensates appear. In other words, the specific synaptic areas, where PFK-1.1 condensate repeatedly reappeared, may have different AMP/ATP ratio that may trigger the condensation of the glycolytic proteins in those locationsupon conformational changes in PFK-1. We can’t exclude, currently, the presence of nucleating factors at synapses that facilitate PFK-1 clustering, but we have not yet identified them. We now include a discussion of this (lines 494-519).

      Do the authors think that these clusters are effectively G-bodies from yeast?

      G-bodies from yeast also shows glycolytic proteins changing from its diffuse localization to punctate localization in response to hypoxia (Jin et al., 2017). G-bodies, like C. elegans glycolytic condensates, are forms of subcellular glycolytic organization within eukaryotic cells. Yet, G-bodies take 24 hours to form, while we observe the glycolytic clusters in C. elegans within minutes of hypoxic conditions. We will need to understand the composition and function of both to determine if these forms of glycolytic subcellular organization represent the same structure. We note that glycolytic clusters have also been observed in some human cancer cell lines (Kohnhorst et al., 2017). Observation of glycolytic compartments in multiple different species and cell types suggest that, although the regulation, composition and formation kinetics of the glycolytic condensates may differ, compartmentalization of glycolytic enzymes may be a conserved feature. We now add a sentence discussing this (line 535-537).

      Reviewer #3 (Significance (Required)):

      It is much appreciated that this study tackles the cell biology of signaling and metabolism, which is a fascinating but difficult to study aspect of molecular biology. This work conclusively documents the dynamic reorganization of metabolic enzymes in vivo in response to physiological stimuli. Such reorganization had been proposed previously but was controversial and difficult to study in a controlled way. This work not only confirms previous observations but further demonstrates that the dynamic reorganization is mediated by a liquid-liquid phase separation. What is lacking is a demonstration that this phase separation is physiologically important. Such observations would generate interest from a much broader audience; the present audience presently targeting people specifically interested in non-membrane organelles per se. The reviewer has expertise in cell signalling and its regulation by phase separation.

      As we explain for Reviewer 1, we focus this study on the biophysical characterization of the condensates, and how that results in compartmentalized enrichment of glycolytic proteins. Examination of the functional significance of the phase separation to the enzymatic reactions in vivo is not currently possible because we lack probes we can use in vivo to measure the metabolites resulting from the reaction. We have now added discussion acknowledging this and framing its significance in the context of what has been published in the field (lines 484-492). For example, a recent manuscript in ChemRxiv demonstrated, in vitro, that the enzymatic activity of glycolytic proteins, hexokinase and glucose-6 phosphate dehydrogenase, promote these enzymes condensing into liquid droplets. The authors further found that the condensation accelerated the glycolytic reactions (Ura et al., 2020). This raises the question whether glycolytic proteins compartmentalize, and form condensates, in vivo, which we address in this manuscript. We capture this point in (lines 444-464) where we explain that, while it has long been hypothesized that glycolytic proteins like PFK-1 could be compartmentalized, this remained controversial due to lack of dynamic in vivo imaging. In our study, and through a systematic examination of endogenous PFK-1.1 via the use of a hybrid microfluidic-hydrogel device, we conclusively determine that PFK-1.1 indeed displays distinct patterns of subcellular localization in specific tissues in vivo.

      **REFEREES CROSS-COMMENTING** Globally it seems that all reviewers feel that impact would be increased if the physiological consequence of PFK-1.1 condensates was examined. Other, specific comments seem fair.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, the authors focus on the subcellular localization of the key glycolytic enzyme PFK-1.1 in C. elegans, initially in whole animals through GFP tagging of the endogenous locus and subsequently in single cells/tissues using a clever genome editing strategy that permitted tissue-specific expression of GFP-tagged PFK-1.1 from its endogenous locus. They observe that PFK-1.1 localization differs from cell-type to cell-type and can be dynamically reorganized in response to exogenous cues. Focusing on hypoxia, they observe that PFK-1.1 forms foci near synapses in neurons under this stress condition. These foci are not stress granules and they are dissolved upon re-oxygenation. These condensates have properties of liquid droplets and can mature (harden) over time. PFK-1.1 fused to the CRY domain can trigger condensate formation under normoxic conditions, which can co-recruit WT PFK-1.1 as well as aldolase.

      Major comments:

      The conclusions are convincing but the impact could be increased if the authors were able to demonstrate the physiological role that the observed phase separation plays in this stress response. Would it be possible to assess glycolytic flux under hypoxia vs normoxia?

      The authors should comment on viability during the hypoxia time course.

      The co-clustering of ALDO-1 and PFK-1.1::mCh::CRY2 in Figure 7 should be properly quantified/statistically analyzed

      A control of mCh::CRY2 + ALDO-1::EGFP is missing from the experiments shown in Figure 7. Is the presence of mCh::CRY2 sufficient to drive ALDO-1::EGFP clustering?

      Does hypoxia trigger co-clustering of ALDO-1 and PFK-1.1?

      The authors speculate that hypoxia acts via diminished energy (altered ATP AMP ratios). Can this be measured? To support this hypothesis, the authors may wish to test if similar phase separation is triggered by mitochondrial poisons.

      Minor comments: Is 21% O2 not hyperoxic for worms? Can the authors speculate more on how do these condensates exhibit "memory" (how they're able to cluster in the same place repeatedly)? Is there any role for the cytoskeleton in mediating nucleation and/or condensation of PFK and glycolytic enzymes? Do the authors think that these clusters are effectively G-bodies from yeast?

      Significance

      It is much appreciated that this study tackles the cell biology of signalling and metabolism, which is a fascinating but difficult to study aspect of molecular biology. This work conclusively documents the dynamic reorganization of metabolic enzymes in vivo in response to physiological stimuli. Such reorganization had been proposed previously but was controversial and difficult to study in a controlled way. This work not only confirms previous observations but further demonstrates that the dynamic reorganization is mediated by a liquid-liquid phase separation. What is lacking is a demonstration that this phase separation is physiologically important. Such observations would generate interest from a much broader audience; the present audience presently targeting people specifically interested in non-membrane organelles per se. The reviewer has expertise in cell signalling and its regulation by phase separation.

      REFEREES CROSS-COMMENTING Globally it seems that all reviewers feel that impact would be increased if the physiological consequence of PFK-1.1 condensates was examined. Other, specific comments seem fair.