6,070 Matching Annotations
  1. Last 7 days
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Reply to the Reviewers

      I thank the Referees for their...

      Referee #1

      1. The authors should provide more information when...

      Responses + The typical domed appearance of a hydrocephalus-harboring skull is apparent as early as P4, as shown in a new side-by-side comparison of pups at that age (Fig. 1A). + Though this is not stated in the MS 2. Figure 6: Why has only...

      Response: We expanded the comparison

      Minor comments:

      1. The text contains several...

      Response: We added...

      Referee #2

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Dr. Ancheta et al. designed several parameters to assess different velocity algorithms, including local consistency, method agreement, overlap of derived genes, and robustness to sequencing depth. Generally, this helps scientists understand the performance of each software. However, I don't think this is enough to help scientists judge which one is better. The biggest problem in the manuscript is the lack of a ground truth. I suggest the authors choose tissues with reliable ground truth, such as spermiogenesis, which has a single lineage direction. You would see a clear streamline from pachytene spermatocytes to sperm, or embryonic cells with different culture days, where the direction should go from earlier to later stages. I strongly suggest the authors use clear lineage samples like spermiogenesis. After that, I think it will be a very helpful paper for scientists.

      1. Local consistency is a useful parameter that helps scientists determine which one has more uniform directions.

      2. Method agreement is problematic because I don't know which one is ground truth. If you cannot get ground truth, you could use the average direction or angle as the ground truth to see which one is significantly biased from the averages.

      3. The overlap of the driver genes also lacks ground truth. Fig4C is good; we could use the overlap of all methods' genes as ground truth to see which one is too biased. I suggest you perform a GO term analysis to see the driver gene distribution and count how many genes are related to the expected GO term. That would also provide evidence to support the ground truth.

      4. Robustness to sequencing depth is a good parameter. No comment.

      5. The discussion could include more about other methods, such as nascent RNA single-cell sequencing methods and full-length single-cell sequencing methods, which improve the estimation of alpha, beta, and gamma. This could help delve deeper into enhancing the velocity program.

      Significance

      The biggest problem in the manuscript is the lack of a ground truth. I suggest the authors choose tissues with reliable ground truth, such as spermiogenesis, which has a single lineage direction. You would see a clear streamline from pachytene spermatocytes to sperm, or embryonic cells with different culture days, where the direction should go from earlier to later stages. I strongly suggest the authors use clear lineage samples like spermiogenesis. After that, I think it will be a very helpful paper for scientists.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors present a comparative statistical analysis of five RNA velocity methods using two datasets and a single performance metric. Using the selected statistical metric, they describe the variable performance of RNA velocity methods, their variable robustness across different cell states, and the discrepancy of sets of identified lineage-specific driver genes.

      At this point, the scientific community extensively documented limitations and lack of stability of RNA velocity performance across methods and datasets. In that context, the manuscript lacks clear theoretical and practical conclusions that would be beneficial to the scientific community.

      The choice to focus on only a subset of RNA velocity methods is not discussed. Recent and important extensions, such as VeloVAE, VeloVI, LatentVelo, and Pyro-Velocity, are omitted, which limits the generality of the analysis. The statistical properties of the chosen consistency metric are not explored. The authors do not provide a justification for why this metric is appropriate for comparative analysis. Additionally, the authors do not present how the consistency score can be utilized to evaluate RNA velocity performance on user datasets. Overall, a discussion of the pressing issue of choosing statistical metrics to interpret RNA velocity results is lacking. The pros and cons of different RNA velocity methods, especially in light of the various statistical metrics, are not discussed. The manuscript does not present conclusions from the sampling analysis of sequencing depth. For instance, formalizing these findings with code that users can employ for their datasets would enhance the manuscript's practical utility. Overall, the manuscript would benefit from a thorough benchmark of the methodological approaches in the RNA velocity field, and testing various methods to evaluate RNA veloicty performance.

      Significance

      At this point, the scientific community extensively documented limitations and lack of stability of RNA velocity performance across methods and datasets. In that context, the manuscript lacks clear theoretical and practical conclusions that would be beneficial to the scientific community.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors present a survey of RNA velocity methods, evaluate them on a variety of model datasets, and introduce metrics to determine the local consistency of each individual method. They investigate differences between the methods with a separate metric that identifies consistency across methods, and using this, comment on applicability of each method to novel datasets. The effect of these differences on a downstream driver-gene identification task is also evaluated, and further conclusions are drawn related to this, particularly related to variability as a function of sequencing depth.

      Major comments:

      • There are a few changes that could be made to improve future applicability. One assumption seems to be that consistency between methods will indicate the most likely trajectory, however without a number of ground truth trajectories it seems that this is difficult to justify. In fact, there are probably good reasons why this would not be the case and that some methods might well be expected to underperform in certain cases. A deep comparison of the mechanics of the methods, or validating a robust set of novel ground truth trajectories, is probably beyond the scope of this paper, but it would be good to make some reference to the fact that these methods do differ in ways that might lead one to expect that some outperform others for good reasons.
      • Related to this, it's still tricky to identify a clear path between the underlying approaches of the methods, the empirical observations in Fig 6, and how a reader coming cold to the field could match these aspects to the aims for the analysis of their own dataset. Though, I think that this might well be solved by rephrasing Fig 6 to include statements on the dataset itself - e.g., if you have large transcriptional diversity and a smaller dataset, then probably you would disfavour DeepVelo.

      • One other suggestion relates to the uncertainties in the trajectories. This is touched upon in the case of the 'DeepVelo' method, where, for example, it's mentioned that these have a large number of parameters and could be prone to overfitting. However, the paper doesn't go as far to suggest that this could result in the trajectories having a much higher variance, which is is potentially evident in the sequencing depth study. This is perhaps a confounding issue in the comparisons in, for example, Fig 1a, where it seems that there is much more dynamical structure in the DeepVelo plot, but in reality this may be due to a higher degree of variance in the predictions. In this case, it may be that, in fact, all of the trajectories are perfectly consistent between the methods, within their uncertainties, despite the 'mean' values displayed in the plot looking quite different.

      • Likely a full accounting of the uncertainties on all of the outputs of these models is also beyond the scope of the paper, however some indication of what the variance of these trajectories looks like (or the average value of this across the UMAMP plot, etc), although optional, would also be another valuable point of comparison between the methods. Bootstrap resampling of the data and re-running the methods, for example, would likely give a good indication of the consequences of the behaviour seen in the sequencing depth studies.

      Minor comments on specific sections:

      Intro:

      It would be good to define exactly what you mean by 'state' with respect to the expression (or expression programs, etc) up front here.

      • lineages between states' sounds a little awkward to me, I would suggest something like 'state lineages' instead.
      • During cellular transitions' - suggest removing and starting with 'scRNASeq data...'
      • Also would be good to define concretely what you mean by 'trajectory' with respect to expression
      • inconsistent or incorrect directionalities' - suggest 'inconsistent or incorrect trajectories'?

      Perhaps move 'RNA velocity has been applied...' after the description of RNA velocity

      • Given these limitations...' - it would perhaps be nice to give some vignettes of how the methods differ before discussing their limitations
      • As the mRNA matures...' - perhaps mention the key piece of information is the splicing out of introns, and this permits identification of the mature and immature mRNA (otherwise it seems a bit vague), and/or define splicing in the text
      • The method yields...' - redundant?
      • ...linear differential equations with constant slope...' - unclear what 'constant slope' means here
      • steady state solution' - not clear whether 'steady state' here is synonymous with 'equilibrium', so it would be good to define (e.g., whether this means alpha = gamma, or whether this is a statement on beta, etc).
      • ..the directionality in the cell-cell graph,...' - not clear how this becomes a graph, so it would be good to expand on how this is obtained
      • the results depend heavily on chosen hyperparameters' - not clear that the other methods don't have this issue. I would guess that this is a matter of degree, but it would be good to indicate whether there are specific reasons why some models are expected to be less sensitive by construction.
      • treating them as probabilistic events and resulting in a Markov process' -> 'treating them as probabilistic events in a Markov process
      • a dynamic model (scv-Dyn) to address many of the original issues' - not clear from this description how the methods fixes these issues. Perhaps also talk a bit more about this 'latent time' parameter, if it's useful to contextualise the results later.
      • deriving gene-specific splicing parameters in a single step,' - not clear what doing this in a 'single step' is, or why it might be advantageous.
      • graph convolution' -> graph convolutional
      • ...but also its neighbors...' - neighbours in what space?
      • the mouse pancreas, a well studied lineage' - phrasing sounds a bit weird, maybe 'with a well studied lineage'?
      • RNA velocity streams' -> trajectories? (if defined before)
      • Therefore, a general benchmark that compares RNA velocity methods...' It might be nice here also to explain what exactly one might expect to be the ground truth in this case, as it's not particularly clear that something like VeloCyto that apparently predicts basically nothing is a 'bad' result compared to something like scv-Sto which predicts much more complicated dynamics where there may not be any.
      • Disparate or contradictory results from various RNA velocity methods undermine our confidence in the predicted trajectories.' -

      As mentioned before, I think it is important here to add that these methods were each individually developed for a specific purpose, and likely with an aim to improve upon previous techniques in the literature. So I think this kind of statement would have to be motivated a bit better, if claiming this without specific reference to the designs of each of the techniques on their own merit. For example, one could imagine in future an 'oracle' method that somehow predicts all trajectories with 100% accuracy, but its results are rejected because they don't align with the earlier and more primitive methods in the field. Equivalently, it could be such that one model is designed specifically for a specific type of dataset, or in the low data size regime, so a comparison using other datasets is more unfair (I don't think that is necessarily the case here, but without surveying the rest of the literature, a reader would not know that).

      Results:

      • 30 nearest neighbors' - With a fixed number of neighbours rather than a fixed similarity, it seems like you might end up getting results that are hard to compare if this is calculated in a sparse region (or the datasets have significantly different sizes)?
      • Cell types from well-defined lineages...', '...those with more complex cellular heterogeneity...' - it would be good to indicate how these are considered 'well defined' and what 'more complex cellular heterogeneity' refers to (whether this is just from the UMAP, or whether these are statements that include prior biological assumptions that are used to evaluate the method).
      • ...their differentiation process is more complex..' ... '..lineages with complex diversity...' - is this a statement based purely on the variation in expression? If so, it would be good to be clear here.
      • ...the landscape's smoothness varies depending on cell type.' - it's been a while since you mentioned 'landscape', so it would be good to remind the reader that this is the distribution of expression in your high dimensional space.
      • correlated with cell diversity' -> cell type diversity? Although 'notochord, endoderm and hindbrain' have already been indicated as 'cell types' here, so this is a little confusing
      • ...can indicate overtraining or over smoothing...' - this seems a little contradictory, where I would assume that overtraining would result in higher variance, and oversmoothing would give the opposite effect?
      • Altogether, the variation in agreement...' - it would be nice to go a little further here and make specific recommendations, even if it's something very vague, as there will be cases where there are no clear biological clues.

      Downstream:

      • ..overlap in macrostates...' - even if macrostates is a term defined in CellRank, it would be good to redefine it here for the reader
      • ..scv-Dyn and UniTVelo both utilize a shared latent time variable..' - is it possible to give any indication why this might lead to a difference? Or even which might be more plausible?
      • RNA Velocity' -> RNA velocity

      Robustness to sequencing depth:

      • DeepVelo, scv-Dyn, and UniTVelo maintained low levels of correlation with the magnitudes from the full reads... - this seems like quite a startling effect. It seems like this indicates the models are really quite unstable, if the removal of 2% of the dataset gives such a considerable difference.

      Discussion:

      • Our research emphasizes the importance of implementing a method that best fits the dataset...' - You don't indicate any goodness of fit metrics prior to this, so it's not really clear what this means in practice.

      • Because the pancreas dataset is often used as a benchmark dataset for RNA velocity methods...' - presumably also this could mean that methods are developed to overfit to this dataset?

      • Capturing the full splicing dynamics..' - not clear what 'full splicing dynamics' means here

      Fig 1c. (and elsewhere):

      Often it's hard to see the trajectory arrows in the rasterised plots. If this isn't just an issue with the review PDF, in Matplotlib it's possible to rasterise the points without rasterising the annotations and axes (https://matplotlib.org/stable/gallery/misc/rasterization_demo.html), which makes things a lot easier to read, but also doesn't leave you with giant file sizes.

      Fig 2a.: The equation is a bit confusing, as k seems to be the size of the set of neighbours and the set itself, where in the text k is only ever the number of neighbours. Perhaps it would be better to have something like a set K of neighbours k (where k ∈ K), and then the sum is normalised by |K| and the sum is over k (or alternatively have k = |K| to be consistent with the text).

      Fig 2d. (and others) A label on the z axis would be good here

      Significance

      This is a commendable effort, and will be of use to practitioners navigating the properties of current and future RNA velocity methods, particularly those without a background in the more advanced mathematical formulations of the newer methods. It also is the first time, to my knowledge, that a systematic comparison has been performed using a number of real datasets and with a novel metric.

      However, the paper does not go as far as to link specific approaches or assumptions within the methods directly to their empirical observations, which could limit the applicability of the conclusions to only datasets and methods similar to those studied.

  2. Nov 2024
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02640

      Corresponding author(s): Purusharth I, Rajyaguru; Stephan Vagner

      1. General Statements

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      In the manuscript titled, "RGG motif-containing Scd6/LSM14A proteins regulate the translation of specific mRNAs in response to hydroxyurea-induced genotoxic stress" we elucidate a conserved role of an RNA-binding protein with low-complexity sequences (RGG-motifs) in genotoxic stress response. This work uncovers HU-stress mediated translation regulation of SRS2, Ligase IV and RTEL1 transcripts by Scd6 (yeast)/LSM14 (human). It further identifies RNP condensates and arginine methylation as sites and means of this regulation.

      We heartily thank all three reviewers for their overall encouraging comments about the significance of this manuscript. Specifically, we appreciate their view that the manuscript provides new functional insights into the role of RGG-motif-containing RNA-binding protein in genotoxic stress response. They further agree that such knowledge will impact and interest the general audience of RNA biology and stress biology.

      We have carefully noted all the comments raised by three reviewers. We have addressed almost all the comments, including several by performing new experiments. The new results and their analysis have helped us improve the manuscript, allowing us to provide a stronger mechanistic and functional insight underlying the findings presented in this work. We thank the reviewers for their insightful comments. Below, we provide a point-by-point response to each of the comments.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Reviewer 3

      Major Comment 4: Page 7, top: '...indicating that Scd6 regulated the expression of SRS2 in a HU-dependent manner.' In my opinion, the results so far suggest that Scd6 and SRS2 are somehow functionally connected during HU-treatment. To substantiate the statement of the authors, they should provide a Western blot showing that the levels of SRS2 change upon Scd6 KO or OE during HU-treatment. This will also substantiate the results shown in Figs 2G-H.

      Response: We thank the reviewer for this comment. Detecting Srs2 protein has been technically challenging. The SRS2 construct used in this study is untagged. Unfortunately, the commercial SRS2 antibody has been discontinued. We requested several groups who have used SRS2 antibody in their past studies but they have either closed down their labs or are unable to find an aliquot to share. We have tried tagging SRS2 with 6xHis/1XFLAG/3xFLAG tags at N and C-terminal, but unfortunately, the protein was undetectable in the Western blot analysis using either of the tag-specific antibodies. We have also tried western blot analysis using SRS2-GFP strain, but the protein does not get detected by anti-GFP antibody, probably because of very low expression.

      Since we will not be able to provide western blots for Srs2 protein levels due to technical challenges, we shall provide western blots for RTEL1 (human homolog of Srs2) protein levels upon Lsm14A knockdown in the presence and absence of HU. This will validate the polysome data we have of RTEL1 regulation by LSM14A, and would, by extension, substantiate the SRS2 polysome data.

      Major Comment 5: Figs 3: How are the localization of Scd6 protein and SRS2 mRNA to granules, and the levels of Srs2 protein, in cells exposed to HU after deletion of Hmt1? This would substantiate a role of Hmt1 in vivo.

      Response: We will provide the data for Scd6 protein localization and SRS2 mRNA localization in granule enriched fraction upon HU treatment in Δhmt1 background. This experiment is ongoing.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer 1

      Major Comment 1: Fig. 1 F/G: were the delta RGG and LSM variants expressed at an equivalent level to the WT protein in these experiments?

      Response: We thank the reviewer for this comment. We have quantified the total fluorescence intensity of GFP from the existing microscopy images for WT and domain deletion mutants for both Scd6 and Sbp1 (Now Figure 3A and 3D). This result (added as a new figure panel Fig 3C and 3F) indicates that the levels of Scd6∆RGG mutant is more whereas Scd6∆Lsm protein levels are comparable than WT. Similarly, Sbp1∆RGG mutant expression is comparable to WT in the given experimental conditions.

      Major Comment 2: Fig. 3G: The 6 data points for the delta LSM variant are literally spread evenly up and down the graph, making these data appear highly questionable as to whether one can draw a definitive conclusion from them.

      Response: We agree with the reviewer that the data points are varied. To address the scatter in data, we have performed additional experiments and added those to the existing results. Even though there is a spread in the points, except for one data point, all others show an increase in methylation of LSM domain deletion mutant compared to WT, which is statistically significant. The old blot and graph (Old Figure 3F and 3G) have now been replaced with new ones (Figure 5F and 5G) which look more convincing. The result and conclusion derived from it remain unchanged.

      Minor Comments

      Comment 1: Abstract: the acronym NHEJ likely will need to be defined for the general reader.

      Response: The acronym has been expanded in the abstract and explained in the introduction.

      Comment 2: Introduction, first paragraph: change gene expression to 'transcription' in the phrase 'Even if the contribution of gene expression to GSR..' as I assume this is what is meant here. Gene expression consists of synthesis, processing, translation and decay.

      Response: The required change has been made.

      Comment 3: Pg. 3 Introduction: Since they are liquid-liquid phase condensates and ribonucleoproteins (RNPs) refer to any protein-RNA interaction, I think that referring to PBs and SGs as mRNPs is a bit misleading (especially the 'major mRNPs').

      Response: The statement has been rewritten.

      Comment 4: Introduction: are PBs truly 'sites' of mRNA decay as stated? There are papers in the literature that would argue otherwise.

      Response: The statement has been modified with more citations.

      Comment 5: Pg. 3, three lines from bottom. Change LSM14 to LSM14A

      Response: The addition has been done.

      Comment 6: Pg. 4 top - What is an 'LCS' - containing protein? The acronym has not been defined

      Response: The acronym has been defined now. We have also defined acronyms wherever they were missing.

      Comment 7: Fig. S1 - there are a lot of important data in this figure that demonstrate the coordinated movement of Scd6 and Sbp1 to granules. They should be moved into the main body of the manuscript in my opinion. Likewise, a whole section of the Results is dedicated to Fig. S2 - thus I would suggest moving these data into the main body of the manuscript to assist the reader.

      Response: We thank the reviewer for pointing this out. Figure S1 has now been added to the main body of the manuscript as Figure 2. Figure S2 has now been added to Figure 1 and new Figure 3. This rearrangement has improved the flow of the manuscript.

      Comment 8: Fig. 1F should be flipped in the figure with panel G since G is discussed in the results section before F

      Response: Figure 1F and 1G are now Figure 3A and 3D and in the same order as mentioned in the text.

      Comment 9: Be sure to define all acronyms for the reader.

      Response: All acronyms in the manuscript have been defined wherever applicable.

      Comment 11: Fig. 3H/I: It might be optimal to calculate and compare Kd's for the methylated and unmethylated variants. Also, the labels at the top of 3H do not line up with the wells of the EMSA gel.

      Response: We have calculated the Kd’s for the EMSA, and it has been added to the results section. We have also aligned the labels at the top of the EMSA gel (now Figure 5I) to match with the wells.

      Reviewer 2

      Major Comment 1: Fig. 2A, B. While there seems to be an effect on the lag phase, it could be revealing if the authors pls. calculate the doubling times for the strains and treatments (taking through the exponential growth phase). Furthermore, it would be good if the authors can show the rescue of phenotypes for deletion strains (ie. reintroduction of respective gene on ARS-CEN based plasmids or (if not available) with the OE plasmids.

      Response: We thank the reviewer for this remark. We have calculated the doubling times for the strains in the tested conditions and added in the text. We have analyzed the effect of complementing the deletion strains with the respective genes on the CEN plasmid. We observe that Δscd6 shows tolerance to HU stress as previously seen, which gets rescued almost completely upon complementation with WT SCD6. This result has been included in the manuscript as a new figure panel (Figure S1A) . Δsbp1 also shows marginal tolerance to HU stress, but complementation with WT SBP1 only slightly rescues the phenotype, which is not statistically significant (Figure S1B). This result highlights a more important role of Scd6 as compared to Sbp1 in genotoxic stress response.

      Major Comment 2 (part 1): Fig. 3H. The authors tested the 5'UTR of SRS2 for interaction with recombinant Scd6. Firstly, it is unclear why the authors have chosen the 5'UTR for investigation? Can the authors explain.

      Response: We thank the reviewer for this important comment. During experimentation and analysis, we assayed Scd6 binding to two different fragments of SRS2 mRNA: 5’ and 3’UTR of same lengths (200 bases). We used the UTR fragments because there are numerous reports indicating the role of UTRs in the regulation by RNA binding proteins (https://doi.org/10.1093/bfgp/els056, https://doi.org/10.1126/science.aad9868, https://doi.org/10.1093/jxb/erae073). RNA EMSAs with purified Scd6 and in vitro transcribed UTR RNA fragments revealed a significantly better binding of Scd6 with the 5’ UTR fragment of SRS2 mRNA compared to the 3’ UTR. Therefore, we proceeded with the 5’ UTR fragment for further analysis. We have now added this as a supplementary figure panel and explanation in the manuscript text (Figure S2B).

      Major Comment 2 (part 2): Secondly, the affinities are relatively low (µM), and the gel shift assay lacks a negative control. The authors should test an unrelated RNA fragment of approximately the same size to control for specificity (negative control). It is unclear whether the protein could interact with any RNA fragment through a charged RNA backbone.

      Response: Our in vivo data suggests that the binding of Scd6 with SRS2 mRNA is condition and RNA-specific and is regulated by methylation (now Figure 5C, S2A and 5E). As the reviewer mentioned, Scd6, in principle, could bind to any RNA molecule given the affinity of an RNA-binding protein (with positively charged amino acids such as arginine) to RNA molecule. Nevertheless, the significant difference in the binding of Scd6 to the 5’UTR and 3’UTR fragments itself acts as a relative control for EMSA. The aim of the in vitro experiment (EMSA) was to establish the difference, if any, in the binding affinities of unmethylated vs methylated Scd6, like the in vivo data, where we observe significantly increased binding to SRS2 mRNA upon decreased Scd6 methylation.

      Major Comment 2 (part 3): Thirdly, it would be good if the authors could show a Coomassie gel for the recombinant protein used in those assays.

      Response: The Coomassie gel which was provided as part the supplementary data (now Figure S2C), have now been added as another gel image to the main figure (Figure 5H), next to the EMSA, for better clarity.

      Major Comment 3: Methods and Materials: The Materials and Methods section lacks important information and requires further details to evaluate the study (see below 11 – 17)

      Response: The comment has been duly noted.

      Minor Comments

      Results:

      Comment 4: The numbering of Figure S1, S2 is confused in the first part of the results section. The authors should check numbering. In general, numbering should follow in the order of the text - pls. check.

      Response: Based on the comment#7 by Reviewer 1, Figure S1 and S2 have now been added to the main figure, and the changes in the text have been made accordingly.

      Comment 5: Pg. 5. CHX treatment leads to a decrease in Scd6-GFP and SBP-1 GFP granules. Essentially, CHX blocks translation elongation so the result indicates that puncta depend on active translation. The authors may want to add this liaising point towards the claim that mRNAs could be present in those puncta. How this results integrates with data shown in Fig. S5B*.

      *

      Response: We thank the reviewer for this comment. Since granules are dynamic structures that depend on active translation, CHX treatment leads to the dissociation of Scd6 and Sbp1 granules. This indicate that most of the mRNAs present in these granules could be recycled for translation in polysomes. This strategy has been used in multiple research articles for similar deductions (10.1091/mbc.E08-05-0499, https://doi.org/10.1083/jcb.151.6.1257, https://doi.org/10.1093/nar/gku582). We have now modified the text in the manuscript to accommodate this point. It has been previously reported that core components of stress granules, once formed are stable and resistant to RNase, EDTA and NaCl treatment ex vivo (https://doi.org/10.1016/j.cell.2015.12.038), even when these structures have RNA. Figure S5B (now S3C) indicates that the granule enriched fraction derived from untreated and treated cells indeed behaves like stress granule cores and not protein aggregates allowing us to proceed with downstream experiments.

      Comment 6: Fig. 2H. It would be helpful to the reader, if the authors could mark the respective fraction in the polysomes taken for analysis of relative enrichments. How was this relative enrichment was calculated needs further description.

      Response: The modification has been made (now Figure 4G) and added to the methods and materials.

      Comment 7: Fig. S5B. 1% SDS treatment cause absence for Scd6 signal from the pellet fraction. Based on this result, I am not clear how based on this result they can claim for presence of higher order mRNA-protein complexes? Why does it exclude the possibility for Scd6 aggregates accumulating in the pellet? The authors need to explain/ modify this statement. Related to earlier findings that showed dependency of puncta upon CHX treatment, one wonders how this result matches to this earlier observation (ie.EDTA should dissassemble ribosomes)? Can the authors explain?

      Response: The very stable β-zipper interactions present in prion like domains, which leads to aggregation, is resistant to 1-2% SDS treatment (https://doi.org/10.1016/j.cell.2015.12.038). Hence, we think that solubilization upon 1% SDS treatment indicates that these are not aggregates. EDTA and NaCl are capable of disrupting interactions, which are stabilized mainly by electrostatic forces. Our observations (now Figure S3C) indicate that Scd6 could be part of the more stable mRNP condensate core structure and are therefore resistant to these treatments. Such observations have been previously reported, for example, stress granules in yeast are not affected by EDTA and NaCl treatments (https://doi.org/10.1016/j.cell.2015.12.038).

      Comment 8 (part 1): Fig. 5E, F. For the RNA-seq, the authors compared polysomes with free RNAs (up to 80S) and found enrichment of LIG4 and RTEL1. However, the polysomal profiling mainly shows a slight shift of those mRNAs in higher polysomes; while there is no difference compared to free fractions. How can this be explained?

      Response: We observed a shift from lower polysome fractions (11-12-13) (not from free fractions) to higher polysome fractions (14-15) indicating an increased number of ribosomes translating the RTEL1 mRNA.

      Comment 8 (part 2): On the line, the authors should indicate clearly what fractions were pooled for RNA seq analysis. It is also not clear how the authors quantified percentage of RNA in individual fractions (have they spiked-in an RNA?) - this needs to be stated in the M&M section.

      Response: We have now added the requested information in the Materials and Methods section. Fractions 13 to 17 were pooled for RNAseq analysis. The % of RNA in each fraction was calculated as described in Panda AC et al. Bio Protoc . 2017 Feb 5;7(3):e2126. doi: 10.21769/BioProtoc.2126

      Comment 9: At the end, if may be beneficial to the reader if the authors could provide a simple scheme depicting the model develop during this study.

      Response: We thank the reviewer for this comment. We have included a model derived from our study as a new figure (Figure 8).

      Comment 10: Supplemental Data set (.xls) The adjusted p-values are clustered and >0.05. Can the authors check and describe how those were calculated. How does it match with Volcano plots.

      Response: The adjusted p-values are indeed >0.05. The p-values (and not the adjusted p-values) are plotted in the Volcano plot (now Fig. 7E)

      Materials and Methods:

      Comment 11: A list of primers should be given with specification of their use.

      Response: The list has been added in the supplementary files (Table S3)

      Comment 12: The plasmids constructed for (over)expression of proteins/ production of recombinant proteins should be added. If published, references should be added accordingly.

      Response: The list has been added in the supplementary files (Table S4)

      Comment 13: RIP: the media for growing yeast cells should be added. Check also other section if defined.

      Response: The information has been added wherever required.

      Comment 14: RT-qPCR is not sufficiently described. RT kit needs specification, PCR reaction cycles should be given.

      Response: The information has been added

      Comment 15: Quantification of mRNA levels in polysomes is unclear. How was the distribution of mRNA profiles determined? Have the authors added some RNA spikes to fractions?

      See above.

      Response: The % of RNA in each fraction was calculated as described in Panda AC et al. Bio Protoc . 2017 Feb 5;7(3):e2126. doi: 10.21769/BioProtoc.2126. Details have now been added in the Mat and Meth section.

      Comment 16: The calculation for the enrichments in IPs is not described conclusively and should be added.

      Response: The calculation has now been elaborated and added to the methods and materials section.

      Comment 17: Polysomes fractionation (mammalian). It is indicated that the resultant supernatant was adjusted to 5M NaCl and 1 M MgCl2. This seems to be very high - is this a typo? OR why such high concentrations have been chosen?

      Response: The sentence has been removed. There is no need for such adjustment.

      Review 3

      Major Comment 2: Fig 2A-F: The effects of Scd6 and Sbp1 deletion upon HU-treatment are very small. A more convincing effect is observed upon over-expression of both SRS2 and SCD6. What is the effect of over-expression of SCD6 and SBP1 alone (i.e. without SRS2 over-expression)?

      Response: We thank the reviewer for this comment. The effects are indeed small but consistent and reproducible with two different kinds of assays (growth curve and plating assay, now Figure 4A-C). Overexpression of Scd6 or Sbp1 alone when expressed from a CEN/2u plasmid does not have any phenotype in the presence of HU (Figure S1A and S1B). Although, it has been previously reported that galactose-inducible Scd6 causes a severe growth defect (https://doi.org/10.1093/nar/gkw762), we performed spot assays with galactose inducible Scd6 and Sbp1 on control and HU plates, but did not see any difference in the extent of growth upon HU treatment. This data has now been presented as Figure S1C.

      Major Comment 3: Fig 2E: Why is there an opposite effect of deletion of Scd6 and Sbp1in the SRS2 over-expression background?

      Response: We thank the reviewer for this comment; however, we respectfully disagree with the idea that overexpression of SRS2 yields opposite phenotypes in SCD6 and SBP1 deletion backgrounds. Figure 2E (now Figure 4E) gives the impression that SRS2 overexpression in SBP1 deletion grows significantly more for two reasons. There was an increased spotting of Dsbp1 cells overexpressing SRS2 (row#6) as compared to Dscd6 cells overexpressing SRS2 (row#4), which is evident in the plate without HU (left panel). Additionally, there is also reduced spotting of wild-type cells overexpressing SRS2 (row#2) as compared to Dscd6 cells overexpressing SRS2 (row#4). We have now replaced these panels with another image with better loadings. Quantitation of five experiments (Figure S1F) indicates that Dsbp1 grows slightly better in both EV and SRS2 over-expression background, but the increase is not statistically significant. We interpret this data to suggest that SRS2 is not a direct target of Sbp1. Another protein perhaps performs the specific role of Sbp1 in assisting Scd6 in genotoxic stress response in Dsbp1 background.

      Major Comment 6: Fig 3C: Is the increased interaction of SRS2 mRNA with Scd6 due to increased levels of SRS2 mRNA upon HU treatment? See also comment below.

      Response: Based on RT-qPCR of total RNA, SRS2 mRNA levels do not seem to increase, which has now been added as a Supplementary figure (Figure S3D, left panel). Moreover, quantification of SRS2 mRNA from the FISH data also does not support an increase in mRNA levels (Figure 6D, left panel).

      Major Comment 7: Fig 4A: There seems to be an enrichment of SRS2 mRNA both in the granule-enriched pellet and in the supernatant upon HU treatment in the Scd6-GFP context, suggesting increased SRS2 mRNA levels altogether. The enrichment in granules upon HU is difficult to see, as one should measure the distribution of the mRNA in the pellet relative to the supernatant. Can the authors represent the ratio pellet/supernatant normalized to a control transcript? A similar calculation can be done for the protein normalized to a control protein.

      Response: As mentioned earlier, RT-qPCR data with SRS2 mRNA levels in total lysate has been added to supplementary data (Figure S3D, left panel). Based on RT-qPCR of total RNA, SRS2 mRNA levels do not seem to increase.

      The quantification of SRS2 mRNA and Scd6 protein enrichment is done such that the supernatant and pellet fractions are separately normalized to their respective controls (Scd6GFP, untreated sample) and therefore do not represent the mRNA distribution but relative mRNA enrichment. However, as per the recommendation by the reviewer, the data has been replotted as a ratio of supernatant and pellet with the addition of two more data points and has been added in the main figure (Figure 6E). The data concludes increased enrichment of SRS2 mRNA in granules upon HU treatment. The previous data has been included in the supplementary data as Supplementary figure (Figure S3D, right panel).

      Major Comment 8: Fig 4B: Increased juxtaposition of SRS2 mRNA and Scd6 granules upon HU treatment does not really mean increased colocalization. Granules are likely significantly apart such that increased interactions between the two partners are not explained by increased juxtaposition. Please, comment, tune-down and provide examples where increased granule juxtaposition is associated with increased interaction.

      Response: We believe that the usage of term ‘juxtaposition’ is leading to misinterpretation of the data. Therefore, we have replaced it with ‘percentage area overlap’ analysis to demonstrate that the SRS2 mRNA foci indeed overlap/localize with Scd6GFP foci up to an average of 43.5% in HU stress. This analysis has been added as an additional panel (Figure 6C), indicating that the SRS2 mRNA interacts with Scd6 in the granules. Even though the granules do not overlap/localize completely, the observed area of granule overlap (43.5%) is functionally effective as it leads to the physical interaction of Scd6 and SRS2 (Figure 6E & 5C) and, consequently, repression (Figure 4H). The FISH data, granule enrichment, and RNA immunoprecipitation data demonstrate Scd6 protein and SRS2 mRNA interaction in granules.

      Major Comment 9: Fig 4D: These results are in direct contradiction with those shown in Fig 1C.

      Response: We thank the reviewer for this comment. Figure 1C (now Figure 1B and 1C) demonstrates that Scd6 localization to puncta, when expressed from a CEN plasmid, significantly increases upon HU stress. The same trend is visible in Figure 4D (now Figure 6D) where Scd6 is expressed from a 2μ plasmid; however, it is not significant. The data in 1C and 4D (now 1C and 6D respectively) are rather inconsistent with each other than being contradictory. Nevertheless, we understand this reviewer’s concern and address it below.

      The initial localization experiments were performed using Scd6 expressed from CEN plasmid or genomically tagged Scd6. Since both these versions of Scd6 are not detectable using western blotting, we used Scd6 expressed from 2μ plasmid. Localization to condensates by liquid-liquid phase separation is a concentration-driven phenomenon. Therefore, when Scd6 is expressed from a 2μ plasmid amounting to increased protein levels, its localization to puncta increases even in the absence of stress, which is visible in the quantitation provided in the figure (Figure 6D) as compared to Figure 1C. We have now analyzed the percentage granular localization (granule intensity) of Scd6 (2µ), which significantly increases upon HU stress (Figure S3A). Thus although number of Scd6 granules does not increase upon HU stress when expressed from a 2µ plasmid, there is significant increase in localization of Scd6 to granule upon HU stress (Figure S3A).

      Major comment 10: Fig 5E: Can the authors provide a GO analysis of the up- and down- regulated transcripts?

      Response: We have now provided a GO analysis (Table S2). However, due to the low number of regulated genes, only a few GO terms with weak scores appeared in the analysis.

      Minor comments:

      Comment 11: Figures S1 and S2 seem to be swapped. Please make sure that Figures and panels are arranged in the order they are mentioned in the main text.

      Response: We thank the reviewer for pointing it out. Based on the comment#7 by Reviewer 1, Figure S1 and S2 have now been added to the main figure, and the changes in the text have been made accordingly. We have ensured that the order of figures matches the text.

      Comment 12: Page 5, sentence: 'our results argue for the role of Scd6 and Sbp1 in HU-mediated stress response'. I do not agree, as no functional assays showing that these proteins affect HU-mediated stress response have been provided at this point of the story. Please, delete.

      Response: We have removed the sentence from the existing paragraph.

      Comment 13: Page 6: The authors state 'Since Dscd6 and Dsbp1 showed tolerance to chronic HU exposure...'. Where is this shown?

      Response: The growth curve in Figure 2A and 2B (now Figure 4A and 4B) and the plating assay in Figure 2C (now Figure 4C) was done with hydroxyurea in the media/plate. Hence, we state that deletion of either SCD6 or SBP1 shows tolerance to chronic (or continuous) HU stress.

      Comment 14: Fig 2F: The rescue by SCD6 OE is not complete, as mentioned in the main text.

      Response: We have now included the quantification of the spot assay in 2F (now Figure 4F) to show that the rescue by SCD6 overexpression is complete (Fig S1G).

      Comment 15: Figure 2G-H: Please, indicate in the figure what the authors consider 'translated' and 'untranslated’ fractions.

      Response: The fractions have now been labelled to indicate the missing information in Figure 2G (now Figure 4G).

      4. Description of analyses that authors prefer not to carry out



      Review 1


      Minor Comment 10: Pg. 8/Fig. S3D/4A: It would be interesting to complete the story and determine the functional relationship of Scd6 to the DNL4 mRNA

      Response: It is indeed an interesting observation and is currently being pursued as part of another story. We believe it is beyond the scope of the current manuscript.


      Review 3

      Major Comment 1: Page 5 and Fig S2E-F: The CLHX experiment to conclude that mRNA is present in Scd6 and Sbp1 puncta is rather indirect. The fact that RNase treatment of a granule-enriched pellet has no effect (Fig S5B) does not help. The authors should perform RNase treatment of intact cells and see that the puncta disappear.

      Response: We thank the reviewer for this comment. Cycloheximide treatment is a well-accepted assay to detect the presence of mRNA in granules. Since granules are dynamic structures, and these depend on active translation, CHX treatment leads to the dissociation of Scd6 and Sbp1 granules. This indicates that granule assembly depends on the availability of mRNA derived from translating ribosomes. The observation that Scd6 puncta are sensitive to cycloheximide but not to RNase A treatment is not surprising. It indeed is consistent with the properties of some of the condensates reported in the literature. For example, stress granule cores that are sensitive to cycloheximide, like Scd6 puncta, are resistant to RNase treatment in lysate, indicating that once formed, these structures are quite stable (https://doi.org/10.1016/j.cell.2015.12.038). It is interpreted to suggest that the RNAs in these condensates are protected by the RNA-binding proteins. Also, subsequently, in the study, we do RNA immunoprecipitation and granule enrichment experiments and show specific RNA enrichment with Scd6 (Figure 5C, 6A).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript, Rajyaguru, Vagner and colleagues address the role of yeast Scd6 in translation regulation during hydroxyurea (HU) stress response. They first show that Scd6 associates to puncta upon HU-treatment, in a manner that depends on the RGG and LSm domains of the protein. They then show that deletion of Scd6 has a mild effect on cell survival under HU-treatment, and over-expression of SCD6 restores the defects observed upon over-expression of SRS2 (a known regulator of the DNA damage response-DDR), suggesting that SRS2 is an SCD6 target, and that SCD6 negatively regulates SRS2. The authors further show that SCD6 binds to SRS2 mRNA and inhibits its translation. Binding to mRNA is somewhat regulated by methylation of the RGG domain, and methylation seems to be controlled by the LSm domain. Finally, the role of SCD6 as a translation regulator during HU-response is conserved, because its mammalian homolog, LSM14A, stimulates the translation of DDR factors under similar conditions. Altogether, this is a nice report showing a function of Scd6/LSM14A in regulation of translation upon HU treatment. However, there are some contradictions that need to be resolved, and the role of puncta in the whole picture is not clear.

      Major comments:

      1. Page 5 and Fig S2E-F: The CLHX experiment to conclude that mRNA is present in Sdc6 and Sbp1 puncta is rather indirect. The fact that RNase treatment of a granule-enriched pellet has no effect (Fig S5B) does not help. The authors should perform RNase treatment of intact cells and see that the puncta disappear.
      2. Fig 2A-F: The effects of Scd6 and Sbp1 deletion upon HU-treatment are very small. A more convincing effect is observed upon over-expression of both SRS2 and SCD6. What is the effect of over-expression of SDC6 and SBP1 alone (i.e. without SRS2 over-expression)?
      3. Fig 2E: Why is there an opposite effect of deletion of Scd6 and Sbp1in the SRS2 over-expression background?
      4. Page 7, top: '...indicating that Scd6 regulated the expression of SRS2 in a HU-dependent manner.' In my opinion, the results so far suggest that Scd6 and SRS2 are somehow functionally connected during HU-treatment. To substantiate the statement of the authors, they should provide a Western blot showing that the levels of SRS2 change upon Scd6 KO or OE during HU-treatment. This will also substantiate the results shown in Figs 2G-H.
      5. Figs 3: How are the localization of Scd6 protein and SRS2 mRNA to granules, and the levels of SRS2 protein, in cells exposed to HU after deletion of Hmt1? This would substantiate a role of Hmt1 in vivo.
      6. Fig 3C: Is the increased interaction of SRS2 mRNA with Scd6 due to increased levels of SRS2 mRNA upon HU treatment? See also comment below.
      7. Fig 4A: There seems to be an enrichment of SRS2 mRNA both in the granule-enriched pellet and in the supernatant upon HU treatment in the Scd6-GFP context, suggesting increased SRS2 mRNA levels altogether. The enrichment in granules upon HU is difficult to see, as one should measure the distribution of the mRNA in the pellet relative to the supernatant. Can the authors represent the ratio pellet/supernatant normalized to a control transcript? A similar calculation can be done for the protein normalized to a control protein.
      8. Fig 4B: Increased juxtaposition of SRS2 mRNA and Scd6 granules upon HU treatment does not really mean increased colocalization. Granules are likely significantly apart such that increased interactions between the two partners are not explained by increased juxtaposition. Please, comment, tune-down and provide examples where increased granule juxtaposition is associated with increased interaction.
      9. Fig 4D: These results are in direct contradiction with those shown in Fig 1C.
      10. Fig 5E: Can the authors provide a GO analysis of the up- and down- regulated transcripts?

      Minor comments:

      1. Figures S1 and S2 seem to be swapped. Please make sure that Figures and panels are arranged in the order they are mentioned in the main text.
      2. Page 5, sentence: 'our results argue for the role of Scd6 and Sbp1 in HU-mediated stress response'. I do not agree, as no functional assays showing that these proteins affect HU-mediated stress response have been provided at this point of the story. Please, delete.
      3. Page 6: The authors state 'Since scd6 and sbp1 showed tolerance to chronic HU exposure...'. Where is this shown?
      4. Fig 2F: The rescue by SCD6 OE is not complete, as mentioned in the main text.
      5. Figure 2G-H: Please, indicate in the figure what the authors consider 'translated' and 'untranslated´fractions.

      Significance

      The findings provide a new function for Scd6/LSM14A in regulation of translation upon HU treatment. There are limitations regarding the strength of effects in some cases, and the integration of the role of granule formation. These findings are useful for scientists working on genotoxic stress, RNA-binding proteins and/or translation.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors describe a novel role for the RGG-box containing RNA-binding protein Scd6 and its human ortholog LSM14A in the genotoxic stress to HU treatment. The proteins accumulate in granules upon HU treatment and may be involved in the translational repression of mRNAs coding for relevant factors mediating the HU response. For instance, they show that yeast Scd6 interact with SRS2 mRNA and represses translation during HU response. Interestingly, the also show that interaction with RNAs is dependent on arginine methylation in the LSM domain, which greatly affects interaction with the RNA though specificity needs to be shown. At the end, the authors performed initial analysis with the human ortholog LSM14A, indicating that the cellular function of the protein is evolutionary conserved.

      The study is interesting and adds new aspects for understanding RGG-containing proteins and their molecular functions. While it has been previously known that Scd6 accumulates in granules and acts as a translational repressor via eIF4G, this study reveals the regulation of a particular target (SRS2) in context of a relevant stress response. The experiments are usually well performed, though some controls and descriptions need to be added.

      Major points:

      1. Fig. 2A, B. While there seems to be an effect on the lag phase, it could be revealing if the authors pls. calculate the doubling times for the strains and treatments (taking through the exponential growth phase). Furthermore, it would be good if the authors can show the rescue of phenotypes for deletion strains (ie. reintroduction of respective gene on ARS-CEN based plasmids or (if not available) with the OE plasmids.
      2. Fig. 3H. The authors tested the 5'UTR of SRS2 for interaction with recombinant Scd6. Firstly, it is unclear why the authors have chosen the 5'UTR for investigation? Can the authors explain. Secondly, the affinities are relatively low (µM) and the gel shift assay lacks a negative control. The authors should test an unrelated RNA fragment of approximately the same size to control for specificity (negative control). It is unclear whether the protein could interact with any RNA-fragment through charged RNA backbone. Thirdly, it would be good if the authors could show a Coomassie gel for the recombinant protein used in those assays.
      3. The Materials and Methods section lacks important information and requires further details to evaluate the study (see below 10 - 17).

      Minor points:

      Results:

      1. The numbering of Figure S1, S2 is confused in the first part of the results section. The authors should check numbering. In general, numbering should follow in the order of the text - pls. check.
      2. Pg. 5. CHX treatment leads to a decrease in Scd6-GFP and SBP-1 GFP granules. Essentially, CHX blocks translation elongation so the result indicates that puncta depend on active translation. The authors may want to add this liaising point towards the claim that mRNAs could be present in those puncta. How this results integrates with data shown in Fig. 5S5.
      3. Fig. 2H. It would be helpful to the reader, if the authors could mark the respective fraction in the polysomes taken for analysis of relative enrichments. How was this relative enrichment was calculated needs further description.
      4. Fig. S5B. 1% SDS treatment cause absence for Scd6 signal from the pellet fraction. Based on this result, I am not clear how based on this result they can claim for presence of higher order mRNA-protein complexes? Why does it exclude the possibility for Scd6 aggregates accumulating in the pellet? The authors need to explain/ modify this statement. Related to earlier findings that showed dependency of puncta upon CHX treatment, one wonders how this result matches to this earlier observation (ie.EDTA should dissassemble ribosomes)? Can the authors explain?
      5. Fig. 5E, F. For the RNA-seq, the authors compared polysomes with free RNAs (up to 80S) and found enrichement of LIG4 and RTEL1. However, the polysomal profiling mainly shows a slight shift of those mRNAs in higher polysomes; while there is no difference compared to free fractions. How can this be explained? On the line, the authors should indicate clearly what fractions were pooled for RNA seq analysis. It is also not clear how the authors quantified percentage of RNA in individual fractions (have they spiked-in an RNA?)
        • this needs to be stated in the M&M section.
      6. At the end, if may be beneficial to the reader if the authors could provide a simple scheme depicting the model develop during this study.
      7. Supplemental Data set (.xls) The adjusted p-values are clustered and >0.05. Can the authors check and describe how those were calculated. How does it match with Volcano plots.

      Materials and Methods:

      1. A list of primers should be given with specification of their use.
      2. The plasmids constructed for (over)expression of proteins/ production of recombinant proteins should be added. If published, references should be added accordingly.
      3. RIP: the media for growing yeast cells should be added. Check also other section if defined.
      4. RT-qPCR is not sufficiently described. RT kit needs specification, PCR reaction cycles should be given.
      5. Quantification of mRNA levels in polysomes is unclear. How was the distribution of mRNA profiles determined? Have the authors added some RNA spikes to fractions?
      6. The calculation for the enrichments in IPs is not described conclusively and should be added.
      7. Polysomes fractionation (mammalian). It is indicated that the resultant supernatant was adjusted to 5M NaCl and 1 M MgCl2. This seems to be very high - is this a typo? OR why such high concentrations have been chosen?

      Significance

      The study appears solid and well done, except some weaknesses that need to be addressed (see above section). Overall, the interplay with translation could be better investigated and the involvement for interactions with eIF4G/ ribosomes could be better investigated but possibly beyond this study.

      Essentially, the study adds a nice mix of experimental approaches to manifeste the linkage between granules formation, translation and the physiological implications for genotoxic stress response. However, it is not clear how the chosen conditions reflect any natural conditions (yeast may never be exposed to 100-200 mM HU) and hence, it is unclear howfar the observations reflect nature and occur like that. It is certainly a limitiation that only one particual stress condition was investigated and it is unclear whether it is also seen with other genotoxic stress inducing agents.

      Audience: The study will attract the interest of researchers working with cytoplasmic RBPs, translation and stress granules. The latter topic is currently on a high wave as many researchers jumped on this topic. The study may though remain within this circle as the analysis is rather specialised and contrained on one stress (genotoxic) and wider implications of the findings in a physiological context are unclear.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript demonstrates that RGG containing proteins (Sbp1, Scd6/LSM14A) localize to granules upon treatment with hydroxyurea, Scd6 binds and regulates SRS2 in a manner that is regulated by arginine methylation of the RGG motif, that the RGG and LSM motifs are crucial for the process, and that NHEJ is influenced by LSM14A, suggesting the functional significance of this regulation. Overall the data generally support the conclusions that are drawn. In terms of overall presentation, the manuscript is a bit of a dense read and some of the supplementary figures should be moved to the main body of the manuscript to reflect their importance to the story. I do believe the manuscript will have impact to field, particularly to the specialized RGG protein and stress response niche. I do have several suggestions to polish the manuscript/study:

      Major Points:

      1. Fig. 1 F/G: were the delta RGG and LSM variants expressed at an equivalent level to the WT protein in these experiments?
      2. Fig. 3G: The 6 data points for the delta LSM variant are literally spread evenly up and down the graph, making these data appear highly questionable as to whether one can draw a definitive conclusion from them.

      Minor Points:

      1. Abstract: the acronym NHEJ likely will need to be defined for the general reader.
      2. Introduction, first paragraph: change gene expression to 'transcription' in the phrase 'Even if the contribution of gene expression to GSR..' as I assume this is what is meant here. Gene expression consists of synthesis, processing, translation and decay.
      3. Pg. 3 Introduction: Since they are liquid-liquid phase condensates and ribonucleoproteins (RNPs) refers to any protein-RNA interaction, I think that referring to PBs and SGs as mRNPs is a bit misleading (especially the 'major mRNPs').
      4. Pg. 3 Introduction: are PBs truly 'sites' of mRNA decay as stated? There are papers in the literature that would argue otherwise.
      5. Pg. 3, three lines from bottom. Change LSM14 to LSM14A
      6. Pg. 4 top - What is an 'LCS' - containing protein? The acronym has not been defined
      7. Fig. S1 - there are a lot of important data in this figure that demonstrate the coordinated movement of Scd6 and Sbp1 to granules. They should be moved into the main body of the manuscript in my opinion. Likewise, a whole section of the Results is dedicated to Fig. S2 - thus I would suggest moving these data into the main body of the manuscript to assist the reader.
      8. Fig. 1F should be flipped in the figure with panel G since G is discussed in the results section before F
      9. Be sure to define all acronyms for the reader.
      10. Pg. 8/Fig. S3D/4A: It would be interesting to complete the story and determine the functional relationship of Scd6 to the DNL4 mRNA
      11. Fig. 3H/I: It might be optimal to calculate and compare Kd's for the methylated and unmethylated variants. Also the labels at the top of 3H do not line up with the wells of the EMSA gel.

      Significance

      Overall the data generally support the conclusions that are drawn. In terms of overall presentation, the manuscript is a bit of a dense read and some of the supplementary figures should be moved to the main body of the manuscript to reflect their importance to the story. I do believe the manuscript will have impact to field, particularly to the specialized RGG protein and stress response niche

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We are grateful to the reviewers for their many valuable suggestions for improving this paper. In particular, we fully understand the points raised by Reviewers #1 and #2 regarding the insufficient data analysis and the points raised by Reviewers #2 and #3 regarding the insufficient analysis of the mechanism. In future revisions, we will perform sufficient analysis of our datasets and we will also conduct an analysis focusing on Dmrt3 to investigate the mechanisms for chromatin accessibility and changes in gene expression during neuronal differentiation. We will also make revisions to address other minor points.

      2. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors have developed a method for labeling a specific stage of differentiating neurons. Using this approach, they tracked the four-day differentiation process of deep-layer excitatory neurons in the mouse embryonic cortex. They investigated genome-wide changes in transcription patterns and chromatin accessibility using RNA-seq and DNase-seq. Additionally, they provided H3K4me3 and H3K27me3 ChIP-seq data from E12.0 NPCs. This resulting omics data would be a valuable resource for the field. While initial data analyses show potentially interesting findings, only part of the analyses are presented in the figures, lacking sufficient detail. Before publishing the manuscript, the authors should include more comprehensive analyses of their datasets. Specific suggestions are below.

      We appreciate this reviewer's positive comments describing our study as 'a valuable resource for the field.' We plan to revise the paper, as noted below, to address this reviewer's concerns.

      Figure 4 focuses on promoter-specific chromatin accessibility analysis. The author can process the data similarly to the transcription data. They should identify differentially accessible promoter regions across E13.0 to E16.0 and generate a heatmap with clustering. Additionally, the author should provide matched gene expression data, either in the form of a heatmap or box plot, corresponding to those differentially accessible promoter regions. Currently, Figure 4 only presents E16.0 data compared to E12.0, which is not comprehensive.

      We thank the reviewer for the useful suggestions. In the following submission, we will determine gene sets for all chromatin accessibility change patterns, not just open/closed gene sets from E12 to E16. We will then illustrate the changes in gene expression for each gene set.

      Reviewer #1 (Significance (Required)):

      Multi-omics data from the differentiation process of deep-layer excitatory neurons would be a valuable resource for the field.

      Once again, we would like to thank the reviewers for their positive comments.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: The manuscript from Sakai et al. examines changes in chromatin accessibility during the differentiation of deep-layer excitatory neurons in the neocortex. The authors establish a novel genetic labelling method that tracks differentiating neurons based on their birthdates allowing following neuronal differentiation in vivo. By combining RNA-seq and DNase-seq they provide a comprehensive dataset of gene expression and chromatin accessibility changes during neuronal differentiation of deep-layer neurons and reveal that key genes linked to mature neuronal functions and bivalent genes in neural precursor cells become accessible during early differentiation. These findings underscore the crucial role of chromatin regulation in preparing neurons for maturation and unravel novel key insights into the regulatory mechanisms governing deep-layer neuronal differentiation.

      Overall, this manuscript presents a novel technique for tracking neuron development from NPCs with specific birthdates. However, in its current form, it is largely descriptive and relies on correlative observations rather than elucidating a clear mechanism underlying chromatin and transcriptional changes. The provided data could be further leveraged to gain deeper insights into the molecular mechanisms governing deep-layer neuron development.

      We would like to thank the reviewer for recognizing the methods used in this paper as 'a novel technique for tracking neuron development from NPCs with specific birthdates'. As the reviewer commented, this paper was descriptive, and we plan to prepare a revised version that includes results that approach 'the molecular mechanisms governing deep-layer neuron development' by analyzing the role of Dmrt3 in neuronal differentiation, as shown in the response below, especially for point 9.

      Major comments:

      The authors have generated extensive RNA- and DNAse-seq datasets across different developmental time points following birthdate labelling. However, the bioinformatics analyses and interpretations are limited and need further clarification and refinement:

      The violin plots used to demonstrate expression and accessibility changes across developmental time points and the conclusions drawn from them are not convincing. The authors used a rank test to assess significant changes in expression, which only indicates the enrichment of genes with increased or decreased expression in each group. This cannot be directly interpreted as "significant upregulation." For instance, in Figures 4a and 4b, similar violin plots yield different statistical outcomes. The mean values on both graphs are comparable, yet Figure 4a suggests significant changes, while Figure 4b does not conclude significant downregulation of closing DHS genes. This is unconvincing. A more robust approach would be identifying DEGs between time points and analysing functional terms associated with these genes. The current plots do not support interpretations of gene upregulation, as each dot represents a gene, and the violin plot serves more as a population representation. The authors should either revisit their explanations and conclusions or include additional analyses and appropriate plots that support their claims of significant upregulation and downregulation of specific genes during development. We would like to thank the reviewer for their helpful suggestions on presenting the data in Figure 4 more effectively. In future reanalysis, we will add an analysis focusing on DEGs, as suggested by the reviewer. Specifically, we will examine the overlap between DEGs identified by RNA-seq and genes with altered chromatin accessibility and test this using Fisher's exact test and other methods. This will allow us to verify the conclusions of this paper from multiple perspectives.

      Figure 6b lacks clarity regarding the cutoff value used to categorise genes as K4me3 and K27me3 negative or positive from the heatmap. Even the "K4me3 negative" cluster displays a detectable signal of the mark, albeit at lower levels. Since only one plot of the entire gene body is provided, it is unclear what levels of enrichment are present, particularly at the promoter region. The authors are encouraged to provide additional informative plots and analyses of this ChIP-seq experiment, as this is a critical point where they draw conclusions about bivalent genes. This would not only strengthen their claims but could also uncover additional findings with more detailed analyses. A heatmap of clustered ChIP-seq signals of K4me3 and K27me3 alongside expression levels of the same genes (similar to Figure 2c) and differential accessibility (e.g., between NPC and E16) would better visualise and correlate histone modifications with chromatin and gene expression states.

      We would also like to thank this reviewer for their useful suggestions regarding Figure 6. In the next submission, we will try different methods to quantify H3K4me3 and H3K27me3 signals. Specifically, we plan to try methods using peak calling and methods that quantify signals in promoter regions.

      We also plan to show new figures for changes in gene expression and chromatin accessibility in gene sets categorized by H3K4me3 and H3K27me3 signals.

      The DNase-seq dataset can be better utilised to investigate differentially accessible motifs through development. Is this something the authors already looked into? This could strengthen mechanism investigation together with the ChIP-atlas results in Fig.6a

      In the revised version, we will perform motif analysis and ChIP-atlas analysis for all genomic region sets showing differential accessibility. We will then use the results obtained to discuss the mechanisms of chromatin accessibility changes during the neuronal differentiation process in more depth.

      The two distinct modes of H3K4me3 enrichment observed are not addressed and should be explained. Which genes belong to these two clusters? Is there a difference in DHS and gene expression between them?

      In relation to point 2 of this reviewer, we will also re-analyze the differences in H3K4me3 patterns and changes in gene expression and chromatin accessibility. We believe that we can answer this reviewer's questions through the analyses using peak calling and signal quantification, as described in point 2.

      The same concern regarding the use of violin plots to correlate gene expression with bivalent genes through development (Figure 6c) as mentioned earlier. It would be better to use DEGs and intersect them. This is particularly important given the wide range of gene expression levels in the already poised state.

      In relation to this reviewer's point 1, we will also perform a reanalysis focusing on DEGs in Figure 6.

      The authors limited their analyses to promoter/gene body regions. A survey of the bivalent marks and accessibility at enhancer regions would be also beneficial for understanding the changes at the chromatin landscape through development.

      The results of Figure 3 showed that chromatin accessibility in the promoter region changes significantly during neuronal differentiation, and this paper has focused on the promoter region. However, as this reviewer has commented, we have realized that analysis of enhancers is also useful. We plan to re-analyze the changes in chromatin accessibility in the enhancer region for the revised version.

      The mechanisms driving the activation and expression of poised neuronal genes through the development of deep-layer neurons is not uncovered. The authors suggest certain histone modifiers and the DNA methyltransferase Dnmt3 as potential drivers of chromatin landscape and transcriptional regulation changes; however, this remains speculative, as there is no direct evidence or validation of these factors binding to the identified target regions or changes in DNA methylation states. The authors should provide validation of their candidate factors' presence at potential targets, as well as changes in DNA methylation if they want to conclude these as the mechanisms driving deep-layer neuron development.

      We thank the reviewer for pointing out the critical issue of the mechanism for the activation of poised genes. We agree that investigating the mechanism in more depth would improve our paper.

      To this end, we will analyze the role of Dmrt3, not Dnmt3, in activating poised genes. Dmrt3 is a transcription factor mainly involved in transcriptional repression, and our RNA-seq results indicate that it is highly expressed in NPCs, and its expression decreases during neuronal differentiation. Therefore, Dmrt3 may suppress poised genes in NPCs. Indeed, our preliminary results using public data have shown that knocking out Dmrt3 increased the expression of poised genes.

      In future analyses, we plan to analyze the role of Dmrt3 using RNA-seq data from Dmrt3 knockout NPCs and Dmrt3 ChIP-seq data from NPCs.

      Minor comments:

      The motif analysis can be included in the main figures.

      We appreciate the reviewer's positive suggestions. Regarding point 9, we will move the results of the motif analysis to the main figure after reanalysis about Dmrt3.

      Reviewer #2 (Significance (Required)):

      By introducing a novel genetic labelling method that tracks neurons based on their birthdates, the study provides a precise way to examine differentiation in vivo, adding valuable insights beyond traditional in vitro approaches. The combination of RNA-seq and DNase-seq analyses reveals how chromatin accessibility changes, particularly in bivalent genes, play a crucial role in neuronal maturation. This work highlights the importance of chromatin dynamics in establishing neuronal identity. The techniques and findings provide a useful framework for future studies, offering a path for deeper exploration of chromatin regulation across different neuronal types, stages of development, or disease contexts, making it a valuable contribution to the field of developmental neurobiology.

      While the manuscript suggests the involvement of chromatin regulators such as Trithorax and Polycomb proteins, as well as Dnmt3 and DNA methylation, it lacks direct mechanistic evidence, such as ChIP-seq, bisulfite-seq, or loss-of-function experiments, to substantiate these claims.

      The bioinformatics analyses and interpretations are limited and require further clarification and refinement.

      The proposed mechanisms are not fully explored, leaving the manuscript largely descriptive rather than providing a detailed mechanistic understanding.

      We would like to thank the reviewer again for their various suggestions for improving our manuscript. By performing the experimental plan described above, we try to resolve the reviewer's concerns and improve this paper.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript the authors use in utero electroporation of tamoxifen inducible reporters to permanently mark cortical neurons with a common birthdate. They then FACS harvest these cells for bulk DNAse seq and RNA seq to see changes in chromatin regulation and gene expression as these newborn immature cortical neurons become deep layer neurons. As has been shown in prior studies that have addressed other neuronal types or used different methods to isolate developmental cell stages in the CNS, the authors find correlated changes between the opening or closing of chromatin with changes in gene expression. They use this information to localize chromatin marks that are associated with the differential expression of genes and conclude that many of the differential genes are bivalent for active and repressive chromatin marks. Finally the authors cross this dataset with a microarray they did of BDNF-inducible genes in cortical culture and suggest enrichment of this program in the differentially regulated gene set from in vivo.

      Reviewer #3 (Significance (Required)):

      The idea that chromatin regulation coordinates developmental changes in gene expression in neurons has been addressed with several different strategies over the past decade including prior strategies that allow for isolation of neurons with common birth dates. Many current strategies (well cited by the authors) use single cell sequencing and computational algorithms to deconvolve differentiation state from complex mixtures. This study takes an alternative approach to experimentally label these developmental stages which is nice to see for the validation of ground truth. However the study does not go far beyond current knowledge to use this method to add new concepts to the field. The main point of innovation seems to be the observation that the newborn neurons are primed at the chromatin level to express deep layer markers at the time they are born during embryonic life. This is useful to see but not unexpected on the basis of large scale single cell datasets. They also show that bivalent promoters prime developmental stage specific gene expression (in addition to the well-established function of this form of regulation in fate determination), however this too has been shown already in other neuron types.

      We are very pleased that the reviewer evaluated our method as 'nice to see for the validation of ground truth' and distinguished it from the current mainstream method to trace the differentiation process computationally using single-cell analysis that tracks. On the other hand, we also agree with the reviewer's assessment that our results do not exceed previous knowledge. Therefore, as mentioned in our response to Reviewer #2, we plan to analyze the role of Dmrt3 in gene expression and chromatin structure during the neuronal differentiation process. This will allow us to clarify the novel insight into the neuronal differentiation process.

      In addition to these conceptual limitations, there are some poorly supported comments in the text. For example, the fact that their microarray shows some genes in a category called "apoptosis" that are BDNF-sensitive does not meaningful suggest that BDNF induces excitotoxicity in embryonic cortical culture. BDNF has been well established as a survival factor for many kinds of neurons and is a common additive to serum-free media supplements (like B27). The appearance of "apoptosis" terms in the upregulated genes on the microarray more likely suggests either that the microarray is a poor detector of differential gene expression or that the genes in question are inaccurately categorized as "apoptotic" (GO terms are not terribly specific indicators of gene function). If the authors really wanted to test if BDNF was inducing apoptosis their cultures they could test this. However to use only the GO term data in such a strong statement about the biology of their system caused me to question the rigor of either their data or their analysis.

      We are grateful to the reviewers for their important comments. We also agree that BDNF is an important neurotrophic factor and do not believe that it induces cell death. Therefore, we checked the following 40 genes, which showed chromatin closing from E12 to E16, upregulation upon BDNF stimulation, and the GO term 'programmed cell death'.

      Cdip1, Diablo, Pla2g6, Braf, Tnfrsf25, Pa2g4, Mcl1, Hpn, Cebpb, Epha2, Plk3, Herpud1, Crip1, Dusp1, Sphk1, Irf5, Bag3, Stil, Fosl1, Cadm1, Lhx3, Hip1r, Relt, Irs2, Bmp8a, Ptcra, Mef2d, Prkcz, Rnf41, Pcid2

      As a result, we found that there were no genes involved in the main pathway of apoptosis. From this, we understand that the GO terms related to cell death are listed in Figure 5f because 'the genes in question are inaccurately categorized as "apoptotic" ', as this reviewer pointed out.

      We apologize for the misleading discussion in the previous manuscript and would like to thank the reviewer again for realizing this important point. We have corrected this in the new manuscript (page 9, line 263).

      In addition, we will perform a reanalysis to confirm this conclusion of chromatin opening at neuronal activity-associated gene loci using public gene expression analysis data of neuronal stimulation.

      A second example is the section about promoters being the focus of their discussion for DHS sites. Sure figure 3c shows promoters are more likely to be open compared with their contribution to the genome overall, but this is entirely expected since they are major gene TF binding sites, which is what DNAse detects. However promoters do not look to be more likely to be differentially regulated over time (3c vs 3e), and the statement that promoters are more enriched in opening compared with closing sites would require a statistical statement. Distal DHS sites appear equally more abundant in opening sites too.

      We thank the reviewer for their thoughtful comments on our results. As the reviewer points out, the proportion of promoter regions in the opening DHS in Figure 3e is not so high compared to that in Figure 3c. However, as described in the Abstract and Introduction sections, we are interested in how neurons acquire their function during the differentiation process, and our main focus was on comparing neuron-specific and NPC-specific DHS here. In the comparison within Figure 3e, it is clear that the opening DHS has a higher proportion of promoter regions than the closing DHS. We made the necessary revisions to avoid any misunderstanding on this point (page 7, line 192).

      On the other hand, as noted in the discussion, we are also interested in the role of the alteration in distal DHS. As in our response to Reviewer #2, we also plan to analyze changes in DHS in enhancer regions.

      • *

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In Figure 1c, the actual values of the differentially expressed genes are unclear. Is this a Z-score? Please provide the log2 expression values and specify the scale used for the heatmap and clustering.

      We apologize for the unclear expression value of Figure 2c. As this reviewer pointed out, the heatmap shows the Z-score, and we provided the actual scale in the new figure.

      • *

      Figure 5: It is somewhat unusual that the authors used microarray instead of RNA-seq for the BDNA stimulation of in vitro cortical neurons. Please provide a justification for this choice.

      Gene expression analysis using microarrays is a well-established technique, though it is currently unfamiliar. Compared to RNA-seq, microarrays have the disadvantage that they can analyze only RNAs with probes and have a lower dynamic range. However, on the other hand, they have the advantages of reasonable cost and a simpler analysis method. In this paper, we performed microarray analysis for BDNF experiment, considering these advantages.

      Figure 6: again, the data analyses are not comprehensively presented. What are the gene expression profiles of the other clusters (H3K27me3+, H3K4me3-/H3K27me3-, H3K4me3+)? Additionally, the sequencing data is inaccessible, and it is unclear how many samples (e.g., replicates) were used in this study for RNA-seq, DNase-seq, and ChIP-seq.

      We apologize for the lack of gene expression patterns of other clusters in Figure 6c. We provided them in the new figure and confirmed that only bivalent genes (H3K4me3+, H3K27me3+) showed increased gene expression levels during neuronal differentiation and other clusters slight reduction (new Figure 6c). This result again suggests that the bivalent state in NPCs contributes to their activation during neuronal differentiation.

                We described these data in the revised manuscript (page 10, line 296).
      

      Raw sequence datasets (fastq files) and processed data were deposited in the DNA Data Bank of Japan (DDBJ) Sequence Read Archive, a partner of International Nucleotide Sequence Database Collaboration (INSDC), as already described in the Data Availability section. Although DDBJ does not provide a reviewer access system for raw sequence datasets,

      the reviewer's access to the processed data is as follows.


      To review GEA accession E-GEAD-803, E-GEAD-859, E-GEAD-860:

      Please see the instructions below.

      https://www.ddbj.nig.ac.jp/gea/reviewer-access-e.html


      We will provide the access tokens in the final revised manuscript.

      For replicate numbers, we apologize for forgetting to describe them for the BDNF microarray experiment, though those for RNA-seq, DNase-seq, and ChIP-seq were already described in the Methods section. The replicates numbers are as follows:

      RNA-seq: two replicates

      DNase-seq: two replicates

      Microarray: three replicates

      ChIP-seq: two replicates

      We provided the replicate number of the microarray experiment in the revised manuscript (page 17, line 543).

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Major comments:

      The authors begin by examining TFs enriched at E16 DHS regions and suggest that TrxG and PcG factors are highly enriched in neurons, initiating their investigation of bivalent marks. However, they later conclude that bivalent marks are present in the NPC state and later become accessible. It is unclear why PRC factors would be enriched at the neuronal stage when the authors conclude that the chromatin becomes more open (potentially by removal of K27me3). The authors should refine this section of the manuscript to better rationalise their methodology and results.

      We are grateful to the reviewers for pointing out our poor explanation in Figure 6.

      This section aimed to investigate the mechanism by which open genomic regions in E16 were established. We used ChIP-atlas to investigate the transcription factors enriched in the E16 DHS and found many of the components of TrxG and PcG in the previous experiments using ES cells, which are the stem cells as NPCs. Therefore, we hypothesized that binding both TrxG and PcG, meaning a bivalent state, in NPCs may be important for chromatin opening until E16.Therefore, we analyzed bivalent genes in NPCs rather than E16 neurons in Figure 6b-d.

      We explained the rationale in detail in the revised version (page 9-10, line 269-288).

      Do the authors find any expressional changes of the suggested candidate proteins at the RNA or protein levels through development?

      We thank this reviewer for the useful suggestions. We agree that changes in the expression of TrxG and PcG components during neuronal differentiation are important information for considering the mechanism of chromatin structural changes in bivalent genes. Therefore, we checked the expression levels of genes encoding components of PcG or TrxG, determined by Schuettengruber et al., Cell, 2017, in our RNA-seq dataset (new Supplementary Data 5). More than half of them showed significant alteration, suggesting the possible contribution of alteration in the activity of PcG or TrxG or both on chromatin opening.

                We described this point in the revised manuscript (page 12, line 370).
      

      Minor comments:

      1. The manuscript would improve with proofreading by a native English speaker.

      We have already had proofreading by a native English speaker performed. We will also do it when submitting the revised version.

      4. Description of analyses that authors prefer not to carry out

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      One additional point, which may be beyond the scope of this paper, is that to demonstrate the temporal resolution of this birthdate tracking method robustly, the authors should also apply the technique to upper-layer neuron development and compare developmental differences that were previously challenging to capture due to lower resolution.

      Reviewer #2 (Significance (Required)):

      The study focuses exclusively on deep-layer excitatory neurons, without comparisons to other neuronal subtypes or non-neuronal cells. Including such comparisons would help determine whether the observed chromatin changes are unique to this specific population or part of a broader developmental process.

      We are grateful for the reviewer's meaningful suggestions. We also think that by comparing with upper-layer neurons and non-neuronal cells, we can more comprehensively understand the development of the cerebral cortex . However, this paper primarily focuses on deep-layer neurons, and analysis of upper-layer neurons and non-neuronal cells will be future work.

      We described this point in the revised manuscript (page 13, line 384).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript the authors use in utero electroporation of tamoxifen inducible reporters to permanently mark cortical neurons with a common birthdate. They then FACS harvest these cells for bulk DNAse seq and RNA seq to see changes in chromatin regulation and gene expression as these newborn immature cortical neurons become deep layer neurons. As has been shown in prior studies that have addressed other neuronal types or used different methods to isolate developmental cell stages in the CNS, the authors find correlated changes between the opening or closing of chromatin with changes in gene expression. They use this information to localize chromatin marks that are associated with the differential expression of genes and conclude that many of the differential genes are bivalent for active and repressive chromatin marks. Finally the authors cross this dataset with a microarray they did of BDNF-inducible genes in cortical culture and suggest enrichment of this program in the differentially regulated gene set from in vivo.

      Significance

      The idea that chromatin regulation coordinates developmental changes in gene expression in neurons has been addressed with several different strategies over the past decade including prior strategies that allow for isolation of neurons with common birth dates. Many current strategies (well cited by the authors) use single cell sequencing and computational algorithms to deconvolve differentiation state from complex mixtures. This study takes an alternative approach to experimentally label these developmental stages which is nice to see for the validation of ground truth. However the study does not go far beyond current knowledge to use this method to add new concepts to the field. The main point of innovation seems to be the observation that the newborn neurons are primed at the chromatin level to express deep layer markers at the time they are born during embryonic life. This is useful to see but not unexpected on the basis of large scale single cell datasets. They also show that bivalent promoters prime developmental stage specific gene expression (in addition to the well-established function of this form of regulation in fate determination), however this too has been shown already in other neuron types.

      In addition to these conceptual limitations, there are some poorly supported comments in the text. For example, the fact that their microarray shows some genes in a category called "apoptosis" that are BDNF-sensitive does not meaningful suggest that BDNF induces excitotoxicity in embryonic cortical culture. BDNF has been well established as a survival factor for many kinds of neurons and is a common additive to serum-free media supplements (like B27). The appearance of "apoptosis" terms in the upregulated genes on the microarray more likely suggests either that the microarray is a poor detector of differential gene expression or that the genes in question are inaccurately categorized as "apoptotic" (GO terms are not terribly specific indicators of gene function). If the authors really wanted to test if BDNF was inducing apoptosis their cultures they could test this. However to use only the GO term data in such a strong statement about the biology of their system caused me to question the rigor of either their data or their analysis.

      A second example is the section about promoters being the focus of their discussion for DHS sites. Sure figure 3c shows promoters are more likely to be open compared with their contribution to the genome overall, but this is entirely expected since they are major gene TF binding sites, which is what DNAse detects. However promoters do not look to be more likely to be differentially regulated over time (3c vs 3e), and the statement that promoters are more enriched in opening compared with closing sites would require a statistical statement. Distal DHS sites appear equally more abundant in opening sites too.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The manuscript from Sakai et al. examines changes in chromatin accessibility during the differentiation of deep-layer excitatory neurons in the neocortex. The authors establish a novel genetic labelling method that tracks differentiating neurons based on their birthdates allowing following neuronal differentiation in vivo. By combining RNA-seq and DNase-seq they provide a comprehensive dataset of gene expression and chromatin accessibility changes during neuronal differentiation of deep-layer neurons and reveal that key genes linked to mature neuronal functions and bivalent genes in neural precursor cells become accessible during early differentiation. These findings underscore the crucial role of chromatin regulation in preparing neurons for maturation and unravel novel key insights into the regulatory mechanisms governing deep-layer neuronal differentiation.

      Overall, this manuscript presents a novel technique for tracking neuron development from NPCs with specific birthdates. However, in its current form, it is largely descriptive and relies on correlative observations rather than elucidating a clear mechanism underlying chromatin and transcriptional changes. The provided data could be further leveraged to gain deeper insights into the molecular mechanisms governing deep-layer neuron development.

      One additional point, which may be beyond the scope of this paper, is that to demonstrate the temporal resolution of this birthdate tracking method robustly, the authors should also apply the technique to upper-layer neuron development and compare developmental differences that were previously challenging to capture due to lower resolution.

      Major comments:

      The authors have generated extensive RNA- and DNAse-seq datasets across different developmental time points following birthdate labelling. However, the bioinformatics analyses and interpretations are limited and need further clarification and refinement:

      1. The violin plots used to demonstrate expression and accessibility changes across developmental time points and the conclusions drawn from them are not convincing. The authors used a rank test to assess significant changes in expression, which only indicates the enrichment of genes with increased or decreased expression in each group. This cannot be directly interpreted as "significant upregulation." For instance, in Figures 4a and 4b, similar violin plots yield different statistical outcomes. The mean values on both graphs are comparable, yet Figure 4a suggests significant changes, while Figure 4b does not conclude significant downregulation of closing DHS genes. This is unconvincing. A more robust approach would be identifying DEGs between time points and analysing functional terms associated with these genes. The current plots do not support interpretations of gene upregulation, as each dot represents a gene, and the violin plot serves more as a population representation. The authors should either revisit their explanations and conclusions or include additional analyses and appropriate plots that support their claims of significant upregulation and downregulation of specific genes during development.
      2. Figure 6b lacks clarity regarding the cutoff value used to categorise genes as K4me3 and K27me3 negative or positive from the heatmap. Even the "K4me3 negative" cluster displays a detectable signal of the mark, albeit at lower levels. Since only one plot of the entire gene body is provided, it is unclear what levels of enrichment are present, particularly at the promoter region. The authors are encouraged to provide additional informative plots and analyses of this ChIP-seq experiment, as this is a critical point where they draw conclusions about bivalent genes. This would not only strengthen their claims but could also uncover additional findings with more detailed analyses. A heatmap of clustered ChIP-seq signals of K4me3 and K27me3 alongside expression levels of the same genes (similar to Figure 2c) and differential accessibility (e.g., between NPC and E16) would better visualise and correlate histone modifications with chromatin and gene expression states.
      3. The DNase-seq dataset can be better utilised to investigate differentially accessible motifs through development. Is this something the authors already looked into? This could strengthen mechanism investigation together with the ChIP-atlas results in Fig.6a
      4. The two distinct modes of H3K4me3 enrichment observed are not addressed and should be explained. Which genes belong to these two clusters? Is there a difference in DHS and gene expression between them?
      5. The same concern regarding the use of violin plots to correlate gene expression with bivalent genes through development (Figure 6c) as mentioned earlier. It would be better to use DEGs and intersect them. This is particularly important given the wide range of gene expression levels in the already poised state.
      6. The authors limited their analyses to promoter/gene body regions. A survey of the bivalent marks and accessibility at enhancer regions would be also beneficial for understanding the changes at the chromatin landscape through development.
      7. The authors begin by examining TFs enriched at E16 DHS regions and suggest that TrxG and PcG factors are highly enriched in neurons, initiating their investigation of bivalent marks. However, they later conclude that bivalent marks are present in the NPC state and later become accessible. It is unclear why PRC factors would be enriched at the neuronal stage when the authors conclude that the chromatin becomes more open (potentially by removal of K27me3). The authors should refine this section of the manuscript to better rationalise their methodology and results.
      8. Do the authors find any expressional changes of the suggested candidate proteins at the RNA or protein levels through development?
      9. The mechanisms driving the activation and expression of poised neuronal genes through the development of deep-layer neurons is not uncovered. The authors suggest certain histone modifiers and the DNA methyltransferase Dnmt3 as potential drivers of chromatin landscape and transcriptional regulation changes; however, this remains speculative, as there is no direct evidence or validation of these factors binding to the identified target regions or changes in DNA methylation states. The authors should provide validation of their candidate factors' presence at potential targets, as well as changes in DNA methylation if they want to conclude these as the mechanisms driving deep-layer neuron development.

      Minor comments:

      1. The manuscript would improve with proofreading by a native English speaker.
      2. The motif analysis can be included in the main figures.

      Significance

      By introducing a novel genetic labelling method that tracks neurons based on their birthdates, the study provides a precise way to examine differentiation in vivo, adding valuable insights beyond traditional in vitro approaches. The combination of RNA-seq and DNase-seq analyses reveals how chromatin accessibility changes, particularly in bivalent genes, play a crucial role in neuronal maturation. This work highlights the importance of chromatin dynamics in establishing neuronal identity. The techniques and findings provide a useful framework for future studies, offering a path for deeper exploration of chromatin regulation across different neuronal types, stages of development, or disease contexts, making it a valuable contribution to the field of developmental neurobiology.

      While the manuscript suggests the involvement of chromatin regulators such as Trithorax and Polycomb proteins, as well as Dnmt3 and DNA methylation, it lacks direct mechanistic evidence, such as ChIP-seq, bisulfite-seq, or loss-of-function experiments, to substantiate these claims.

      The study focuses exclusively on deep-layer excitatory neurons, without comparisons to other neuronal subtypes or non-neuronal cells. Including such comparisons would help determine whether the observed chromatin changes are unique to this specific population or part of a broader developmental process. The bioinformatics analyses and interpretations are limited and require further clarification and refinement. The proposed mechanisms are not fully explored, leaving the manuscript largely descriptive rather than providing a detailed mechanistic understanding.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors have developed a method for labeling a specific stage of differentiating neurons. Using this approach, they tracked the four-day differentiation process of deep-layer excitatory neurons in the mouse embryonic cortex. They investigated genome-wide changes in transcription patterns and chromatin accessibility using RNA-seq and DNase-seq. Additionally, they provided H3K4me3 and H3K27me3 ChIP-seq data from E12.0 NPCs. This resulting omics data would be a valuable resource for the field. While initial data analyses show potentially interesting findings, only part of the analyses are presented in the figures, lacking sufficient detail. Before publishing the manuscript, the authors should include more comprehensive analyses of their datasets. Specific suggestions are below.

      In Figure 1c, the actual values of the differentially expressed genes are unclear. Is this a Z-score? Please provide the log2 expression values and specify the scale used for the heatmap and clustering.

      Figure 4 focuses on promoter-specific chromatin accessibility analysis. The author can process the data similarly to the transcription data. They should identify differentially accessible promoter regions across E13.0 to E16.0 and generate a heatmap with clustering. Additionally, the author should provide matched gene expression data, either in the form of a heatmap or box plot, corresponding to those differentially accessible promoter regions. Currently, Figure 4 only presents E16.0 data compared to E12.0, which is not comprehensive.

      Figure 5: It is somewhat unusual that the authors used microarray instead of RNA-seq for the BDNA stimulation of in vitro cortical neurons. Please provide a justification for this choice.

      Figure 6: again, the data analyses are not comprehensively presented. What are the gene expression profiles of the other clusters (H3K27me3+, H3K4me3-/H3K27me3-, H3K4me3+)? Additionally, the sequencing data is inaccessible, and it is unclear how many samples (e.g., replicates) were used in this study for RNA-seq, DNase-seq, and ChIP-seq.

      Significance

      Multi-omics data from the differentiation process of deep-layer excitatory neurons would be a valuable resource for the field.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      In this manuscript by Torres Esteban et. al., the authors investigate how Pellino1/2 are recruited to chromatin surrounding DSBs. These two proteins contain the well-established FHA phospho-peptide binding domain that are present in other DDR chromatin binding such as RNF8. These two proteins were recently implicated as being involved in the DNA damage response, however, the exact phosphorylation-dependent recruitment method Pellino1/2 is not well defined.

      The authors first attempt to identify proteins that bind to the multiple MDC1 TQXF motif, specifically when phosphorylated on the Threonine residue. In two independent experiments both Pellino1 and Pellino2 were pulled down with a phosphorylated MDC1 pTQXF peptide along with RNF8, as expected. They further show that this interaction between Pellino1/2 and the MDC1 is not specific to just a single MDC1 TQXF motif and that it requires a phosphorylated Threonine residue both from nuclear extracts and an in-vitro binding assay. The authors also show that Pellino1/2 cannot be pulled down from nuclear extracts using a pS139 H2AX peptide, as had been suggested by immunoprecipitation methods previously. Next, in Figure 2 the authors investigate if any of the 4 MDC1 TQXF motifs are preferentially bound by Pellino1/2 in comparison to RNF8. The authors further provide evidence for this phosphor-dependent interaction with a crystal structure of Pellino2-FHA along with a pTQXF MDC1 peptide. They then show that while this Pel1/2-MDC1 interaction can be observed using immunoprecipitation of HA/Flag-MDC1, it appears to be IR-induced DNA damage independent with this method. Using PLA as a second approach they validate that Pellino1 and MDC1 can be found in close proximity within the nucleus in cells and that this interaction is enhanced upon IR-induced DNA damage. As a control they see no PLA foci in MDC1 KO U2OS cells. The authors further this notion that MDC1 is important for Pellino1/2 recruitment to DSBs with laser micro-irradiation and see that GFP-Pellino1 is recruited to DNA damage in an MDC1-dependent manner. They further show that this MDC1-dependent recruitment of Pellino1 requires the TQXF motif of MDC1 as mutation of all 4 TQXF residues to AQXF abolished Pellino1 recruitment to laser induced DNA damage. The authors then show that this same AQXF mutant is also devoid of Pellino1 recruitment ability using IR.

      Overall, the manuscript is very clear and concise and presents a reasonable model that MDC1 pTQXF motifs are required for recruitment of Pellino1/2 to DSBs. The explanations of data and figures, including the rationale behind experiments are easily understood and well presented. There are a few points that should be addressed.

      Major Comments:

      1. What were the other proteins that came down with the phospho-MDC1 pTQXF peptide pulldown, and what were the statistics of the pulldowns from the mass spectrometry experiments (i.e. number of distinct peptides, coverage, etc, for all of the interacting proteins that were identified. I was not able find a table showing all of the other interactors (Table S1? Not included in the files). These data are critical and need to be presented in the manuscript.

      2. Given that the FHA domain structure primarily recognizes the three amino acids C-terminal to the pThr residue, why do the authors think that the measured KD for the pT765 site (pTQPF) is 5-fold lower than the measured KD for the pT752 site (pTQPF)? Please provide some rationale for this in the text. (See point 4 below as well)

      3. In Figure 2, the pT699 peptide binds strongest to RNF8 and PELI1/2. Is this a consequence of other factors (conformational or allosteric changes, or other modifications on these proteins from the nuclear extracts) in the full length proteins compared to the isolated FHA domains? If the authors repeat this pulldown experiment using only the isolated FLAG- or GST-tagged FHA domains, do they also get the same result? The data in Table T1 suggests this will not be the case. Please confirm this, and if the data and discordant, please comment on the difference in full-length RNF8 and PELI1/2 binding from the HeLa nuclear extracts versus the isolated FHA domains in the text.

      4. In Figure 3, I am confused by the reference to Asp+5. The sequence around pT765 is pTQPFDT, so isn't the critical Asp in the +4 position? Is this interaction sufficient to account for the 5-fold tighter binding of this site than to the pT752 site? Or is this higher affinity a consequence of the pT-1 Glu residue, or both? Perhaps showing a full side chain interaction map would help here.

      5. It would be helpful if the authors looked at RNF8 levels in the FLAG Ips shown in Figure 4A. It looks like DNA damage slightly reduced PELI1/2 binding which might be accounted for by RNF8 competition.

      6. Do the authors have any data, or can they speculate, on whether the entire MDC1 699-770 region can simultaneously bind to both RNF8 and PELI1/2 though different sites at the same time on a single MDC1 molecule or is the RNF8-PELI1/2 co-localization from adjacent MDC1 molecules? While clearly not required, it would be interesting to ask whether transfecting MDC1-knockout cells with variations of MDC1 containing 1, 2, or 3 Thr-Ala mutations at these sites compromises co-localization and DNA damage responses? Can co-transfection of two constructs containing distinct single TQXF sites - one optimized for RNF8 binding and one optimized for PELI1/2 binding, can rescue the co-localization phenotype.

      7. What effect does the combined knock-down of PELI1/2 have on some aspect of DNA damage signaling or cell cycle progression? Is the repair/loss of gH2AX or 53BP1 foci delayed if PELI1/2 are absent?

      8. Figure 4A shows that the interaction between MDC1 and Pellino1/2 is independent of IR-induced DNA damage as observed using immunoprecipitation. 4B shows that PLA foci between MDC1 and Strep-HA-GFP-Pellino1 can be observed in undamaged cells and PLA foci are increased when observed 3h post 3Gy IR damage. Is this discrepancy explainable by differences between PELI1/2 recruitment given that the antibody in 4A used recognizes both isoforms but the PLA in 4B is specific to Pellino1? It would be nice to see if there are any differences between Pellino1/2 by performing IP +/- DNA damage using exogenously expressed Pelliono1 or 2 variants. Additionally, repeating PLA in 4B with a Pellino2 construct would help address this question.

      9. Similar to the comment above, a further investigation into if there are any differences in recruitment between Pellino1/2 using the experimental systems used in Figure 5 would greatly support the model presented, particularly since the crystal structure presented in Figure 3 is with Pellino2. Is there any rationale for this based upon amino acid sequence differences between Pellino1/2?

      10. Using the experimental approaches in Figures 4 & 5 and the insight from the crystal structure in Figure 3, making point mutations in Pellino1/2 that would be predicted to disrupt the phospho-specific interaction would provide further confidence in the model proposed.

      Minor Comments:

      1. Could the authors please comment on if Pellino3a/b came down in the initial IP experiment and if not, is there potentially an explanation based upon sequence similarities/differences as inferred by the crystal structure of Pellino2?

      2. Is there a rationale for why only single TQXF motifs were tested in isolation for binding in 2A? Given the proximity of the TQXF sites one could imagine that 2 or 3 TQXF sites being simultaneously phosphorylated could provide an even better peptide for Pellino1/2 FHA binding.

      3. In Figure 2B, how is the data "Adjusted"?

      4. Table T2 is provided twice, but the Table of co-precipitating proteins from the mass spectrometry experiments is not shown.

      REFEREE CROSS-COMMENTS

      It seems that most of us hit on the same set of core issues - whether the MDC stretch of 4 pTQxF motifs simutaneously recruits RNF8 and Pellino1/2 to the same MDC1 molecules or not, a few more mutational studies, and whether the Pellino recruitment is important in events downstream of MDC1 binding. This latter issue could entail significantly more work, though its inclusion would likely influence which of the journals that use Review Commons was interested in pursuing the paper. A purely biochemical study is fine, at least in my opinion, depending on the type of journal, but adding some functional DNA damage context and relevant cell biology would bring the work to a deeper level. Just my thoughts, of course....

      Significance

      The general significance of the data presented by Torres Esteban et. al., provide a mechanistic understanding of how Pellino1/2 are recruited to DNA damage, a role that has previously not been characterized. While this manuscript does not delve into the role of Pellino1/2 downstream of its recruitment, much of this has been investigated by a paper by Ha et. al., 2019 as cited in this manuscript. In fact, the Ha et. al., propose that Pellino1/2 directly interacts with pS139 H2AX. Torres Esteban et. al., convincingly show that this is likely not the case and provide robust biochemical and cellular evidence to support their mechanistic model, very nicely contributing to the body of knowledge regarding Pellino1/2 in the DDR. The audience that this paper would be important for would be those in the DNA damage field or structural biologists interested in phospho-peptide binding domains.

      Our expertise is in DNA damage signaling, FHA and other protein modular domains, and determination of protein structure by X-ray crystallography.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors in this work provide compelling evidence to support a critical role of MDC1 as a molecular scaffold that dock Pellino1/2 at DSBs. Using an unbiased approach Pellino1/2, alongside RNF8, were identified as FHA-containing DNA damage responsive factors that bear affinity for an MDC1 TQXF phospho-peptide. Results from a series of interaction studies suggest that Pellino1/2 highly likely interact with MDC1 in a phosphorylation-dependent manner. Consistently, Pellino1/2 assembly at DSBs required MDC1 and its TQXF phosphorylation. Overall the work is of high quality and observations that highlight the MDC1-Pellino1/2 interaction are solid.

      Significance

      The work offers an alternative perspective that details MDC1 as an important molecular scaffold that allows DSB targeting of Pellino1/2. I have only two minor comments -

      1) As opposed to examining interaction between Pellino1/2 and the MDC1 fragment (Figure 4A), to unequivocally demonstrate that Pellino1/2 interacts with MDC1 in response to genotoxic stress the authors should show that Pellino1/2 interacts with full-length MDC1 but not its AQXF mutant. In this setting IR should further enhance the Pellino1/2-MDC1 interaction.

      2) For visualisation purpose the authors should provide density map that shows the MDC1-pTQXF and the key interacting residues on the PELI2 FHA.

      OPTIONAL:

      Given the plausibility that Pellino1/2 may co-occupy MDC1 with NBS1 and RNF8 it would be of interest to examine if that is indeed the case. & Does Pellino1/2 expression affect RNF8-dependent ubiquitylation?

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors present a series of experiments demonstrating a physical interaction between the phosphorylated TQXF motifs of MDC1 and Pellino1 / Pellino 2 proteins (PELI1/PELI2). Both proteins are enriched in a peptide pull-down mass spectrometry experiment using HeLa nuclear extracts, and shown to bind preferentially to MDC1-pThr699 and -pThr765. These data are nicely supported by iso-thermal titration calorimetry experiments with a dissociation constant of 230nM determined for interaction with a synthetic pT765 phospho-peptide. The authors go on to determine an X-ray crystal structure of the FHA domain from PELI2 in complex with the MDC1-pThr765, to reveal the molecular determinants underpinning the interaction. Additional experiments, including immunoprecipitation, proximity ligation assay, and laser micro-irradiation support a phospho-specific interaction, driven by DNA damage, between MDC1 and PELI1/2 - and discount a direct interaction of PELI1/2 with the phosphorylated tail of histone H2AX as proposed previously in Ha et al (2019).

      Major comments:

      a) In all cases, RNF8 binds more tightly to the MDC1-pTQXF motifs, with the notable exception of pThr765; however, the affinity of RNF8 for the MDC1-pThr765 peptide has not been determined by ITC, which would be a useful comparator. Esp., given the question of how the authors envisage the interchange between binding to RNF8 vs PELI1/2?

      b) Does Wing-II of PELI1/2 provide a secondary interaction pocket, capable of interacting with an extended bis-phosphorylated peptide? Is it highly conserved in terms of amino acid sequence?

      c) Do the authors findings support the idea of PELI1 acting as part of a positive feedback mechanism that functions via PELI1-dependent ubiquitylation of NBS1?

      d) The manuscript appears a little truncated, and does not explore the downstream (cellular) effects of disrupting the PELI1/2 - MDC1 interaction.

      Significance

      Overall, the manuscript contains a robust and well-controlled set of experiments that confirm a direct interaction between the PELI1/2 protein and MDC1, dependent on prior phosphorylation of MDC1 (at its TQXF ) sites in response to DNA damage. With that said, it is somewhat limited in scope, and does not explore the cellular consequences of breaking/interfering with the ability of PELI1/2 to interact with MDC1 and how this might be integrated into our understanding of the mammalian DNA damage response.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Esteban et al. provides compelling evidence that MDC1, in addition to its well-known interaction with the E3 ligase RNF8, also directly binds to the E3 ubiquitin ligases Pellino1 and Pellino2, both of which have recently emerged as important players in the DNA damage response. The authors convincingly show that MDC1 mediates the recruitment of Pellino1/2 to sites of DNA double-strand breaks (DSBs) via a phosphorylation-dependent interaction.

      Major Comments:

      1. Competitive Binding Dynamics: The manuscript should address whether RNF8 and Pellino1/2 compete for binding to phosphorylated MDC1. Understanding whether these interactions are competitive, cooperative, or mutually exclusive is crucial for deciphering the functional implications of MDC1's ability to interact with multiple E3 ligases.

      2. DNA Damage-Independent Interaction: The observation that MDC1 and Pellino1/2 interact in the absence of DNA damage (as depicted in Figure 4A) is unexpected. While the authors note that MDC1 is phosphorylated to some degree in non-irradiated cells, this explanation does not fully clarify the mechanism or significance of this interaction in the absence of DNA damage. Since DNA damage typically enhances MDC1 phosphorylation, one would anticipate a corresponding increase in the interaction between MDC1 and Pellino1/2. Further investigation into this aspect is needed to better understand the context in which these interactions occur.

      Minor Comments:

      Figure 1E Labeling: There is a labeling error in Figure 1E; the labels should be corrected to "H2AX" and "pS139-H2AX."

      Significance

      While the discovery of this interaction is intriguing, the manuscript would benefit from a more thorough exploration of its physiological significance.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      miRNAs are important for the control of many cellular processes, with the miR-29 family of miRNAs implicated in the regulation of cell growth in different cell types in both the epidermis and dermis of the skin. However, the roles of miRNAs in specific cell types in general, and of the miR-29 family in the skin, are currently unknown. Here, the authors use a range of cellular and molecular techniques, including miRNA cross-linking and immunoprecipitation (miRNA-CLIP) and antisense oligonucleotides (ASO), as well as RNA-Seq, qPCR, Western blotting, in situ hybridization, adhesion and ECM assays, ELISA and immunofluorescence, to interrogate the roles of the miR-29 family of miRNAs in controlling cell growth in epidermal keratinocytes and dermal fibroblasts, using 2D and 3D ex vivo models. The coupling of miR-CLIP with functional assays allowed the authors to identify both miRNA-mRNA complexes, and the biological pathways that these ultimately manipulate.

      The authors report the identification of unbiased, tangible miR-29/mRNA pairs, together with functional roles in cell adhesion, ECM regulation and fibroblast proliferation, that are distinct between keratinocytes and fibroblasts. miR-29 is identified as a valuable target for interventions that seek to promote healthy skin regeneration, including applications for wound healing. Many of the pathways identified here have previously been described, but the novelty of this manuscript lies in the innovative combination of miR-CLIP with functional assays, the application of these in combination to specific cell types, the identification of miR-29 as a novel master regulator of epidermal keratinocyte adhesion via a range of different pathways, and the demonstration that miR-29 inhibition in fibroblasts can influence keratinocyte adhesion via paracrine signalling.

      The experiments are well designed and reported. The interpretations are sound and appropriate for the data presented (though see the comment on potential normalisation of ECM data to cell numbers in cultures for the miR-29 mimic/inhibitor data for fibroblasts and the query about the number of direct miR-29 targets in fibroblasts that are ECM-related).

      Major Comments: I have no major concerns to raise over this manuscript. The claims and conclusions are supported by the data and no additional experiments are required (though please note the comment on normalisation mentioned above and detailed below). The methods are clearly reported and statistical reporting is adequate.

      Minor Comments: Pg3, 7th line from the bottom: "processed into three functional miRNA..." - minor edit needed here, it looks like there's a word missing somewhere. Pg3, last line on the page: "results supported..." - is there a missing 'are' here? Pg5, 15th line of the main text: "of miRNA-29-mediate repression..." - is there a missing 'd' here ('-mediated...')? There is lots on minor presentation errors like this throughout the manuscript - I won't point them out exhaustively, but the manuscript needs a good thorough proof-read, maybe from a fresh pair of eyes? - We fully agree with the reviewer. The manuscript has been proofread and corrected throughout. Fig. 1C: Can the figure be edited to better highlight the basal layer with lack of (nsm image) and expression of (abm image) K10? Maybe a box around that layer, rather than the current arrows only on the abm image (which are not particularly closely indicating the basal layer)? We thank the reviewer for this suggestion. The arrows on the Fig.1C point to the areas where keratin K10 filaments are reaching the basal membrane (indicated by collagen IV staining). It was difficult to box out the basal level without covering the K10 signal. We decided to explain this in the legend to clarify how the data shows this pre-mature expression of keratin K10 in the miR-29ab mimic sample. ____The basal layer of the control (nsm) sample thus remains K10-free and only shows nuclear DAPI staining. Fig. 2 legend should include definitions of abbreviations shown on the figure. - Added Pg8/Fig. 4A: Can the reporting of shared transcript targets of miR-29 in IFK/HFK/DF cells be better communicated? Maybe just adding the actual percentage overlap in transcriptomes for IFK/HFL and keratinocytes/fibroblasts to the main text would help . – Actual percentages of the overlaps added in the text. Similarly, I think a direct report somewhere (in the main text?) of total number for relevant groups shown in Fig. 4E would also be useful - e.g. there are 45 transcripts that are direct targets of miR-29 in keratinocytes and also associated with ECM, and 190 that are direct targets of miR-29 in keratinocytes and also associated with cell adhesion, but these number are difficult to come by quickly at the moment. It would be nice to be able to quickly compare these numbers for keratinocytes to their equivalents for fibroblasts__. – This is a very helpful suggestion with a good example. We incorporated the suggestion into the text and made changes to the figure to make it easier to compare pro-adhesive and miR-29-regulated functions in keratinocytes and fibroblasts. Fig. 4B: It's interesting that ~15% of miR-29 binding targets identified using miR-CLIP are not predicted targets based on TargetScan/microT-CDS. I'd like to see a little more information on this added to the manuscript - perhaps listing some of these or including a table of them? And perhaps some discussion of this could be added also. - Indeed, almost 170 mRNAs are in this category and are now listed in a table in Suppl. File 1. Non-canonical binding is briefly discussed in the text. Fig. 4E: I would be nice to see the Venn numbers for keratinocyte proliferation (either is a supp figure, or addition to the main text?), to help illustrate the lack of a role for miR-29 in the regulation of keratinocyte proliferation. – It is an interesting point; the cell proliferation seems to be a function of miR-29 in fibroblasts but not in keratinocytes. We did not detect cell proliferation as a significantly enriched function among keratinocyte mRNAs directly regulated by miR-29. It is consistent with the lack of change in BrdU incorporation in keratinocytes grown in 3D (Figure 2). We also never noticed any change in keratinocyte proliferation while expanding them in 2D after miR-29 transfection or inhibition. This has been further highlighted in the text. Fig. 4E: Is the reported number of direct miR-29 targets in fibroblasts that are ECM-related correct? This number is reported as 10 in the main text (pg10, 3rd paragraph), but it looks like 10 is only for direct miR-29 targets in fibroblasts that are ECM-related AND related to proliferation. Should this number be 58? The 10 that are direct miR-29 targets in fibroblasts that are ECM-related AND related to proliferation can be reported in the next sentence, where this group is specifically referred to. – This has now been amended in the text according to the reviewer’s suggestion. Fig. 7 (and related main text): Did you take any steps to normalise ECM measurements to cell numbers present in cultures in the miR-29 mimic/inhibition experiments in fibroblasts? This should really be included as it would provide an answer to the speculation of whether the effects of manipulating miR-29 on ECM are due to proliferation or classical pro-fibrotic pathways - it is probably based on proliferation not pro-fibrosis because TGFb is one of the most pro-fibrotic cytokine known and it’s response is abrogated by miR-29KD. Need to check the original excel for Fig. 7D. – Yes, the concentration of the ECM was measured in ng/ml and normalized per number of cells. We calculated the concentration of oligonucleotides per cell by dividing the amount of transfected oligo per number of transfected cells counterstained with nuclear DAPI signal. We could do so because every cell showed a similar transfection rate by calculating fluorescence of Cy3 conjugated to the miR oligos. Then, we divided the ECM concentration by the number of transfected cells per well, thus normalizing the ECM deposition to the cell number. The reviewer is correct, both the increase in ECM after miRNA-29 KD and the decrease in ECM after miRNA-29 overexpression is consistent with increased and decreased cell numbers, correspondingly. As suggested, we later confirm that the increased deposition of the ECM was not a result of activated pro-fibrotic pathway (Figure 7).__

      Fig. 8E: The upper and lower image need to have nsa/abc labels added to them. – This has been done, thank you for noticing! Pg12, 1st sub-heading: typo (cell-specific). -corrected.

      **Referees cross-commenting**

      All reviews appear to be fair and balanced to me. I agree that in places wording could be amended to temper the strengths of some claims, and it would also be nice to see some additional functional assays included, to complement the adhesion and ECM deposition assays that are currently presented, though I do not think this should necessarily be a requirement for publication and could be included in subsequent follow-up work from the group. I did not spot the reuse of images between Fig. 1 and 2, but clearly this should be addressed - either by replacing one set of images, or by removing the relevant panels from Fig. 1 and changing in-text reference to guide the reader to Fig. 2A. I also agree that it would be nice to see miR-29 staining of mouse dermal fibroblasts during wound healing, to complement the images already shown for keratinocytes, and to see miR-29 staining in human skin__. – We thank Reviewer 1 for cross-checking other reviews, and we address these comments in response to Reviewers 2 and 3. __

      Reviewer #1 (Significance (Required)):

      miR-CLIP is a powerful, recently developed technique, with enormous promise for the identification of true miRNA-mRNA pairs, that has not yet been widely adopted by the research community. As such, its application here is itself relatively novel, adding enormously to our existing knowledge of likely miR-29 targets, providing tangible information in miR-29/mRNA pairs in specific cell types in different layers of the skin, but also further adding novel functional information to this, with demonstrations of the regulation of specific relevant biological pathways through manipulation of targets identified using miR-CLIP. The methods are sound (and impressive), results are reported well and not over-interpreted. There is the potential for better characterisation of the relative importance of canonical pro-fibrotic pathways vs proliferation-related effects on ECM production, and this should not be difficult to address. This paper will be on interest to a wide readership, including those engaged in fundamental research and clinicians.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      The article entitled, "miRNA-29-CLIP uncovers new targets and functions to improve skin repair", by Thiagarajan et al. describes the characterization of the functions of miRNA-29 in keratinocytes and fibroblasts, its RNA interactors and potential mechanisms of action. Using candidate interactors and 2D cell culture and 3D skin equivalents combined with loss-of-function (inhibitor) and gain-of-function (mimic), and changes in expression analyses, the authors conclude that the major function of miRNA-29 is to regulate cell-substrate adhesion.

      Major comments:

      • While the interactors and expression changes are useful resources, the claims and the conclusions that are based on them are exaggerated. The treatments are associated with changes in expression, but no functional data support the conclusions. Additional functional experiments are required to assertively make the claims. The title is misleading when stating "to improve skin repair" and the abstract also makes some bold general claims, which are tangentially supported by the findings. For example, "protein folding" only appears in the abstract and "RNA processing" is in the abstract and figures but not referred to in the text__. – We thank the reviewer for valid criticism. While this manuscript was in preparation, we were publishing our other study showing the function of miRNA-29 in wound healing in cutaneous mouse-based model. This study demonstrated an improved re-epithelialization and wound closure in Mir29ab1 KO mice (Robinson et al, Am. J. of Pathology 2024). It was difficult not to think about the role of miR-29 in a wider context of skin repair, which was the goal of the in vivo part of the project. We could not cite the other manuscript at that time as a reference and should have toned down our claims to improved skin repair in this manuscript.__

      • The authors may want to tune their language that their data suggest the conclusions as opposed to being definitive and assertive. This should be done in the Discussion, while the Results should represent the direct conclusions__. – This has now been amended accordingly (highlighted in green).__

      • A couple of examples to the above, in the conclusion to section 1 of the Results, how was the "loss of basal adhesion" assessed? Is it by beta1-integrin localization changes? – We have not performed assays specific to activated integrins, but this is planned studies where we will address the molecular details of the miRNA-29-controlled cell-to-cell and cell-to-matrix adhesion mechanism. Also, how is "growth" defined"? proliferation is not changed and a more accurate way to describe the result is to refer to thickness__. – Indeed, our results clearly demonstrate no change in keratinocyte proliferation in response to a change in miRNA-29 levels either way. We therefore speculate that the reason for differences in 3D cultures of keratinocytes (the SEs) is pre-mature differentiation, induced by miRNA-29. While we do not have a mechanistic answer to this observation (e.g., keratin K14 is not a direct target of miRNA-29), premature expression of K10 in the basal layer may be a consequence of altered adhesion mechanisms in the basal layer. As noted earlier, we are currently investigating the mechanism of miRNA-29-regulated adhesion of mouse and human keratinocytes, but this was beyond the scope of presented study, which has identified the phenomenon at the first instance using organismal and tissue-level approach.__

      • The images in Fig 1C are reused in Fig 2A, where new examples should be shown instead. – We had erroneously inserted the same panel as in Figure 2. The correct day 6 panel is now inserted instead in Figure 1C, along with an additional control of normal human skin.

      • Fig 1C and Fig 2A are not quantified to make the claims about premature differentiation and integrin expression changes. – We struggled to find an accurate method of quantifying the fluorescent signal coming from varied cell shapes and the basal lamina of human SEs. We however see certain consistency in deposition of integrin beta 1 and alpha 6 (ITGB1and ITGA6) in our SEs. The signal for ITGB1 completely disappears in miRNA-29 treated SEs while ITGA6 goes down. Conversely, increased ITGB1 after inhibition of miR-29 coincides with a higher signal of ITGA6 (Figure 2A). ITGB1 and ITGA6 are co-expressed in basal layer of ____human skin____ and ____SEs____(____Solé-Boldo et al, Comm. Biology 2020, ____Fig. 1c____; Stabel et al, Cell Rep. 2023, Fig. 3E) and can heterodimerize to form integrin α6β1 in various tissues (____reviewed by Zhou et al. Stem Cell Res Ther. 2018____). We have changed the way we discuss the results in the text.

      • Fig 3: It is not clear from the figure legends what statistical methods were used for which experiment or how many times the experiment was performed (not just biological replicates), especially given the variability among experiments in Fig 3C. - Adhesion assay in Fig. 3A was performed in four biological replicates with one batch of primary human keratinocytes (pooled neonatal), and in 3C, as two independent experiments (exp) with two different batches of keratinocytes (exp 1 and exp 2). Lower numbers of cells in exp 1 as compared to expt 2 are due to an unfortunate but usual variability between batches of primary cells. The variability noted by the reviewer is most likely coming from lower numbers of cells in exp 1 as compared to exp 2. We have now clarified this in the figure legend.

      Minor comments:

      • The Introduction is focused on methodology and should include elements that pave the way to the Results. Some information that belongs in the introduction are present in the Results section. In this respect, please define the miRNA processing Dicer pathway and its components in the introduction so that the reader can follow the nomenclature (AGO2, RISC, etc.). Also, introduce human skin equivalents or organotypic culture as a model system in the Introduction.

      • Some information in the Results belongs in the Introduction, for example, the first seven lines of the Results section. - We have changed the introduction accordingly

      • The authors might want to consider including quantifications in the main figures, so they are immediately apparent to the reader, for example, Fig S1C. Also, Fig S2B is an important measure for the immediate outcome of the treatment on miRNA-29__. – We have included the quantification of the SE epidermal thickness in Fig. 1D and emphasized the KD effect of miR-29 anti-sense oligos in the text.__

      • Please change "imidiate" to "immediate", "sculp" to "scalp", "has to be releaved of miRNA-29-mediate repression" to "has to be relieved of miRNA-29-mediated repression" - Done.

      **Referees cross-commenting**

      I agree with my colleagues' assessments and suggestions. The miRNA-CLIP data in keratinocytes and fibroblasts are important resources. The figures and text require reconsideration to more accurately represent the data as detailed in our collective reviews

      Reviewer #2 (Significance (Required)):

      The study utilizes 2D and 3D cultures and presents an important resource for miRNA-29 interactors in keratinocytes and fibroblasts, as well as the expression changes associated with its inhibition and overexpression. However, the conclusions are exaggerated and based on expression changes. If the conclusions are rephrased, the findings would be of interest to a broad audience interested in miRNA, cell adhesion and epithelial and mesenchymal biology.

      My expertise is in skin development and maintenance, genetics and cell biology. I have limited knowledge in RNA biology.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Thiagarajan et al. report on the functions and molecular targets of miR-29 in human primary skin cells. They first focus on the potential role of miR-29 in wound healing and in the adhesion of keratinocytes to the basement membrane using both in vivo wounding assays in the mouse and human cultures/skin equivalents. The authors report that miR-29 negatively affects adhesion in vivo and in vitro and characterise the transcriptome of fast and slow-adhering cells with or without miR-29inhibition. They proceed to identify miR-29 targets in three primary skin cell types (follicular keratinocytes, interfollicular keratinocytes and fibroblasts) by performing miRNA-clip. By comparing these targets to genes altered in keratinocytes with high adhesion capacity after miR-29 inhibition or fibroblasts after miR-29 inhibition, the authors describe a model in which miR-29 inhibits multiple adhesion-associated pathways in keratinocytes and negatively regulates proliferation and ECM deposition by dermal fibroblasts.

      Major comments:

      Overall, the paper is interesting, and the experiments performed are generally sensible for the questions being investigated. However, I thought the data was presented in a very confusing and unclear way, both in the main text and in the figures. I found the paper quite difficult to navigate, with contradictory statements between text and figures, cryptic or confounding graphs or arrangement of the figures and, in at least one instance, re-use of the same image with inconsistent labelling. The paper will thus greatly benefit from extensive tidying up and review of both text and figures to improve clarity. I highlight several points below, with many being related to this overarching issue, and I try to offer suggestions to the authors improve the quality of the manuscript.

      • The stainings in Figure 1A should be repeated in intact sections as it is difficult to understand the exact distribution of miR-29 when the whole epidermis appears to be falling apart in the section. It is possible to see the pattern the authors are describing based on the current images, but it is not convincing. – We fully agree with the reviewers that an intact section would inform the reader on the distribution of miRNA-29 inside the wound much better when the wound morphology is preserved. We have tried repeating the staining (fluorescent in situ hybridization coupled with the antibody staining). The protocol involves multiple washing steps performed at high temperature (for the FISH) and detergent (for the immunodetection step) to ensure specific miRNA probe binding and a low background for the antibody binding. As a result, we could not get a more intact section at the end unfortunately. We have however published a miRNA-29 FISH only stained mouse wounds in ____Robinson et al, Am Journal of Pathology 2024, Figure 1C and Suppl. Fig. 1B____ showing more intact sections with miRNA-29 signal against DAPI. There, one can see the same pattern of miRNA-29 expression as in Figure 1 of this manuscript, with less miRNA in the basal layer of wound keratinocytes vs more miRNA-29 in the skin peripheral to the wound.

      The authors should comment on the fact that miR-29 signal in the inset (at the edge of the wound) appears more basal than in the wound epidermis or in the unwounded__. – We have now inserted this suggestion and discussed it where appropriate (highlighted in cyan)__

      Quantifications and statistical analysis of the intensity and distribution of miR-29 for panels A and B and K10 for panel C will need to be included to help get a better sense of the data in its entirety and strengthen the observations. – We agree with the reviewer that such quantifications would be extremely helpful. The nature of the miRNA FISH protocol relies on signal amplification, allowing detection of mature miRNA specifically despite their short length. We could not therefore rely on conventional methods to quantify the fluorescence reliably as it can only be interpreted relatively to other areas/sections stained at the same time. We have attempted to do the miRNA FISH without amplifying the signal by attaching the FITC probe directly to the miRNA-29 probe but the signal was too weak to reliably detect and quantify miRNA-29 expression in wounds. Importantly, Figure 1C is described as staining after 6 days of skin equivalent cultures, but the same images are used in Figure 2A, where they are described as stainings after 11 days of culture. The authors should try to harmonise the data presentation so that the same data is not presented multiple times if possible. If repeated data presentation is necessary, it should be clearly stated and justified, and the authors should be careful to correctly indicate what the images represent. – This has been corrected.

      • ITGB1 stainings in Figure 2 do not convincingly match the statements in the main text ("miRNA-29 mimic-transfected SE struggled to attach through the integrin beta1 (ITGB1)-mediated adhesion__"). – This should have been phrased rather as a suggestion. We detected virtually no integrin beta 1 in miRNA-29 overexpressing cells, which strongly suggested that high levels of miRNA-29 prevent ITGB1-mediated adhesion of keratinocytes to the basal membrane. __

      All stainings, or at least the most important ones, like ITGB1, should have quantifications and statistical analyses of their intensity and distribution to support any observations. – We thank the reviewer for this comment and fully agree it would be ideal to have quantifications of all staining. We have tried to do so but were able to reliably quantify only BrdU, ITGB1, and ITGA6. The data has now been added to results and discussion.

      Staining of basement membrane proteins at 6 days could help better visualize if indeed there are any attachment defects in the mimic-overexpressing cells – We stained 6 day section for basement proteins collagen IV and laminin 5 but could not detect any differences in attachment (data added below). Since both keratinocytes and fibroblasts contribute to the epidermal-dermal adhesion on the BM, a more sophisticated method of detecting adhesion in human skin equivalents may be needed following miRNA-29 manipulation (e.g., electron microscopy of keratinocyte-BM contacts like hemidesmosomes).

      Since the authors use transient transfections, the significance ant interpretation of the stainings performed at 11 days will be reliant on the transfection strategy employed, the rate of proliferation of the cells, and the half-life of the proteins stained.

      The transfection strategy is not clearly explained (this is a more general problem, see below) and staining for miR-29 in these sections is necessary to ensure that the treatments are still in effect after this prolonged time in culture__. – We have now clarified the transfection protocol and added the quantification of miRNA-29 levels in skin equivalents at day 6 and day 11 (Figure S2D). The overexpression and the inhibition of miRNA-29 is still evident at day 6 and day 11, probably because of the high levels of miRNA mimics and the stabilizing chemistry of miRNA-29 anti-sense oligos (MOE-PS modifications). - The mimic/inhibitor transfection strategy employed by the authors throughout the paper is not clearly explained and this is a very important detail to understand the results of many of the assays they perform. The methods and Figures S2/S3 describe a 'double transfection' transfected twice on D2 and D4 strategy for the inhibitors, but it is unclear if the same approach was used for the mimics (which is important since some of the experiments where they are employed have functional assays that can last longer than a week). Additionally, the strategy used for the inhibitors described in the methods section seems different than the one described in Figure S3. In the methods, the cells are transfected at day 1 and day 3 and collected for functional assays at day 5. Figure S3 instead shows two transfections at 'day 0' and an additional one at 'day 4' with miRNA levels measured at day 0 and day 8 (this bar plot should be modified to better reflect that measurements were only taken on specific days). The legend for Figure S3 reads "keratinocytes (P3/4) were transfected twice on subsequent days" and mentions "representative images of the cells from each treatment after the third transfection". This is all extremely confusing. The authors should make sure they explain what they did clearly and univocally, for both mimics and inhibitors, and they should add a time course with miR-29 levels following transfections of mimics and inhibitors covering the span of their longest assay. – We thank the reviewer for carefully checking the flow and apologize for the confusion. The successful transfection of primary keratinocytes with miRNA mimics is more straightforward than with the anti-sense oligos as the chemistry quite differ. Mimics go in as a ‘stem loop’ RNA structures _and require only one transfection round. Anti-sense ‘inhibitors’ oligos (ASOs) are 15-16 nt single-stranded, _phosphorothioate (PS)-methoxyethyl (MOE)-modified ASO_ require a double-transfection. This way, ASO remain in ‘fast’ cells for days and during adhesion assay as shown here._ The additional experiment for the cell viability and proliferation was following the 2nd transfection, which is now clarified in the text and in the Suppl. Figure S3.__

      • Figure 3 includes reference to morphological parameters that would be predictive of a keratinocyte ability to form a holoclone (red arrows). While the larger size and low nucleus-to-cytoplasm ratio of differentiated cells is well-established, to my knowledge there is no accepted consensus about strong predictive capacity of simple morphological parameters when it comes to holoclone formation. The consensus regarding keratinocyte clonogenicity is generally missing in the field, relying primarily on early passage, low cytoplasm/nucleus ratio, and colony boundaries. Another important characteristic is the number of passages that the cells can undergo before they growth arrest or die. We are currently performing follow up experiments to characterize the miRNA-29 KD (abc) clones and consistently observe higher growth capacity (longevity) of the miRNA-29 depleted keratinocytes. This is also consistent with the data shown in Figure 3A and S3A.

      • The inhibition of miR-29 in experiment 1 of the growth factor depletion assay seems to have failed according to Figure S2C, so the results of experiment 1 (-GF) in Figure 3 should be disregarded and the experiment repeated. We have disregarded the failed experiment and repeated adhesion assays under -GF conditions with more controls. While the improved adhesion upon depletion of miRNA-29 was reproducible, we also found that the growth factor depletion using a specific inhibitor of epidermal growth factor receptor (EGFR) AG-1478 abrogated the fast ____adhesion effect of miRNA-29 inhibition. It possibly means that miRNA-regulated adhesion requires EGF (but not other GF) signaling; however, more experiments would be needed to uncouple the role of GF in miRNA-29 adhesion.

      • The authors report reduced keratinocyte differentiation in the miR-29 inhibited cells. This statement is mostly supported by the cell number time course shown in Figure S3B, but this experiment is not mentioned in the main text, which instead focuses on (less reliable) morphological parameters alone. Moreover, Figure S3 only shows the morphology of cells at day 4 and does not provide any information about the cell morphology at day 6 or day 8 as suggested by the main text. Assessing differentiation based on morphology alone is prone to inaccuracy and while the cell number experiment is good support for the stated decrease in differentiation in the miR-29 inhibited cells, it should be complemented with differentiation marker staining and/or clonogenicity assays. - We agreed with the reviewer and made the appropriate changes in the text. Figure S3 has been updated as well, and we also ran a side analysis of differentiation markers (keratin K10 and loricrin). We found that miRNA-29 does not change significantly during keratinocyte differentiation in 2D (please, see the Support Figure A below).

      • The authors' claim that their results "revealed the direct in vivo targetome and functions of miRNA-29 in three types of cells isolated from human skin" is not accurate. While their experiments are indeed compelling, they are performed in cultured primary cells grown for at least 3 passages, which are akin, but not the same as cells in vivo and may behave differently. – We agree and have changed this now in the text. On a similar note, while there is some evidence from mouse that miR-29 may intervene in the regulation of the wound healing response in keratinocytes in vivo (Figure 1A), no analogous in vivo data is presented for fibroblasts. The authors should consider showing miR-29 stainings of mouse dermal fibroblasts and the potential variation in its level during wound healing. - While this manuscript was in preparation, we were in the process of publishing our study showing the function of miRNA-29 in wound healing in cutaneous mouse-based model. This study shows the staining for miRNA-29 in mouse wounds during healing and includes the staining in dermal fibroblasts (____Robinson et al, Am. J. of Pathology 2024, Figure S1B____). We have isolated total RNA from mouse wounds at different points of healing and checked miRNA-29a/b levels using TaqMan assays. While we detected a change in miRNA-29 expression (Support Figure C, D), this possibly included miRNA-29 in the normal surrounding skin, inevitably present in a wound biopsy. __They should also show miR-29 staining of normal human skin to confirm that its expression pattern mimics the mouse. - We could not cite the other manuscript at that time, but it shows lower levels of miRNA-29 in dermal fibroblasts compared to keratinocytes in the epidermis by FISH (_Robinson et al, Am. J. of Pathology 2024, Figure S1B_). We also quantified levels of miRNA-29a/b in primary mouse keratinocytes and fibroblasts using TaqMan assays, and consistently with FISH, detected more miRNA-29 in keratinocytes (Support Figure B). The FISH for miRNA-29 in human skin was published earlier, also showing much lower signal of miRNA-29 in the dermis (Kurinna, S. Nuc. Acid Res. 2021, Supplementary Figure S3A). If possible, they could also 'wound' human skin explants and check what happens during re-epithelialisation to miR-29 expression and to the key targets they identified (explants may be challenging to obtain, though). These experiments could provide some more compelling (though inevitably correlative) suggestion that miR-29 could intervene in the wound healing response in vivo in humans. – This is a very good experiment suggested by the reviewer. The human skin explants were indeed challenging to obtain. We could only get a few sections of paraffin-embedded samples, which were suboptimal for miRNA-29 FISH. We included the data as Figure S1A. __

      Minor comments:

      • I would encourage the authors to avoid, when possible, the use of red/green colour palettes both in stainings and in graphs, as it makes the paper less accessible to colourblind individuals. – We sincerely apologise for the use of these colours in many stainings. We substituted red and green everywhere we could, but our technical capabilities did not permit changing colours on all Figures.

      • I would suggest avoiding the use of "stacked" bar plots to show data as they might lend themselves to misinterpretation. It would likely increase clarity if the bars for different conditions were plotted next to rather than on top of one another. - We replaced the stacked plots as suggested on Figures 3, 6, and Figure 8. We kept one stacked plot in Figure 6D to show variability in the nsa-treated samples for some mRNAs. The control samples on these plots were set to one (nsa) and the stacked part on top reflected the fold increase in mRNA levels after knock-down of miRNA-29 (abc).

      • The first inset in Figure 1B does not appear to match the box in the lower magnification image. – We moved the inset to the correct location.

      • The title of the section "Rescue of miRNA-29 mRNA targets improves basal adhesion of human keratinocytes" should be changed, as no rescue experiments are performed. The term is used again in the text when referring to targets upregulated (or "de-repressed") after miR-29 inhibition, but it is not accurate and should be changed__. – We followed the suggestion and highlighted changes throughout the text.__

      • The authors should specify the most important details of the adhesion assay in the Results section (for example the fact that the assay is carried out on fibronectin). – We added this to the Results.

      • The main text is imprecise when describing the RNAseq of fast/slow attaching keratinocytes, because it does not mention that the assay also includes miR-29 inhibition. - We have amended this and highlighted the changes in the text.

      • The insets in the middle of Figure 3 are not described in the figure legend and it is unclear what they are meant to be highlighting. The Authors should also double-check the accuracy of the scale bars across Figure 3A. - We described the insets in the legend and double-checked the scale bars in Figure 3A.

      • The pattern in the "abc" bars in Figure 3C makes it difficult to see the symbols – We increased the font and adjusted the label.

      • The area overlaps in the Venn diagram in Figure 4A should reflect the numbers. Since the diagram is comparing only three sets, accurate overlaps should improve the representation of the data. – We have re-created the Venn diagram to reflect the representation of the data on Figure 4A.

      • The colour scheme of the label borders in Figure 4E does not match the colour of set for the right-most sets in both keratinocyte and fibroblast Venn diagrams, leading to confusion. – We adjusted the colours to match the diagram in Figure 4E.

      • The figure legend for Figure 6E reads "Ingenuity Pathway Analysis (IPA) generated heat map of diseases and functions from the fast keratinocytes (abc) versus control (nsa)", but this is not what is displayed in the figure panel at all. - We apologise for the mistake; we corrected the legend.

      • The methods section for the miRNA-CLIP should include information about the number of cells used in each experiment. – The change is highlighted in the Methods.

      • The authors should carefully review the text for typos and misspellings and try to improve the readability of the manuscript__. – The manuscript has been carefully reviewed for these.__

      **Referees cross-commenting**

      I generally agree with the comments of the other reviewers: I think the paper is interesting and a valuable contribution to the field, particularly with regard to the role of miRNAs in the skin and the application of miRNA-CLIP to primary skin cells. While I did not remark on any gross overstatements, I agree that the data needs some strengthening to more adequately support some of the author's claims (I have tried to offer some realistic suggestions). There seems to be some difference of opinion regarding the data presentation, but all Reviewers thought it needed improvement in some capacity. While the way in which the paper is laid out and the results are displayed will be perceived subjectively by different readers, I believe it is in the best interest of the authors to try to reach the widest readership and thus I would maintain that the manuscript requires adjustments to increase clarity. I have tried to indicate specific sources of confusion and offer appropriate suggestions in my review.

      Reviewer #3 (Significance (Required)):

      This paper complements previous work that highlighted the role of miR-29 in desmosome formation in keratinocytes (Kurinna et al., 2014) and in skin repair in the mouse (Robinson et al., 2024), adding depth to these findings by understanding the molecular details of the key genes regulated by miR-29 in primary human skin cells. While the influence of miRNA on skin biology is well known, the details of which miRNAs and molecular mechanisms are involved are somewhat understudied. For this, I believe this paper, adequately amended, could be an interesting and useful contribution to the field and help highlight the role of miRNAs in the skin. This is also, to my knowledge, the first use of miRNA-CLIP in primary keratinocytes or fibroblasts and can provide a useful precedent for other studies looking to investigate miRNA interactomes in these cells.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Thiagarajan et al. report on the functions and molecular targets of miR-29 in human primary skin cells. They first focus on the potential role of miR-29 in wound healing and in the adhesion of keratinocytes to the basement membrane using both in vivo wounding assays in the mouse and human cultures/skin equivalents. The authors report that miR-29 negatively affects adhesion in vivo and in vitro and characterise the transcriptome of fast and slow-adhering cells with or without miR-29inhibition. They proceed to identify miR-29 targets in three primary skin cell types (follicular keratinocytes, interfollicular keratinocytes and fibroblasts) by performing miRNA-clip. By comparing these targets to genes altered in keratinocytes with high adhesion capacity after miR-29 inhibition or fibroblasts after miR-29 inhibition, the authors describe a model in which miR-29 inhibits multiple adhesion-associated pathways in keratinocytes and negatively regulates proliferation and ECM deposition by dermal fibroblasts.

      Major comments:

      Overall, the paper is interesting, and the experiments performed are generally sensible for the questions being investigated. However, I thought the data was presented in a very confusing and unclear way, both in the main text and in the figures. I found the paper quite difficult to navigate, with contradictory statements between text and figures, cryptic or confounding graphs or arrangement of the figures and, in at least one instance, re-use of the same image with inconsistent labelling. The paper will thus greatly benefit from extensive tidying up and review of both text and figures to improve clarity. I highlight several points below, with many being related to this overarching issue, and I try to offer suggestions to the authors improve the quality of the manuscript.

      • The stainings in Figure 1A should be repeated in intact sections as it is difficult to understand the exact distribution of miR-29 when the whole epidermis appears to be falling apart in the section. It is possible to see the pattern the authors are describing based on the current images, but it is not convincing. The authors should comment on the fact that miR-29 signal in the inset (at the edge of the wound) appears more basal than in the wound epidermis or in the unwounded. Quantifications and statistical analysis of the intensity and distribution of miR-29 for panels A and B and K10 for panel C will need to be included to help get a better sense of the data in its entirety and strengthen the observations. Importantly, Figure 1C is described as stainings after 6 days of skin equivalent cultures, but the same images are used in Figure 2A, where they are described as stainings after 11 days of culture. The authors should try to harmonise the data presentation so that the same data is not presented multiple times if possible. If repeated data presentation is necessary, it should be clearly stated and justified, and the authors should be careful to correctly indicate what the images represent.
      • ITGB1 stainings in Figure 2 do not convincingly match the statements in the main text ("miRNA-29 mimic-transfected SE struggled to attach through the integrin beta1 (ITGB1)-mediated adhesion"). All stainings, or at least the most important ones, like ITGB1, should have quantifications and statistical analyses of their intensity and distribution to support any observations. Staining of basement membrane proteins at 6 days could help better visualise if indeed there are any attachment defects in the mimic-overexpressing cells. Since the authors use transient transfections, the significance ant interpretation of the stainings performed at 11 days will be reliant on the transfection strategy employed, the rate of proliferation of the cells, and the half-life of the proteins stained. The transfection strategy is not clearly explained (this is a more general problem, see below) and staining for miR-29 in these sections is necessary to ensure that the treatments are still in effect after this prolonged time in culture.
      • The mimic/inhibitor transfection strategy employed by the authors throughout the paper is not clearly explained and this is a very important detail to understand the results of many of the assays they perform. The methods and Figures S2/S3 describe a 'double transfection' strategy for the inhibitors, but it is unclear if the same approach was used for the mimics (which is important since some of the experiments where they are employed have functional assays that can last longer than a week). Additionally, the strategy used for the inhibitors described in the methods section seems different than the one described in Figure S3. In the methods, the cells are transfected at day 1 and day 3 and collected for functional assays at day 5. Figure S3 instead shows two transfections at 'day 0' and an additional one at 'day 4' with miRNA levels measured at day 0 and day 8 (this bar plot should be modified to better reflect that measurements were only taken on specific days). The legend for Figure S3 reads "keratinocytes (P3/4) were transfected twice on subsequent days" and mentions "representative images of the cells from each treatment after the third transfection". This is all extremely confusing. The authors should make sure they explain what they did clearly and univocally, for both mimics and inhibitors, and they should add a time course with miR-29 levels following transfections of mimics and inhibitors covering the span of their longest assay.
      • Figure 3 includes reference to morphological parameters that would be predictive of a keratinocyte ability to form a holoclone (red arrows). While the larger size and low nucleus-to-cytoplasm ratio of differentiated cells is well-established, to my knowledge there is no accepted consensus about strong predictive capacity of simple morphological parameters when it comes to holoclone formation.
      • The inhibition of miR-29 in experiment 1 of the growth factor depletion assay seems to have failed according to Figure S2C, so the results of experiment 1 (-GF) in Figure 3 should be disregarded and the experiment repeated.
      • The authors report reduced keratinocyte differentiation in the miR-29 inhibited cells. This statement is mostly supported by the cell number time course shown in Figure S3B, but this experiment is not mentioned in the main text, which instead focuses on (less reliable) morphological parameters alone. Moreover, Figure S3 only shows the morphology of cells at day 4 and does not provide any information about the cell morphology at day 6 or day 8 as suggested by the main text. Assessing differentiation based on morphology alone is prone to inaccuracy and while the cell number experiment is good support for the stated decrease in differentiation in the miR-29 inhibited cells, it should be complemented with differentiation marker staining and/or clonogenicity assays.
      • The authors' claim that their results "revealed the direct in vivo targetome and functions of miRNA-29 in three types of cells isolated from human skin" is not accurate. While their experiments are indeed compelling, they are performed in cultured primary cells grown for at least 3 passages, which are akin, but not the same as cells in vivo and may behave differently. On a similar note, while there is some evidence from mouse that miR-29 may intervene in the regulation of the wound healing response in keratinocytes in vivo (Figure 1A), no analogous in vivo data is presented for fibroblasts. The authors should consider showing miR-29 stainings of mouse dermal fibroblasts and the potential variation in its level during wound healing. They should also show miR-29 stainings of normal human skin to confirm that its expression pattern mimics the mouse. If possible, they could also 'wound' human skin explants and check what happens during re-epithelielisation to miR-29 expression and to the key targets they identified (explants may be challenging to obtain, though). These experiments could provide some more compelling (though inevitably correlative) suggestion that miR-29 could intervene in the wound healing response in vivo in humans.

      Minor comments:

      • I would encourage the authors to avoid, when possible, the use of red/green colour palettes both in stainings and in graphs, as it makes the paper less accessible to colourblind individuals.
      • I would suggest avoiding the use of "stacked" bar plots to show data as they might lend themselves to misinterpretation. It would likely increase clarity if the bars for different conditions were plotted next to rather than on top of one another.
      • The first inset in Figure 1B does not appear to match the box in the lower magnification image.
      • The title of the section "Rescue of miRNA-29 mRNA targets improves basal adhesion of human keratinocytes" should be changed, as no rescue experiments are performed. The term is used again in the text when referring to targets upregulated (or "de-repressed") after miR-29 inhibition, but it is not accurate and should be changed.
      • The authors should specify the most important details of the adhesion assay in the Results section (for example the fact that the assay is carried out on fibronectin).
      • The main text is imprecise when describing the RNAseq of fast/slow attaching keratinocytes, because it does not mention that the assay also includes miR-29 inhibition.
      • The insets in the middle of Figure 3 are not described in the figure legend and it is unclear what they are meant to be highlighting. The Authors should also double-check the accuracy of the scale bars across Figure 3A.
      • The pattern in the "abc" bars in Figure 3C makes it difficult to see the symbols.
      • The area overlaps in the Venn diagram in Figure 4A should reflect the numbers. Since the diagram is comparing only three sets, accurate overlaps should improve the representation of the data.
      • The colour scheme of the label borders in Figure 4E does not match the colour of set for the right-most sets in both keratinocyte and fibroblast Venn diagrams, leading to confusion.
      • The figure legend for Figure 6E reads "Ingenuity Pathway Analysis (IPA) generated heat map of diseases and functions from the fast keratinocytes (abc) versus control (nsa)", but this is not what is displayed in the figure panel at all.
      • The methods section for the miRNA-CLIP should include information about the number of cells used in each experiment.
      • The authors should carefully review the text for typos and misspellings and try to improve the readability of the manuscript.

      Referees cross-commenting

      I generally agree with the comments of the other reviewers: I think the paper is interesting and a valuable contribution to the field, particularly with regard to the role of miRNAs in the skin and the application of miRNA-CLIP to primary skin cells.

      While I did not remark on any gross overstatements, I agree that the data needs some strengthening to more adequately support some of the author's claims (I have tried to offer some realistic suggestions). There seems to be some difference of opinion regarding the data presentation, but all Reviewers thought it needed improvement in some capacity. While the way in which the paper is laid out and the results are displayed will be perceived subjectively by different readers, I believe it is in the best interest of the authors to try to reach the widest readership and thus I would maintain that the manuscript requires adjustments to increase clarity. I have tried to indicate specific sources of confusion and offer appropriate suggestions in my review.

      Significance

      This paper complements previous work that highlighted the role of miR-29 in desmosome formation in keratinocytes (Kurinna et al., 2014) and in skin repair in the mouse (Robinson et al., 2024), adding depth to these findings by understanding the molecular details of the key genes regulated by miR-29 in primary human skin cells. While the influence of miRNA on skin biology is well known, the details of which miRNAs and molecular mechanisms are involved are somewhat understudied. For this, I believe this paper, adequately amended, could be an interesting and useful contribution to the field and help highlight the role of miRNAs in the skin. This is also, to my knowledge, the first use of miRNA-CLIP in primary keratinocytes or fibroblasts and can provide a useful precedent for other studies looking to investigate miRNA interactomes in these cells.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The article entitled, "miRNA-29-CLIP uncovers new targets and functions to improve skin repair", by Thiagarajan et al. describes the characterization of the functions of miRNA-29 in keratinocytes and fibroblasts, its RNA interactors and potential mechanisms of action. Using candidate interactors and 2D cell culture and 3D skin equivalents combined with loss-of-function (inhibitor) and gain-of-function (mimic), and changes in expression analyses, the authors conclude that the major function of miRNA-29 is to regulate cell-substrate adhesion.

      Major comments:

      • While the interactors and expression changes are useful resources, the claims and the conclusions that are based on them are exaggerated. The treatments are associated with changes in expression, but no functional data support the conclusions. Additional functional experiments are required to assertively make the claims. The title is misleading when stating "to improve skin repair" and the abstract also makes some bold general claims, which are tangentially supported by the findings. For example, "protein folding" only appears in the abstract and "RNA processing" is in the abstract and figures but not referred to in the text.
      • The authors may want to tune their language that their data suggest the conclusions as opposed to being definitive and assertive. This should be done in the Discussion, while the Results should represent the direct conclusions.
      • A couple of examples to the above, in the conclusion to section 1 of the Results, how was the "loss of basal adhesion" assessed? Is it by beta1-integrin localization changes? Also, how is "growth" defined"? proliferation is not changed and a more accurate way to describe the result is to refer to thickness.
      • The images in Fig 1C are reused in Fig 2A, where new examples should be shown instead.
      • Fig 1C and Fig 2A are not quantified to make the claims about premature differentiation and integrin expression changes.
      • Fig 3: It is not clear from the figure legends what statistical methods were used for which experiment or how many times the experiment was performed (not just biological replicates), especially given the variability among experiments in Fig 3C.

      Minor comments:

      • The Introduction is focused on methodology and should include elements that pave the way to the Results. Some information that belongs in the introduction are present in the Results section. In this respect, please define the miRNA processing Dicer pathway and its components in the introduction so that the reader can follow the nomenclature (AGO2, RISC, etc.). Also, introduce human skin equivalents or organotypic culture as a model system in the Introduction.
      • Some information in the Results belongs in the Introduction, for example, the first seven lines of the Results section.
      • The authors might want to consider including quantifications in the main figures, so they are immediately apparent to the reader, for example, Fig S1C. Also, Fig S2B is an important measure for the immediate outcome of the treatment on miRNA-29.
      • Please change "imidiate" to "immediate", "sculp" to "scalp", "has to be releaved of miRNA-29-mediate repression" to "has to be relieved of miRNA-29-mediated repression"

      Referees cross-commenting

      I agree with my colleagues' assessments and suggestions. The miRNA-CLIP data in keratinocytes and fibroblasts are important resources. The figures and text require reconsideration to more accurately represent the data as detailed in our collective reviews

      Significance

      The study utilizes 2D and 3D cultures and presents an important resource for miRNA-29 interactors in keratinocytes and fibroblasts, as well as the expression changes associated with its inhibition and overexpression. However, the conclusions are exaggerated and based on expression changes. If the conclusions are rephrased, the findings would be of interest to a broad audience interested in miRNA, cell adhesion and epithelial and mesenchymal biology.

      My expertise is in skin development and maintenance, genetics and cell biology. I have limited knowledge in RNA biology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      miRNAs are important for the control of many cellular processes, with the miR-29 family of miRNAs implicated in the regulation of cell growth in different cell types in both the epidermis and dermis of the skin. However, the roles of miRNAs in specific cell types in general, and of the miR-29 family in the skin in particular, are currently unknown. Here, the authors use a range of cellular and molecular techniques, including miRNA cross-linking and immunoprecipitation (miRNA-CLIP) and antisense oligonucleotides (ASO), as well as RNASeq, qPCR, Western blotting, in situ hybridization, adhesion and ECM assays, ELISA and immunofluorescence, to interrogate the roles of the miR-29 family of miRNAs in controlling cell growth in epidermal keratinocytes and dermal fibroblasts, using 2D and 3D ex vivo models. The coupling of miR-CLIP with functional assays allowed the authors to identify both miRNA-mRNA complexes, and the biological pathways that these ultimately manipulate.

      The authors report the identification of unbiased, tangible miR-29/mRNA pairs, together with functional roles in cell adhesion, ECM regulation and fibroblast proliferation, that are distinct between keratinocytes and fibroblasts. miR-29 is identified as a valuable target for interventions that seek to promote healthy skin regeneration, including applications for wound healing. Many of the pathways identified here have previously been described, but the novelty of this manuscript lies in the innovative combination of miR-CLIP with functional assays, the application of these in combination to specific cell types, the identification of miR-29 as a novel master regulator of epidermal keratinocyte adhesion via a range of different pathways, and the demonstration that miR-29 inhibition in fibroblasts can influence keratinocyte adhesion via paracrine signalling.

      The experiments are well designed and reported. The interpretations are sound and appropriate for the data presented (though see the comment on potential normalisation of ECM data to cell numbers in cultures for the miR-29 mimic/inhibitor data for fibroblasts and the query about the number of direct miR-29 targets in fibroblasts that are ECM-related).

      Major Comments:

      I have no major concerns to raise over this manuscript. The claims and conclusions are supported by the data and no additional experiments are required (though please note the comment on normalisation mentioned above and detailed below). The methods are clearly reported and statistical reporting is adequate.

      Minor Comments:

      Pg3, 7th line from the bottom: "processed into three functional miRNA..." - minor edit needed here, it looks like there's a word missing somewhere.

      Pg3, last line on the page: "results supported..." - is there a missing 'are' here?

      Pg5, 15th line of the main text: "of miRNA-29-mediate repression..." - is there a missing 'd' here ('-mediated...')? There is lots on minor presentation errors like this throughout the manuscript - I won't point them out exhaustively, but the manuscript needs a good thorough proof-read, maybe from a fresh pair of eyes?

      Fig. 1C: Can the figure be edited to better highlight the basal layer with lack of (nsm image) and expression of (abm image) K10? Maybe a box around that layer, rather than the current arrows only on the abm image (which are not particularly closely indicating the basal layer)?

      Fig. 2 legend should include definitions of abbreviations shown on the figure.

      Pg8/Fig. 4A: Can the reporting of shared transcript targets of miR-29 in IFK/HFK/DF cells be better communicated? Maybe just adding the actual percentage overlap in transcriptomes for IFK/HFL and keratinocytes/fibroblasts to the main text would help. Similarly, I think a direct report somewhere (in the main text?) of total number for relevant groups shown in Fig. 4E would also be useful - e.g. there are 45 transcripts that are direct targets of miR-29 in keratinocytes and also associated with ECM, and 190 that are direct targets of miR-29 in keratinocytes and also associated with cell adhesion, but these number are difficult to come by quickly at the moment. It would be nice to be able to quickly compare these numbers for keratinocytes to their equivalents for fibroblasts.

      Fig. 4B: It's interesting that ~15% of miR-29 binding targets identified using miR-CLIP are not predicted targets based on TargetScan/microT-CDS. I'd like to see a little more information on this added to the manuscript - perhaps listing some of these or including a table of them? And perhaps some discussion of this could be added also.

      Fig. 4E: I would be nice to see the Venn numbers for keratinocyte proliferation (either is a supp figure, or addition to the main text?), to help illustrate the lack of a role for miR-29 in the regulation of keratinocyte proliferation.

      Fig. 4E: Is the reported number of direct miR-29 targets in fibroblasts that are ECM-related correct? This number is reported as 10 in the main text (pg10, 3rd paragraph), but it looks like 10 is only for direct miR-29 targets in fibroblasts that are ECM-related AND related to proliferation. Should this number be 58? The 10 that are direct miR-29 targets in fibroblasts that are ECM-related AND related to proliferation can be reported in the next sentence, where this group is specifically referred to.

      Fig. 7 (and related main text): Did you take any steps to normalise ECM measurements to cell numbers present in cultures in the miR-29 mimic/inhibition experiments in fibroblasts? This should really be included as it would provide an answer to the speculation of whether the effects of manipulating miR-29 on ECM are due to proliferation or classical pro-fibrotic pathways.

      Fig. 8E: The upper and lower image need to have nsa/abc labels added to them.

      Pg12, 1st sub-heading: typo (cell-specifcic).

      Referees cross-commenting

      All reviews appear to be fair and balanced to me. I agree that in places wording could be amended to temper the strengths of some claims, and it would also be nice to see some additional functional assays included, to complement the adhesion and ECM deposition assays that are currently presented, though I do not think this should necessarily be a requirement for publication and could be included in subsequent follow-up work from the group. I did not spot the reuse of images between Fig. 1 and 2, but clearly this should be addressed - either by replacing one set of images, or by removing the relevant panels from Fig. 1 and changing in-text reference to guide the reader to Fig. 2A. I also agree that it would be nice to see miR-29 staining of mouse dermal fibroblasts during wound healing, to complement the images already shown for keratinocytes, and to see miR-29 staining in human skin.

      Significance

      miR-CLIP is a powerful, recently developed technique, with enormous promise for the identification of true miRNA-mRNA pairs, that has not yet been widely adopted by the research community. As such, its application here is itself relatively novel, adding enormously to our existing knowledge of likely miR-29 targets, providing tangible information in miR-29/mRNA pairs in specific cell types in different layers of the skin, but also further adding novel functional information to this, with demonstrations of the regulation of specific relevant biological pathways through manipulation of targets identified using miR-CLIP. The methods are sound (and impressive), results are reported well and not over-interpreted. There is the potential for better characterisation of the relative importance of canonical pro-fibrotic pathways vs proliferation-related effects on ECM production, and this should not be difficult to address. This paper will be on interest to a wide readership, including those engaged in fundamental research and clinicians.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We are very grateful for the three reviewers’ positive and considered comments. We copy specific comments below in bold font, and address them in turn including quotations from the revised manuscript in italics.

      2. Description of the planned revisions

      One Supplementary Figure could be copied from an earlier manuscript to show the model circuits, please see Reviewer 3 comment 1, below.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer 1

      Major 1. In this work, the authors observed discrepancies between the estimated and actual values of DNA-binding dissociation constant (Kd). Such inconsistencies may arise if the model used to simulate Kd omits certain regulatory mechanisms. For example, previous studies (_https://doi.org/10.1371/journal.pcbi.1008340, ____https://doi.org/10.1098/rsfs.2021.0084_) showed that when specific gene regulatory mechanisms are missing, the dissociation constant required for generating oscilla-tions can be underestimated. Thus, the inconsistency between the estimated and ac-tual Kd values might result from missing mechanisms in the model. It would be benefi-cial if the authors could discuss this possibility. __


      We thank the reviewer for these examples, which illustrate the relevant point that multiple biochemical processes can contribute to the non-linearity required to simulate a particular clock behaviour within biochemically-reasonable parameter values. We now cite both, along with the original Buchler & Louis paper on the titration mechanism in general, as follows:

      “It is possible that the measured, bulk clock protein levels might over-estimate the protein available for promoter binding due to mechanisms absent from the model, such as protein partitioning outside the nucleus (Yakir et al, 2009), protein titration (Buchler & Louis, 2008), clustering of proteins within the nucleus, a processing step akin to the formation of a smaller EC pool from a fraction of the bulk LUX protein or any combination of these mechanisms (Jeong et al, 2022b; Yao et al, 2022).”

      __ Minor 1. The order in which the figures are mentioned does not match the order of the figures. Please adjust the sequence of the figures accordingly. __

      • The amalgamated PDF for review presented the Figures and Supplementary Figures in the order of citation, which we intended for the reviewers’ convenience. As this didn’t help two of the reviewers, they are now in numerical sequence.

      __ In the last paragraph of Introduction, the authors state, 'The absolute numbers of proteins directly constrain their possible biochemical activities (Kim & Forger, 2012)'. It would be helpful to also cite Jeong et al. (_https://doi.org/10.1073/pnas.2113403119_), who showed that two circadian neuronal groups in Drosophila, containing different amount of clock proteins, exhibit distinct molecular properties within the circadian clock. __

      • Added a citation to this relevant example later in that sentence.

      __ In the title of the first section of Results, 'Predicting clock proteins levels' should be revised to 'Predicting clock protein levels.' __

      • Done

      __ The description of the results in Figure 4 is unclear. Please, refer to the color and shape of the points when explaining the results for better clarity.__

      • The legend of Figure 4 now refers to both characteristics. Reviewer 2

      __2.1. Limited Generalizability of Some Assumptions: __

      __The translation and degradation rate assumptions are based on data from specific temperature and light conditions, which may not generalize well to other growth conditions. The authors could address this point by explicitly stating this limitation and explaining how variable conditions would affect the model's underlying assumptions. __

      • To address this point, we now note in Discussion: “For example, the reporter assays could quickly test protein numbers under different conditions from those reported here, to understand the biochemical mechanisms for the canonical ‘temperature compensation’ of circadian period in constant conditions (as modelled in Gould et al, 2013) and/or the adaptation to fluctuating, natural conditions (see Future perspectives: models informed by genome sequence, below).”

      __2.5. Simplified Model for Translational Efficiency: __

      __The model simplifies translational efficiency, which may not fully capture the complexity of clock protein synthesis. A more comprehensive approach, considering factors like ribosome density variation across transcripts, would add depth to the protein quantification model. Experiments are not required here. But it'd be nice to explain how a more complex model involving differential ribosomal density per transcript could affect the overall conclusions. __

      • The ‘simple model’ is intentionally simple. We note in the Results that it even ignores the published light/dark-regulation of translation rate, both generally in Arabidopsis and specifically for LHY. To address this point, we have added in Discussion, “In other words, the bulk levels of these clock proteins might be rather simply regulated. The simple model’s approach could justifiably be repeated to estimate the levels of other proteins, and extended to test where more complex biochemical mechanisms, such as translational regulation, are functionally significant.”

      __ Minor Points __


      __- Figure Legends: Several figures lack sufficient detail in legends, particularly Figures 3 and 5, where the methodology for generating the predicted protein levels and Kd values could be described a bit more, without majorly elongating the caption length. __

      • Both legends have been updated. However, the methods are described in detail in the Supplementary Information, which provides much more space. - Unclear Units in Supplementary Table 1: Some units in Supplementary Table 1 for translation and degradation rates are not clearly specified.

      • Units seem clear in the column headings, as follows:

      s (proteins per mRNA per hour)

      k or klight (per hour)

      kdark (per hour)

      The reviewer might be highlighting that some values of kdark were given as “-“ because that protein did not have light-regulated degradation in the simple model, so degradation rate was just k rather than klight and kdark. This notation has been updated to NA, with an explanatory note in the supplementary table legend.


      Reviewer 3

      1. __ I would have appreciated inclusion of an additional figure describing the U2019 and U2020 that were previously published by this group in 2021, along with a brief clarification of the differences between the simple and full versions of these models (Beyond the small summary presented in Figure 8). __ Supplementary Figure 3 of Urquiza and Millar 2021 shows simplified circuit diagrams of the two models and could be added as a new Supplementary Figure in this manuscript, as the editor prefers. To address this point, we added an introduction to the detailed models in the Results: “Our detailed models of the clock gene circuit (see Introduction) are not driven by rhythmic data input like the simple model, but rather use ordinary differential equations to recapitulate the dynamics of each RNA and protein component in the clock circuit, along with their interconnected feedback loops and their regulation by light signals. The models autonomously generate rhythmic patterns of RNA and protein expression that match the rhythmic data. The gene circuits in models U2019 and U2020 differ only in the regulation of daytime processes, involving LHY/CCA1 and the PRR genes (Urquiza-García & Millar, 2021). The circuit of U2019 is closer to its antecedent model P2011 (Pokhilko et al, 2011), using gene activation, whereas U2020 uses repression (for circuit diagrams, see Supplementary Figure 3 of (Urquiza-García & Millar, 2021). Repression is better supported by molecular data but U2020 simulations fit the data no better than, or slightly worse than, the activation-based model U2019 (consistent with Fogelmark & Troein, 2014), so we use both circuits here.”

      __Minor comments __

      __Supplemental figure 2 is partially cut off __

      • Corrected. Only the edge of a label was affected. It would be useful if figures/supplemental figures were provided in order.

      • Corrected, see reviewer 1 above.

      4. Description of analyses that authors prefer not to carry out

      Reviewer 1

      N/A

      Reviewer 2

      __2.2. Discrepancies in CCA1 Binding Affinity: __

      __The model's simulated Kd values for CCA1 largely deviate from empirical measurements, suggesting missing mechanisms that may impact protein binding affinity. This might be due to some structural or environmental factors influencing CCA1 binding. No experiments are needed here but some plausible explanations would enhance the manuscript. __

      • This point is addressed in the section ‘Future perspectives: investigating Kd in vivo’, now enhanced by our response to reviewer 1’s major comment, as noted above. __2.3. Limited Data for Evening Complex Proteins: __

      __The model assumptions for ELF3, ELF4, and LUX rely on estimated degradation rates, which could introduce inaccuracies in the predictions. Empirical quantification of these rates would strengthen the reliability of these protein dynamics in the model. If these experiments are too resource and time intensive, then include some explanation of how changing these estimated degradation rates would affect the overall result. __

      • We assume that the reviewer is referring to the k parameters of the simple model driven by RNA data, as in Figure 2, which gives the predicted protein levels that we compare to measured protein levels in Table 1. Both the prior degradation rate data and our NanoLUC reporter measurements are strongest for LHY, CCA1, PRR7 and TOC1, so these are our focus. New measurements of degradation rate for ELF3, ELF4 and LUX are indeed beyond our scope. Our reporter methods might facilitate future measurement of such parameters by other researchers. In the Supplementary Information, we outline the estimation of degradation rates k, which returned biologically plausible protein half-lives for LUX = 3.7 h, ELF4 = 1.3 h and ELF3 8.7 h. We have no data for ELF4 and we highlight the specific limitations of our ELF3 and LUX data in the Results and Discussion text, Table 1, Methods and Supplementary Information. Taking the in vivo data for ELF3 for example (Table 1), the protein levels predicted by the simple model using that degradation rate were within 6% to 7% of the measured levels at the peak and at the trough under LD. This independent result supports the estimated degradation rate: varying the estimated degradation rate for ELF3 would not maintain a prediction so remarkably close to the data.

      __2.4. Unexplained Variability in Reporter Data: __

      __The NanoLUC reporter assays show some variability that the model does not account for, possibly due to factors like differential protein stability or folding in vivo. Further tests across different expression contexts, or including protein stability measurements, would clarify these inconsistencies. If these experiments are too resource and time intensive, then include some explanation of how changing these estimated degradation rates would affect the overall result. __

      • This comment cannot be addressed without knowing which of the many possible data series the reviewer is referring to, and whether the variability of interest is in level, timing or a higher-order dynamic behaviour. For example, we discuss three specific instances where reporter and model behaviour differs, in detail, in Discussion section ‘Refining the modelled protein profiles’ (726 words).

      Reviewer 3

      __ It would have been useful to quantify the mRNA expression levels in the rescued transgenic lines to enable direct comparison to WT. The use of multiple independent transgenic lines supports the authors' conclusions but this characterisation would aid future research. __

      The proposed study most directly addresses whether RNA from the reporter transgene is functionally equivalent to wild-type RNA, whereas the focus of this article is at the protein level. As the reviewer notes, the proposed study would principally be an aid to future research, so we prefer not to start that additional experimental work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      This is a valuable manuscript that begins to factorise the plant circadian model according to estimated protein translation and degradation rates. These models are iterative by nature, but the latest models provide a significant advance in our understanding and highlight avenues for future exploration. The use of Nanoluc protein:reporter fusions enables estimation of protein abundance, providing additional support for the modelled biological process.

      The latest models highlight discrepancies between protein abundance predicted in the model compared to the experimental evidence, and sensible suggestions are discussed to prioritise experiments to enable greater biological understanding.

      Major comments

      1. I would have appreciated inclusion of an additional figure describing the U2019 and U2020 that were previously published by this group in 2021, along with a brief clarification of the differences between the simple and full versions of these models (Beyond the small summary presented in Figure 8).
      2. It would have been useful to quantify the mRNA expression levels in the rescued transgenic lines to enable direct comparison to WT. The use of multiple independent transgenic lines supports the authors' conclusions but this characterisation would aid future research.

      Minor comments

      Supplemental figure 2 is partially cut off

      It would be useful if figures/supplemental figures were provided in order.

      Significance

      This work extends models of the plant circadian system to assess absolute protein numbers. This enables the biological 'plausibility' of the model to be assessed and also highlights where the model diverges from experimental data, indicating where additional understanding is required.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      1. Summary of this work

      This manuscript delves into the Arabidopsis circadian clock by addressing a major gap in plant gene circuit models-particularly, the lack of absolute units for protein quantification, which has historically hindered comparisons to biochemical data. By recalibrating two mathematical models of Arabidopsis thaliana's circadian clock from relative to absolute units, the authors introduce a framework that allows protein levels to be quantified in terms of copies per cell. This approach facilitates more direct comparisons between model predictions and empirical measurements.

      The core of the study involves both analytical predictions and experimental validation to quantify protein levels for key clock proteins (such as LHY, CCA1, PRR7, and TOC1) using RNA data and luciferase reporter protein fusions. The simple model predicts the abundance of clock proteins, suggesting that the protein concentration may reach up to 100,000 copies per cell, which was then verified through experiments involving NanoLUC reporters. The recalibration of detailed mathematical models enabled the calculation of DNA-binding dissociation constants (Kd) based on empirical data, establishing a bridge between theoretical and experimental insights for understanding plant circadian rhythms.

      The authors further explore how recalibrated models align with data by focusing on Kd values associated with DNA binding, particularly with LUX and CCA1 proteins. In vitro binding assays validated these predictions, while certain discrepancies emerged for CCA1 binding, implying other mechanisms could be influencing the observed results. Importantly, the recalibrated models provide a more realistic representation of protein-DNA interactions within the circadian clock, and the framework can be applied to understand other plant gene regulatory networks.

      Overall, the authors demonstrate how absolute protein quantification can advance our understanding of circadian rhythm dynamics in Arabidopsis. This study highlights the broader applicability of their methods, suggesting potential adaptations for investigating gene regulation across plant species and evolutionary contexts. The integration of empirical data with mathematical models introduces a new standard for rigor in plant gene circuit modeling and opens up avenues for exploring gene regulation in crop species and adaptive evolution in plants. 2. Major points

      2.1. Limited Generalizability of Some Assumptions:

      The translation and degradation rate assumptions are based on data from specific temperature and light conditions, which may not generalize well to other growth conditions. The authors could address this point by explicitly stating this limitation and explaining how variable conditions would affect the model's underlying assumptions.

      2.2. Discrepancies in CCA1 Binding Affinity:

      The model's simulated Kd values for CCA1 largely deviate from empirical measurements, suggesting missing mechanisms that may impact protein binding affinity. This might be due to some structural or environmental factors influencing CCA1 binding. No experiments are needed here but some plausible explanations would enhance the manuscript.

      2.3. Limited Data for Evening Complex Proteins:

      The model assumptions for ELF3, ELF4, and LUX rely on estimated degradation rates, which could introduce inaccuracies in the predictions. Empirical quantification of these rates would strengthen the reliability of these protein dynamics in the model. If these experiments are too resource and time intensive, then include some explanation of how changing these estimated degradation rates would affect the overall result.

      2.4. Unexplained Variability in Reporter Data:

      The NanoLUC reporter assays show some variability that the model does not account for, possibly due to factors like differential protein stability or folding in vivo. Further tests across different expression contexts, or including protein stability measurements, would clarify these inconsistencies. If these experiments are too resource and time intensive, then include some explanation of how changing these estimated degradation rates would affect the overall result.

      2.5. Simplified Model for Translational Efficiency:

      The model simplifies translational efficiency, which may not fully capture the complexity of clock protein synthesis. A more comprehensive approach, considering factors like ribosome density variation across transcripts, would add depth to the protein quantification model. Experiments are not required here. But it'd be nice to explain how a more complex model involving differential ribosomal density per transcript could affect the overall conclusions. 3. Minor Points - Figure Legends: Several figures lack sufficient detail in legends, particularly Figures 3 and 5, where the methodology for generating the predicted protein levels and Kd values could be described a bit more, without majorly elongating the caption length. - Unclear Units in Supplementary Table 1: Some units in Supplementary Table 1 for translation and degradation rates are not clearly specified.

      Significance

      1. Overall Evaluation

      The recalibration of models to absolute units for protein quantification is a novel advancement in the field, allowing more direct comparison to experimental data. The study's combination of modeling and empirical validation is robust, and the use of quantitative NanoLUC reporters adds rigor to the experimental design. The study offers a clear protocol for estimating Kd values by integrating Protein-Binding Microarray (PBM) data and Surface Plasmon Resonance (SPR) data, which is a significant methodological contribution. These approaches could become a new standard for studies aiming to link molecular and phenotypic traits in plants. Moreover, through NanoLUC reporter assays, the study provides empirical data for protein levels that align closely with the model's predictions, enhancing the validity of their model. Additionally, by creating a framework that can be adapted to other plant gene regulatory networks, the authors extend the impact of their work beyond the Arabidopsis circadian clock, hinting at potential applications in agriculture and crop science.

      I recommend this study with a minor revision, addressing the points below.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Major

      1. In this work, the authors observed discrepancies between the estimated and actual values of DNA-binding dissociation constant (Kd). Such inconsistencies may arise if the model used to simulate Kd omits certain regulatory mechanisms. For example, previous studies (https://doi.org/10.1371/journal.pcbi.1008340, https://doi.org/10.1098/rsfs.2021.0084) showed that when specific gene regulatory mechanisms are missing, the dissociation constant required for generating oscilla-tions can be underestimated. Thus, the inconsistency between the estimated and ac-tual Kd values might result from missing mechanisms in the model. It would be benefi-cial if the authors could discuss this possibility.

      Minor

      1. The order in which the figures are mentioned does not match the order of the figures. Please adjust the sequence of the figures accordingly.
      2. In the last paragraph of Introduction, the authors state, 'The absolute numbers of proteins directly constrain their possible biochemical activities (Kim & Forger, 2012)'. It would be helpful to also cite Jeong et al. (https://doi.org/10.1073/pnas.2113403119), who showed that two circadian neuronal groups in Drosophila, containing different amount of clock proteins, exhibit distinct molecular properties within the circadian clock.
      3. In the title of the first section of Results, 'Predicting clock proteins levels' should be revised to 'Predicting clock protein levels.'
      4. The description of the results in Figure 4 is unclear. Please, refer to the color and shape of the points when explaining the results for better clarity.

      Significance

      In the manuscript by U. Urquiza-Garcia et al., entitled "Abundant clock proteins point to missing molecular regulation in the plant circadian clock," the authors refactored two models of the plant circadian clock to use absolute units, allowing direct comparison with biochemical data. This study significantly advances the integration of experimental data into computational models.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      'The authors do not wish to provide a response at this time.'

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript, Takenaka et al revisits the role of the maternal-effect gene abnormal oocyte (ao) in the regulation of canonical histone gene expression in Drosophila. The authors generated both a knockout allele of ao and a V5-tagged ao allele using CRISPR/Cas9. They show that (i) loss of ao results in maternal-effect lethality; (ii) Ao protein is localised to histone locus bodies; (iii) an increased dosage of AO heterochromatin partially rescues maternal-effect lethality. These results are consistent with previous reports. However, this study convincingly demonstrates that ao is not required to suppress histone expression, which is in contrast to previous suggestions.

      The manuscript is well structured and written. The conclusions are supported by the data. However, I have a few comments that may help to improve the manuscript:

      1. Localisation of the Ao-V5 protein: Ao-V5 co-localises with Mxc, a marker for the histone locus body. The authors state that some Ao-V5 puncta are without Mxc colocalisation. As far as I can see, it looks more like the other way around - Ao-V5 co-localises with Mxc, but there are some Mxc-marked HLBs without Ao-V5, which would imply that Ao-V5 does not have any other additional genomic loci as suggested. Therefore, I think a quantification of the existing images would be beneficial.
      2. qPCR analysis of histone gene expression in ao mutant ovaries shows that histone H1 RNA is slightly upregulated, while all other canonical histones remain unchanged compared to control. However, Western blot analysis shows a significant decrease in histone H3 protein levels (~50%), which the authors describe as 'slightly decreased'. This result needs to be considered in more detail, as it would suggest that Ao is required to maintain (at least) histone H3 protein levels?
      3. Some of the supplementary figures are not referenced in the main text (only in the figure legends).
      4. Ao deletion (Figure S1): I suggest including the Sanger sequencing results (as mentioned in the text) to demonstrate that neighbouring gene sequences remain unaffected.

      Referees cross-commenting

      I also think that all reviews are consistent with only a few suggestions for improving the manuscript.

      Significance

      Strength: This paper shows that ao is not a histone gene repressor, as previously thought, which is an important finding. It also describes new alleles of ao, which are valuable tools for the Drosophila research community.

      Limitations: The study lacks insight into the actual molecular role of ao. The finding that H3 protein levels are considerably reduced in ao ovaries could be further investigated.

      Advances and audience: As described above, this paper extends our knowledge of the role of ao by showing that it is not a histone gene repressor, which is of interest to the scientific community working in the fields of chromatin and development.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This study by Takenaka et al. explores the role of the abnormal oocyte (ao) gene in Drosophila melanogaster. Historically, ao was thought to regulate histone gene expression, causing maternal-effect lethality when disrupted. However, the lack of critical reagents including some alleles limited further testing of this hypothesis. In this study, the authors created new CRISPR/Cas9-generated ao knockouts and an epitope-tagged allele to clarify its role. Findings confirmed ao's maternal-effect lethality, which can be rescued by reducing histone copy number or increasing Y-chromosome heterochromatin. Contrary to previous studies, they found ao does not repress histone transcript levels, leaving its lethality mechanism unresolved. This work challenges the previously assumed role of ao in histone regulation at the level of transcription.

      Major comments: Overall, this manuscript was a joy to read, and the experiments are well controlled and done and presented in a clear and precise manner.

      Minor comments: My only two suggestions are if there is a way to include that ao is a conserved gene in the introduction that may broaden the readership of this manuscript. Also, there are many supplementary data panels showing Ao expression that could be condensed.

      Referees cross-commenting

      All the comments sound good to me.

      Significance

      The significance of the study lies in its challenge to the previously accepted role of ao gene in regulating histone expression. While earlier research suggested ao acted as a repressor of histone expression, this study, using new and well-controlled tools, did not find evidence supporting that role. The findings not only question the previous assumptions but also emphasize the complexity of ao's function, highlighting the need for further exploration of the mechanisms underlying maternal-effect lethality and its genetic interactions with heterochromatin and histone genes. This will contribute to a better understanding of gene regulation during early development.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: In this manuscript Takenaka et al address the function of the gene abnormal oocyte (ao, formerly abo) in Drosophila oogenesis. Previous reports implicated ao as a negative regulator of histone biogenesis based on its localization to the histone locus body (HLB), rescue by reduction of histone copy number, and reports that histone production is increased in ao mutants. Only one of the two original ao mutants are still available and the other has been lost. The authors sought to confirm ao's mechanism of action by generating a CRISPR null allele for ao. They also generate a tagged rescue construct which allows them to visualize ao localization. First, they confirm that a clean deletion of ao does indeed confer maternal lethality and that their tagged construct does localize to the HLB. Next they test histone expression levels in ao null ovaries and find that histones are not, in fact, overexpressed at the RNA or protein level. They also test the RNA levels in unfertilized embryos and find that H2B is decreased rather than increased. Finally, they confirm the previous report that histone deficiency or reduced copy number reduces ao's maternal effect lethality indicating that there is a relationship between ao and the histone locus.

      Major comments: I have only one suggestion and a possible extension of the work.

      Suggestion: Since ao localizes to the HLB it presumably only affects the replication coupled histones. It would be very interesting to know how replication independent histone RNAs change (or do not) in ao null ovaries.

      Extension: (This may be beyond the scope of this work). One wonders if the partial rescue of ao mutants by the histone deficiency/reduced histone copy number is due to a change in the localization of other HLB components in the absence of ao. To test this the authors could image Mxc in the ao null.

      Minor comments: None

      Significance

      General assessment: Overall, I think that this is an important story that sets the record straight about the molecular function of ao, particularly because it was recently suggested as a tool to manipulate histone levels. The experiments appear to be carefully done and provide good support for the claims. The manuscript is clearly written and easy to interpret.

      Advance: The main takehome of this story is that the previously described and long-standing mechanism of ao on histone biogenesis and oogenesis is flawed. This story will be the starting point for future characterization of this mutant.

      Audience: Those interested in histone biogenesis, especially in flies, will be interested in this story. It does not provide an alternative molecular mechanism (nor does it claim to) which will somewhat limit the general interest. However, this story provides a vital correction to the record.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 The manuscript by Consorte and coworkers focusses on the role of the tudor-doman containing proteins, Tdrd6a and Tdrd6c in Germplasm stability in zebrafish. Single mutants for each protein do not affect germ plasm stability or germ cell fates, Through the use of double mutants lacking the function of both proteins, the authors find that germ plasm complexes form and the Balbiani body of mutant oocytes are unaffected. However, the germ plasm complexes disperse during early development, leading to loss of primordial germ cells and eventually sterility of adult double mutant fish. Domain analysis of Tdrd6c showed that the Tudor domains are not required for interactions with the germ plasm organiser Bucky ball (Buc), but function in germ plasm dynamics. The prion-like domains of Tdrd6c were found to be required for interactions with Buc. Tdrd6c protein localizes to perinuclear granules in germ cells, but not in the Bb, unlike Tdrd6a. The manuscript is generally well done, and the findings are of interest to researchers interested in germline development, RNA-protein complexes and intrinsically disordered /prion-like proteins. Some further work would bolster the findings and support the main conclusions better. Major comments:

      • Regarding the 6a6c double mutants, figure 3 and S4 show preliminary evidence that the gonads are severely underdeveloped. However it is unclear when/what stage the gonads are arrested and whether there is a loss of germline stem cells. This can be shown.

      Reply:

      As the PCGs are already missing at 1 day post fertilization, there will be no germ cells in the gonads, leading to the rudimentary gonad structures we show in Figure S4. This phenotype has been described before by us and others (PMID: 17418787; PMID: 12932328; PMID: 15728735). Hence, a tissue analysis would not yield any further information.

      • The authors show that germplasm forms in single mutants for 6a and 6c and Buc-eGFP reporter transgene localization does not show overt germpalsm defects in the single mutant embryos. But PGC numbers are reduced by larval stages. Are germplasm RNAs destabilised to some extent in the single mutants? This should be examined.

      Reply:

      Thanks for bringing up this interesting point. In Roovers et al. (PMID: 30086300) we did an extensive analysis in tdrd6a mutants in this regard, showing that indeed germ plasm transcripts were generally reduced in PGCs. We do not plan to repeat such analysis for tdrd6c mutants. However, we propose to address this by smFISH experiments on known germ plasm transcripts, like vasa and dazl. This would not only reveal potential abundance issues, but also localization issues.

      • Relevant to the PGC defects shown in Fig 3, is there is more male bias or earlier defects in the 6c single mutants ? What is the tissue shown in Fig S4 B in the double mutant? Some sections and markers would be useful.

      Reply:

      In figure 3D that no male bias was observed in the offspring of single mutant females. While we cannot exclude earlier defects, these will be minor as no fertility defects have been noted. Hence, we do not plan to look at gonad development in offspring of single mutants.

      • Regarding expressing of the Tdrd6c constructs in BmN4 cells: the expression levels do not appear uniform and the background fluorescence is very high in some images, making comparisons and differences in expression levels/distribution difficult to see.eg Fig S6. These images (eg S6 6c and 6a6c double mutant images) should be assessed carefully and replaced with better representative images.

      Reply:

      Thank you for pointing this out. We fully agree, and we plan to quantify the images we have on these experiments to provide a more complete and possibly less biased results.

      Minor comments:

      • Fig 1 a: spelling error in the schematic "Antibody Binging site" should be changed to "Antibody binding site".

      Reply:

      This will be fixed.

      Reviewer #1 (Significance (Required)): How germ plasm stability is controlled is not well understood. In this manuscript, the role of the related Tudor-domain proteins, Tdrd6a and 6c proteins are compared. The proteins have redundant roles in germplasm stability and germ cells in early zebrafish embryos, and the combined loss of the proteins leads to germplasm destabilisation, germ cell loss and sterility. The manuscript is generally well done, and the findings are of interest to researchers interested in germline development, RNA-protein complexes and intrinsically disordered /prion-like proteins. Some further work would bolster the findings and support the main conclusions better (as detailed in major and minor comments above).

      Reviewer #2

      In this report, the authors utilize the zebrafish model to examine two multi-Tudor proteins, Tdrd6a and Tdrd6c, demonstrating that both are essential for the stability of germplasm during primordial germ cell (PGC) formation. They reveal that the Prion-like domain of Tdrd6c is key to Tdrd6c's self-interaction and its interaction with Bucky ball, a key organizer of germplasm in zebrafish, and that these interactions are regulated by the Tudor domains of Tdrd6c. These findings provide new insights into the mechanisms governing this phase-separated structure during development. Overall, the results are interesting, and the manuscript is generally well-written. However, additional experimental evidence is required to substantiate these findings.

      Major Points 1. Compared to single mutations in tdrd6a or tdrd6c, the tdrd6a/tdrd6c double mutations result in more severe PGC defects. Is there evidence for genetic compensation in single tdrd6 mutations? This needs to be clarified.

      Reply:

      This is an interesting point. We plan to do RT-qPCR on tdrd6a and tdrd6c in the single mutants to test this idea.

      In Figure 3, can injecting another tdrd6 mRNA into single mutant embryos for tdrd6a or tdrd6c rescue the PGC defect?

      Reply:

      Thank you for pointing out this idea. We had contemplated the idea, but reasoned that most likely any injected mRNA would be expressed too late to make a difference. However, we should just try it, because if it works it opens up possibilities (as also brought up by other reviewers). Hence, we plan to test this by injecting mRNAs for tdrd6a and/or tdrd6c in embryos derived from double mutant females. We believe that this approach would be more sensitive than a potential rescue on single mutants as the phenotype of the double is simply much stronger and consistent.

      Given the distinct subcellular localization of Tdrd6a and Tdrd6c during oocyte stages, it is suggested that Tdrd6a, Tdrd6c, and Buc may interact differently. This variation might contribute to differences in germplasm distribution in early embryonic development. It would be useful to assess germplasm levels and distribution in the different mutants using single-molecule fluorescence in situ hybridization (smFISH).

      Reply:

      This is a good idea, and we will test this as suggested, with smFISH.

      In Figure 5, co-immunoprecipitation (Co-IP) experiments are recommended to further confirm the interaction between Buc and Tdrd6a.

      Reply:

      Most likely the reviewer refers to Tdrd6c, and not Tdrd6a. For Tdrd6a we have shown before that it co-IPs with Buc (Roovers et al.(2018) Figure 5). Also Tdrd6c comes down in these IPs. In panel 5H we furthermore show that the coIP between Tdrd6a and Tdrd6c is disrupted in absence of Buc, implying that Tdrd6a and Tdrd6c interact with each other via Buc. Hence, we will not perform further coIP experiments from the artificial setting of BmN4 cells.

      The functional role of zebrafish Tdrd6c may not be fully elucidated through cellular experiments alone. Would injecting mutant variants of tdrd6c into tdrd6a mutant embryos rescue the PGC defects?

      Reply:

      Thank you for the good suggestion. We plan to try such rescue experiments by injection of mRNAs

      Line 368, improper writing style. "I selected, cloned and expressed...". The sentence should not use "I" as the subject.

      Reply:

      This will be fixed.

      Minor Points 1. The fonts in Figures 3C, 3D, 5B, 6B, etc., are too small and difficult to read. 2. Figure 3C and other charts are somewhat rough in appearance; optimization is recommended. 3. In line 171, an inappropriate reference is cited and should be revised.

      Reply:

      These will be addressed in the revision.

      Reviewer #2 (Significance (Required)): Strength and limitation: Strength: showing that Tdrd6a and Tdrd6c contribute to the stability of germplasm is novel. Limitation: the direct interaction between Tdrd6c and Buc is not fully supported by the experiments and results.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript "Germplasm stability in zebrafish requires maternal Tdrd6a and Tdrd6c" by Consorte and colleagues explores the poorly understood process of how the formation of the germ plasm, a collection of phase-separated RNA and protein components that segregate asymmetrically in the embryo to the future germ cells in many vertebrates, is regulated. In this study, the authors show that Tdrd6a and Tdrd6c are necessary to stabilize the germplasm in zebrafish embryos, while they are not required for the formation of a related structure during oogenesis, the Balbiani body. Interestingly, Tdrd6a and Tdrd6c are not required for the initial formation of the germ plasm in the embryo, but rather for stabilizing the germ plasm after its initial segregation from the rest of the cytoplasm: the absence of both of these proteins together in the oocyte causes a dispersal of the germ plasm during the first hours of embryogenesis, and consequently an absence of primordial germ cells in the larvae as well as sterility of the adult fish (fish looking like males were sterile, and no adult female fish in line with severely diminished gonad formation). The authors further imply a role of the prion-like domain of Tdrd6c in mediating self-interaction (clustering in the cytoplasm) as well as interaction with Bucky ball, and that these dynamics are modulated by Tdrd6c Tudor domains1-3 and lead, again in cells, to an immobilization of the Buc-Tdrd6c complex. The main new finding in this study is that Tdrd6a and Tdrd6c act redundantly and are together required for germ plasm stabilization in zebrafish. The mutant phenotype of Tdrd6a had already been previously published by the lab (and the authors introduce their prior work in the introduction). In prior work, the authors had shown that absence of Tdrd6a caused a mild phenotype in germ plasm assembly and loss of PGCs in the embryo, similar as they show now for the single Tdrd6c mutant. Moreover, Tdrd6a was also shown to interact with Buc, albeit via its Tudor domain, which is in contrast to the new finding that Tdrd6c interacts with Buc not with its Tudor but instead with its prion-like domain, which is absent in Tdrd6a. Together with the new findings presented here, this identifies Tdrd6a and Tdrd6c as redundantly acting factors that can both interact with Buckyball and can stabilize the germ plasm in the embryo.

      Major comments: The authors provide a careful analysis of the mutants, and most of the claims are fully supported by data. The data presented is very clear and the paper is well written. There is one aspect that I think would require further in vivo evidence, and that is the analysis of the interaction between Tdrd6c and Buc, which is currently performed only in vitro in the Bombyx cell line, which has clear limitations regarding conclusion that can be drawn for the in vivo situation. The observation that Tdrd6c-PrLD-TDR123 and Buc condensates localize adjacently/colocalize and that Buc condensates are immobilized on Tdrd6c granules via its PrLD domain do in my opinion suggest that Bb interacts with Tdrd6c via its PrLD domain, but this could still be indirect or an overexpression effect. To really show this, the authors should consider performing at some experiment in this regard in zebrafish embryos. I realize this is tricky given that the double mutants do not give you oocytes/embryos to work with, but maybe also here the overexpression in a single mutant would at least have the in vivo normal environment and endogenous (or transgenically labelled) Buc there. This could be either via imaging, or IPs (e.g. using the tagged line or AB). Potential AlphaFold modeling could also help though this might not result in anything given the unstructured nature of both proteins. Another alternative to show direct interaction could be a peptide-Spot-assay that might be able to detect direct interaction between those two proteins (and/or protein domains)?

      Reply:

      We believe the main point of the reviewer is that the interaction between Tdrd6c and Buc may be indirect. This is a valid point, but hard to address. As indicated in our replies to reviewer 2, we did already publish IP-MS data suggesting that Tdrd6a and Tdrd6c interact likely directly with Buc (Roovers et al.(2018)). First, a pull-down with a Buc-peptide pulled down Tdrd6a. Second, Tdrd6a and Tdrd6c interact with each other via Buc. There is no experiment that does not include artificial setting that would help us further here. However, we did recently manage to make full length Buc and Tdrd6c, and plan to use these in in vitro Buc phase-separation assays (which are working) to test if Tdrd6c may participate in Buc granules under our experimental conditions.

      Suggestion for additional experiments:

      • The authors show that ziwi-driven transgenic Tdrd6c is expressed during oogenesis but does not localize to the Balbiani body, which is rather surprising given that Tdrd6a localizes there (also confirmed again in this manuscript). Is (endogenous) Tdrd6c present already during oogenesis, and does it localize there to the Balbiani body? The authors should check this with AB staining for Tdrd6c in ovaries.

      Reply:

      This is an excellent point. We will put renewed effort in getting our Tdrd6c antibody to work on ovary samples.

      • It is currently unclear whether (endogenous) Tdrd6c is indeed already present and required in the ovary/oocyte, or whether very early expression in the embryo could be sufficient for rescuing the mutant phenotype, particularly since the initial germ plasm forms rather normally in the embryo in the double mutant. Can the authors attempt to rescue the double mutant phenotype by zygotic expression of either Tdrd6a and Tdrd6c (e.g. mRNA injection)?

      Reply:

      The phenotype we observed is strictly maternal. Zygotic, wild-type tdrd6a/c cannot not rescue the phenotype. Nevertheless, as also requested by the other reviewers, attempting rescue by mRNA injection is worthwhile, and we plan to do this.

      Minor comments: - The videos were not labelled with the respective numbers (only Movie 3 was assigned as Movie 3) - please assign them the corresponding numbers.

      Reply:

      This will be fixed.

      • In Fig 2B, DAPI would be nice to show to see directly where the nuclei are.

      Reply:

      DAPI does not stain the DNA in oocytes because the nuclei are so large. Nevertheless, we will use a Lamin antibody, or other suitable antibody, to indicate the nuclei.

      • In Fig 2C, indicate with a box the area of the zoom in D; plus make the contrast particularly for red brighter in 2C since the red is almost invisible

      Reply:

      This will be fixed.

      • Fig 4B, I would suggest still showing the 'no volume measured' data (=0) for the double mutant for the 3h timepoint (or at least indicate in the right blot as 'no data'), otherwise it's easy to miss if one just looks at the figure

      Reply:

      This will be fixed.

      • Fig 5d/E: the phenotype is visible, but it's unclear from the figure whether these images are cherry-picked and how penetrant it is; thus some quantification would be helpful (e.g. clustering amount? Relative percentage of area of the cytoplasm of a cell pink? Or granularity of the cytoplasm?)

      Reply:

      This comment was also raised by other reviewers. We will quantify the imaging we have performed.

      • Fig 6A: any speculation what is different in the few cells that have the colocalization of Buc and Tdrd6c (full-length) vs those that don't? could it be the level of the protein, or something else? In addition, I was missing to see just the Buc as a control on its own (without the co-transfection of Tdrd6c); and same comment as before, also here some quantification of changes to the Buc localization could be helpful (and changes/quantification of the Tdrd6c localization)

      Reply:

      We apologies we leaving out our Buc-only control. We have done that experiment, showing Buc alone yields nice round foci in these cells. Will include that in the revision.

      The variability in co-localization we believe indeed stems from expression levels.

      • This is more of a comment: I find it surprising that the two similar proteins would use different motifs/domains for interacting with Bb. Can it be ruled out that the previously found interaction between Tdrd6a and Bb could be mediated by Tdrd6c (via an interaction of Tdrd6a and Tdrd6c via their Tudor domains)? I assume Tdrd6c was not present in those cells during the previous assay, but could there have been another Tdrd6-like (endogenous) protein in the cells that could take 'Tdrd6c's' spot', making the interaction with Tdrd6a and Bb potentially indirect? Given this difference in domains and the in vitro overexpression cell-based assay as main evidence for this point, I do think this will require some experimental work to confirm the present model.

      Reply:

      Please see our reply to the general comments: in Roovers et al. (2018) we showed that Tdrd6a and Tdrd6c coIP with each other via Buc. Hence, Tdrd6a seems not to need Tdrd6c for Buc binding.

      *Reviewer #3 (Significance (Required)): Overall, this manuscript identifies and provides an initial characterization of two factors that are required for germ plasm stabilization and thus reproductive ability in zebrafish. The paper is solid in what it shows. It's main limitation is that the conceptual insights it provides in its current stage are rather limited. However, it does provide a useful and important foundation for future work, that will need to address how these factors regulate germ plasm condensation, and why there is a specific requirement in the embryo (but not during oogenesis). *

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      This is an excellent manuscript from the Ketting lab describing generation of a double mutant of tdrd6a and tdrd6c and showing that PGCs fail to form in their absence, whereas PGCs are present and functional in each single maternal-zygotic mutant, although PGCs are reduced in number. The Ketting lab previously published the tdrd6a mutant and here they describe the tdrd6c mutant and the double mutant. They find that Buc-GFP aggregation occurs normally in the double mutant but fails to persist to 3 hpf presumably due to a role of Tdrd6a/c in stabilizing the germplasm granules that have formed. The Balbiani while mildly affected in tdrd6a mutants is little or not affected in the double mutant. They perform co-localization and aggregation analysis in a cell culture system, which suggests that the Tdrd6c prion-like domain (PrLD) can self-aggregate, although not in the context of the full-length Tdrd6c. Further, the Tdrd6c PrLD with the Tudor domains 1,2, 3 co-localizes fully with Buc-GFP in granules in the cell system, while the Tdrd6c PrLD domain alone only leads to Buc-GFP docking on the Tdr6c-PrLD large aggregate. Interestingly, Tdrd6a and Tdrd6c appear to associate via distinct mechanisms to Buc, since Tdrd6a does not contain a PrLD. The points below would strengthen the manuscript.

      1. The authors should examine Tdrd6c localization in oocytes using their antibody to ensure that the Tdrd6c-mKate fusion is accurately reflecting endogenous Tdrd6c localization.

      Reply:

      This is an excellent point. We plan to do these experiments. This antibody thus far failed to work on ovary samples, but we will give it some more effort.

      The authors should test if the Tdrd6c-mKate transgene can rescue the tdrd6c mutant to ensure the mKate fusion is not altering its function, which could lead to mis-localization.

      Reply:

      This is an excellent point. We plan to do these experiments. The crossing schemes will, however, take significant time. Nevertheless, this is an important suggestion and we will try it.

      Please describe in fig 3 legend or methods the exact locations of the sequences deleted in the crispr allele generated in tdrd6c.

      Reply:

      This will be addressed.

      Line 152-153, is it not indicative of maternal expression of both tdrda and c being important, since each one alone is sufficient?

      Reply:

      Exactly, and therefore it follows that '*maternal inheritance of at least one of the Tdrd6 proteins is crucial for the specification of PGCs.' When embryo lack only one, they do relatively fine. We will look at this passage, however, to phrase it in an easier manner. *

      Lines 202-204, what percent of cells showed colocalization of Tdrdc with Buc-GFP and include the number of cells examined in a particular area. Quantitation would make more clear what is meant by 'occasional'.

      Reply:

      We will quantify the imaging experiments on the BmN4 cells.

      1. The authors previously published a balbiani body defect in the tdrda mutant in Roovers et al, 2018. The authors state in lines 235-236 that there is no Balbiani body defect in the double mutant? Is there not the same balbiani defect in the double mutant as found in the tdrd6a mutant? The authors should show their data for the normal Balbiani body and comment on this point.

      Reply:

      Thank you for pointing this out. The balbiani body defect in tdrd6a mutants is not an easy one, and we have not analysed the balbiani body in as much detail in this study as we did before for the tdrd6a mutant, as the major defect was observed in the germ plasm. However, we agree we should also addres the balbiani body in more detail. We plan to address this by looking at balbiani body morphology using smFISH markers in the various mutants.

      The authors previously published that Tdrd6a localizes around Buc droplets, at the periphery of the Buc aggregate. Tdrd6c localization in the embryo germplasm appears different and to be fully within the Buc aggregate. The authors should discuss this point, if it still holds.

      Reply:

      We will repeat the stainings at higher resolution to address this.

      Minor points:

      1. End of Introduction lines 65-67, 'demonstrate' is too strong here, since the work was done in a heterologous cell system, not the embryo, and their correct association requires both Tdrd domains 1-3 and the PrLD.

      2. Figure 1A has a typo in 'binding' site.

      3. How were the fish lines genotyped? The exact method should be included and if by PCR, the primer sequences used.

      4. Only one of the five supplementary movies is labelled, rest are all identically named, so this reviewer could not be sure of what video corresponded to what data. Also the two AVI videos did not run on the website, so could not be viewed by this reviewer.

      Reply:

      These minor issues will be resolved in the revision.

      **Referees cross-commenting** Reviewer 1: the PGCs/germline stem cells were shown to be absent at 1 dpf, re comment 1. Comment 4, Fig S6 is Zili IF in oocytes, not BmN4, although it does see a lot of background without a control of a zili mutant. Reviewer 2: I agree with point 5. For a higher impact paper, this would be required in my view. Data in cells is not necessarily reflective of in vivo. The authors are generally cautious in their interpretation though. Reviewer 3 also raises this point, although incorrectly states that there are not embryos to work with from the double mutant--they could indeed inject Tdrdc FL and the fragments as mRNA into the early embryo and test for colocalization with Buc in the germplasm at the cleavage furrows to provide in vivo evidence and increase the impact of the manuscript and then it could be appropriate for a higher impact journal. REviewer 3, I agree with point on Fig 5d/E, some measure and quantification would be helpful. I agree with comment on Fig 6A too, I thought the same. Reviewer 3 refers to the Bb multiple times, when I believe they mean the embryo germ plasm, including their last comment before Signifance. This is a good point too that Tdrd6a and c may interact with each other and only one interacts with Buc. I agree with their Significance statements.

      Reviewer #4 (Significance (Required)): This manuscript will be of interest to those studying germ cells, as well as the Piwi pathway and phase separation. The advance is an important first step to understanding how Tdrd6 proteins function in germ plasm persistence or stability in the early embryo. Interesting self-aggregation and interaction with Bucky ball studies are shown in a cell culture system that suggests the Prion-like domain of Tdrdc is important for its co-localization with Buc in droplet-like puncta, a mechanism distinct from Tdrd6a which does not contain a PrLD.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      This is an excellent manuscript from the Ketting lab describing generation of a double mutant of tdrd6a and tdrd6c and showing that PGCs fail to form in their absence, whereas PGCs are present and functional in each single maternal-zygotic mutant, although PGCs are reduced in number. The Ketting lab previously published the tdrd6a mutant and here they describe the tdrd6c mutant and the double mutant. They find that Buc-GFP aggregation occurs normally in the double mutant but fails to persist to 3 hpf presumably due to a role of Tdrd6a/c in stabilizing the germplasm granules that have formed. The Balbiani while mildly affected in tdrd6a mutants is little or not affected in the double mutant. They perform co-localization and aggregation analysis in a cell culture system, which suggests that the Tdrd6c prion-like domain (PrLD) can self-aggregate, although not in the context of the full-length Tdrd6c. Further, the Tdrd6c PrLD with the Tudor domains 1,2, 3 co-localizes fully with Buc-GFP in granules in the cell system, while the Tdrd6c PrLD domain alone only leads to Buc-GFP docking on the Tdr6c-PrLD large aggregate. Interestingly, Tdrd6a and Tdrd6c appear to associate via distinct mechanisms to Buc, since Tdrd6a does not contain a PrLD. The points below would strengthen the manuscript.

      1. The authors should examine Tdrd6c localization in oocytes using their antibody to ensure that the Tdrd6c-mKate fusion is accurately reflecting endogenous Tdrd6c localization.
      2. The authors should test if the Tdrd6c-mKate transgene can rescue the tdrd6c mutant to ensure the mKate fusion is not altering its function, which could lead to mis-localization.
      3. Please describe in fig 3 legend or methods the exact locations of the sequences deleted in the crispr allele generated in tdrd6c.
      4. Line 152-153, is it not indicative of maternal expression of both tdrda and c being important, since each one alone is sufficient?
      5. Lines 202-204, what percent of cells showed colocalization of Tdrdc with Buc-GFP and include the number of cells examined in a particular area. Quantitation would make more clear what is meant by 'occasional'.
      6. The authors previously published a balbiani body defect in the tdrda mutant in Roovers et al, 2018. The authors state in lines 235-236 that there is no Balbiani body defect in the double mutant? Is there not the same balbiani defect in the double mutant as found in the tdrd6a mutant? The authors should show their data for the normal Balbiani body and comment on this point.
      7. The authors previously published that Tdrd6a localizes around Buc droplets, at the periphery of the Buc aggregate. Tdrd6c localization in the embryo germplasm appears different and to be fully within the Buc aggregate. The authors should discuss this point, if it still holds.

      Minor points:

      1. End of Introduction lines 65-67, 'demonstrate' is too strong here, since the work was done in a heterologous cell system, not the embryo, and their correct association requires both Tdrd domains 1-3 and the PrLD.
      2. Figure 1A has a typo in 'binding' site.
      3. How were the fish lines genotyped? The exact method should be included and if by PCR, the primer sequences used.
      4. Only one of the five supplementary movies is labelled, rest are all identically named, so this reviewer could not be sure of what video corresponded to what data. Also the two AVI videos did not run on the website, so could not be viewed by this reviewer.

      Referees cross-commenting

      Reviewer 1: the PGCs/germline stem cells were shown to be absent at 1 dpf, re comment 1. Comment 4, Fig S6 is Zili IF in oocytes, not BmN4, although it does see a lot of background without a control of a zili mutant.

      Reviewer 2: I agree with point 5. For a higher impact paper, this would be required in my view. Data in cells is not necessarily reflective of in vivo. The authors are generally cautious in their interpretation though. Reviewer 3 also raises this point, although incorrectly states that there are not embryos to work with from the double mutant--they could indeed inject Tdrdc FL and the fragments as mRNA into the early embryo and test for colocalization with Buc in the germplasm at the cleavage furrows to provide in vivo evidence and increase the impact of the manuscript and then it could be appropriate for a higher impact journal.

      REviewer 3, I agree with point on Fig 5d/E, some measure and quantification would be helpful. I agree with comment on Fig 6A too, I thought the same. Reviewer 3 refers to the Bb multiple times, when I believe they mean the embryo germ plasm, including their last comment before Signifance. This is a good point too that Tdrd6a and c may interact with each other and only one interacts with Buc. I agree with their Significance statements.

      Significance

      This manuscript will be of interest to those studying germ cells, as well as the Piwi pathway and phase separation. The advance is an important first step to understanding how Tdrd6 proteins function in germ plasm persistence or stability in the early embryo. Interesting self-aggregation and interaction with Bucky ball studies are shown in a cell culture system that suggests the Prion-like domain of Tdrdc is important for its co-localization with Buc in droplet-like puncta, a mechanism distinct from Tdrd6a which does not contain a PrLD.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript "Germplasm stability in zebrafish requires maternal Tdrd6a and Tdrd6c" by Consorte and colleagues explores the poorly understood process of how the formation of the germ plasm, a collection of phase-separated RNA and protein components that segregate asymmetrically in the embryo to the future germ cells in many vertebrates, is regulated. In this study, the authors show that Tdrd6a and Tdrd6c are necessary to stabilize the germplasm in zebrafish embryos, while they are not required for the formation of a related structure during oogenesis, the Balbiani body. Interestingly, Tdrd6a and Tdrd6c are not required for the initial formation of the germ plasm in the embryo, but rather for stabilizing the germ plasm after its initial segregation from the rest of the cytoplasm: the absence of both of these proteins together in the oocyte causes a dispersal of the germ plasm during the first hours of embryogenesis, and consequently an absence of primordial germ cells in the larvae as well as sterility of the adult fish (fish looking like males were sterile, and no adult female fish in line with severely diminished gonad formation). The authors further imply a role of the prion-like domain of Tdrd6c in mediating self-interaction (clustering in the cytoplasm) as well as interaction with Bucky ball, and that these dynamics are modulated by Tdrd6c Tudor domains1-3 and lead, again in cells, to an immobilization of the Buc-Tdrd6c complex.

      The main new finding in this study is that Tdrd6a and Tdrd6c act redundantly and are together required for germ plasm stabilization in zebrafish. The mutant phenotype of Tdrd6a had already been previously published by the lab (and the authors introduce their prior work in the introduction). In prior work, the authors had shown that absence of Tdrd6a caused a mild phenotype in germ plasm assembly and loss of PGCs in the embryo, similar as they show now for the single Tdrd6c mutant. Moreover, Tdrd6a was also shown to interact with Buc, albeit via its Tudor domain, which is in contrast to the new finding that Tdrd6c interacts with Buc not with its Tudor but instead with its prion-like domain, which is absent in Tdrd6a. Together with the new findings presented here, this identifies Tdrd6a and Tdrd6c as redundantly acting factors that can both interact with Buckyball and can stabilize the germ plasm in the embryo.

      Major comments:

      The authors provide a careful analysis of the mutants, and most of the claims are fully supported by data. The data presented is very clear and the paper is well written. There is one aspect that I think would require further in vivo evidence, and that is the analysis of the interaction between Tdrd6c and Buc, which is currently performed only in vitro in the Bombyx cell line, which has clear limitations regarding conclusion that can be drawn for the in vivo situation. The observation that Tdrd6c-PrLD-TDR123 and Buc condensates localize adjacently/colocalize and that Buc condensates are immobilized on Tdrd6c granules via its PrLD domain do in my opinion suggest that Bb interacts with Tdrd6c via its PrLD domain, but this could still be indirect or an overexpression effect. To really show this, the authors should consider performing at some experiment in this regard in zebrafish embryos. I realize this is tricky given that the double mutants do not give you oocytes/embryos to work with, but maybe also here the overexpression in a single mutant would at least have the in vivo normal environment and endogenous (or transgenically labelled) Buc there. This could be either via imaging, or IPs (e.g. using the tagged line or AB). Potential AlphaFold modeling could also help though this might not result in anything given the unstructured nature of both proteins. Another alternative to show direct interaction could be a peptide-Spot-assay that might be able to detect direct interaction between those two proteins (and/or protein domains)?

      Suggestion for additional experiments:

      • The authors show that ziwi-driven transgenic Tdrd6c is expressed during oogenesis but does not localize to the Balbiani body, which is rather surprising given that Tdrd6a localizes there (also confirmed again in this manuscript). Is (endogenous) Tdrd6c present already during oogenesis, and does it localize there to the Balbiani body? The authors should check this with AB staining for Tdrd6c in ovaries.
      • It is currently unclear whether (endogenous) Tdrd6c is indeed already present and required in the ovary/oocyte, or whether very early expression in the embryo could be sufficient for rescuing the mutant phenotype, particularly since the initial germ plasm forms rather normally in the embryo in the double mutant. Can the authors attempt to rescue the double mutant phenotype by zygotic expression of either Tdrd6a and Tdrd6c (e.g. mRNA injection)?

      Minor comments:

      • The videos were not labelled with the respective numbers (only Movie 3 was assigned as Movie 3) - please assign them the corresponding numbers.
      • In Fig 2B, DAPI would be nice to show to see directly where the nuclei are.
      • In Fig 2C, indicate with a box the area of the zoom in D; plus make the contrast particularly for red brighter in 2C since the red is almost invisible
      • Fig 4B, I would suggest still showing the 'no volume measured' data (=0) for the double mutant for the 3h timepoint (or at least indicate in the right blot as 'no data'), otherwise it's easy to miss if one just looks at the figure
      • Fig 5d/E: the phenotype is visible, but it's unclear from the figure whether these images are cherry-picked and how penetrant it is; thus some quantification would be helpful (e.g. clustering amount? Relative percentage of area of the cytoplasm of a cell pink? Or granularity of the cytoplasm?)
      • Fig 6A: any speculation what is different in the few cells that have the colocalization of Buc and Tdrd6c (full-length) vs those that don't? could it be the level of the protein, or something else? In addition, I was missing to see just the Buc as a control on its own (without the co-transfection of Tdrd6c); and same comment as before, also here some quantification of changes to the Buc localization could be helpful (and changes/quantification of the Tdrd6c localization)
      • This is more of a comment: I find it surprising that the two similar proteins would use different motifs/domains for interacting with Bb. Can it be ruled out that the previously found interaction between Tdrd6a and Bb could be mediated by Tdrd6c (via an interaction of Tdrd6a and Tdrd6c via their Tudor domains)? I assume Tdrd6c was not present in those cells during the previous assay, but could there have been another Tdrd6-like (endogenous) protein in the cells that could take 'Tdrd6c's' spot', making the interaction with Tdrd6a and Bb potentially indirect? Given this difference in domains and the in vitro overexpression cell-based assay as main evidence for this point, I do think this will require some experimental work to confirm the present model.

      Significance

      Overall, this manuscript identifies and provides an initial characterization of two factors that are required for germ plasm stabilization and thus reproductive ability in zebrafish. The paper is solid in what it shows. It's main limitation is that the conceptual insights it provides in its current stage are rather limited. However, it does provide a useful and important foundation for future work, that will need to address how these factors regulate germ plasm condensation, and why there is a specific requirement in the embryo (but not during oogenesis).

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this report, the authors utilize the zebrafish model to examine two multi-Tudor proteins, Tdrd6a and Tdrd6c, demonstrating that both are essential for the stability of germplasm during primordial germ cell (PGC) formation. They reveal that the Prion-like domain of Tdrd6c is key to Tdrd6c's self-interaction and its interaction with Bucky ball, a key organizer of germplasm in zebrafish, and that these interactions are regulated by the Tudor domains of Tdrd6c. These findings provide new insights into the mechanisms governing this phase-separated structure during development. Overall, the results are interesting, and the manuscript is generally well-written. However, additional experimental evidence is required to substantiate these findings.

      Major Points

      1. Compared to single mutations in tdrd6a or tdrd6c, the tdrd6a/tdrd6c double mutations result in more severe PGC defects. Is there evidence for genetic compensation in single tdrd6 mutations? This needs to be clarified.
      2. In Figure 3, can injecting another tdrd6 mRNA into single mutant embryos for tdrd6a or tdrd6c rescue the PGC defect?
      3. Given the distinct subcellular localization of Tdrd6a and Tdrd6c during oocyte stages, it is suggested that Tdrd6a, Tdrd6c, and Buc may interact differently. This variation might contribute to differences in germplasm distribution in early embryonic development. It would be useful to assess germplasm levels and distribution in the different mutants using single-molecule fluorescence in situ hybridization (smFISH).
      4. In Figure 5, co-immunoprecipitation (Co-IP) experiments are recommended to further confirm the interaction between Buc and Tdrd6a.
      5. The functional role of zebrafish Tdrd6c may not be fully elucidated through cellular experiments alone. Would injecting mutant variants of tdrd6c into tdrd6a mutant embryos rescue the PGC defects?
      6. Line 368, improper writing style. "I selected, cloned and expressed...". The sentence should not use "I" as the subject.

      Minor Points

      1. The fonts in Figures 3C, 3D, 5B, 6B, etc., are too small and difficult to read.
      2. Figure 3C and other charts are somewhat rough in appearance; optimization is recommended.
      3. In line 171, an inappropriate reference is cited and should be revised.

      Significance

      Strength and limitation:

      *Strength: showing that Tdrd6a and Tdrd6c contribute to the stability of germplasm is novel.

      Limitation: the direct interaction between Tdrd6c and Buc is not fully supported by the experiments and results.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Consorte and coworkers focusses on the role of the tudor-doman containing proteins, Tdrd6a and Tdrd6c in Germplasm stability in zebrafish. Single mutants for each protein do not affect germ plasm stability or germ cell fates, Through the use of double mutants lacking the function of both proteins, the authors find that germ plasm complexes form and the Balbiani body of mutant oocytes are unaffected. However, the germ plasm complexes disperse during early development, leading to loss of primordial germ cells and eventually sterility of adult double mutant fish. Domain analysis of Tdrd6c showed that the Tudor domains are not required for interactions with the germ plasm organiser Bucky ball (Buc), but function in germ plasm dynamics. The prion-like domains of Tdrd6c were found to be required for interactions with Buc. Tdrd6c protein localizes to perinuclear granules in germ cells, but not in the Bb, unlike Tdrd6a. The manuscript is generally well done, and the findings are of interest to researchers interested in germline development, RNA-protein complexes and intrinsically disordered /prion-like proteins. Some further work would bolster the findings and support the main conclusions better.

      Major comments:

      • Regarding the 6a6c double mutants, figure 3 and S4 show preliminary evidence that the gonads are severely underdeveloped. However it is unclear when/what stage the gonads are arrested and whether there is a loss of germline stem cells. This can be shown.
      • The authors show that germplasm forms in single mutants for 6a and 6c and Buc-eGFP reporter transgene localization does not show overt germpalsm defects in the single mutant embryos. But PGC numbers are reduced by larval stages. Are germplasm RNAs destabilised to some extent in the single mutants? This should be examined.
      • Relevant to the PGC defects shown in Fig 3, is there is more male bias or earlier defects in the 6c single mutants ? What is the tissue shown in Fig S4 B in the double mutant? Some sections and markers would be useful.
      • Regarding expressing of the Tdrd6c constructs in BmN4 cells: the expression levels do not appear uniform and the background fluorescence is very high in some images, making comparisons and differences in expression levels/distribution difficult to see.eg Fig S6. These images (eg S6 6c and 6a6c double mutant images) should be assessed carefully and replaced with better representative images.

      Minor comments:

      • Fig 1 a: spelling error in the schematic "Antibody Binging site" should be changed to "Antibody binding site".

      Significance

      How germ plasm stability is controlled is not well understood. In this manuscript, the role of the related Tudor-domain proteins, Tdrd6a and 6c proteins are compared. The proteins have redundant roles in germplasm stability and germ cells in early zebrafish embryos, and the combined loss of the proteins leads to germplasm destabilisation, germ cell loss and sterility. The manuscript is generally well done, and the findings are of interest to researchers interested in germline development, RNA-protein complexes and intrinsically disordered /prion-like proteins. Some further work would bolster the findings and support the main conclusions better (as detailed in major and minor comments above).

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their thorough review of our manuscript and believe it has been much improved based on their comments.

      A detailed response to each comment is itemized below.


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This is an interesting manuscript where the authors systematically measure rG4 levels in brain samples at different ages of patients affected by AD. To the best of my knowledge this is the first time that BG4 staining is used in this context and the authors provide compelling evidence to show an association with BG4 staining and age or AD progression, which interestingly indicates that such RNA structure might play a role in regulating protein homeostasis as previously speculated. The methods used and the results reported seem robust and reproducible.

      In terms of the conclusions, however, I think that there are 2 main things that need addressing prior to publication:

      1) Usually in BG4 staining experiments to ensure that the signal detected is genuinely due to rG4 an RNase treatment experiment is performed. This does not have to be extended to all the samples presented but having a couple of controls where the authors observe loss of staining upon RNase treatment will be key to ensure with confidence that rG4s are detected under the experimental conditions. This is particularly relevant for this brain tissue samples where BG4 staining has never been performed before.

      Response____: With what is now known about RNA rG4s and the recent reconciliation of the controversy on rG4 formation (Kharel, Nature Communications 2023), this experiment is no longer strictly required for demonstration of rG4 formation. Despite this change, we did attempt this experiment at the reviewer's suggestion, but the controls were not successful, suggesting it may not be feasible with our fixing and staining conditions. That said, we agree that despite the G4 staining appearing primarily outside the nucleus, it would be helpful to have some direct indication of whether we were observing primarily RNA or DNA G4s, and so we performed an alternate experiment to determine this.

      In our previous submission, we had performed ribosomal RNA staining (Figure S7), and the staining patterns were similar to that of BG4, especially the punctate pattern near the nuclei. Therefore, we directly asked whether the BG4 was largely binding to rRNA and have now shown the resulting co-stain in Figure 3b. These results show that at least a large amount of the BG4 staining does arise from rG4s in ribosomes. At high magnification, we observe that the BG4 stains a subset of the ribosomes, consistent with previous observations of high rG4 levels in ribosomes both in vitro and in cells (Mestre-Fos, 2019 J Mol Biol, Mestre-Fos 2019 PLoS One, Mestre-Fos 2020 J Biol Chem), but this had never been demonstrated in tissue. This experiment has therefore both answered the primary question of whether we are primarily observing rG4s, as well as provided more detailed information on the cellular sublocalization of rG4 formation, and provided the first evidence of rG4 formation on ribosomes in tissue.

      2) The authors have an association between rG4-formation and age/disease progression. They also observe distribution dependency of this, which is great. However, this is still an association which does not allow the model to be supported. This is not something that can be fixed with an easy experiment and it is what it is, but my point is that the narrative of the manuscript should be more fair and reflect the fact that, although interesting, what the authors are observing is a simple correlation. They should still go ahead and propose a model for it, but they should be more balanced in the conclusion and do not imply that this evidence is sufficient to demonstrate the proposed model. It is absolutely fine to refer to the literature and comment on the fact that similar observations have been reported and this is in line with those, but still this is not an ultimate demonstration.

      Response: ____We agree that these are correlative studies (of necessity when studying human tissue), but recent experiments have shown that rG4s affect the aggregation of Tau in vitro - and we have now better clarified this in the text itself. We have now also been more careful in drawing causative conclusions as shown in the revised text (see yellow highlighted portions of the text).

      Minor point:

      3) rG4s themselves have been shown to generate aggregates in ALS models in the absence of any protein (Ragueso et al. Nat Commun 2023). I think this is also important in the light of my comment on the model, could well be that these rG4s are causing aggregates themselves that act as nucleation point for the proteins as reported in the paper I mentioned. Providing a broader and more unbiased view of the current literature on the topic would be fair, rather than focusing on reports more in line with the model proposed.

      __Response: ____ We agree and have modified the discussion and added a broader context, including the Ragueso report described above. __

      __Reviewer #1 (Significance (Required)): __ This is a significant novel study, as per my comments above. I believe that such a study will be of impact in the G4 and neurodegenerative fields. Providing that the authors can address the criticisms above, I strongly believe that this manuscript would be of value to the scientific community. The main strength is the novelty of the study (never done before) the main weakness is the lack of the RNase control at the moment and the slightly over interpretation of the findings (see comments above).

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      RNA guanine-rich G-quadruplexes (rG4s) are non-canonical higher order nucleic acid structures that can form under physiological conditions. Interestingly, cellular stress is positively correlated with rG4 induction. In this study, the authors examined human hippocampal postmortem tissue for the formation ofrG4s in aging and Alzheimer Disease (AD). rG4 immunostaining strongly increased in the hippocampus with both age and with AD severity. 21 cases were used in this study (age range 30-92). This immunostaining co-localized with hyper-phosphorylated tau immunostaining in neurons. The BG4 staining levels were also impacted by APOE status. rG4 structure was previously found to drive tau aggregation. Based on these observations, the authors propose a model of neurodegeneration in which chronic rG4 formation drives proteostasis collapse. This model is interesting, and would explain different observations (e.g., RNA is present in AD aggregates and rG4s can enhance protein oligomerization and tau aggregation).

      Main issue: There is indeed a positive correlation between Braak stage severity and BG4 staining, but this correlation is relatively weak and borderline significant ((R = 0.52, p value = 0.028). This is probably the main limitation of this study, which should be clearly acknowledged (together with a reminder that "correlation is not causality".

      __Response: _ We believe that we had not explained this clearly enough in the text (based on the reviewer's comment), as the correlation mentioned by the Reviewer was for the CA4 region only, and not the OML, which was substantially more correlated and statistically significant (_Spearman R= 0.72, p = 0.00086). As a result, we believe this was a miscommunication that is rectified by the revised text: __

      "In the OML, plotting BG4 percent area versus Braak stage demonstrated a strong correlation (Spearman R= 0.72) with highly significantly increased BG4 staining with higher Braak stages (p = 0.00086) (Fig. 2b)."

      Related to this, here is no clear justification to exclude the four individuals in Fig 1d (without them R increases to 0.78). Please remove this statement. On the other hand, the difference based on APOE status is more striking.

      Response: We did not mean to imply that deleting these outliers was correct, but merely were demonstrating that they were in fact outliers. To avoid this misinterpretation, we have now deleted the sentence in the Figure 1d caption mentioning the outliers.

      Minor suggestions - "BG4 immunostaining was in many cases localized in the cytoplasm near the nucleus in a punctate pattern". Define "many"

      Response: This is seen in nearly every cells and this is now altered in the text and is now identified as ribosomes containing rG4s using the rRNA antibody (Fig. 3b).

      • Specify that MABE917 corresponds to the specific single-chain version of the BG4 antibody

      __Response:____ Yes, this is correct, and this clarification has been added to the manuscript __

      • Define PMI, Braak, CERAD (add a list of acronyms or insert these definitions in Fig 1b legend)

      Response: ____These definitions have all been added when they first appear.

      • Fig 3: scale bar legend missing (50 micrometers?)

      Response:____ This has been added, and the reviewer was correct that it was 50 micrometers.

      • Supplementary data Table 1: indicate target for all antibodies

      Response: ____The target for each antibody has been added to supplementary Table 1.

      • Supplementary data Table 2: why give ages with different levels of precision? (e.g. 90.15 vs 63)

      Response:____ We apologize for this oversight and have altered the ages to the same (whole years) in the figure.

      • Supplementary data Fig 1 X-axis legend: add "(nm)" after wavelength. Sequence can also be added in the legend. Why this one? Max/Min Wavelengths in the figure do not match indications in the experimental part. Not sure if that part is actually relevant for this study.

      Response: The CD spectrum in Sup Fig 1 is the sequence that had previously been shown to aid in tau aggregation seeding, but had not been suspected by those authors to be a quadruplex. So we tested that here and showed it is a quadruplex, as described at the end of the introduction. We have added wording to the figure legend to clarify where its corresponding description in the main text can be found. We have also checked and corrected the wavelength and units.

      • Supplementary data Fig 7: Which ribosomal antibody was used?

      Response: The details of this antibody have now been added to Supplementary Table 2 which lists all the antibodies used.

      Reviewer #2 (Significance (Required)):

      Provide a link between Alzheimer disease and RNA G-quadruplexes.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This study investigated the formation of RNA G quadruplexes (rG4) in aging and AD in human hippocampal postmortem tissue. The rG4 immunostaining in the hippocampus increases strongly with age and with the severity of AD. Furthermore, rG4 is present in neurons with an accumulation of phosphorylated tau immunostaining.

      Major comments 1.The method used in this study is primarily immunostaining of BG4, and the results cannot be considered correct without additional data from more multifaceted analyses (biochemical analysis, RNA expression analysis, etc.).

      __Response: ____We respectfully disagree with the Reviewer's assessment of the value of these experiments. The most relevant biochemical experiments at the cellular and molecular level showing the role of G4s in aggregation in general and Tau in particular have been done and are referenced in the text. The results here stand on their own and are highly novel and significant, as evaluated by both of the other reviewers. There has been no previous work demonstrating the presence of rG4s in human brain - either in controls or in patients with AD. AD is a complex condition that only occurs spontaneously in the human brain and no other species; because of this complexity, novel aspects are best first studied in human brain tissue using the methods employed here. __

      Overall, the quality of the stained images is poor, and detailed quantitative analysis using further high quality data is essential to conclude the authors' conclusions.

      Response:____ We have again looked at our images and they are not poor quality -they are confocal images taken at recommended resolution of the confocal microscope. It is possible the poor quality came from pdf compression by the manuscript submission portal, which is beyond our control as they were uploaded at high resolution. These data were quantified by scientists who were blinded to the diagnosis of each case.____ The level of description on the detailed quantification is higher than we have observed in similar studies. We therefore disagree with the reviewer's conclusion.

      Reviewer #3 (Significance (Required)):

      Overall, this study is not a deeply analyzed study. In addition, the authors of this study need further understanding regarding G4.

      __Response____: It is also unclear why the reviewer believes that we do not have sufficient understanding of G4s, and would request that the reviewer instead provides specific comments regarding what is lacking in terms of knowledge on G4s, as we respectfully disagree with this judgement of our knowledge-base (see other G4 papers from the Horowitz lab, Begeman, 2020, Litberg 2023, Son, 2023 referenced below). __

      __ ____Litberg TJ, Sannapureddi RKR, Huang Z, Son A, Sathyamoorthy B, Horowitz S. Why are G-quadruplexes good at preventing protein aggregation? Jan;20(1):495-509. doi: 10.1080/15476286.2023.2228572. RNA Biol. (2023)​__

      __ ____Son A*, Huizar Cabral V*, Huang Z, Litberg TJ, Horowitz S. G-quadruplexes rescuing protein folding. May 16;120(20):e2216308120. doi: 10.1073/pnas.2216308120. Proc Natl Acad Sci U S A (2023)__

      ​____Guzman BB*, Son A*, Litberg TJ*, Huang Z*, Dominguez ‡, Horowitz S. Emerging Roles for G-Quadruplexes in Proteostasis FEBS J​.doi: 10.1111/febs.16608. (2022)

      __ ____Begeman A*, Son A*, Litberg TJ, Wroblewski TH, Gehring T, Huizar Cabral V, Bourne J, Xuan Z, Horowitz S‡. G-Quadruplexes Act as Sequence Dependent Protein Chaperones. EMBO Reports Sep 18;e49735. doi: 10.15252/embr.201949735. (2020)__

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study investigated the formation of RNA G quadruplexes (rG4) in aging and AD in human hippocampal postmortem tissue. The rG4 immunostaining in the hippocampus increases strongly with age and with the severity of AD. Furthermore, rG4 is present in neurons with an accumulation of phosphorylated tau immunostaining.

      Major comments

      1.The method used in this study is primarily immunostaining of BG4, and the results cannot be considered correct without additional data from more multifaceted analyses (biochemical analysis, RNA expression analysis, etc.). 2. Overall, the quality of the stained images is poor, and detailed quantitative analysis using further high quality data is essential to conclude the authors' conclusions.

      Significance

      Overall, this study is not a deeply analyzed study. In addition, the authors of this study need further understanding regarding G4.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      RNA guanine-rich G-quadruplexes (rG4s) are non-canonical higher order nucleic acid structures that can form under physiological conditions. Interestingly, cellular stress is positively correlated with rG4 induction. In this study, the authors examined human hippocampal postmortem tissue for the formation ofrG4s in aging and Alzheimer Disease (AD). rG4 immunostaining strongly increased in the hippocampus with both age and with AD severity. 21 cases were used in this study (age range 30-92).

      This immunostaining co-localized with hyper-phosphorylated tau immunostaining in neurons. The BG4 staining levels were also impacted by APOE status. rG4 structure was previously found to drive tau aggregation. Based on these observations, the authors propose a model of neurodegeneration in which chronic rG4 formation drives proteostasis collapse. This model is interesting, and would explain different observations (e.g., RNA is present in AD aggregates and rG4s can enhance protein oligomerization and tau aggregation).

      Main issue

      There is indeed a positive correlation between Braak stage severity and BG4 staining, but this correlation is relatively weak and borderline significant ((R = 0.52, p value = 0.028). This is probably the main limitation of this study, which should be clearly acknowledged (together with a reminder that "correlation is not causality"). Related to this, here is no clear justification to exclude the four individuals in Fig 1d (without them R increases to 0.78). Please remove this statement. On the other hand, the difference based on APOE status is more striking.

      Minor suggestions

      • "BG4 immunostaining was in many cases localized in the cytoplasm near the nucleus in a punctate pattern". Define "many"
      • Specify that MABE917 corresponds to the specific single-chain version of the BG4 antibody
      • Define PMI, Braak, CERAD (add a list of acronyms or insert these definitions in Fig 1b legend)
      • Fig 3: scale bar legend missing (50 micrometers?)
      • Supplementary data Table 1: indicate target for all antibodies
      • Supplementary data Table 2: why give ages with different levels of precision? (e.g. 90.15 vs 63)
      • Supplementary data Fig 1 X-axis legend: add "(nm)" after wavelength. Sequence can also be added in the legend. Why this one? Max/Min Wavelengths in the figure do not match indications in the experimental part. Not sure if that part is actually relevant for this study.
      • Supplementary data Fig 7: Which ribosomal antibody was used?

      Significance

      Provide a link between Alzheimer disease and RNA G-quadruplexes.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is an interesting manuscript where the authors systematically measure rG4 levels in brain samples at different ages of patients affected by AD. To the best of my knowledge this is the first time that BG4 staining is used in this context and the authors provide compelling evidence to show an association with BG4 staining and age or AD progression, which interestingly indicates that such RNA structure might play a role in regulating protein homeostasis as previously speculated. The methods used and the results reported seems robust and reproducible. In terms of the conclusions,. however, I think that there are 2 main things that need addressing prior publication:

      1. Usually in BG4 staining experiments to ensure that the signal detected is genuinely due to rG4 an RNase treatment experiment is performed. This does not have to be extended to all the samples presented but having a couple of controls where the authors observe loss of staining upon RNase treatment will be key to ensure with confidence that rG4s are detected under the experimental conditions. This is particularly relevant for this brain tissue samples where BG4 staining has never been performed before.
      2. The authors have an association between rG4-formation and age/disease progression. They also observe distribution dependency of this, which is great. However, this is still an association which does not allow the model to be supported. This is not something that can be fixed with an easy experiment and it is what it is, but my point is that the narrative of the manuscript should be more fair and reflect the fact that, although interesting, what the authors are observing is a simple correlation. They should still go ahead and propose a model for it, but they should be more balanced in the conclusion and do not imply that this evidence is sufficient to demonstrate the proposed model. It is absolutely fine to refer to the literature and comment on the fact that similar observation have been reported and this is in line with those, but still this is not an ultimate demonstration.

      Minor point:

      1. rG4s themselves have been shown to generate aggregates in ALS models in the absence of any protein (Ragueseo et al. Nat Commun 2023). I think this is also important in the light of my comment on the model, could well be that these rG4s are causing aggregates themselves that act as nucleation point for the proteins as reported in the paper I mentioned. Providing a broader and more unbiased view of the current literature on the topic would be fair, rather than focusing on reports more in line with the model proposed.

      Significance

      This is a significant novel study, as per my comments above. I believe that such a study will be of impact in the G4 and neurodegenerative fields. Providing that the authors can address the criticisms above, I strongly believe that this manuscript would be of value to the scientific community. The main strength is the novelty of the study (never done before) the main weakness is the lack of the RNase control at the moment and the slightly over interpretation of the findings (see comments above).

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      __Dear Reviewers, __

      We would like to thank you for the time and attention dedicated to reviewing our manuscript. We have taken all the questions and comments into consideration, which we believe have helped us to improve the paper.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The study by Aguirre-Botero et al. shows the dynamics of 3D11 anti-CSP monoclonal antibody (mAb) mediated elimination of rodent malaria Plasmodium berghei (Pb) parasites in the liver. The authors show that the anti-CSP mAb could protect against intravenous (i.v.) Pb sporozoite challenge along with the cutaneous challenge, but requires higher concentration of antibody. Importantly, the study shows that the anti-CSP mAb not only affects sporozoite motility, sinusoidal extravasation, and cell invasion but also partially impairs the intracellular development inside the liver parenchyma, indicating a late effect of this antibody during liver stage development. While the study is interesting and conducted well, the only novel yet very important observation made in this manuscript is the effect of the anti-CSP mAb on liver stage development.

      Major This observation is highlighted in the manuscript title but is supported by only limited data. A such it needs to be substantiated and a mechanism should be investigated. The phenomenon of intracellular effects of the anti-CSP mAb should be analyzed in much more detail. For example, can the authors demonstrate uptake of the Ab together with the parasite during hepatocyte invasion? What cellular mechanism leads to elimination?

      Lines 234 - 243; 308 - 325: These results are the gist of the entire study and also defined the title of the manuscript. Thus, it would be pre-mature to claim the substantial effect of 3D11 antibody in late killing of the parasite in the infected hepatocytes just by looking at the decreased GFP fluorescence. The authors need to at least verify the fitness of the liver stages by measuring the size of the developing parasites as well as using different parasite specific markers (UIS4, MSP1, HSP70 etc.) in immunofluorescence assays on the infected liver sections and in vitro infections.

      Response

      We greatly appreciate the comments. We have taken the suggestions into consideration and deepened the characterization of 3D11's late killing of parasites. We first analyzed the presence of 3D11 in the intracellular parasite after the invasion and compared it with the CSP expression on the surface of control parasites (new Fig. 4F). Next, we tested a potential action of 3D11 added in the cell culture after the invasion (new Fig. 4G). The two new panels and the text accompanying them are shown below.

      *“Post-invasion labeling of 3D11 bound to the membrane of intracellular parasites revealed a strong staining surrounding the parasite at 2 and 15h, but only punctual traces of 3D11 at 44h (Figure 4F, 3D11, 3D11). Of note, CSP was detected surrounding the control parasites at all time-points indicating that the lack of staining at 44h is not due to a decrease in the CSP amount on the parasite surface (Figure 4F, CSP, Control). To evaluate the potential post-invasion entry of 3D11 into the PV of infected cells and posterior neutralization of intracellular parasites, we incubated invaded cells from 2 to 44 h with 3D11, but no effect on the parasite intracellular development was observed (Figure 4G, 2h p.i.). 3D11 incubated for 2 h with sporozoites and cells elicited, as expected, a dose-dependent inhibition of parasite development. Altogether, our results indicate that the late inhibition of parasite development is already achieved at 15h and likely caused by antibodies dragged inside cells bound to sporozoites before or during the invasion.” *

      Finally, we better characterized the parasite loss of fitness caused by 3D11 in infected cells by quantifying the parasite size, GFP intensity and the presence and intensity of UIS4, a parasitophorous vacuole membrane developmental marker at 2, 4 and 44h as described below in the new figure 5 and accompanying text.

      “*To further characterize the killing of intracellular parasites by 3D11 in HepG2 cells, we next evaluated the expression of the parasitophorous vacuole membrane (PVM) marker, UIS4 37, to infer the parasite intracellular development at 2, 4 and 44h. HepG2 cells were incubated with Pb-GFP expressing sporozoites in the absence (Control, Figure 5) or presence of 1.25 µg/mL of 3D11 during the first two hours of incubation (3D11, Figure 5). The chosen 3D11 concentration led to ~50% decrease in cell invasion (Figure 4C, 2h) and ~30% decrease in the post-invasion number of EEFs (Figure 4D), leaving enough parasites to be analyzed by microscopy. To distinguish between extracellular and intracellular parasites at 2h, washed and fixed samples were incubated with mouse 3D11 mAb (1µg/mL) and revealed with a fluorescent anti-mouse secondary antibody (Figure 5A, 3D11 in blue). Samples were then permeabilized and incubated with a goat anti-UIS4 polyclonal antibody revealed with a fluorescent anti-goat secondary antibody (Figure 5A, UIS4 in red). DNA was stained with Hoechst (Figure 5A, DNA in white). *

      *Extracellular GFP+ sporozoites were identified by their 3D11+UIS4- phenotype (Figure 5A, 2h, extracellular). Conversely, intracellular parasites were identified by their 3D11- phenotype and stained positive or negative for UIS4 (Figure 5A, 2h and 44h, intracellular). UIS4+ PVM is normally associated with a productive cell infection 37. However, a small number of EEFs can develop in the absence of UIS4 37 , likely inside the host cell nucleus (Figure 5A, 44h, intranuclear). *

      *In the control and 3D11-treated groups, the percentage of intracellular UIS4- parasites decreased 2 to 3-fold from 2 to 44h, as expected of a parasite population negative for a marker of productive infection (Figure 5B). However, while at 2h in the control group, this population represented 14% of intracellular parasites, in the 3D11-treated group, it reached 48% (Figure 5B). This ~3-fold increase in the UIS4 negative population could explain the late killing of intracellular sporozoites by 3D11. Whether this population is constituted by intracellular transmigratory sporozoites lacking a PVM or parasites surrounded by a PVM, but incapable of secreting UIS4 still needs to be determined. At 44h, surviving EEFs in the 3D11-treated samples presented a similar area and UIS4 staining intensity than control parasites (Figure 5C, D). However, as observed by flow cytometry (Figure 4D), the GFP intensity of 3D11-treated parasites was significantly lower than control EEFs, indicating that 3D11 can somehow affect protein expression with undetermined effects in the genesis of red blood cell infecting stages.” *

      ____Reviewer #1__ Minor comments __

      • Line 44 - 43: The statement is applicable only to the rodent infecting Plasmodium parasites. The authors need to clarify that.

      Responses

      This is an important clarification. We have modified the text that now reads:

      "The sporozoite surface is covered by a dense coat of the immunodominant circumsporozoite protein (CSP), shown to be an immunodominant protective antigen using a rodent malaria model".

      • Line 68: Replace the second 'against' after the CSP with 'of'.

      It is done.

      • Line 141 - 143: The 3D11 mAb does affect the homing and killing in the blood of cutaneous injected sporozoites. The authors need to clearly state that the statement is true only for i.v. injected sporozoites.

      Thank you for the comment. Now the text reads "Altogether, these data indicate that 3D11 rather than having an early effect on i.v. inoculated sporozoites, e.g. in the homing or killing in the blood, requires more than 4 h to eliminate most parasites in the liver."

      • Figure 3B: The numbers of sporozoites detected in the experiment varies from 0 h (line 172) to 2 h (line 184). Therefore, the numbers need to be mentioned on all the bars of each timepoint.

      This information was missing but now we have added the numbers to Figure 3B.

      • Figure 3C: If the authors have used flk1-GFP mice, then how well they were able to detect the Pb-PfCSP GFP parasites in the vessel vs. parenchyma in the intravital imaging? The representative images for Pb-PfCSP GFP should also be included.

      Since 3D11 does not target PbPf parasites most of them are motile in the movies, making them easily distinguishable from the endothelial cells. In addition, the stronger GFP intensity of sporozoites make them easily detectable in the sinusoids. Representative images were added in the supplementary figures (now Figure S3).

      • It is not mentioned anywhere how the viability of the sporozoites was determined. This has to be described especially in the methods section.

      • Also, the flow acquisition and data analysis of the sporozoites and infected HepG2 cells must be described in the method section.

      __We briefly mentioned it in the results (line 228- 230): “____In addition, by comparing the total number of recovered GFP+ sporozoites at 2 h in the two studied conditions, we measured the early lethality (%viable sporozoites, Figure 4B) of the anti-CSP Ab on the extracellular forms of the parasite (Figure 4A).” __

      A more detailed description has been added in the methods section that now reads:

      After 2 h, the supernatant was collected, and the culture was washed 2x with 0.5 volume of PBS. The cells were subsequently trypsinized. The supernatant plus the washing steps and the trypsinized cells were analyzed by flow cytometry to quantify the amount of GFP+ events inside and outside cells (Figure 3A and Figure S4). Viability was then quantified by the sum of the total number of sporozoites (GPF+ events) in the supernatant, inside and outside the cells. We calculated the percentage of parasite viability by dividing the average of the total number of sporozoites in the treated samples by the average in controls using three technical replicates for each condition. Additionally, we quantified the percentage of infected cells using the total number of GFP+ events in the HepG2 gate (Figure S4). To compare the biological replicates, we further normalized to the control of each experiment. For the samples used to analyze parasite development, the cells were incubated for 15 or 44 h after sporozoite addition, and the medium was changed after 2 and 24 h. The cells were trypsinized and the percentage of intracellular parasites was determined by flow cytometry as described above (Figure S4). The prolonged effect between 2 h and 15/44 h was calculated by normalizing the percentage of infected cells at 15/44 h to that of 2 h. For all flow cytometry measurements, the same volume was acquired.”

      • Figure 4: The flow layouts should be included for at least comparing the 0 vs. 5 μg/ml of 3D11 mAb concentrations.

      Flow layouts were added in the supplementary figures.

      • Line 651 (Figure S1 legend): Typographical error '14'.

      Thank you for noticing. We corrected it.

      Reviewer #2 (Evidence, reproducibility and clarity (Required____)):

      Aguirre-Botero and collaborators report on the dynamics of Plasmodium parasite elimination in the liver using the 3D11 anti-CSP monoclonal antibody (mAb). By using microscopy and bioluminescence imaging in the P. berghei rodent malaria model, the authors first demonstrate that higher antibody concentrations are required for protection against intravenous sporozoite challenge, when compared to cutaneous challenge, which is not surprising. The study also shows that the 3D11 mAb reduces sporozoite motility, impairs hepatic sinusoidal barrier crossing, and more relevantly inhibits intracellular development of liver stages through its cytotoxic activity. These findings highlight the role of this specific monoclonal antibody, 3D11 mAb against CSP, in targeting sporozoites in the liver.


      Major Comments

      The study provides valuable insights into the mechanisms of protection conferred by the 3D11 anti-CSP monoclonal antibody against P. berghei sporozoites and this finding allow the field to speculate that other monoclonal antibodies against CSP of P. Falciparum may act similarly. However, an important experiment is missing that would significantly strengthen the conclusions. Specifically, the authors should perform experiments where the monoclonal antibody is added immediately after the sporozoites have completed invasion. This should be done both in vitro and in vivo to show whether the antibody has any effect on intracellular development of liver stages when added after invasion.

      While the claims are generally supported by the data presented, to comprehensively conclude the late cytotoxic effects of 3D11, the additional experiment of post-invasion antibody application is relevant. This would help determine if the observed effects are due to the antibody's action during invasion or its continued action post-invasion.

      The data and methods are presented in a manner that allows for reproducibility. The use of microscopy and bioluminescence imaging is well-documented. The experiments appear adequately replicated, and statistical analyses are appropriate.

      Response

      We thank reviewer 2 for these important suggestions. To be sure that the effect might not come from the internalization of the antibodies after sporozoite invasion, we tested the amount of 3D11 bound to the parasite following invasion (new Fig. 4F) and the potential post-invasion neutralizing effect of 3D11 ____in vitro____. The results obtained are presented below.

      *“Post-invasion labeling of 3D11 bound to the membrane of intracellular parasites revealed a strong staining surrounding the parasite at 2 and 15h, but only punctual traces of 3D11 at 44h (Figure 4F, 3D11, 3D11). Of note, CSP was detected surrounding the control parasites at all time-points indicating that the lack of staining at 44h is not due to a decrease in the CSP amount on the parasite surface (Figure 4F, CSP, Control). To evaluate the potential post-invasion entry of 3D11 into the PV of infected cells and posterior neutralization of intracellular parasites, we incubated invaded cells from 2 to 44 h with 3D11, but no effect on the parasite intracellular development was observed (Figure 4G, 2h p.i.). 3D11 incubated for 2 h with sporozoites and cells elicited, as expected, a dose-dependent inhibition of parasite development. Altogether, our results indicate that the late inhibition of parasite development is already achieved at 15h and likely caused by antibodies dragged inside cells bound to sporozoites before or during the invasion.” *

      __Reviewer #2 ____Minor Comments __

      • The text and figures are clear and accurate. Some minor typographical errors should be corrected.

      Thank you for the remark, we have worked on that and hope that the text reads better now.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Aguirre-Botero et al have studied the effect of a potent monoclonal antibody against the circumsporozoite protein, the major surface protein of the malaria sporozoite. This is an elegantly designed, performed, and analyzed study. They have efficiently delineated the mode of action of anti-CSP repeat mAb and confirmed previous in vitro work (not cited) that demonstrated the same intracellular effect. Specific comments :

      • Line 51: The authors claim a correlation between high antibody levels and protection. However, they did not provide direct proof that these antibodies were responsible for protection, nor did they establish a cut-off level of anti-CSP antibodies that would distinguish between protected and unprotected individuals.

      We would first like to thank reviewer 3 for the comments. Indeed, we agree with reviewer 3, these are correlative studies where the causality cannot be established. We modified the ensuing sentence to specify the causality between anti-CSP mAbs and in vivo protection against sporozoite infection. Now the text reads: "Extensive research has demonstrated a positive correlation between high levels of anti-CSP antibodies (Abs) induced by the RTS,S/AS01 vaccine and its efficacy against malaria 11-13. Remarkably, anti-CSP monoclonal Abs (mAbs) have been proven to protect in vivo against malaria in various experimental settings, including, mice 14-21, monkeys 23, and humans 24-26"

      • Line 326: The late intrahepatic effect of mAb against the CSP repeat has been previously reported (see Figure 2, Nudelman et al, J Immunol, 1989). The effect was shown to affect the transition from liver trophozoites to liver schizonts. This study should be cited and discussed.

      Thank you for the remark. Now the text reads: Notably, a similar effect has been previously reported using sera from mice immunized with PfCSP or mAb against P. yoelii (Py) CSP. Incubation of Pf or Py sporozoites with the immune sera or mAbs not only affected sporozoite invasion in vitro but continued to affect intracellular forms for several days after invasion38,39. Additionally, using anti-PfCSP sera, it was also observed that late EEFs from sera-treated sporozoites had abnormal morphology38. Altogether, it was thus concluded that the anti-CSP Abs present in the sera had a long-term effect on the parasites38,39.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Aguirre-Botero et al have studied the effect of a potent monoclonal antibody against the circumsporozoite protein, the major surface protein of the malaria sporozoite. This is an elegantly designed, performed, and analyzed study. They have efficiently delineated the mode of action of anti-CSP repeat mAb and confirmed previous in vitro work (not cited) that demonstrated the same intracellular effect.

      Specific comments

      Line 51: The authors claim a correlation between high antibody levels and protection. However, they did not provide direct proof that these antibodies were responsible for protection, nor did they establish a cut-off level of anti-CSP antibodies that would distinguish between protected and unprotected individuals.

      Lone 326: The late intrahepatic effect of mAb against the CSP repeat has been previously reported (see Figure 2, Nudelman et al, J Immunol, 1989). The effect was shown to affect the transition from liver trophozoites to liver schizonts. This study should be cited and discussed.

      Significance

      A well-done study that elucidates the mechanisms of a protective monoclonal antibody against malaria sporozoites. These data are important and will interest a large audience of researchers working in infectious diseases and immunology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Aguirre-Botero and collaborators report on the dynamics of Plasmodium parasite elimination in the liver using the 3D11 anti-CSP monoclonal antibody (mAb). By using microscopy and bioluminescence imaging in the P. berghei rodent malaria model, the authors first demonstrate that higher antibody concentrations are required for protection against intravenous sporozoite challenge, when compared to cutaneous challenge, which is not surprising. The study also shows that the 3D11 mAb reduces sporozoite motility, impairs hepatic sinusoidal barrier crossing, and more relevantly inhibits intracellular development of liver stages through its cytotoxic activity. These findings highlight the role of this specific monoclonal antibody, 3D11 mAb against CSP, in targeting sporozoites in the liver.


      Major Comments

      The study provides valuable insights into the mechanisms of protection conferred by the 3D11 anti-CSP monoclonal antibody against P. berghei sporozoites and this finding allow the field to speculate that other monoclonal antibodies against CSP of P. Falciparum may act similarly. However, an important experiment is missing that would significantly strengthen the conclusions. Specifically, the authors should perform experiments where the monoclonal antibody is added immediately after the sporozoites have completed invasion. This should be done both in vitro and in vivo to show whether the antibody has any effect on intracellular development of liver stages when added after invasion.

      While the claims are generally supported by the data presented, to comprehensively conclude the late cytotoxic effects of 3D11, the additional experiment of post-invasion antibody application is relevant. This would help determine if the observed effects are due to the antibody's action during invasion or its continued action post-invasion.

      The data and methods are presented in a manner that allows for reproducibility. The use of microscopy and bioluminescence imaging is well-documented. The experiments appear adequately replicated, and statistical analyses are appropriate.

      Minor Comments

      The text and figures are clear and accurate. Some minor typographical errors should be corrected.

      Significance

      The study's strengths lie in its detailed analysis of the 3D11 mAb's effect on sporozoite motility and liver stage development. The use of advanced imaging techniques adds robustness to the findings. The main limitation is the lack of data on the antibody's effect post-invasion. Additionally, the study's conclusions are based on a single monoclonal antibody and its target region, which may not be representative of other anti-CSP antibodies. Still, the findings offer insights into the cytotoxic action of anti-CSP antibodies, which could inform the development of more effective malaria vaccines and therapeutic antibodies.

      This research will primarily interest a specialized audience in malaria research, particularly those focused on vaccine development and antibody therapeutics. It also holds value for broader audiences in immunology and infectious disease research.

      My expertise: Malaria research and liver invasion by Plasmodium sporozoites

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The study by Aguirre-Botero et al. shows the dynamics of 3D11 anti-CSP monoclonal antibody (mAb) mediated elimination of rodent malaria Plasmodium berghei (Pb) parasites in the liver. The authors show that the anti-CSP mAb could protect against intravenous (i.v.) Pb sporozoite challenge along with the cutaneous challenge, but requires higher concentration of antibody. Importantly, the study shows that the anti-CSP mAb not only affects sporozoite motility, sinusoidal extravasation, and cell invasion but also partially impairs the intracellular development inside the liver parenchyma, indicating a late effect of this antibody during liver stage development. While the study is interesting and conducted well, the only novel yet very important observation made in this manuscript is the effect of the anti-CSP mAb on liver stage development.

      Major

      This observation is highlighted in the manuscript title but is supported by only limited data. A such it needs to be substantiated and a mechanism should be investigated.

      • The phenomenon of intracellular effects of the anti-CSP mAb should be analyzed in much more detail. For example, can the authors demonstrate uptake of the Ab together with the parasite during hepatocyte invasion? What cellular mechanism leads to elimination?

      Minor

      • Line 44 - 43: The statement is applicable only to the rodent infecting Plasmodium parasites. The authors need to clarify that.
      • Line 68: Replace the second 'against' after the CSP with 'of'.
      • Line 141 - 143: The 3D11 mAb does affect the homing and killing in the blood of cutaneous injected sporozoites. The authors need to clearly state that the statement is true only for i.v. injected sporozoites.
      • Figure 3B: The numbers of sporozoites detected in the experiment varies from 0 h (line 172) to 2 h (line 184). Therefore, the numbers need to be mentioned on all the bars of each timepoint.
      • Figure 3C: If the authors have used flk1-GFP mice, then how well they were able to detect the Pb-PfCSP GFP parasites in the vessel vs. parenchyma in the intravital imaging? The representative images for Pb-PfCSP GFP should also be included.
      • It is not mentioned anywhere how the viability of the sporozoites was determined. This has to be described especially in the methods section.
      • Also, the flow acquisition and data analysis of the sporozoites and infected HepG2 cells must be described in the method section.
      • Figure 4: The flow layouts should be included for at least comparing the 0 vs. 5 μg/ml of 3D11 mAb concentrations.
      • Lines 234 - 243; 308 - 325: These results are the gist of the entire study and also defined the title of the manuscript. Thus, it would be pre-mature to claim the substantial effect of 3D11 antibody in late killing of the parasite in the infected hepatocytes just by looking at the decreased GFP fluorescence. The authors need to at least verify the fitness of the liver stages by measuring the size of the developing parasites as well as using different parasite specific markers (UIS4, MSP1, HSP70 etc.) in immunofluorescence assays on the infected liver sections and in vitro infections.
      • Line 651 (Figure S1 legend): Typographical error '14'.

      Significance

      The phenomenon of intracellular effects of the anti-CSP Ab is the only novel observation and as such, should be analyzed in much more detail. For example, can the authors demonstrate uptake of the Ab together with the parasite during hepatocyte invasion? What cellular mechanism leads to elimination?

  3. Oct 2024
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the Reviewers

      I would like to thank the reviewers for their comments and interest in the manuscript and the study.

      Referee #1

      1. I would assume that there are RNA-seq and/or ChIP-seq data out there produced after knockdown of one or more of these DBPs that show directional positioning.

      Response: As the reviewer pointed out, a wet experimental validation of the results of this study would give an opportunity for more biological researchers to have an interest in the study. I plan to promote the wet experimental analysis in collaboration with biological experimental researchers as a next step of this study. The same analysis would be performed in immortalized cells for CRISPR experiment (e.g. Guo Y et al. Cell 2015).

      1. Figure 6 should be expanded to incorporate analysis of DBPs not overlapping CTCF/cohesin in chromatin interaction data that is important and potentially more interesting than the simple DBPs enrichment reported in the present form of the figure.

      Response: Following the reviewer's advice, I performed the same analysis with the DNA-binding sites that do no overlap with the DNA-binding sites of CTCF and cohesin (RAD21 and SMC3) (Fig. 6 and Supplementary Fig. 3). The result showed the same tendency in the distribution of DNA-binding sites. The height of a peak on the graph became lower for some DNA-binding proteins after removing the DNA-binding sites that overlapped with those of CTCF and cohesin. I have added the following sentence on lines 359 and 705: For the insulator-associated DBPs other than CTCF, RAD21, and SMC3, the DNA-binding sites that do not overlap with those of CTCF, RND21, and SMC3 were used to examine their distribution around interaction sites.

      1. Critically, I would like to see use of Micro-C/Hi-C data and ChIP-seq from these factors, where insulation scores around their directionally-bound sites show some sort of an effect like that presumed by the authors - and many such datasets are publicly-available and can be put to good use here.

      Response: As suggested by the reviewer, I have added the insulator scores and boundary sites from the 4D nucleome data portal as tracks in the UCSC genome browser. The insulator scores seem to correspond to some extent to the H3K27me3 histone marks from ChIP-seq (Fig. 4a and Supplementary Fig. 2). The direction of DNA-binding sites on the genome can be shown with different colors (e.g. red and green), but the directionality is their overall tendency, and it may be difficult to notice the directionality from each binding site.

      I found that the CTCF binding sites examined by a wet experiment in the previous study may not always overlap with the boundary sites of chromatin interactions from Micro-C assay (Guo Y et al. Cell 2015). The chromatin interaction data do not include all interactions due to the high sequencing cost of the assay. The number of the boundary sites may be smaller than that of CTCF binding sites acting as insulators and/or some of the CTCF binding sites may not be locate in the boundary sites. It may be difficult for the boundary location algorithm to identify a short boundary location. Due to the limitations of the chromatin interaction data, I planned to search for insulator-associated DNA-binding proteins without using chromatin interaction data in this study.

      1. The suggested alternative transcripts function, also highlighted in the manuscripts abstract, is only supported by visual inspection of a few cases for several putative DBPs. I believe this is insufficient to support what looks like one of the major claims of the paper when reading the abstract, and a more quantitative and genome-wide analysis must be adopted, although the authors mention it as just an 'observation'.

      Response: According to the reviewer's comment, I performed the genome-wide analysis of alternative transcripts where the DNA-binding sites of insulator-associated proteins are located near splicing sites. The DNA-binding sites of insulator-associated DNA-binding proteins were found within 200bp centered on splice sites more significantly than the other DNA-binding proteins (Fig. 4E and Table 2). I have added the following sentences on lines 329 - 337: We performed the statistical test to estimate the enrichment of insulator-associated DNA-binding sites compared to the other DNA-binding proteins, and found that the insulator-associated DNA-binding sites were significantly more abundant at splice sites than the DNA-binding sites of the other proteins (Fig 4e and Table 2; Mann‒Whitney U test, p value 5. Figure 1 serves no purpose in my opinion and can be removed, while figures can generally be improved (e.g., the browser screenshots in Figs 4 and 5) for interpretability from readers outside the immediate research field.

      Response: I believe that the Figure 1 would help researchers in other fields who are not familiar with biological phenomena and functions to understand the study. More explanation has been included in the Figures and legends of Figs. 4 and 5 to help readers outside the immediate research field understand the figures.

      1. Similarly, the text is rather convoluted at places and should be re-approached with more clarity for less specialized readers in mind.

      Response: Reviewer #2's comments would be related to this comment. I have introduced a more detailed explanation of the method in the Results section, as shown in the responses to Reviewer #2's comments.

      Referee #2

      1. Introduction, line 95: CTCF appears two times, it seems redundant.

      Response: On lines 91-93, I deleted the latter CTCF from the sentence "and examined the directional bias of DNA-binding sites of CTCF and insulator-associated DBPs, including those of known DBPs such as RAD21 and SMC3".

      1. Introduction, lines 99-103: Please stress better the novelty of the work. What is the main focus? The new identified DPBs or their binding sites? What are the "novel structural and functional roles of DBPs" mentioned?

      Response: Although CTCF is known to be the main insulator protein in vertebrates, we found that 97 DNA-binding proteins including CTCF and cohesin are associated with insulator sites by modifying and developing a machine learning method to search for insulator-associated DNA-binding proteins. Most of the insulator-associated DNA-binding proteins showed the directional bias of DNA-binding motifs, suggesting that the directional bias is associated with the insulator.

      I have revised the statement in lines 97-104 as follows: To validate these findings, we demonstrate that the DNA-binding sites of the identified insulator-associated DBPs are located within potential insulator sites, and some of the DNA-binding sites in the insulator site are found without the nearby DNA-binding sites of CTCF and cohesin. Our method and analyses contribute to the identification of insulator- and chromatin-associated DNA-binding sites that influence EPIs and reveal novel functional roles and molecular mechanisms of DBPs associated with transcriptional condensation, phase separation and transcriptional regulation.

      1. Results, line 111: How do the SNPs come into the procedure? From the figures it seems the input is ChIP-seq peaks of DNBPs around the TSS.

      Response: On lines 115-118, to explain the procedure for the SNP of an eQTL, I have added the sentence in the Methods: "If a DNA-binding site was located within a 100-bp region around a single-nucleotide polymorphism (SNP) of an eQTL, we assumed that the DNA-binding proteins regulated the expression of the transcript corresponding to the eQTL".

      1. Again, are those SNPs coming from the different cell lines? Or are they from individuals w.r.t some reference genome? I suggest a general restructuring of this part to let the reader understand more easily. One option could be simplifying the details here or alternatively including all the necessary details.

      Response: On line 113, I have included the explanation of the eQTL dataset of GTEx v8 as follows: "The GTEx v8 dataset, after quality control, consists of 838 donors and 17,382 samples from 52 tissues and two cell lines". On lines 569 and 753, I have added the filename of the eQTL data "(GTEx_Analysis_v8_eQTL.tar)".

      1. Figure 1: panel a and b are misleading. Is the matrix in panel a equivalent to the matrix in panel b? If not please clarify why. Maybe in b it is included the info about the SNPs? And if yes, again, what is then difference with a.

      Response: The reviewer would mention Figure 2, not Figure 1. If so, the matrices in panels a and b in Figure 2 are equivalent. The green boxes in the matrix show the regions with the ChIP-seq peak of a DNA-binding protein overlapping with a SNP of an eQTL. I used eQTL data to associate a gene with a ChIP-seq peak that was more than 2 kb upstream and 1 kb downstream of a transcriptional start site of a gene. For each gene, the matrix was produced and the gene expression levels in cells were learned and predicted using the deep learning method. I have added the following sentences to explain the method in lines 125 - 131: Through the training, the tool learned to select the binding sites of DNA-binding proteins from ChIP-seq assays that were suitable for predicting gene expression levels in the cell types. The binding sites of a DNA-binding protein tend to be observed in common across multiple cell and tissue types. Therefore, ChIP-seq data and eQTL data in different cell and tissue types were used as input data for learning, and then the tool selected the data suitable for predicting gene expression levels in the cell types.

      1. Line 386-388: could the author investigate in more detail this observation? Does it mean that loops driven by other DBPs independent of the known CTCF/Cohesin? Could the author provide examples of chromatin structural data e.g. MicroC?

      Response: As suggested by the reviewer, to help readers understand the observation, I have added Supplementary Fig. S3c to show the distribution of DNA-binding sites of "CTCF, RAD21, and SMC3" and "BACH2, FOS, ATF3, NFE2, and MAFK" around chromatin interaction sites. I have modified the following sentence to indicate the figure on line 417: Although a DNA-binding-site distribution pattern around chromatin interaction sites similar to those of CTCF, RAD21, and SMC3 was observed for DBPs such as BACH2, FOS, ATF3, NFE2, and MAFK, less than 1% of the DNA-binding sites of the latter set of DBPs colocalized with CTCF, RAD21, or SMC3 in a single bin (Fig. S3c).

      In Aljahani A et al. Nature Communications 2022, we find that depletion of cohesin causes a subtle reduction in longer-range enhancer-promoter interactions and that CTCF depletion can cause rewiring of regulatory contacts. Together, our data show that loop extrusion is not essential for enhancer-promoter interactions, but contributes to their robustness and specificity and to precise regulation of gene expression. Goel VY et al. Nature Genetics 2023 mentioned in the abstract: Microcompartments frequently connect enhancers and promoters and though loss of loop extrusion and inhibition of transcription disrupts some microcompartments, most are largely unaffected. These results suggested that chromatin loops can be driven by other DBPs independent of the known CTCF/Cohesin. I added the following sentence on 470: The depletion of cohesin causes a subtle reduction in longer-range enhancer-promoter interactions and that CTCF depletion can cause rewiring of regulatory contacts. The loop extrusion is not essential for enhancer-promoter interactions, but contributes to their robustness and specificity and to precise regulation of gene expression.

      FOXA1 pioneer factor functions as an initial chromatin-binding and chromatin-remodeling factor and has been reported to form biomolecular condensates (Ji D et al. Molecular Cell 2024). CTCF have also found to form transcriptional condensate and phase separation (Lee R et al. Nucleic acids research 2022). FOS was found to be an insulator-associated DNA-binding protein in this study and is potentially involved in chromatin remodeling, transcription condensation, and phase separation with the other factors such as BACH2, ATF3, NFE2 and MAFK. I have added the following sentence on line 467: FOXA1 pioneer factor functions as an initial chromatin-binding and chromatin-remodeling factor and has been reported to form biomolecular condensates.

      1. In general, how the presented results are related to some models of chromatin architecture, e.g. loop extrusion, in which it is integrated convergent CTCF binding sites?

      Response: Goel VY et al. Nature Genetics 2023 identified highly nested and focal interactions through region capture Micro-C, which resemble fine-scale compartmental interactions and are termed microcompartments. In the section titled "Most microcompartments are robust to loss of loop extrusion," the researchers noted that a small proportion of interactions between CTCF and cohesin-bound sites exhibited significant reductions in strength when cohesin was depleted. In contrast, the majority of microcompartmental interactions remained largely unchanged under cohesin depletion. Our findings indicate that most P-P and E-P interactions, aside from a few CTCF and cohesin-bound enhancers and promoters, are likely facilitated by a compartmentalization mechanism that differs from loop extrusion. We suggest that nested, multiway, and focal microcompartments correspond to small, discrete A-compartments that arise through a compartmentalization process, potentially influenced by factors upstream of RNA Pol II initiation, such as transcription factors, co-factors, or active chromatin states. It follows that if active chromatin regions at microcompartment anchors exhibit selective "stickiness" with one another, they will tend to co-segregate, leading to the development of nested, focal interactions. This microphase separation, driven by preferential interactions among active loci within a block copolymer, may account for the striking interaction patterns we observe.

      The authors of the paper proposed several mechanisms potentially involved in microcompartments. These mechanisms may be involved in looping with insulator function. Maz and MyoD1 among the identified insulator-associated DNA-binding proteins form loops without CTCF (Xiao T et al. Proc Natl Acad Sci USA 2021 ; Ortabozkoyun H et al. Nature genetics 2022 ; Wang R et al. Nature communications 2022). If chromatin loop anchors have some structural conformation, as shown in the paper entitled "The structural basis for cohesin-CTCF-anchored loops" (Li Y et al. Nature 2020), directional DNA binding would occur similarly to CTCF binding sites. I have included the following explanation on line 476: Maz and MyoD1 among the identified insulator-associated DNA-binding proteins form loops without CTCF.

      1. Do the authors think that the identified DBPs could work in that way as well?

      Response: Liquid-liquid phase separation was shown to occur through CTCF-mediated chromatin loops and to act as an insulator (Lee, R et al. Nucleic Acids Research 2022). Among the identified insulator-associated DNA-binding proteins, CEBPA has been found to form hubs that colocalize with transcriptional co-activators in a native cell context, which is associated with transcriptional condensate and phase separation (Christou-Kent M et al. Cell Reports 2023). The proposed microcompartment mechanisms are also associated with phase separation. Thus, the same or similar mechanisms are potentially associated with the insulator function of the identified DNA-binding proteins. I have included the following information on line 465: CEBPA in the identified insulator-associated DNA-binding proteins was also reported to be involved in transcriptional condensates and phase separation.

      1. Also, can the authors comment about the mechanisms those newly identified DBPs mediate contacts by active processes or equilibrium processes?

      Response: Snead WT et al. Molecular Cell 2019 mentioned that protein post-transcriptional modifications (PTMs) facilitate the control of molecular valency and strength of protein-protein interactions. O-GlcNAcylation as a PTM inhibits CTCF binding to chromatin (Tang X et al. Nature Communications 2024). I found that the identified insulator-associated DNA-binding proteins tend to form a cluster at potential insulator sites (Supplemental Fig. 2d). These proteins may interact and actively regulate chromatin interactions, transcriptional condensation, and phase separation by PTMs. I have added the following explanation on lines 478-484: Furthermore, protein post-transcriptional modifications (PTMs) facilitate control over the molecular valency and strength of protein-protein interactions. O-GlcNAcylation as a PTM inhibits CTCF binding to chromatin. We found that the identified insulator-associated DNA-binding proteins tend to form a cluster at potential insulator sites (Supplementary Fig. 2d). These proteins may interact and actively regulate chromatin interactions, transcriptional condensation, and phase separation through PTMs.

      1. Can the author provide some real examples along with published structural data (e.g. the mentioned micro-C data) to show the link between protein co-presence, directional bias and contact formation?

      Response: Structural molecular model of cohesin-CTCF-anchored loops has been published by Li Y et al. Nature 2020. The structural conformation of CTCF and cohesin in the loops would be the cause of the directional bias of CTCF binding sites, which I mentioned in lines 454 - 458.

      Referee #3

      1. Some of these TFs do not have specific direct binding to DNA (P300, Cohesin). Since the authors are using binding motifs in their analysis workflow, I would remove those from the analysis.

      Response: When a protein complex binds to DNA, one protein of the complex binds to the DNA directory, and the other proteins may not bind to DNA. However, the DNA motif sequence bound by the protein may be registered as the DNA-binding motif of all the proteins in the complex. The molecular structure of the complex of CTCF and Cohesin showed that both CTCF and Cohesin bind to DNA (Li Y et al. Nature 2020). I think there is a possibility that if the molecular structure of a protein complex becomes available, the previous recognition of the DNA-binding ability of a protein may be changed. Therefore, I searched the Pfam database for 99 insulator-associated DNA-binding proteins identified in this study. I found that 97 are registered as DNA-binding proteins and/or have a known DNA-binding domain, and EP300 and SIN3A do not directory bind to DNA, which was also checked by Google search. I have added the following explanation in line 239 to indicate direct and indirect DNA-binding proteins: Among 99 insulator-associated DBPs, EP300 and SIN3A do not directory interact with DNA, and thus 97 insulator-associated DBPs directory bind to DNA. I have updated the sentence in line 22 of the Abstract as follows: We discovered 97 directional and minor nondirectional motifs in human fibroblast cells that corresponded to 23 DBPs related to insulator function, CTCF, and/or other types of chromosomal transcriptional regulation reported in previous studies.

      1. I am not sure if I understood correctly, by why do the authors consider enhancers spanning 2Mb (200 bins of 10Kb around eSNPs)? This seems wrong. Enhancers are relatively small regions (100bp to 1Kb) and only a very small subset form super enhancers.

      Response: As the reviewer mentioned, I recognize enhancers are relatively small regions. In the paper, I intended to examine further upstream and downstream of promoter regions where enhancers are found. Therefore, I have modified the sentence in line 805 of the Fig. 2 legend as follows: Enhancer-gene regulatory interaction regions consist of 200 bins of 10 kbp between -1 Mbp and 1 Mbp region from TSS, not including promoter.

      1. I think the H3K27me3 analysis was very good, but I would have liked to see also constitutive heterochromatin as well, so maybe repeat the analysis for H3K9me3.

      Response: Following the reviewer's advice, I have added the ChIP-seq data of H3K9me3 as a truck of the UCSC Genome Browser. The distribution of H3K9me3 signal was different from that of H3K27me3 in some regions. I also found the insulator-associated DNA-binding sites close to the edges of H3K9me3 regions, but the number of the sites may be less than for H3K27me3. I took some screenshots of the UCSC Genome Browser of the regions around the sites in Supplementary Fig. 2b. I have modified the following sentence on line 853 in the legend of Fig. 4: a Distribution of histone modification marks H3K27me3 (green color) and H3K9me3 (turquoise color) and transcript levels (pink color) in upstream and downstream regions of a potential insulator site (light orange color). I have also added the following result on lines 313 - 316: The same analysis was performed using H3K9me3 marks, instead of H3K27me3. We found that the distribution of H3K9me3 signal was different from that of H3K27me3 in some regions, and discovered the insulator-associated DNA-binding sites close to the edges of H3K9me3 regions (Fig. S2b).

      1. I was not sure I understood the analysis in Figure 6. The binding site is with 500bp of the interaction site, but micro-C interactions are at best at 1Kb resolution. They say they chose the centre of the interaction site, but we don't know exactly where there is the actual interaction. Also, it is not clear what they measure. Is it the number of binding sites of a specific or multiple DBP insulator proteins at a specific distance from this midpoint that they recover in all chromatin loops? Maybe I am missing something. This analysis was not very clear.

      Response: The resolution of the Micro-C assay is considered to be 100bp and above, as the human nucleome core particle contains 145bp (and 193bp with linker) of DNA. However, internucleosomal DNA is cleaved by endonuclease into fragments of multiples of 10 nucleotides (Pospelov VA et al. Nucleic Acids Research 1979). Highly nested focal interactions were observed (Goel VY et al. Nature Genetics 2023). Base pair resolution was reported using Micro Capture-C (Hua P et al. Nature 2021). Sub-kilobase (20bp resolution) chromatin topology was reported using an MNase-based chromosome conformation capture (3C) approach (Aljahani A et al. Nature Communications 2022). On the other hand, Hi-C data was analyzed at 1kb resolution. (Gu H et al. bioRxiv 2021). If the resolution of Micro-C interactions is at best at 1kb, the binding sites of a DNA-binding protein will not show a peak around the center of the genomic locations of interaction edges. Each panel shows the number of binding sites of a specific DNA-binding protein at a specific distance from the midpoint of all chromatin interaction edges. I have modified and added the following sentences in 487-491: High-resolution chromatin interaction data from a Micro-C assay indicated that most of the predicted insulator-associated DBPs showed DNA-binding-site distribution peaks around chromatin interaction sites, suggesting that these DBPs are involved in chromatin interactions and that the chromatin interaction data has a high degree of resolution. Base pair resolution was reported using Micro Capture-C.

      Minor comments:

      1. PIQ does not consider TF concentration. Other methods do that and show that TF concentration improves predictions (e.g., https://www.biorxiv.org/content/10.1101/2023.07.15.549134v2 or https://pubmed.ncbi.nlm.nih.gov/37486787/). The authors should discuss how that would impact their results.

      Response: The directional bias of CTCF binding sites was identified by ChIA-pet interactions of CTCF binding sites. The analysis of the contribution scores of DNA-binding sites of proteins considering the binding sites of CTCF as an insulator showed the same tendency of directional bias of CTCF binding sites. In the analysis, to remove the false-positive prediction of DNA-binding sites, I used the binding sites that overlapped with a ChIP-seq peak of the DNA-binding protein. This result suggests that the DNA-binding sites of CTCF obtained by the current analysis have sufficient quality. Therefore, if the accuracy of prediction of DNA-binding sites is improved, althought the number of DNA-binding sites may be different, the overall tendency of the directionality of DNA-binding sites will not change and the results of this study will not change significantly.

      As for the first reference in the reviewer's comment, chromatin interaction data from Micro-C assay does not include all chromatin interactions in a cell or tissue, because it is expensive to cover all interactions. Therefore, it would be difficult to predict all chromatin interactions based on machine learning. As for the second reference in the reviewer's comment, pioneer factors such as FOXA are known to bind to closed chromatin regions, but transcription factors and DNA-binding proteins involved in chromatin interactions and insulators generally bind to open chromatin regions. The search for the DNA-binding motifs is not required in closed chromatin regions.

      1. DeepLIFT is a good approach to interpret complex structures of CNN, but is not truly explainable AI. I think the authors should acknowledge this.

      Response: In the DeepLIFT paper, the authors explain that DeepLIFT is a method for decomposing the output prediction of a neural network on a specific input by backpropagating the contributions of all neurons in the network to every feature of the input (Shrikumar A et al. ICML 2017). DeepLIFT compares the activation of each neuron to its 'reference activation' and assigns contribution scores according to the difference. DeepLIFT calculates a metric to measure the difference between an input and the reference of the input.

      Truly explainable AI would be able to find cause and reason, and to make choices and decisions like humans. DeepLIFT does not perform causal inferences. I did not use the term "Explainable AI" in our manuscript, but I briefly explained it in Discussion. I have added the following explanation in lines 503-508: AI (Artificial Intelligence) is considered as a black box, since the reason and cause of prediction are difficult to know. To solve this issue, tools and methods have been developed to know the reason and cause. These technologies are called Explainable AI. DeepLIFT is considered to be a tool for Explainable AI. However, DeepLIFT does not answer the reason and cause for a prediction. It calculates scores representing the contribution of the input data to the prediction.

      Furthermore, to improve the readability of the manuscript, I have included the following explanation in lines 152-158: we computed DeepLIFT scores of the input data (i.e., each binding site of the ChIP-seq data of DNA-binding proteins) in the deep leaning analysis on gene expression levels. DeepLIFT compares the importance of each input for predicting gene expression levels to its 'reference or background level' and assigns contribution scores according to the difference. DeepLIFT calculates a metric to measure the difference between an input and the reference of the input.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Osato and Hamada propose a systematic approach to identify DNA binding proteins that display directional binding. They used a modified Deep Learning method (DEcode) to investigate binding profiles of 1356 DBP from GTRD database at promoters (30 of 100bp bins around TSS) and enhancers (200 bins of 10Kb around eSNPs) and use this to predict expression of 25,071 genes in Fibroblasts, Monocytes, HMEC and NPC. This method achieves a good prediction power (Spearman correlation between predicted and actual expression of 0.74). They then use PIQ, and overlap predicted binding sites with actual ChIP-seq data to investigate the motifs of TFs that are controlling gene expression. They find 99 insulator proteins showing either a specific directional bias or minor non-directional bias, corresponding to 23 DBP previously reported to have insulator function. Of the 23 proteins they identify as regulating enhancer promoter interactions, 13 are associated with CTCF. They also show that there are significantly more insulator proteins binding sites at borders of polycomb domains, transcriptionally active or boundary regions based on chromatin interactions than other proteins.

      Major Comments:

      1. Some of these TFs do not have specific direct binding to DNA (P300, Cohesin). Since the authors are using binding motifs in their analysis workflow, I would remove those from the analysis.
      2. I am not sure if I understood correctly, by why do the authors consider enhancers spanning 2Mb (200 bins of 10Kb around eSNPs)? This seems wrong. Enhancers are relatively small regions (100bp to 1Kb) and only a very small subset form super enhancers.
      3. I think the H3K27me3 analysis was very good, but I would have liked to see also constitutive heterochromatin as well, so maybe repeat the analysis for H3K9me3.
      4. I was not sure I understood the analysis in Figure 6. The binding site is with 500bp of the interaction site, but micro-C interactions are at best at 1Kb resolution. They say they chose the centre of the interaction site, but we don't know exactly where there is the actual interaction. Also, it is not clear what they measure. Is it the number of binding sites of a specific or multiple DBP insulator proteins at a specific distance from this midpoint that they recover in all chromatin loops? Maybe I am missing something. This analysis was not very clear.

      Minor comments:

      1. PIQ does not consider TF concentration. Other methods do that and show that TF concentration improves predictions (e.g., https://www.biorxiv.org/content/10.1101/2023.07.15.549134v2 or https://pubmed.ncbi.nlm.nih.gov/37486787/). The authors should discuss how that would impact their results.
      2. DeepLIFT is a good approach to interpret complex structures of CNN, but is not truly explainable AI. I think the authors should acknowledge this.

      Referee Cross-Commenting

      I would like to mention that I agree with the comments of reviewers 1 and 2.

      Significance

      General assessment:

      This is the first study to my knowledge that attempts to use Deep Learning to identify insulators and directional biases in binding. One of the limitations is that no additional methods were used to show that these DBP have directional binding bias. It is not necessarily to employ additional methods, but it would definitely strengthen the paper.

      Advancements:

      This is a useful catalogue of potential DNA binding proteins of interest, beyond just CTCF. Some known TFs are there, but also new ones are found.

      Audience:

      Basic research mainly, with particular focus on chromatin conformation and TF binding fields.

      My expertise:

      ML/AI methods in genomics, TF binding models, epigenetics and 3D chromatin interactions.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this work, the authors describe a deep learning computational tool to identity binding motifs of DNA binding proteins associated to insulators that led to the discovery of 99 motifs related to insulation. This is in turn related to chromatin architecture and highlight the importance of directional bias in order to form chromatin loops.

      In general, there are some aspects to be clarified and better explored to make stronger conclusions. In particular, there are some aspects to clarify in the text about the Machine Learning procedure (see my points below). In addition, I have some general questions about the biological implications of the discussed findings, listed in detail in the following list.

      Also, I encourage the authors to integrate the current presentation of the data with other (published) data about chromatin architecture, to make more robust the claims and go deeper into the biological implications of the current work. Se my list below.

      It follows a specific list of relevant points to be addressed:

      Specific points:

      1. Introduction, line 95: CTCF appears two times, it seems redundant;
      2. Introduction, lines 99-103: Please stress better the novelty of the work. What is the main focus? The new identified DPBs or their binding sites? What are the "novel structural and functional roles of DBPs" mentioned?
      3. Results, line 111: How do the SNPs come into the procedure? From the figures it seems the input is ChIP-seq peaks of DNBPs around the TSS;
      4. Again, are those SNPs coming from the different cell lines? Or are they from individuals w.r.t some reference genome? I suggest a general restructuring of this part to let the reader understand more easily. One option could be simplifying the details here or alternatively including all the necessary details;
      5. Figure 1: panel a and b are misleading. Is the matrix in panel a equivalent to the matrix in panel b? If not please clarify why. Maybe in b it is included the info about the SNPs? And if yes, again, what is then difference with a.
      6. Line 386-388: could the author investigate in more detail this observation? Does it mean that loops driven by other DBPs independent of the known CTCF/Cohesin? Could the author provide examples of chromatin structural data e.g. MicroC?
      7. In general, how the presented results are related to some models of chromatin architecture, e.g. loop extrusion, in which it is integrated convergent CTCF binding sites?
      8. Do the authors think that the identified DBPs could work in that way as well?
      9. Also, can the authors comment about the mechanisms those newly identified DBPs mediate contacts by active processes or equilibrium processes?
      10. Can the author provide some real examples along with published structural data (e.g. the mentioned micro-C data) to show the link between protein co-presence, directional bias and contact formation?

      Significance

      In this work, the authors describe a deep learning computational tool to identity binding motifs of DNA binding proteins associated to insulators that led to the discovery of 99 motifs related to insulation. This is in turn related to chromatin architecture and highlight the importance of directional bias in order to form chromatin loops.

      In general, chromatin organization is an important topic in the context of a constantly expanding research field. Therefore, the work is timely and could be useful for the community. The paper appears overall well written and the figures look clear and of good quality. Nevertheless, there are some aspects to be clarified and better explored to make stronger conclusions. In particular, there are some aspects to clarify in the text about the Machine Learning procedure (see list of specific points). In addition, I have some general questions about the biological implications of the discussed findings, listed in detail in the above reported points.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The study by Osato and Hamada aims at computationally identifying a set of novel putative insulator-associated DNA binding proteins (DBPs) via estimation of their contribution to the expression of genes in the same chromosome region of their binding sites (+- 1Mbp from TSS). To achieve this, the authors leverage a deep learning architecture already published via which ChIP-seq peaks of DBPs in the TSS of a given gene are used to predict its expression level in four human cell lines.

      Building on this, the authors used another tool called DeepLIFT to evaluate the weight of each DBP binding site on the final gene expression value. Hence they made the assumption that if a given DBP had an insulator function they could restrict the prediction of the gene's expression to the region included between pairs of that DBP binding sites, and evaluate the pair's motif directionality bias in the distribution of weights. They exemplify their approach's validity by the fact that they can predict the known directionality bias of CTCF/cohesin-bound sites as the highest of the lot, with the F-R orientation of the pairs the most enriched, recapitulating what already known in literature: i.e., that F-R chromatin interaction peaks are the most enriched. In addition, they find several new DBPs showing significant directionality bias; hence they could be candidates for insulation activity. They then provide correlation between these putative insulator binding sites and sites of transition between euchromatin and heterochromatin by independently using histone mark and gene expression datasets. This, of course, is not surprising because (a) there is insulation between regions with heterotypic chromatin identities, and (b) it was already known from the first papers describing insulated chromatin domains that their boundaries were well-enriched for active transcription and transcriptional regulators (e.g., Dixon et al, Nature 2012).

      Finally, they use chromatin interaction (looping) sites to check the overlap between CTCF and all other DBPs and define a subset of putative insulator DBPs not overlapping CTCF peaks, suggesting potentially new insulatory mechanisms. These factors were all known transcriptional activators, but this part of the findings carry most of the novelty in the work and have the potential of opening up new directions for research in chromatin organization.

      Overall, the methodology applied here is adequate, clear, and reproducible. The major issue, in our view, is that the entire manuscript's findings relies on the usage of deepLIFT, a tool which was not benchmarked previously or by the current study. In fact, deepLIFT is public as regards its code, and also appears as a preprint from 2017 on biorXiv and published in the Proceedings of Machine Learning Research conference. Also, this key tool was developed by the Kundaje lab (who produce high quality alogrithms), and not by the authors. Therefore, the manuscript is predominantly based on the execution of existing workflows to publicly-available data. This does not take anything away from the interesting question posed here, but at the same time does not provide the community with any new algorithm/workflow.

      Finally, although I appreciate that the authors are purely computational and have likely no capacity for experimental validation of their claims of new DBPs having insulator roles, I would assume that there are RNA-seq and/or ChIP-seq data out there produced after knockdown of one or more of these DBPs that show directional positioning. Using this kind of data, effects on gene expression can at least be tested in regard to the authors' predictions. Moreover, in terms of validation, Figure 6 should be expanded to incorporate analysis of DBPs not overlapping CTCF/cohesin in chromatin interaction data that is important and potentially more interesting than the simple DBPs enrichment reported in the present form of the figure. Critically, I would like to see use of Micro-C/Hi-C data and ChIP-seq from these factors, where insulation scores around their directionally-bound sites show some sort of an effect like that presumed by the authors - and many such datasets are publicly-available and can be put to good use here.

      As secondary issues, we would point out that:

      • The suggested alternative transcripts function, also highlighted in the manuscript;s abstract, is only supported by visual inspection of a few cases for several putative DBPs. I believe this is insufficient to support what looks like one of the major claims of the paper when reading the abstract, and a more quantitative and genome-wide analysis must be adopted, although the authors mention it as just an 'observation'.
      • Figure 1 serves no purpose in my opinion and can be removed, while figures can generally be improved (e.g., the browser screenshots in Figs 4 and 5) for interpretability from readers outside the immediate research field.
      • Similarly, the text is rather convoluted at places and should be re-approached with more clarity for less specialized readers in mind.

      Significance

      The scientific novelty of the work lies primarily in the identification of a set of DBPs that are proposed to confer insulator activity genome-wide. This has been long sought after in human data (whilst it is well understood and defined in Drosophila). The authors produce a quantitative ranking of the putative insulation effect of these DBPs and, most importantly, go on to identify a smaller subset that are apparently non-overlapping with anchors of CTCF-cohesin loop anchors; the presence of strong motif orientation biases in many DBPs can also be of broad interest, especially those that cannot be trivially ascribable to the loop extrusion process.

      However, although these findings open the way for speculation on multiple insulation mechanisms via proteins with multiple regulatory functions, the manuscript provide no experimental or computational means to test the proposed roles of these DBPs - and, as such, this limits the potential impact of the work and mostly targets researchers in the field of genome organization that can test these findings. Having said this, if validated, this work can significantly broaden our understanding of how chromatin is organized in 3D nuclear space.

      I typically identify myself to the authors: A. Papantonis, expertise in 3D genome architecture, chromatin biology, and genomics/bioinformatics.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We sincerely thank all three reviewers for their professional and constructive feedback. We appreciate the thorough evaluation of our manuscript and are committed to revising both the manuscript and supplemental materials based on the suggestions. We have carefully considered each comment and have addressed most of them in the initial revised version, which has been transferred. Additionally, we are currently conducting new experiments to provide the requested data to address a few comments. We are confident that these revision experiments will be completed in a couple of months or so, which will significantly enhance the quality of our study.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In the manuscript by Agarwal and Ghosh, the authors examine yeast Scm3 function in the DNA damage response. They show that Scm3 loss results in DNA damage sensitivity and more Rad52 foci. Importantly, Scm3 is recruited to DSB sites using an HO-cut site and its loss results in an attenuated DNA damage checkpoint as measured by Rad53 phosphorylation. The authors demonstrate convincingly that Scm3, like its human counterpart HJURP, plays a role in the DNA damage response through altered Rad53 activation. However, what its specific role in DNA repair is remains ambiguous.

      Major comments:

      1. It is unusual to see multiple DNA repair foci as those observed in Figure 2B. What is the distribution of cells with 1, 2, 3, 4, or more foci? Are more observed in SCM3-AID cells perhaps suggesting that the DSB ends are not being clustered as would be expected in WT cells exposed to DNA damage?

      Response: As per the reviewer’s comment, we have included a graph (Figure R2) showing the distribution of cells with 1, 2, 3, 4, or more Rad52-GFP foci when they are treated with MMS. There are more cells with 4 or more foci when Scm3 is depleted (SCM3-AID + Auxin) compared to the wild type (SCM3-AID). The average number of Rad52-GFP foci per cell presented in Figure 2B (2.8 in the mutant vs. 1.9 in the wild type) is well in accord with the previous report (Conde and San-Segundo, 2008), where the same was reported as ~2.5 in the cells lacking a methyl transferase Dot1, vs ~ 1.5 in the wild type. More Rad52-GFP foci in MMS-treated cells lacking Scm3 may arise due to the creation of too many damaged sites to be accommodated in 1-2 foci and/or due to the inability of the cells to cluster the DSB ends.

      This result has been incorporated as a new supplementary Figure S4C and new text has been added in the revised manuscript as: “We further quantified the distribution of cells with 1, 2, 3, or >4 Rad52-GFP foci in wild type (SCM3-AID) or Scm3 depleted (SCM3-AID + auxin) cells treated with MMS. Scm3 depleted cells showed a significantly higher number of cells with more than >4 Rad52-GFP foci, suggesting the possibility of the creation of too many damaged sites to be accommodated in 1-2 foci or the inability of such cells to cluster the DSB ends.” in page 7, lines: 237-241.

      2. The peaks with increased Scm3 recruitment by ChIP-seq upon MMS is confusing as MMS does not induce specific damage at genomic locations. Is Scm3 being recruited at other genomic sites that might be more susceptible to DNA damage? Is Scm3 recruited to Pol2 sites for example? Or fragile sites?

      __Response: __We believe that the MMS induced increase in association of Scm3 with the non-centromeric chromatin loci depends on MMS sensitive vulnerable chromosomal sites. We agree with the reviewer that MMS might cause DNA damage at these sites, leading to Scm3 occupancy at those sites. Therefore, we compared the sites of Scm3 occupancy with possible such sites available from the literature that include fragile sites, RNA Pol II binding sites, double strand break hotspots, and coldspots. Based on our analysis, we have included the following lines in the ‘discussion’ section in page 16-17, lines 566-594 as follows:

      “Moreover, an overall increase in the chromatin association of Scm3 in response to MMS also suggests that Scm3 might be recruited to several repair centers or sites that are susceptible to DNA damage, for example, the fragile sites (Figure 3B, C, E, S6). These sites in yeast are DNA regions prone to breakage under replication stress, often corresponding to replication-slow zones (RSZs) (Lemoine et al., 2005). These regions include replication termination (TER) sequences, tRNA genes, long-terminal repeats (LTRs), highly transcribed genes, inverted repeats/palindromes, centromeres, autonomously replicating sequences (ARS), telomeres, and rDNA (Song et al., 2014). Since the helicase Rrm3 is often associated with these fragile regions (Song et al., 2014), we compared Scm3 binding sites with the top 25 Rrm3 binding sites from the literature (Azvolinsky et al., 2009). In untreated cells, Scm3 sites overlapped with three Rrm3 sites on chromosomes X, XII, and XIV. Whereas in MMS treated cells, overlapping was found with four Rrm3 sites, with two (on chromosomes XII and XIV) shared with untreated cells and two new sites were observed on chromosomes II and XII (Table R1). Mapping of the Scm3 sites with the tRNA genes and LTRs revealed that these sites from the untreated cells did not overlap with the LTRs (Raveendranathan et al., 2006). However, the same from the treated cells showed overlap with two LTRs on chromosome XVI. No overlap with tRNA genes was observed in the treated cells (Table R1). We next examined Scm3 occupancy at 71 TERs documented in the literature (Fachinetti et al., 2010). Scm3 was found to bind to 6 TERs in both untreated and MMS-treated cells. Notably, MMS treatment resulted in three new peaks, while three peaks were shared with untreated samples (Table R1). Lastly, we compared Scm3 sites with top 25 RNA Pol II sites obtained from the literature (Azvolinsky et al., 2009). In untreated cells, Scm3 was found at only one of these Pol II sites, whereas after MMS treatment, Scm3 sites overlapped with four such sites (Table R1). We further checked the occupancy of Scm3 at a few DSB hotspots (BUD23, ECM3, and CCT6) and DSB coldspot (YCR093W) as mentioned in the literature (Dash et al., 2024; Nandanan et al., 2021). However, we did not find Scm3 binding to these sites. Overall, in-silico analysis of the binding sites indicates that the non-centromeric enrichment of Scm3 occurs at sites that are amenable to DNA damage.”

      Table R1: The table summarising the occupancy of Scm3 in untreated or MMS treated conditions at the indicated regions

      Region

      Chromosome

      Scm3 occupancy

      Untreated

      MMS treated

      Rrm3 binding sites

      Chr II

      YES

      Chr X

      YES

      Chr XII

      YES

      Chr XII

      YES

      YES

      Chr XIV

      YES

      YES

      LTRs

      Chr XVI

      YES

      Chr XVI

      YES

      tRNA

      Chr XV

      YES

      TERs

      Chr IV

      YES

      Chr V

      YES

      Chr VI

      YES

      YES

      Chr VII

      YES

      Chr X

      YES

      Chr X

      YES

      Chr XIV

      YES

      YES

      Chr XV

      YES

      YES

      Chr XVI

      YES

      Pol II binding sites

      Chr II

      YES

      Chr X

      YES

      Chr XII

      YES

      Chr XII

      YES

      Chr XV

      YES

      The Table R1 has been incorporated as Table S1 in the revised manuscript.

      3. The phosphorylation aspect of Scm3 is intriguing and the authors show that Mec1 is not responsible for mediating its phosphorylation. Tel1 is another kinase that should be examined.

      __Response: __We thank the reviewer for the suggestion. We are in the process of examining the role of Tel1 kinase on Scm3 phosphorylation. The results from the experiment will be incorporated in the manuscript.

      Minor comment: 1. It is hard to see what MMS resistance the authors state is observed in Mif2-depleted cells in Figure S3. Perhaps this could be better explained or the claim removed.

      __Response: __We agree with the comment and have removed the claim from the manuscript.

      2. Protein levels of Scm3 or any of the other factors depleted with AID were never assessed.

      __Response: __We have assessed the protein level of Scm3 and a control protein, tubulin using western blotting as per the reviewer’s suggestion (Figure R3). We did not observe any significant change in the protein levels in SCM3-HA or SCM3-HA-AID cells, suggesting that the AID tagging of Scm3 per se did not make the cells non-functional and the protein was degraded as expected upon addition of auxin. Moreover, the SCM3-AID cells were used previously to examine the effect of Scm3 on kinetochore assembly (Lang et al., 2018).

      This result has been incorporated as Figure S2C, and new text has been added in the revised manuscript as: “The depletion of Scm3 was verified by observing a higher percentage of G2/M arrested cells and by western blot analysis verifying degradation of Scm3-AID after auxin treatment (Figure S2B, C).” in page 5, lines: 150-152.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This manuscript describes a study implicating the Scm3 protein from budding yeast in the DNA damage response (DDR). Scm3 is a chaperone protein, whose main role is considered to be the loading of CENP-A(Cse4) at centromeres to facilitate chromosome segregation. However, the human ortholog of Scm3, HJURP, is known to have a role in DDR and in this study the authors provide evidence that Scm3 is also involved in the DDR in yeast. For the most part, the results presented support the conclusions made.

      Main Points

      1. Figure 1 Could depletion of Scm3 arrest cells in late G2/M and it is this delay that causes increased sensitivity to DNA damaging agents? A control with nocodazole or other means - that also arrests cells at this point - might provide a nice control for this. Perhaps the other kinetochore mutants, used therein, achieve this control - but cell cycle phase would need to be assessed.

      __Response: __ We thank the reviewer for pointing out to a probable effect of the cell cycle stage on the observed MMS sensitivity. In fact, we were also concerned that the observed DNA damage sensitivity in Scm3 depleted cells might be due to G2/M arrest. To rule out this possibility, we monitored Rad52-GFP foci as a marker for DNA damage in the wild type and Scm3 depleted cells both arrested at G2/M using nocodazole (Figure S4). While Scm3 depleted condition exhibited >20% Rad52-GFP positive cells, less than 10% wild type cells showed the same in the absence of any DNA damaging agents (Figure S4E, no MMS, 60 mins). Upon challenging these cells with MMS in the presence of nocodazole, Scm3 depleted condition exhibited over 40% Rad52-GFP positive cells, whereas less than 20% wild-type cells harboured Rad52-GFP. This significant increase in Rad52-GFP positive cells when Scm3 is depleted clearly indicates that the observed MMS sensitivity in these cells is due to the absence of Scm3 rather than due to an effect of a cell cycle stage. Furthermore, we have also used Cdc20 depleted G2/M arrested cells as a wild type control to test the activation of the DNA damage checkpoint by Rad53 phosphorylation. These cells showed robust Rad53 activation in response to MMS, in contrast to poor activation in Scm3 depleted cells (Figure 6), suggesting that G2/M arrest is not the reason for the DNA damage sensitivity observed in the latter cells.

      However, as per the reviewer's suggestion, we examined the MMS sensitivity of the wild type cells arrested at G2/M by nocodazole. As expected, these cells did not show increased sensitivity which further confirms that the DNA damage sensitivity observed in the scm3 mutant is not due to G2/M arrest (Figure R4B). This result has been incorporated within Figure S3, replacing the earlier Figure S3.

      To include this result, we have included new text, and revised the result section in page 5-6, lines 160-181 as follows:

      “The increased sensitivity of scm3-depleted cells to DNA-damaging agents could be due to the weakening of the kinetochores as Scm3-mediated deposition of Cse4 promotes kinetochore assembly or due to the delay in cell cycle, as Scm3 depleted cells arrest in late G2/M phase (Camahort et al., 2007; Cho and Harrison, 2011). If either of these holds true, perturbation of the kinetochore by degradation of other kinetochore proteins or wild type cells arrested at metaphase must show a similar sensitivity to MMS. In budding yeast, Ndc10 is recruited to the centromeres upstream of Scm3 (Lang et al., 2018), whereas the centromeric localization of Mif2, another essential inner kinetochore protein, depends on Scm3 and Cse4 (Xiao et al., 2017). We constructed NDC10-AID and MIF2-AID strains and used them for our assay to represent the proteins independent or dependent on Scm3 for centromeric localization, respectively. We also included one non-essential kinetochore protein, Ctf19, a protein of the COMA complex, to remove any possible mis-judgement in distinguishing cell-growth-arrest phenotype occurring due to drug-sensitivity vs. auxin-mediated degradation of essential proteins. The COMA complex is directly recruited to the centromeres through interaction with the N terminal tail of Cse4, hence dependent on Scm3 (Chen et al., 2000; Fischböck-Halwachs et al., 2019). Mid-log phase cells were harvested and spotted on the indicated plates, however, we did not observe any increased sensitivity of such cells to MMS (Figure S3). Further, wild type cells, when challenged in the presence of nocodazole and MMS, also did not show any increased sensitivity to MMS. Therefore, the increased sensitivity to MMS in scm3 mutant but not in other kinetochore mutant or metaphase arrested cells indicates that Scm3 possesses an additional function in genome stability besides its role in kinetochore assembly.”

      Further we have also revised the discussion section to include the observed results in page 15, lines 502-510 as follows:

      “However, since the primary function of Scm3 is to promote kinetochore formation by depositing Cse4 at the centromeres, it is important to address if the observed sensitivity is due to perturbation in kinetochores or due to cell cycle delay imposed in the absence of Scm3. Therefore, we similarly partially depleted two essential kinetochore proteins, Ndc10 and Mif2, and deleted one non-essential kinetochore protein, Ctf19, in separate cells and also challenged wild type cells to metaphase block but failed to detect any increased sensitivity to DNA damage stress (Figure S3). These results indicate that the drug sensitivity phenotype of Scm3 depleted cells is not due to weakly formed kinetochores or cell cycle delay.”

      2. Mutants of the HR pathway in yeast (e.g. rad52∆ with mre11∆ for example) are typically epistatic. The observation that Scm3 depletion is not epistatic with rad52∆ (Figure 1C) suggests the Scm3 acts via another pathway than the classic Rad52 HR pathway. This should be pointed out and discussed.

      __Response: __We have now included the discussion “In yeast, although HR is the preferred repair pathway, in the case of perturbed HR, an alternate pathway named non-homologous end joining (NHEJ) can occur. The absence of epistatic interaction between SCM3 and RAD52 (Figure 1C) suggests that Scm3 may function in ways other than the Rad52-mediated classical HR pathway. In this context, it would be interesting to test how Scm3 might interact with the key proteins of the NHEJ pathway, such as Ku70/Ku80 and Lig4 (Gao et al., 2016). It is possible that Scm3 may promote a certain chromatin architecture facilitating the DSB ends to stay together to be accessible for NHEJ-mediated end joining.” in page 16, lines 541-548.

      3. Figure 2 should include auxin treatment of RAD52-GFP cells (without the Scm3 degron) to show that the auxin treatment alone does not increase Rad52 foci.

      __Response: __ We performed the suggested experiment and did not observe any significant increase in Rad52-GFP positive cells when treated the cells with auxin+DMSO as compared to only DMSO (Figure R5).

      This result has been incorporated as a new supplementary Figure S4A,B and new text has been added in the revised manuscript as “To rule out the possibility that auxin treatment alone can cause increased Rad52-GFP foci formation, we challenged the wild type (RAD52-GFP) cells with auxin or DMSO and counted the number of cells with Rad52-GFP foci. We did not observe any increase in Rad52-GFP positive cells when treated with auxin+DMSO as compared to only DMSO (Figure S4A, B).” in page 7, lines: 233-236.

      4. Line 246-247 For the data presented, it seems to me possible that Scm3 depleted cells may indicate a defective DDR pathway (as stated) or may indicate defects in DNA replication or an increase in some other form of DNA damage?

      __Response: __We agree with the reviewer’s comment that the depletion of Scm3 can cause replication error or other form of DNA damage in addition to the defect in DDR pathway. To include this, we have modified the sentence as “Taken together, Scm3 depleted cells exhibit more Rad52 foci, indicating a compromised DDR pathway in these cells. Although, defects in DNA replication or creation of other DNA lesions producing more foci also cannot be ruled out.” in page 8, lines 255-257.

      5. In Figure 1 and throughout, please describe in the figure legends how error bars and p values are derived, and the number of experiments involved.

      __Response: __We have now verified all the figure legends and described how error bars and p values are derived and have mentioned the number of experiments involved.

      Minor points Line 35 replace 'cell survival' with 'cell division' - non-dividing cells can survive fine without chromosome segregation. See also line 62.

      __Response: __We have now changed ‘cell survival’ with ‘cell division’ in lines 35 and 62.

      Line 52 and throughout, I suggest replacing CenH3 with CENP-A or Cse4. The term CenH3 is confusing since regional centromeres contain both CENP-A nucleosomes and H3 nucleosomes - the latter of which can also be called CenH3 nucleosomes.

      __Response: __We have replaced CenH3 with CENP-A or Cse4 at the appropriate locations.

      Lines 69-79 specific references are needed for the sentences starting "HJURP was so named...", "In addition,...", "As a corollary,..." and "Finally,..." The final sentence of this paragraph, starting "Perhaps due to..." is unclear.

      __Response: __We have included the reference as mentioned by the reviewer. Also, we have changed the last line as “Notably, HJURP has been visualized to be diffusely present throughout the nucleus (Dunleavy et al., 2009; Kato et al., 2007), which may be due to its global chromatin binding and involvement in DDR.” in page 3, lines 77-79.

      Line 96 "gross chromatin" is unclear; also line 476.

      __Response: __We have changed gross chromatin to “bulk of the chromatin.” and incorporated it into the main text.

      Line 103 "dimerize"

      __Response: __We have replaced ‘dimerizes’ with ‘dimerize’

      Line 109 "most" and "highly" don't work together - perhaps better to say "the functions appear conserved from humans to yeast".

      __Response: __We have changed the wording as the reviewer suggested.

      Line 175 "grown" to "phase", see also line 223.

      __Response: __We have changed the wording as the reviewer suggested.

      Line 293 delete "besides"

      __Response: __We have deleted the word ‘besides’.

      Figure 5 - panels C & D, please make x axis labels clearer - they are directly underneath the 2kb ChIP. They should include a horizontal bar to indicate that all 5 ChIP experiments are included in each time point.

      __Response: __We have now included a horizontal bar in both Figure 5 and the corresponding supplementary Figure S8, to better represent the ChIP experiments. We thank the reviewer for pointing this out.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary This manuscript studies the role of the Cse4 histone chaperone Scm3 in the S. cerevisiae DNA damage response. The authors show that decreased Scm3 levels exhibit genetic interactions with mutations in the Rad52 gene and sensitivity to MMS. They go on to show that some Scm3 co-localizes with sites of DNA damage using spreads and ChIP-seq techniques and that decreased levels of Scm3 have a reduced DNA damage checkpoint response. The Scm3 protein is also phosphorylated in response to DNA damage. Taken together, the authors propose a model whereby DNA damage recruits the Scm3 protein and Scm3 then helps mediate the checkpoint response. Overall, the data make a case that Scm3 has a relationship to the DNA damage checkpoint but the authors should be careful not to over-conclude that it has a precise role in checkpoint activation based on the data.

      Major Comments 1. The western blots in the paper are not always entirely convincing. In addition, they are not described in enough detail to understand if a membrane was cut or if multiple gels were run. For example, the tubulin loading in Figure 6D is interrupted toward the end of the blot and the bands in Figure 7D go in different directions for the two blots for the MMS treated cells. In figure 6B, there are no detectable phospho-forms of Rad53 detected on the upper blot for the WT and scm3 lanes even though quantification is given on the right. It would be good to present better examples of the westerns or at least better describe what the reader is visualizing so the quantification and conclusions can be understood. How were the blots quantified? How were the westerns run and processed?

      Response: We have now included a separate paragraph in materials and methods regarding the gel run, processing and quantification of the western blots in the revised manuscript for better understanding of the readers:

      “To detect Scm3-6HA, Rad53, and g-H2A, the total proteins isolated from the appropriate cells were run on 12%, 8%, and 15% SDS gels, respectively. The proteins were transferred to the membranes, which were cut to detect the above proteins and the control protein tubulin separately. For the quantification of the bands on the western blots, a region of interest (ROI) was made around the band of interest, and the intensity of the band was calculated using ImageJ. A same ROI from a no-band area of the blot was used to calculate the background intensity. The background intensity was subtracted from the band intensity. The same process was done for the tubulin bands. The intensity of the target bands (Scm3-6HA, Rad53, and g-H2A) was divided by control tubulin band intensity to get the normalized values for the target bands, which were plotted using GraphPad Prism 9.0 (Version 9.4.1) software.” This has been added in page 25, lines: 881-890.

      Furthermore, we will again perform the experiments for a better representation of the western blots in figures 6B, D, and 7D.

      2. The argument that scm3 depletion leads to a defect in DNA damage checkpoint activation is not strongly supported. Monitoring exit from the cell cycle by multibudding is not the most rigorous assay, especially since the image shows one cell with 5 nuclei. The authors should release cells from G1 into auxin and MMS and monitor cell cycle progression at least one other cell cycle marker, such as anaphase onset, DNA replication and/or Pds1 levels.

      Response: As per the reviewer’s suggestion, in order to support our argument that the absence of Scm3 causes a defect in DNA damage checkpoint activation, we will examine if these cells abrogate G2/M arrest and show an early anaphase onset. For this, we will monitor the levels of Pds1, as a marker of anaphase onset, along the cell cycle in wild type and Scm3-depleted cells both deleted for Mad2 to remove any inadvertent effect of spindle assembly checkpoint. The schematics of the experimental workflow is given in Figure R1. Typically, the cells will be released from alpha factor arrest in the absence or presence of auxin (for the depletion of Scm3) and in the absence or presence of MMS. The samples will be harvested at the indicated time points and will be analyzed for:

      1. Western blot: Pds1-Myc (to detect anaphase onset)
      2. Western blot: Rad53 and p-Rad53 (to detect DNA damage activation)
      3. Immunofluorescence: Tubulin (to detect cell cycle stages) The results of the above experiment will be incorporated in the revised manuscript.

      3. The quantification in Figure 3B is not clear. Is it done on a per/nuclei basis? What pools of Scm3 and Ndc10 are being normalized?

      Response: The intensity was calculated as done before (Mittal et al., 2020, Shah et al., 2023). Typically, the intensity was first measured from the total signal of Scm3/Ndc10 from each chromatin mass or spread (DAPI) by making a polygon (ROI) around the Scm3/Ndc10+DAPI signal. The same ROI was dragged to the background area, from where two separate intensities were calculated. The average of the background intensities was then subtracted from the Scm3/Ndc10 intensity obtained from the same spread to get the normalized intensity depicting each dot in the box plot of Figure 3B. At least 30 spreads were quantified in a similar manner.

      We have mentioned this in the materials and methods section under “Microscopic image analysis.” section in page 22, lines 770-777 as follows: “For intensity calculation, a Region of Interest (ROI) was drawn around the Scm3/Ndc10/g-H2A+DAPI signal, and the intensity of Scm3/Ndc10/g-H2A was measured from each chromatin mass or spread (DAPI). An ROI of the same size was put elsewhere in the background area, from where two separate intensities were calculated. The average of the background intensities was then subtracted from the Scm3/Ndc10/g-H2A intensity obtained from the same spread to get the normalized intensity depicting each dot in the box plot of the respective figures as mentioned previously (Mittal et al., 2020; Shah et al., 2023).”

      Minor Comments 1. The model is elegant but there are chromatin pools (beyond the kinetochore pool) of Scm3 that do not contain Rad52 and/or gamma-H2X and vice versa. It would be helpful if the authors could speculate on how to reconcile these different pools. It might be premature to suggest such a detailed model at this point since the function of Scm3 in the checkpoint is still very unclear so I would encourage the authors to make a less detailed model.

      Response: By showing the green hallow, we have depicted the nuclear pool of Scm3, and we have not shown that the pool contains DDR proteins viz., Rad52 or g-H2A. Rather, we have shown the recruitment of these proteins at the DNA damage sites. Since the focus of this manuscript is on the non-centromeric functions of Scm3, we have not shown the kinetochore pool of Scm3. Although the model is a detailed one, the contribution from this work has been mentioned legitimately at every stage so that the readers can judge the merit of this work. We believe that a detailed model would provide a better perspective to the readers to correlate the revealed as well as yet-to-reveal functions of Scm3 in a spatiotemporal manner with the other players of the DDR pathway. Therefore, we prefer to keep the model in a detailed form.

      2. The chip-seq data is not publicly accessible. There is no reference to the data being available to review.

      __Response: __The data will be uploaded to the public domain.

      3. Line 103: not clear what "both the proteins dimerize" means...probably should be "both proteins dimerize"

      __Response: __We have changed the wording to “both proteins dimerize”.

      4. The argument that Ndc10 does not have a growth defect on MMS is a weak conclusion given that almost no control cells grow on auxin in the absence of MMS.

      __Response: __We have now repeated the spotting assay with a lesser concentration of auxin and replaced Figure S3 with a new Figure S3 (Figure R4) to better represent and conclude that the loss of Ndc10 does not cause MMS sensitivity.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      This manuscript studies the role of the Cse4 histone chaperone Scm3 in the S. cerevisiae DNA damage response. The authors show that decreased Scm3 levels exhibit genetic interactions with mutations in the Rad52 gene and sensitivity to MMS. They go on to show that some Scm3 co-localizes with sites of DNA damage using spreads and ChIP-seq techniques and that decreased levels of Scm3 have a reduced DNA damage checkpoint response. The Scm3 protein is also phosphorylated in response to DNA damage. Taken together, the authors propose a model whereby DNA damage recruits the Scm3 protein and Scm3 then helps mediate the checkpoint response. Overall, the data make a case that Scm3 has a relationship to the DNA damage checkpoint but the authors should be careful not to over-conclude that it has a precise role in checkpoint activation based on the data.

      Major Comments

      1. The western blots in the paper are not always entirely convincing. In addition, they are not described in enough detail to understand if a membrane was cut or if multiple gels were run. For example, the tubulin loading in Figure 6D is interrupted toward the end of the blot and the bands in Figure 7D go in different directions for the two blots for the MMS treated cells. In figure 6B, there are no detectable phospho-forms of Rad53 detected on the upper blot for the WT and scm3 lanes even though quantification is given on the right. It would be good to present better examples of the westerns or at least better describe what the reader is visualizing so the quantification and conclusions can be understood. How were the blots quantified? How were the westerns run and processed?

      2. The argument that scm3 depletion leads to a defect in DNA damage checkpoint activation is not strongly supported. Monitoring exit from the cell cycle by multibudding is not the most rigorous assay, especially since the image shows one cell with 5 nuclei. The authors should release cells from G1 into auxin and MMS and monitor cell cycle progression at least one other cell cycle marker, such as anaphase onset, DNA replication and/or Pds1 levels.

      3. The quantification in Figure 3B is not clear. Is it done on a per/nuclei basis? What pools of Scm3 and Ndc10 are being normalized?

      Minor Comments

      1. The model is elegant but there are chromatin pools (beyond the kinetochore pool) of Scm3 that do not contain Rad52 and/or gamma-H2X and vice versa. It would be helpful if the authors could speculate on how to reconcile these different pools. It might be premature to suggest such a detailed model at this point since the function of Scm3 in the checkpoint is still very unclear so I would encourage the authors to make a less detailed model.

      2. The chip-seq data is not publicly accessible. There is no reference to the data being available to review.

      3. Line 103: not clear what "both the proteins dimerize" means...probably should be "both proteins dimerize"

      4. The argument that Ndc10 does not have a growth defect on MMS is a weak conclusion given that almost no control cells grow on auxin in the absence of MMS.

      Significance

      Significance

      This is the first examination of the role of Scm3 in the DNA damage response in S. cerevisiae. My expertise is in the chromatin and segregation fields, but I believe this work will be of interest to the DNA damage field as well. While the homologs of Scm3 are known to have a role in DNA damage, it was unclear if this was conserved in budding yeast. The data in this manuscript are consistent with findings in other organisms but the precise role of the chaperone in the DNA damage response is still unclear.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript describes a study implicating the Scm3 protein from budding yeast in the DNA damage response (DDR). Scm3 is a chaperone protein, whose main role is considered to be the loading of CENP-A(Cse4) at centromeres to facilitate chromosome segregation. However, the human ortholog of Scm3, HJURP, is known to have a role in DDR and in this study the authors provide evidence that Scm3 is also involved in the DDR in yeast. For the most part, the results presented support the conclusions made.

      Main Points:

      1. Figure 1 Could depletion of Scm3 arrest cells in late G2/M and it is this delay that causes increased sensitivity to DNA damaging agents? A control with nocodazole or other means - that also arrests cells at this point - might provide a nice control for this. Perhaps the other kinetochore mutants, used therein, achieve this control - but cell cycle phase would need to be assessed.

      2. Mutants of the HR pathway in yeast (e.g. rad52∆ with mre11∆ for example) are typically epistatic. The observation that Scm3 depletion is not epistatic with rad52∆ (Figure 1C) suggests the Scm3 acts via another pathway than the classic Rad52 HR pathway. This should be pointed out and discussed.

      3. Figure 2 should include auxin treatment of RAD52-GFP cells (without the Scm3 degron) to show that the auxin treatment alone does not increase Rad52 foci.

      4. Line 246-247 For the data presented, it seems to me possible that Scm3 depleted cells may indicate a defective DDR pathway (as stated) or may indicate defects in DNA replication or an increase in some other form of DNA damage?

      5. In Figure 1 and throughout, please describe in the figure legends how error bars and p values are derived, and the number of experiments involved.

      Minor points:

      1. Line 35 replace 'cell survival' with 'cell division' - non-dividing cells can survive fine without chromosome segregation. See also line 62.

      2. Line 52 and throughout, I suggest replacing CenH3 with CENP-A or Cse4. The term CenH3 is confusing since regional centromeres contain both CENP-A nucleosomes and H3 nucleosomes - the latter of which can also be called CenH3 nucleosomes.

      3. Lines 69-79 specific references are needed for the sentences starting "HJURP was so named...", "In addition,...", "As a corollary,..." and "Finally,..." The final sentence of this paragraph, starting "Perhaps due to..." is unclear.

      4. Line 96 "gross chromatin" is unclear; also line 476.

      5. Line 103 "dimerize"

      6. Line 109 "most" and "highly" don't work together - perhaps better to say "the functions appear conserved from humans to yeast".

      7. Line 175 "grown" to "phase", see also line 223.

      8. Line 293 delete "besides"

      9. Figure 5 - panels C & D, please make x axis labels clearer - they are directly underneath the 2kb ChIP. They should include a horizontal bar to indicate that all 5 ChIP experiments are included in each time point.

      Significance

      This is a nice complement to the human work on HJURP and provides convincing evidence that Scm3 can be used to model the function of HJURP. Since yeast is such a tractable model, this work provides a route to study the role of this chaperone in DNA damage repair, which may also be true for human HJURP. The work itself is perhaps not too surprising, but is a solid advance in our understanding of the role of Scm3.

      My own expertise is in yeast DNA repair and chromosome segregation.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In the manuscript by Agarwal and Ghosh, the authors examine yeast Scm3 function in the DNA damage response. They show that Scm3 loss results in DNA damage sensitivity and more Rad52 foci. Importantly, Scm3 is recruited to DSB sites using an HO-cut site and its loss results in an attenuated DNA damage checkpoint as measured by Rad53 phosphorylation. The authors demonstrate convincingly that Scm3, like its human counterpart HJURP, plays a role in the DNA damage response through altered Rad53 activation. However, what its specific role in DNA repair is remains ambiguous.

      Major comment:

      1. It is unusual to see multiple DNA repair foci as those observed in Figure 2B. What is the distribution of cells with 1, 2, 3, 4, or more foci? Are more observed in SCM3-AID cells perhaps suggesting that the DSB ends are not being clustered as would be expected in WT cells exposed to DNA damage?

      2. The peaks with increased Scm3 recruitment by ChIP-seq upon MMS is confusing as MMS does not induce specific damage at genomic locations. Is Scm3 being recruited at other genomic sites that might be more susceptible to DNA damage? Is Scm3 recruited to Pol2 sites for example? Or fragile sites?

      3. The phosphorylation aspect of Scm3 is intriguing and the authors show that Mec1 is not responsible for mediating its phosphorylation. Tel1 is another kinase that should be examined.

      Minor comment:

      1. It is hard to see what MMS resistance the authors state is observed in Mif2-depleted cells in Figure S3. Perhaps this could be better explained or the claim removed.

      2. Protein levels of Scm3 or any of the other factors depleted with AID were never assessed.

      Significance

      As mentioned above, a clear link for Scm3 in DNA damage repair has now been established in this work but its function in this process is descriptive.

  4. Sep 2024
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Reviewer Comments:


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      *Glaucoma-associated optineurin mutations increase transmitophagy in vertebrate optic nerve.

      Summary In Jeong et al., the authors perform live imaging of the X. laevis optic nerve to track neuronal mitochondrial movement and expulsion in an intact nervous system. The authors observe similar mitochondrial dynamics in vivo as previously described in other systems. They find that stationary mitochondria are more likely to be associated with OPTN, suggestive of mitochondria undergoing mitophagy. Forced expression of OPTN mutations results in a larger pool of stationary mitochondria that colocalize withLC3B, and OPTN. Finally, the authors argue that extra-axonal mitochondria are observed more frequently in OPTN mutants, suggesting that mutations in OPTN that are associated with disease can lead to an increase in the expulsion of mitochondria through exopher-like structures.

      Major Findings and impact: • The authors establish that mitochondria dynamics can be tracked in the X. laevis optic nerve. • OPTN mutations increase the stationary pool of mitochondria and likely result in increased rates of mitophagy. • Exopher-like structures containing mitochondria and LC3 can be expelled from the optic nerve and increase in the presence of OPTN mutations. These structures were observed in a living system and have interesting implications in the context of disease.

      Concerns: • The authors state in their results that the secreted blebs are exophers. While these initial observations are consistent with exophers, additional data are needed to strengthen this claim. For example: what are the sizes of secreted vesicles? Do all express LC3? How frequently do these occur? From where are they expelling? Alternatively, the discussion of exophers could be moved to the discussion.*

      We agree that calling the axon shedding intermediates “exophers” was an overreach on our part. While we believe that in all probability time will demonstrate this to be the case, reviewers are correct in stating that putting our work in the context of exophers is best left to the discussion. We have removed all mention of exophers from the results and graphical abstract and now use the term only once in the discussion. We do provide detail as to the frequency of the structures, what fraction contain mitochondria, and morphological parameters of the contained mitochondria. And while all of these new data support them being exophers, the point remains that the use of the nomenclature “exopher” in the results section was inappropriate.

      • Quantifications in sparse labeling experiments seem quite surprising and concerns related to these findings should be addressed. As the authors used LC3b expression to represent axonal volume, the authors should demonstrate that this is the case using an axonal fill or membrane marker in both the wt and E50K conditions. This is important as it is unclear whether LC3b expression is consistent between the wild type and the E50K conditions. Lower expression of LC3b in E50K could account for the large changes in axonal width that seem to be observed and could confound the measured amount of expelled mitochondria.*
      • *

      We agree that using EGFP-LC3b as a “cell fill” was problematic in a situation where the interventions likely perturb autophagy/mitophagy and therefore might have also perturbed LC3b. We do provide some axon width and LC3b-EGFP intensity data for a partial dataset that had been imaged side-by-side, showing that expression of LC3b is not different in the two conditions. We also provide independent measures of extra-axonal mitochondria based on a membrane-GFP reporter. While in principle there would be value to repeat the studies of Wt vs. E50K in the context of the membrane-GFP reporter, these experiments would involve new constructs and new breedings, and would likely take months to years to complete.

        • Could large amounts of exogenous mitochondria in explant experiments be from cells that died during the plantation?* The concern that some of the exogenous mitochondria signal might derive from degenerating axons is one that we worry much about, and not only in the transplantation experiments. In our sparse labeling experiments we do occasionally see axons undergoing Wallerian degeneration, but it is rare and does not appear to be more common in the expression of the mutated OPTN, at least not at the stage after transgene expression that the analyses were performed. We do provide new data that expression of E50K OPTN does not compromise vision at the time that experiments were carried out, ruling out that extra-axonal mitochondria are the result of large-scale degeneration. However, from other data we know that axon loss would likely need to be very extensive to manifest itself in functional vision loss in our behavioral assay, so milder axon loss contributing some noise to the measures cannot be excluded. But, the point raised is heard, and now we include a sentence in the discussion acknowledging that some of the signal outside of axons could have been due to degenerating axons, but still contend that our documentation of shedding intermediates support the view that many of the axonal mitochondria outside of axons were shed from otherwise intact axons.

      Suggested experiments/quantifications: • In OPTN/MITO/LC3b trafficking experiments, does flux/number of events change? Representative kymograph in Figure 2D seems to show far more OPTN-positive mitochondria which is opposite of what is shown in Figure 2C.

      Multiple reviewers rightfully point out that we did not carry out the flux experiments which would be necessary to make definitive statements regarding the amount of mitophagy. New experiments show that inhibiting lysosomal activity through chloroquine does increase the amount of astrocytic autophagosomes not yet acidified as expected, and that they contain axonal mitochondria signal, supporting the idea that astrocytes are involved in the degradation of axonal mitochondria. However, they did not show changes in the amount of stopped mitochondria, supporting the view that the co-localization of OPTN and mitochondria in axons is not conventional autophagy. This is a very important point that affects the interpretation of our results, and we thank reviewers for suggesting this experiment.

      • Demonstrate that axonal width measured with LC3B is representative of axonal fill/membrane marker in wt and E50K. Axonal area appears to change, is this accurate? This appears to be the case for both figure 3 and figure 4.* Addressed above.

      • Raw images in addition to the reconstruction would be beneficial.* Now include raw images beside the reconstruction at the first use of reconstructions.

      • Further characterization of exopher-like structures.**

      * Addressed above.

      ***Referees cross-commenting**

      I agree with the concerns of the other reviewers, and perhaps was over-optimistic about a timeline for revision. However, I do think the work is worth the effort, and I hope to see a revised manuscript published somewhere, as these observations are novel

      Reviewer #1 (Significance (Required)):

      This work reports potentially novel biology, and thus will be of interest to the field. The strength of the study is that it is an initial description of this biology, rather than a complete analysis. The work raises many more questions than it answers, and much further work on this topic is required to support these initial findings, but the manuscript will likely be of interest to many. Revisions are required to improve the rigor and clarity of the work, but following these revisions we recommend publication to facilitate follow-up work.*

      Fully agree that our study raises far more questions than it answers. Believe that the revisions made to address reviewer comments go a long way to improve rigor and clarity of the work. We hope that the reviewers agree and deem the changes sufficient.

      *Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: This article studied transmitophagy in xenopus optic nerves in the context of overexpressing glaucoma-associated optineurin mutations. Using a series of labeling, imaging and transplantation techniques, the authors found that overexpressing mutated optineurins stops mitochondria movements and potentially induces transmitophagy, and that astrocytes are responsible for taking up the extra-axonal mitochondria. Below are my comments on this article.

      Major comments: 1. Identifying extra-axonal mitochondria is key to this research. In Figure 3, the authors used EGFP-LC3B as a marker for RGC boundaries. However, it is unconvincing how perfect LC3B is as a cell membrane marker. Particularly in the case of OPTN E50K OE, it seems that the optic nerve is thinner than the WT condition, which makes the quantification of extra-axonal OPTN less convincing. The authors should detect extra-axonal mitochondria with an RGC membrane marker or cytosolic marker. In addition, in Figure 3, the extra-axonal mitochondria seem to localize mostly on the dorsal surface. Why is there such a polarity?*

      As stated above, we acknowledge that the use of LC3b as both an autophagosome marker and a cell fill was somewhat problematic and now provide additional experiments ruling out that the LC3b expression or axon thickness in our sparse axon labeling experiments, or that E50K might affect the thickness of the optic nerve. In addition, we also provide additional new data using a bona fide membrane marker together a transgenic labeling or RGC mitochondria that also shows under the “baseline state” extensive mitochondria signal outside the axons on the surface of the optic nerve (New Fig. 6A and new Suppl Fig. 3D). All the new data are consistent with the previous data and support the view that using LC3b potentially could have been problematic, for the reasons reviewers state, but in practice it was not.

      The reviewer observes that the E50K optic nerve appears thinner--this observation is not a consistent difference in optic nerves across the experimental groups. The images we show are always near the mean values for the quantitative results presented, and we rather not include prettier nerves that are not representative of the whole datasets.

      As for why the extra-axonal mitochondria localize mostly to the dorsal surface, it remains undetermined. There are dorsoventral differences in the optic nerve established during development, as developmental Sonic hedgehog signaling emanating from the midline appears to affect dorsoventral aspects of the optic nerve differentially. Early axon loss in humans and some models of glaucoma do show a dorsal bias, and there may be optic nerve lymphatic structure reported in mice that also may be preferentially dorsal. However, it is not known whether any of these observations are connected, so we did not want to speculate beyond what the data say. We do now explicitly mention the dorsoventral difference in the discussion, and state why we think it may be worth further study.

      • The experiment in Figure 5 is very important as it gives direct evidence of transmitophagy. However, one caveat is that the mitotracker injection is done after the transplantation. If in rare cases the dye is leaky after injection and is taken up by astrocytes directly, then the conclusion that mitochondria from RGCs are phagocytosed by astrocytes will be flawed. The authors should either use a transgene in the donor to label mitochondria or inject mitotracker into the donor before the transplantation and repeat the experiments. In addition, in Figure 5E, what is the large membranous structure inside the highlighted astrocyte? Is it associated with phagocytosis?*

      We fully agree that MitoTracker is an imperfect tool, both for the reason stated here that the dye may get into the astrocytes directly (or may label astrocyte mitochondria after it is released from degrading RGC mitochondria), and, also as stated by reviewer 3, that it requires healthy mitochondria for labeling. For this reason, we have added new datasets that rely on RGC mitochondria labeling not by Mitotracker but through a genetic reporter. As to identity of the conspicuous structure shown inside the astrocytes, it remains an open question, and we are avidly pursuing what astrocytic organelles are involved through additional transgenic reporters and correlated-light-EM studies, but those are complicated experiments that are beyond the scope of the current manuscript.

      • This research is entirely based on overexpression of OPTN. Since overexpressing WT OPTN does seem to affect mito trafficking (Figure S2G, and the description in the manuscript is often inconsistent with this result), it is unclear what the increased stalled mitochondria really mean when overexpressing mutated OPTN. Similarly, the authors examined extra-axonal mitochondria in Figures 3 and 4 all in overexpressing conditions, and made the connection that increased stalled mitochondria lead to transmitophagy. However, this conclusion will be better supported by using mutant animals rather than overexpression. The authors should consider using OPTN mutant xenopus if available or using CRISPR to introduce the specific mutations and repeat mitochondria trafficking and transmitophagy.*

      • *

      We thank this reviewer by pointing out an important detail that we failed to highlight, namely that transgenic overexpression of Wt OPTN (and/or Wt LC3B) does have a small but significant effect on mitochondria trafficking. Interestingly, it is affecting just the speed of retrogradely transported mitochondria, which based on the elegant work of Holzbaur and colleagues, include mitochondria destined for degradation. So, we now acknowledge more explicitly that, since our studies involve expression of OPTN and LC3b transgenes (fluorophore tagged human genes, no less), that some caution should be exercised in not overinterpreting the results. Nonetheless, since we show that expression of Wt OPTN behaves similarly to expression of a mitochondria reporter (Tom20-mCherry) in not affecting either stopped mitochondria or extra-axonal mitochondria, we believe that our results still stand. Nonetheless, we now make mention of the effect Wt OPTN on retrograde mitochondria movement. We have embarked on OPTN loss-of-function studies and have some founder animals carrying CRISPR-generated mutations; however, these experiments will take additional time, and based on the results in mammals may or may not show any measurable effects in our assays, not only because of possible redundancy by the other damaged mitochondria adaptors that we mention in the introduction, but also because the mutations that affect the shedding process (as well as cause glaucoma) are thought to be gain-of-function mutations. However, we decided not to dwell on these complexities in the discussion, as the discussion was previously quite extensive and now is even moreso with the added discussion on how our studies relate to those of exophers.

      • On Page 12, the authors claim that even overexpressing WT OPTN causes extra-axonal mitochondria in the optic nerve. However, there is no control condition without OE to support this conclusion. It is thus unclear to what extent extra-axonal mitochondria occur at baseline and how many extra-axonal mitochondria can be induced by overexpression. The authors should include, in Figure 3 and 4, controls without overexpression.*

      We acknowledge that our language was confusing and somewhat misleading on this point. With the caveat mentioned above that WT OPTN expression does perturb the system somewhat (by increasing the speed of mitochondria retrograde transport, perhaps by increasing the proportion of retrograde moving mitochondria tagged for degradation), we still contend that the state observed after WT OPTN expression is close to the “baseline” state. In support of that, in the new data included in response to the LC3b concern, we observe plentiful shedding events in the absence of any OPTN or LC3b transgenes. Indeed, what may be the most surprising finding of our studies is that in the absence of any significant perturbation of OPTN, there is already a large fraction of axonal mitochondria that are outside of axons and inside of astrocytes, which is consistent with what we previously observed in the optic nerve head of mice; however, the current studies provide much more rigorous quantification of the process and live imaging of intermediates, but also provide for an intervention that increases the process. While there are many more questions to answer, we do believe our studies contribute mechanistic insights.

      • A technical question regarding kymographs: Based on Figure 2C, it looks that OPTN and LC3B labeling are pretty diffuse in axons and this makes sense since they may only be associated with damaged mitos. But this raises a question about how accurate the kymograph assay is. It may significantly underestimate the fraction of OPTN/LC3B that is stationary since they appeared diffusedon the kymograph. This may explain why the percentage of stationary OPTN/LC3B is so small when the authors OE WT OPTN in Figure 2E and 2E', compared to the percentage of moving mitochondria shown in Figure 1E.*

      We fully agree that the kymograph studies likely underestimate the amounts of stationary mitochondria for the reasons stated. However, we interpret the discrepancy between Figure 1E and 2E and 2E’ differently. We believe that the value of stopped mitochondria in the sparse labeling experiments are actually more accurate, as the value of stopped mitochondria in the whole nerve experiments likely include mitochondria stopped within the axons, but also mitochondria recently shed either by those or nearby axons which are perceived to be in axons due to limitations of imaging resolution. In the discussion we now make very explicit that all the measures we provide need should be interpreted as estimates, as every experiment relies on assumptions and is subject to technical limitations.

      Minor: 1. Figure 2E and 2E' do not agree with the text on page 7 and page 8. Not only F178A, but also H486R and D474N have no effect on OPTN trafficking. The authors should make their conclusions more accurate.

      F178 was the only mutation that had no effect on either OPTN or LC3b in either F0 or F1 experiments. However, we agree that our language should have been clearer, and now we have made our description of the results (and conclusions) more accurate.

      • Figure S2E-F: why does OE of mutated OPTN in F1s but not in F0s reduce trafficking speed compared to WT?*

      We do not know the reason for this discrepancy. Though it does not wholly agree with the rest of the story, we felt it important to include all relevant data, not only that which perfectly fit our interpretation. One possible reason may be that the F1 data derives from a single integration event, which is the reason why we trust more the F0 data that derive from multiple integrations, in what are essentially outbred animals, which is the reason we present the F0 data as the primary results where possible.

      * In movie 5, fusion of exopher with other structures is not clear and also the GFP signal does not disappear, which is in contrast to the statement in the text that the GFP signal is quenched in acidified environment. To confirm that LC3B leaves RGC axons in exophers, the authors should consider switching the fluorophores and examine LC3B localization during exopher formation.*

      This too is a valid point, and we have amended our description of these results. While swapping fluorophores between OPTN and LC3b is a highly worthy experiment, for technical reasons it likely would take many months to carry out just because of how involved it is to make the relevant constructs (recombineering details provided in the methods section).

      • In figure 6, to better show exopher formation and the pinching-off step, the authors should consider labeling the membrane and mitochondria instead of using the LC3B and OPTN marker.*

      This arguably was the biggest weakness of our initial submission, and now provide new experiments using a bona fide membrane marker. We have not yet captured a pinching-off event with these better reporters, but that is not surprising given how rare they are, which we now quantify. Indeed, a membrane reporter and a mitochondria transgene in sparsely labeled axons are the ideal tool for figuring out the frequency of these structures and what fraction contain mitochondria, data which we now provide.

      ***Referees cross-commenting**

      Generally agree with the criticisms voiced by the other reviewers; in aggregate the reviews indicate the manuscript needs more than just a quick fix.

      Reviewer #2 (Significance (Required)):

      Previous literature has already described the transmitophagy process in the optic nerve. The significance of this paper lies in the observation that overexpressing glaucoma-associated OPTN mutants can induce increased transmitophagy through astrocytes, which points to a potential role of OPTN in glaucoma. A highlight of this paper is the use of correlated light SBEM to directly show transmitophagy in astrocytes. However, the significance of this paper may be limited for the following reasons: 1. everything is based on overexpression of mutated OPTN, which makes it hard to translate the results to real disease conditions; 2. The consequence of increased transmitophagy on RGC survival or visual functions is unclear.

      *

      While we agree that much of the paper is based on OPTN overexpression, we did have experiments and now provide more that that were not based on OPTN overexpression. Some of these still involve expression of a different transgene (Tom20-mCherry) that might in principle perturb the system, though we show that expression of Tom20-mCherry does not affect mitochondria movement parameters as measured by Mitotracker. As to “the consequence of increased transmitophagy”, we do now provide data showing that there is no vision loss suggestive of axon loss or severe dysfunction at the time that the imaging studies were carried out. Whether longer term expression of these OPTN transgenes lead to axon degeneration and visual dysfunction are studies that are ongoing, but those studies involve extensive characterizations and controls that are beyond what could be included in this study.

      *Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary In this work, Jeong et al describe the effect of Optineurin (OPTN) mutations in the transcellular degradation of retinal ganglion cell (RGC) mitochondria by astrocytes at the Optic Nerve (ON), a process previously described this group and referred as "transmitophagy" (Davis et al 2014). Here, authors use Xenopus laevis animal model to image the optic nerve of animals carrying different OPTN mutations associated to disease or with compromised function and explore its effect in mitochondria dynamics at the RGC axons. They find that OPTN mutants lead to increased stationary mitochondria in the nerve and affect their co-localization with mitophagy-related markers, suggesting alterations in this pathway. Finally, they found that mitochondria co-localizing with OPTN can be found in the periphery of the ON under different conditions and this is particularly increased in glaucoma-associated E50K mutation. This extracellular mitochondria are transferred in vesicles to astrocytes, as they previously described in mice (Davis 2014), where they are presumably degraded. Major comments - OPTN levels at a given time point cannot be used as readout for mitophagy level/flux. Both OPTN and LC3b are degraded upon fusion with acidic compartment (i.e. lysosomes, PMID: 33783320, 33634751) and that is the reason why the field of autophagy /mitophagy blocks lysosomal activity to measure autophagy/mitophagy flux (PMID: 33634751). In this document, authors claim that there is low levels of mitophagy in RGC axons at baseline and increased levels of mitophagy in glaucoma associated perturbations just based on increased presence of OPTN+ mitochondria in this condition. This could be also interpreted as an accumulation of non-degraded defective mitochondria due to a mitophagy block in neurons carrying the glaucoma associated mutation, which is the opposite of what they propose. If authors want to evaluate mitophagy levels in this system, mitophagy/autophagy flux experiments should be performed.*

      In response to reviewers, we do now include “lysosome inhibition” experiment, using chloroquine at doses modestly above those used in aquaculture as an anti-parasitic. After testing various chemical means to inhibit lysosome activity, it was the only one that did not adversely affect the animals. We know the chloroquine intervention works because we see the expected increase in autophagosomes using the standard LC3b-tandem reporter, and in those unacidified astrocytic autophagosomes we do indeed find axonal mitochondria signal. However, since the amount of mitochondria signal there is small relative to the total amount of axonal mitochondria in the astrocytes, we do not feel it would be appropriate to make mechanistic claims, for example claiming this to be related to LC3b associated phagocytosis; much more work would be needed to make that claim. However, we were surprised to find no alteration in either stopped mitochondria in axons or axonal mitochondria material within the astrocytes. There are technical reasons why this result might be difficult to interpret, but now having done it (as we should have before), we are even more careful in describing the process as transcellular degradation rather than transmitophagy. We elaborate further on this point in the next response.

      - I find inappropriate the use of the term "transmitophagy". Although this term transmits very well the message that the authors try to strength, the term "mitophagy" refers to the specific elimination of mitochondria through autophagy (PMID: 21179058). There are many reasons why I think that "transmitophagy" is not adequate to describe this phenomena but I will just refer to these three: First, authors do not provide data showing that this mechanism is specific for mitochondria as they have never checked for the presence of other type of cargo in the vesicles produced by RGCs. If these are related to exophers as they suggest in the document, is very probable that they contain other type of cargo; Second, if the final destiny for those particles is the acidic compartment of astrocytes, this process may have nothing to do with autophagy/mitophagy and just share some molecular mediators with those pathways; Third, they should explore if other canonical mitophagy molecular mediators (i.e. Parkin/Pink) are regulating the production or the mitochondria recruitment to this extracellular particles.

      We too struggle with our own “transmitophagy” term, for the very reasons stated. To address this concern, we now refer to the process as “transcellular degradation of mitochondria”, which is how we described it initially in mice as well. We do present new data that show that while the majority of axonal outpocketings contain mitochondria, not all do. This suggests that the others may contain other cargo, which supports the view that what we are dealing with in axons are indeed exophers. And yet, since what we measure is mitochondria, we think most appropriate to describe the process narrowly and not extrapolate to other types of exophers. We agree that what we originally discovered in mice and now live image and perturb in frog, may not be “autophagy” according to the strict definition of the term, but rather a process that uses some of the same molecular machinery, which given the evolutionary link between autophagy and phagocytosis that should be no surprise. Terminology can be tricky, and we thank the reviewer for calling us out on this point. We now use the term “transmitophagy” only once in the discussion section making the link between our work and the emerging field of exopher biology, and use that occasion to elaborate the point that the more descriptive term “transcellular degradation of mitochondria” is more appropriate in our case.

      *- In several experiments, authors use Mitotracker instead of genetic tools to quantify the amount of mitochondria co-localizing with OPTN (Fig2, Fig3) or being transferred to astrocytes (Fig4). A problem here is that Mitotracker needs the mitochondria to be active at the time of injection in order to label them (PMID: 21807856) and it has a clear effect in mitochondria dynamics in their setting, as pointed by the authors. Since most mitochondria transferred to astrocytes would be presumably damaged and not able to import Mitotracker, I am concern about how this is affecting their quantifications and the conclusions.

      *

      We agree. The use of Mitotracker to label the RGC mitochondria can be problematic for the reasons stated by reviewers 1 and 3. Indeed, our opinion is that many of the studies out there that claim to demonstrate transfer of mitochondria between cells likely are just showing the transfer of the dye rather than the mitochondria. While the previous submission included a number of controls to address this concern, we now provide multiple new experiments that measure the transfer of mitochondria through a transgene rather than Mitotracker. The provided experiments use a new Tom20-mCherry transgene which is highly specific to mitochondria due to the use of an SOD2 UTR. We have similar data using RGC-expressed Mito-mCherry and Mito-EGFP-mCherry (using the commonly used Cox8 mitochondria matrix targeting sequence); we do not include such data because we find the provided data sufficiently compelling, and the story is already sufficiently long and complicated.

      - Some conclusions are based on single images with no quantifications or statistics. This is the case for: 1) Page 6) "Most of the mCherry and Mitotracker objects colocalized with each other both in the merged images (Fig. S1C) and kymographs (Fig. S1D), indicating that the mitochondria-targeted transgene and Mitotracker similarly label the RGC axonal mitochondria".

      That is a fair comment. After reanalyzing the original dataset used, it would be very difficult to quantify that statement, largely because the Tom20-mCherry expression was relatively weak in those particular animals. We are confident that we could generate a new dataset to provide support for this statement, but instead chose to just provide side-by-side movies of mitochondria labeled by Mitotracker or the Tom20-mCherry transgenes, which we believe is far more compelling than any quantification we could provide.

      2) Page 8) "In the nerves labeled by Mitotracker, visual inspection of the raw images (Fig. 2C) and the derived kymographs (Fig. 2D) showed that OPTN and the Mitotracker labeled mitochondria often co-localized, particularly in the stopped populations, and more so in the animals expressing E50K OPTN, further suggesting that at least a fraction of the stopped LC3b, OPTN and mitochondria might represent mitophagy occurring in the axons".

      While we have made a minor change to this sentence, we feel that it is appropriate given that it serves just as a justification to carry out the quantitative studies that follow. We would not have quantified the process had it not been obvious to the eye. However, we do not interpret the results as supporting that mitophagy occurs in axons, for the reasons explained above.

      3) Page 14) "We also observed similar axonal dystrophies and exopher-like structures in E50K OPTN under similar imaging settings, but with 2-min intervals and additional Mitotracker labeling (Mov. 6), demonstrating that these structures not only contain OPTN but also mitochondria or mitochondria remnants". Image in video is not clear and there is not quantification for OPTN or OPTN+ mitochondria.*

      *

      We have removed Mov. 6.

      *Minor comments

      • In Figures showing the reconstruction of OPTN+ mitochondria outside nerve (Fig.3 and Fig.4), those seem to be present only in one lateral of the nerve. Is this process polarized in any way (i.e. faced to astrocytes) or is the result of a technical issue (i.e. difference in laser penetration for blue vs Yellow lasers)? I think it will be important to include this in the discussion.*

      This was also pointed out by reviewer 1, and we agree that it is worth including in the discussion, which we now do. While we do not believe it to be a light penetration issue (based on fluorescence intensities and apparent spatial resolution), we also do not yet have an explanation. Having studied dorsoventral differences in the visual pathway both during my graduate and post-doctoral years, I am very interested in this asymmetry, and we have some theories that might explain it, mentioned above. The asymmetry is obvious and thus we think it would have been inappropriate not to show, but it also be inappropriate to be overly speculative.

      - In Pag.13 authors claim "OPTN and mitochondria leave RGC axons in the form of exophers". After "exophers" were coined by the Driscoll lab in 2017, too few people has adopted this terminology and the molecular machinery involved in this process is still under research. It is clear that the particles described here share some similarities with exophers like size (in the range of microns) and cargo (mitochondria), but you have not demonstrated if they share the same origin or are part of the same phenomena. For that reason, I recommend to be more cautious with this statement and point these limitations in the discussion. Additionally, since Exophers are not a consensus or well defined particles, authors should include an introductory paragraph at the beginning of this section for readers to understand what they are talking about.

      We wholly agree with all points. We now have moved all mention of exophers to just the discussion.

      - Exophers described by Monica Driscoll and Andres Hidalgo laboratories are presented as "garbage bags" that help cells to stay fit through elimination of unwanted material. If the extracellular vesicles presented here are part of the same mechanism and potentially beneficial for the RGCs, why are they increased in OPTN mutants? Is it part of RGCs response to a proteomic stress generated by malfunctioning OPTN? I think that is critical to understand this to figure out the relevance of your findings.

      • *

      Our personal opinion is that the OPTN mutants most likely lead to stress focally in the axons, thus triggering exopher generation. We are carrying additional experiments to determine whether too much exopher generation or their insufficient degradation by astrocytes might be deleterious (by causing inflammation). However, those are big stories that would not stand on their own were we not able to first rigorously demonstrate that certain OPTN mutants increase exopher generation, which I believe our study demonstrates, albeit now without calling them exophers.

      - Related to Fig.5G, authors say "The soma of the astrocytes were located at the optic nerve periphery but had processes that extended deep into the parenchyma". This is very interesting and opens the possibility that many mitochondria are directly transferred to astrocytes through that processes instead of the lateral of the nerve, meaning that your quantifications of "transmitophagy" may be underestimated.

      * *We also agree that this. Our limited optical resolution, and limitations intrinsic to carrying out quantifications with Imaris software, are likely the main reasons for the discrepancy between the whole nerve and sparse-labelled-axon estimates of how much axonal material is outside of axons. Our view is that most of the transcellular degradation occurs within fine astrocyte processes, and that only in the case of failure to degrade material in these fine processes that significant amounts accumulate in the cell body (optic nerve periphery), and that in the cell body additional or different degradative pathways are utilized. Experiments using various transgenes and correlated EM as well as perturbation experiments are ongoing attempting to firmly establish what organelles are used in processes versus soma. However, we believe that such studies are well beyond the scope of this manuscript..

      - Reference to Fig. S2G is missing. Now mentioned twice. Thank you.

      - I cannot find in Fig.5 E-I legends what are the cells/structures labelled in Green and Red. Thank you.

      ***Referees cross-commenting**

      In agreement with my colleagues, I think that a revision is needed to support some important points of the paper. The the work is interesting and I think it deserves a chance for revision. Having that said, I am not familiar with the breeding and experimental times when working with Xenopus but, considering the amount of work requested, it may require more than 3 months to have the work done.

      *

      *Reviewer #3 (Significance (Required)):

      Until not very long ago, it was thought that mitochondria could not cross cell barriers. In recent years however, there has been an explosion in the number of works showing mitochondria transfer between different cell types in vivo. This may happen either as an organelle donation to improve energy production or as a quality control mechanism to get rid of damaged mitochondria, as it is the case in this work. The laboratory of Nicholas Marsh-Armstrong was pioneer in this field with a foundational work in 2014 where they show how RGC-derived mitochondria are captured and eliminated by astrocytes in mice (PMID: 24979790). This work was particularly relevant because it proposed for the first time that mitochondrial degradation can occur in RGC axons far from the cell soma, and surrogated in a different cell type, something that changed completely the view of how quality control is maintained in neurons and other cell types. In the present study, Jeong and collaborators explore how Glaucoma-associated Optineurin mutations affect this process, which is of potential interest for the broad cell biologist community due to its possible implications in other tissues and cell types (OPTN is broadly expressed), but especially for those researchers interested in neurobiology, quality control mechanisms and mitochondria biology. Since some OPTN mutations studied here cause disease, they are also relevant for the clinic. This work provides a thorough characterization of how relevant Optineurin mutations affect mitochondria dynamics in RGCs and their transference to astrocytes, as fairly claimed in the title. However, the mechanism by which they result in pathology is not either explored or carefully discussed, making this a descriptive work with no much conceptual insight. In addition, conclusions are often not unambiguously stated and the results part contains a lot of large sentences and unnecessary technical data that hinders reading and difficult the transmission of the key messages. Even if it stands as a descriptive work, the physiological and clinical relevance of these findings is not clear. There are some claims related with mitophagy activity that may require more sophisticated experiments (mitophagy flux with lysosomal inhibitors). Please see comments above. A critical point to understand the relevance of this work would be to demonstrate if alterations in transmitophagy are either causing or involved in the disease generated by these OPTN mutations in any way, or just a correlative phenomenon. To help authors contextualize my point of view, my field of expertise includes cell biology, imaging, quality control pathways, mitochondria biology and phagocytosis, among others. I am not familiar with Xenopus Laevis genetics or the limitations to work with this animal model.*

      • *

      We appreciate both the complements and the critiques. To a fault, we rather undersell than oversell. We are actively pursuing the possibility that dysregulation of this process is disease causing, and not just for glaucoma. However, those studies will not stand without a strong foundation, which we believe this study provides.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      In this work, Jeong et al describe the effect of Optineurin (OPTN) mutations in the transcellular degradation of retinal ganglion cell (RGC) mitochondria by astrocytes at the Optic Nerve (ON), a process previously described this group and referred as "transmitophagy" (Davis et al 2014). Here, authors use Xenopus laevis animal model to image the optic nerve of animals carrying different OPTN mutations associated to disease or with compromised function and explore its effect in mitochondria dynamics at the RGC axons. They find that OPTN mutants lead to increased stationary mitochondria in the nerve and affect their co-localization with mitophagy-related markers, suggesting alterations in this pathway. Finally, they found that mitochondria co-localizing with OPTN can be found in the periphery of the ON under different conditions and this is particularly increased in glaucoma-associated E50K mutation. This extracellular mitochondria are transferred in vesicles to astrocytes, as they previously described in mice (Davis 2014), where they are presumably degraded.

      Major comments

      • OPTN levels at a given time point cannot be used as readout for mitophagy level/flux. Both OPTN and LC3b are degraded upon fusion with acidic compartment (i.e. lysosomes, PMID: 33783320, 33634751) and that is the reason why the field of autophagy /mitophagy blocks lysosomal activity to measure autophagy/mitophagy flux (PMID: 33634751). In this document, authors claim that there is low levels of mitophagy in RGC axons at baseline and increased levels of mitophagy in glaucoma associated perturbations just based on increased presence of OPTN+ mitochondria in this condition. This could be also interpreted as an accumulation of non-degraded defective mitochondria due to a mitophagy block in neurons carrying the glaucoma associated mutation, which is the opposite of what they propose. If authors want to evaluate mitophagy levels in this system, mitophagy/autophagy flux experiments should be performed.
      • I find inappropriate the use of the term "transmitophagy". Although this term transmits very well the message that the authors try to strength, the term "mitophagy" refers to the specific elimination of mitochondria through autophagy (PMID: 21179058). There are many reasons why I think that "transmitophagy" is not adequate to describe this phenomena but I will just refer to these three: First, authors do not provide data showing that this mechanism is specific for mitochondria as they have never checked for the presence of other type of cargo in the vesicles produced by RGCs. If these are related to exophers as they suggest in the document, is very probable that they contain other type of cargo; Second, if the final destiny for those particles is the acidic compartment of astrocytes, this process may have nothing to do with autophagy/mitophagy and just share some molecular mediators with those pathways; Third, they should explore if other canonical mitophagy molecular mediators (i.e. Parkin/Pink) are regulating the production or the mitochondria recruitment to this extracellular particles.
      • In several experiments, authors use Mitotracker instead of genetic tools to quantify the amount of mitochondria co-localizing with OPTN (Fig2, Fig3) or being transferred to astrocytes (Fig4). A problem here is that Mitotracker needs the mitochondria to be active at the time of injection in order to label them (PMID: 21807856) and it has a clear effect in mitochondria dynamics in their setting, as pointed by the authors. Since most mitochondria transferred to astrocytes would be presumably damaged and not able to import Mitotracker, I am concern about how this is affecting their quantifications and the conclusions.
      • Some conclusions are based on single images with no quantifications or statistics. This is the case for:
        1. Page 6) "Most of the mCherry and Mitotracker objects colocalized with each other both in the merged images (Fig. S1C) and kymographs (Fig. S1D), indicating that the mitochondria-targeted transgene and Mitotracker similarly label the RGC axonal mitochondria".
        2. Page 8) "In the nerves labeled by Mitotracker, visual inspection of the raw images (Fig. 2C) and the derived kymographs (Fig. 2D) showed that OPTN and the Mitotracker labeled mitochondria often co-localized, particularly in the stopped populations, and more so in the animals expressing E50K OPTN, further suggesting that at least a fraction of the stopped LC3b, OPTN and mitochondria might represent mitophagy occurring in the axons".
        3. Page 14) "We also observed similar axonal dystrophies and exopher-like structures in E50K OPTN under similar imaging settings, but with 2-min intervals and additional Mitotracker labeling (Mov. 6), demonstrating that these structures not only contain OPTN but also mitochondria or mitochondria remnants". Image in video is not clear and there is not quantification for OPTN or OPTN+ mitochondria.

      Minor comments

      • In Figures showing the reconstruction of OPTN+ mitochondria outside nerve (Fig.3 and Fig.4), those seem to be present only in one lateral of the nerve. Is this process polarized in any way (i.e. faced to astrocytes) or is the result of a technical issue (i.e. difference in laser penetration for blue vs Yellow lasers)? I think it will be important to include this in the discussion.
      • In Pag.13 authors claim "OPTN and mitochondria leave RGC axons in the form of exophers". After "exophers" were coined by the Driscoll lab in 2017, too few people has adopted this terminology and the molecular machinery involved in this process is still under research. It is clear that the particles described here share some similarities with exophers like size (in the range of microns) and cargo (mitochondria), but you have not demonstrated if they share the same origin or are part of the same phenomena. For that reason, I recommend to be more cautious with this statement and point these limitations in the discussion. Additionally, since Exophers are not a consensus or well defined particles, authors should include an introductory paragraph at the beginning of this section for readers to understand what they are talking about.
      • Exophers described by Monica Driscoll and Andres Hidalgo laboratories are presented as "garbage bags" that help cells to stay fit through elimination of unwanted material. If the extracellular vesicles presented here are part of the same mechanism and potentially beneficial for the RGCs, why are they increased in OPTN mutants? Is it part of RGCs response to a proteomic stress generated by malfunctioning OPTN? I think that is critical to understand this to figure out the relevance of your findings.
      • Related to Fig.5G, authors say "The soma of the astrocytes were located at the optic nerve periphery but had processes that extended deep into the parenchyma". This is very interesting and opens the possibility that many mitochondria are directly transferred to astrocytes through that processes instead of the lateral of the nerve, meaning that your quantifications of "transmitophagy" may be underestimated.
      • Reference to Fig. S2G is missing.
      • I cannot find in Fig.5 E-I legends what are the cells/structures labelled in Green and Red.

      Referees cross-commenting

      In agreement with my colleagues, I think that a revision is needed to support some important points of the paper. The the work is interesting and I think it deserves a chance for revision. Having that said, I am not familiar with the breeding and experimental times when working with Xenopus but, considering the amount of work requested, it may require more than 3 months to have the work done.

      Significance

      Until not very long ago, it was thought that mitochondria could not cross cell barriers. In recent years however, there has been an explosion in the number of works showing mitochondria transfer between different cell types in vivo. This may happen either as an organelle donation to improve energy production or as a quality control mechanism to get rid of damaged mitochondria, as it is the case in this work. The laboratory of Nicholas Marsh-Armstrong was pioneer in this field with a foundational work in 2014 where they show how RGC-derived mitochondria are captured and eliminated by astrocytes in mice (PMID: 24979790). This work was particularly relevant because it proposed for the first time that mitochondrial degradation can occur in RGC axons far from the cell soma, and surrogated in a different cell type, something that changed completely the view of how quality control is maintained in neurons and other cell types.

      In the present study, Jeong and collaborators explore how Glaucoma-associated Optineurin mutations affect this process, which is of potential interest for the broad cell biologist community due to its possible implications in other tissues and cell types (OPTN is broadly expressed), but especially for those researchers interested in neurobiology, quality control mechanisms and mitochondria biology. Since some OPTN mutations studied here cause disease, they are also relevant for the clinic.

      This work provides a thorough characterization of how relevant Optineurin mutations affect mitochondria dynamics in RGCs and their transference to astrocytes, as fairly claimed in the title. However, the mechanism by which they result in pathology is not either explored or carefully discussed, making this a descriptive work with no much conceptual insight. In addition, conclusions are often not unambiguously stated and the results part contains a lot of large sentences and unnecessary technical data that hinders reading and difficult the transmission of the key messages.

      Even if it stands as a descriptive work, the physiological and clinical relevance of these findings is not clear. There are some claims related with mitophagy activity that may require more sophisticated experiments (mitophagy flux with lysosomal inhibitors). Please see comments above. A critical point to understand the relevance of this work would be to demonstrate if alterations in transmitophagy are either causing or involved in the disease generated by these OPTN mutations in any way, or just a correlative phenomenon. To help authors contextualize my point of view, my field of expertise includes cell biology, imaging, quality control pathways, mitochondria biology and phagocytosis, among others. I am not familiar with Xenopus Laevis genetics or the limitations to work with this animal model.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: This article studied transmitophagy in xenopus optic nerves in the context of overexpressing glaucoma-associated optineurin mutations. Using a series of labeling, imaging and transplantation techniques, the authors found that overexpressing mutated optineurins stops mitochondria movements and potentially induces transmitophagy, and that astrocytes are responsible for taking up the extra-axonal mitochondria. Below are my comments on this article.

      Major comments:

      1. Identifying extra-axonal mitochondria is key to this research. In Figure 3, the authors used EGFP-LC3B as a marker for RGC boundaries. However, it is unconvincing how perfect LC3B is as a cell membrane marker. Particularly in the case of OPTN E50K OE, it seems that the optic nerve is thinner than the WT condition, which makes the quantification of extra-axonal OPTN less convincing. The authors should detect extra-axonal mitochondria with an RGC membrane marker or cytosolic marker. In addition, in Figure 3, the extra-axonal mitochondria seem to localize mostly on the dorsal surface. Why is there such a polarity?
      2. The experiment in Figure 5 is very important as it gives direct evidence of transmitophagy. However, one caveat is that the mitotracker injection is done after the transplantation. If in rare cases the dye is leaky after injection and is taken up by astrocytes directly, then the conclusion that mitochondria from RGCs are phagocytosed by astrocytes will be flawed. The authors should either use a transgene in the donor to label mitochondria or inject mitotracker into the donor before the transplantation and repeat the experiments. In addition, in Figure 5E, what is the large membranous structure inside the highlighted astrocyte? Is it associated with phagocytosis?
      3. This research is entirely based on overexpression of OPTN. Since overexpressing WT OPTN does seem to affect mito trafficking (Figure S2G, and the description in the manuscript is often inconsistent with this result), it is unclear what the increased stalled mitochondria really mean when overexpressing mutated OPTN. Similarly, the authors examined extra-axonal mitochondria in Figures 3 and 4 all in overexpressing conditions, and made the connection that increased stalled mitochondria lead to transmitophagy. However, this conclusion will be better supported by using mutant animals rather than overexpression. The authors should consider using OPTN mutant xenopus if available or using CRISPR to introduce the specific mutations and repeat mitochondria trafficking and transmitophagy.
      4. On Page 12, the authors claim that even overexpressing WT OPTN causes extra-axonal mitochondria in the optic nerve. However, there is no control condition without OE to support this conclusion. It is thus unclear to what extent extra-axonal mitochondria occur at baseline and how many extra-axonal mitochondria can be induced by overexpression. The authors should include, in Figure 3 and 4, controls without overexpression.
      5. A technical question regarding kymographs: Based on Figure 2C, it looks that OPTN and LC3B labeling are pretty diffuse in axons and this makes sense since they may only be associated with damaged mitos. But this raises a question about how accurate the kymograph assay is. It may significantly underestimate the fraction of OPTN/LC3B that is stationary since they appeared diffusedon the kymograph. This may explain why the percentage of stationary OPTN/LC3B is so small when the authors OE WT OPTN in Figure 2E and 2E', compared to the percentage of moving mitochondria shown in Figure 1E.

      Minor:

      1. Figure 2E and 2E' do not agree with the text on page 7 and page 8. Not only F178A, but also H486R and D474N have no effect on OPTN trafficking. The authors should make their conclusions more accurate.
      2. Figure S2E-F: why does OE of mutated OPTN in F1s but not in F0s reduce trafficking speed compared to WT?
      3. In movie 5, fusion of exopher with other structures is not clear and also the GFP signal does not disappear, which is in contrast to the statement in the text that the GFP signal is quenched in acidified environment. To confirm that LC3B leaves RGC axons in exophers, the authors should consider switching the fluorophores and examine LC3B localization during exopher formation.
      4. In figure 6, to better show exopher formation and the pinching-off step, the authors should consider labeling the membrane and mitochondria instead of using the LC3B and OPTN marker.

      Referees cross-commenting

      Generally agree with the criticisms voiced by the other reviewers; in aggregate the reviews indicate the manuscript needs more than just a quick fix.

      Significance

      Previous literature has already described the transmitophagy process in the optic nerve. The significance of this paper lies in the observation that overexpressing glaucoma-associated OPTN mutants can induce increased transmitophagy through astrocytes, which points to a potential role of OPTN in glaucoma. A highlight of this paper is the use of correlated light SBEM to directly show transmitophagy in astrocytes. However, the significance of this paper may be limited for the following reasons: 1. everything is based on overexpression of mutated OPTN, which makes it hard to translate the results to real disease conditions; 2. The consequence of increased transmitophagy on RGC survival or visual functions is unclear.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Glaucoma-associated optineurin mutations increase transmitophagy in vertebrate optic nerve.

      Summary

      In Jeong et al., the authors perform live imaging of the X. laevis optic nerve to track neuronal mitochondrial movement and expulsion in an intact nervous system. The authors observe similar mitochondrial dynamics in vivo as previously described in other systems. They find that stationary mitochondria are more likely to be associated with OPTN, suggestive of mitochondria undergoing mitophagy. Forced expression of OPTN mutations results in a larger pool of stationary mitochondria that colocalize withLC3B, and OPTN. Finally, the authors argue that extra-axonal mitochondria are observed more frequently in OPTN mutants, suggesting that mutations in OPTN that are associated with disease can lead to an increase in the expulsion of mitochondria through exopher-like structures.

      Major Findings and impact:

      • The authors establish that mitochondria dynamics can be tracked in the X. laevis optic nerve.
      • OPTN mutations increase the stationary pool of mitochondria and likely result in increased rates of mitophagy.
      • Exopher-like structures containing mitochondria and LC3 can be expelled from the optic nerve and increase in the presence of OPTN mutations. These structures were observed in a living system and have interesting implications in the context of disease.

      Concerns:

      • The authors state in their results that the secreted blebs are exophers. While these initial observations are consistent with exophers, additional data are needed to strengthen this claim. For example: what are the sizes of secreted vesicles? Do all express LC3? How frequently do these occur? From where are they expelling? Alternatively, the discussion of exophers could be moved to the discussion.
      • Quantifications in sparse labeling experiments seem quite surprising and concerns related to these findings should be addressed. As the authors used LC3b expression to represent axonal volume, the authors should demonstrate that this is the case using an axonal fill or membrane marker in both the wt and E50K conditions. This is important as it is unclear whether LC3b expression is consistent between the wild type and the E50K conditions. Lower expression of LC3b in E50K could account for the large changes in axonal width that seem to be observed and could confound the measured amount of expelled mitochondria.
      • Could large amounts of exogenous mitochondria in explant experiments be from cells that died during the plantation?

      Suggested experiments/quantifications:

      • In OPTN/MITO/LC3b trafficking experiments, does flux/number of events change? Representative kymograph in Figure 2D seems to show far more OPTN-positive mitochondria which is opposite of what is shown in Figure 2C.
      • Demonstrate that axonal width measured with LC3B is representative of axonal fill/membrane marker in wt and E50K. Axonal area appears to change, is this accurate? This appears to be the case for both figure 3 and figure 4.
      • Raw images in addition to the reconstruction would be beneficial.
      • Further characterization of exopher-like structures.

      Referees cross-commenting

      I agree with the concerns of the other reviewers, and perhaps was over-optimistic about a timeline for revision. However, I do think the work is worth the effort, and I hope to see a revised manuscript published somewhere, as these observations are novel

      Significance

      This work reports potentially novel biology, and thus will be of interest to the field. The strength of the study is that it is an initial description of this biology, rather than a complete analysis. The work raises many more questions than it answers, and much further work on this topic is required to support these initial findings, but the manuscript will likely be of interest to many. Revisions are required to improve the rigor and clarity of the work, but following these revisions we recommend publication to facilitate follow-up work.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility and clarity

      Summary:

      This manuscript by Xu, Hörner, Schüle and colleagues is an RNA-seq study focusing characterization of axonal transcriptomes from human iPSC-derived cortical neurons. The authors have differentiated iPSC into neurons, cultured them in microfluidic devices and isolated axonal RNA, comparing this to corresponding cell soma transcriptomes. Second, axonal transcriptomes are compared between wild type and Kif1c knockout axons to determine Kif1c-dependently localized transcripts. Characterization of the latter allows the authors to suggest differentially expressed transcripts in Kif1c-KO axons can be mRNAs relevant for motor neuron degeneration owing to Kif1c mutations in hereditary spastic paraplegias.

      Major comments: Overall, his manuscript reads like work in (early) progress. This manuscript provides an interesting dataset, but needs substantial additional experimental and/or bioinformatic work to merit publication. The technical complexity of steps that have led to obtaining axonal transcriptomes can be appreciated, the soundness of generating these data is beyond doubt. However, the study stops at the point of generating axonal transcriptomes from wild type and Kif1c axons. No follow-up experiments are performed to study genes of interest found in RNA-seq. This could be compensated by in-depth bioinformatic analysis (e.g. comparisons with the many different datasets in known in the field), but this is clearly lacking as well. The results section only contains minimal bioinformatic analysis and nothing else. Introduction and discussion are well, clearly written and are in good dialogue with the existing body of work. To improve the manuscript, at minimum these two aspects should be addressed: 1. Characterization of the iPSC-derived neurons is missing (immunostaining with neuronal markers, e.g. Tau, MAP2, exclusion of glial markers, and lack of stem cell markers) 2. Validation of candidates of interest (e.g. FISH analysis in axons vs somata, Kif1c vs wt). Very specific requests from the review are useless at this point, as the authors should have the liberty to focus.

      Thank you for the review of our manuscript. We appreciate your recognition of the technical complexity involved in generating axonal transcriptomes and the clarity of our introduction and discussion sections.

      __Characterization of iPSC-derived neurons: __We acknowledge the importance of immunostaining with neuronal markers to ensure the purity of our neuronal population. We included this characterization in our revised manuscript and added it into the results and methods section of the paper (Supplementary Figure S1). Additionally, we included RT-qPCR analysis that confirmed the presence of cortical markers and added these to the results and method section of the paper (Supplementary Figure S2).

      Additional bioinformatic work: We agree that additional bioinformatic work will greatly benefit this paper. Therefore, we compared our datasets to all additional datasets that we were able to retrieve. This was added to the main text (results and discussion) and supplementary material (Supplementary Figure S5 and S6). We believe this strengthens the merit of our paper, and adds a lot of new unpublished information to the manuscript

      __Validation of candidates of interest: __We understand the necessity of validating our RNA-seq findings through experimental approaches such as FISH analysis and comparisons between KIF1C knockout and wild-type neurons. While we appreciate the comment and agree on the importance of high-resolution RNA FISH, we believe it is beyond the scope of this manuscript due to the considerable complexity of these experiments in human iPSC-derived cortical neurons. We will focus on incorporating this aspect into future studies and added a corresponding statement outlining the limitations of our study in the discussion stressing the importance of this.

      Minor comments: 1. Details of RNA seq technicalities are redundant in the results section, e.g. „Our RNA-seq pipeline encompassed read quality control (QC), RNA-seq mapping, and gene quantification" (p. 7) is a trivial description - this and similar details should be skipped or described in methods.

      We will ensure that technical details are appropriately placed in the methods section and avoid redundancy in the results. Technical details included in the results section have been moved to the methods.

      1. Fig1A: Y axis should start from 0

      We adjusted Figure 1A to start the Y-axis from 0.

      1. Too much interpretational voice in figure legends (e.g. see Fig. 1, „PC1 clearly distinguishes the soma (blue)"

      We revised the interpretational voice in the figure legends to maintain objectivity.

      1. PCA analysis seems redundant in Fig. 2C

      We removed the PCA analysis in Fig. 2A (2C corresponds to Gene ontology term enrichment analysis).

      1. Subheading „Human motor axons show a unique transcription factor profile" is misleading - you are not dealing with motor iPS-derived motoneurons (Isl-1 positive), but cortical neurons (again, no marker information provided to assess this!)

      The subheading „Human motor axons show a unique transcription factor profile" was adjusted. Furthermore, validation of neuronal identity has been added to the supplementary figures (Supplementary Figure S1 and S2), as well as main text and methods section.

      1. Fig. 3: Just by comparing top expressed factors in axonal samples is not informative - overall high expression of a certain transcript likely makes it easier for it to be picked up in the axonal compartment. Axon/soma ratios would perhaps be more appropriate.

      After careful consideration, we decided that we will not change the data presentation in Figure 3. Our aim in this figure was not to compare axon and soma but to see highly expressed transcripts in the axon, regardless of whether they are highly expressed in the soma as well. We think that looking at transcripts present in the axon can give information about axonal function, that we might lose when we only consider transcripts that are upregulated compared to the soma. The fact that 25 out of 50 transcription factor RNAs detected in the axon are actually specific to the axons supports this point of view. The comparison between transcripts expressed in axon and soma are presented in Figure 2.

      1. Figure 4 (KIF1C modulates the axonal transcriptome): you should show also data for the same genes in the soma, axonal data only is misleading (is overall expression changed?)

      We appreciate your suggestion. This data was already included in Supplementary Figure S6 (now Supplementary Figure S9). To make this easier to find, we've added a section to the results part to more clearly state how transcript expression changes in the soma.

      Significance

      Axonal transcriptomes have been studied since early 2010s by a number of groups and several datasets exist from different model systems. The authors know these studies well, address their findings and cite them appropriately. Is the dataset in this manuscript novel? Does it contribute to the field? Several axonal transcriptomes have been characterized in thorough studies, and even in the specific niche (human IPS-derived motoneurons) a point of reference exists - as the authors themselves point out, it is the Nijssen 2018 study. With appropriate presentation and follow-up experiments this material could have merit as a replication study.

      Audience: specialized

      We appreciate the reviewer's suggestion to clarify the differences between our findings and previously published data. In response, we have added a dedicated section to the discussion, where we provide a more detailed comparison of our results with existing research. This includes an in-depth examination of the methodologies, experimental conditions, and biological contexts that may explain the observed discrepancies (e.g., variations in methods, neuronal types, and disease contexts). As prior studies primarily focused on mouse-derived neurons, we have included a new section in both the results (Supplementary Figure S6) and the discussion to highlight the limited overlap in gene expression between the axons of mouse- and human-derived neurons. Furthermore, previous studies on human-derived cells either investigated i3 neurons -induced by transcription factors but not fully representative of human-derived CNS-resident neurons - or neurons of the peripheral nervous system (lower motor neurons). In contrast, our study focuses on human-derived CNS-resident cortical neurons (Supplementary Figure S1, S2; comparison shown in Supplementary Figure S5), emphasizing the greater translatability of our findings.

      Moreover, we have expanded our bioinformatic analyses and compared our dataset with additional datasets to further substantiate our conclusions (Supplementary Figure S5, S6)

      We believe that these revisions significantly enhance the clarity, quality, and impact of our manuscript. We sincerely thank the reviewer for their constructive feedback.

      Reviewer #2

      Evidence, reproducibility and clarity

      This study seeks to identify axonal transcriptome by RNA-sequencing of the iPSC-derived cortical neuron axons. This is achieved by comparing the RNA expressions between the axonal and soma compartments using microfluid system. The specific expression of axon specific RNAs in the axonal compartment validate the specificity of the approach. Some unique RNAs including TF specific RNAs are identified. Furthermore, this study compared the KIF1C-knockout neurons (which models hereditary spastic paraplegia characterized by axonal degeneration) with wildtype (WT) control neurons, which led to the identification of specific down-regulated RNAs involved in axonal development and guidance, neurotransmission, and synaptic formation.

      The data of this study are interesting and clearly presented. The major concerns are the lack of characterization of the neuron identities and the examination of functional deficits in the KIF1C-knockout neurons. For example: 1) are these neurons express layer V/VI markers at protein levels, and the proportion of positive neurons (efficiency of cortical neuron differentiation); 2) What are the phenotypic changes in the KIF1C-knockout neurons; are there change sin axonal growth or transport? 3) Day 58 was selected for collecting RNA for sequencing study: how this time point is selected? And are there phenotypic differences between the WT and knockout neurons at this time point?

      We appreciate the favorable review of our manuscript and the insightful comments:

      Characterization of neuron identities: We agree on the importance of validating neuron identities and included protein-level characterization of layer V/VI markers and efficiency of cortical neuron differentiation in our revised manuscript: We conducted immunohistochemical staining for layer V/VI and other neuronal markers, as well as qRT-PCR to validate the identity of the neurons, ensuring a comprehensive characterization of our neuronal population.

      Functional deficits in KIF1C-knockout neurons: We have conducted phenotypic examinations of the neurons but did not observe gross differences in differentiation, axon growth or axon length. We added a corresponding statement to the results section. Neurons were harvested at DAI 58 because at this time we achieved a nearly confluent chamber that yielded enough material for in-depth RNA-sequencing. We did not observe phenotypic differences between wt and KIF1C-KO neurons at this time point. We added a statement to the method section outlining this.

      Some minor comments:1. The protein levels of some critical factors needs to be validated.

      We validated neuronal identities on qRT-PCR level (Supplementary Figure S2). While we understand the necessity of validating our RNA-seq findings on protein level, we believe it is beyond the scope of this manuscript. However, we will focus on incorporating this aspect into future studies and added a corresponding statement outlining the limitations of our study in the discussion stressing the importance of this.

      1. Figure 4C, for the list genes, statistical analyses between WT and knockout groups are required.

      In Figure 4C we only included differentially expressed genes with a p-value We added a corresponding statement in the main text and figure legend.

      1. Page 15, the 5th to last sentence: "nucleus nucleus" (repeat)

      The repeat word on page 15 was deleted.

      1. The sequencing data requires public links to the deposited library

      We will provide public links to the deposited library for the sequencing data once the data is submitted to a journal (depending on journal guidelines).

      Significance

      The strength of this study is the combinations of iPSC differentiation, gene editing (KIF1C knockout iPSC) and microfluidic system. This allows the identification of specific axonal transcriptomes. Moreover, the comparisons of control and KIF1C knockout neurons at both axon and soma compartments enables the identification of RNAs and pathways caused by the loss of KIF1C.

      The limitation is the lack of functional assessment of the iPSC-derived neurons, especially phenotypic changes in the KIF1C-knockout neurons. Only one time point is selected for comparing the WT and KIF1C knockout neurons, and the relationship between this time point and disease phenotypes is unclear.

      This study will be of interest to researchers from both basic and translational fields, and in the fields of stem cells, neuroscience, neurology and genetics.

      My expertise includes stem cells, iPSC modeling, motor neuron diseases, and nerve degeneration.

      We appreciate the favorable significance statement and believe addressing these points will strengthen the scientific rigor and impact of our study. Thank you for your valuable feedback.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):  Using microfluidics chambers and RNA sequencing (RNA-seq) of axons from iPSC-derived human cortical neurons, authors use RNA profiling to investigate the RNAs present in the soma and axons and the impact of KIF1C molecular motor downregulation (KIF1CKO) on the axonal transcriptome. The rationale is that mutations in KIF1C are associated with an autosomal recessive form of hereditary spastic paraplegia, and KIF1C is implicated in the long-range directional transport of APC-dependent mRNAs and RNA-dependent transport of the exon junction complex into neurites.  Employing a well-defined RNA-seq pipeline for analysis, they obtained RNA sequences particular to axonal samples, outperforming previous studies. They detected over 16,000 genes in the soma (which includes axons) and RNA for more than 5,000 genes in axons. A comparison of the list of axonal genes revealed a strong correlation with previous publications, but they detected more genes overall. They identified transcripts enriched in axons compared to somas, notably those for ribosomal and mitochondrial proteins. Indeed, they observed enrichment for ribosomal subunits, respiratory chain complexes, ion transport, and mRNA splicing.  The study also found that human axons exhibit a unique RNA transcription profile of transcription factors (TFs), with TFs such as GTF3A and ATF4 predominant in axons. At the same time, CREB3 was highly expressed in the soma.  Upon analyzing the soma and axon transcriptomes from KIF1CKO cultures, they identified 189 differentially regulated transcripts: 89 downregulated and 100 upregulated in the KIF1CKO condition. Some of these transcripts are critical for synaptic growth and neurotransmission. Notably, only two targets of APC-target RNAs were downregulated, contrary to their expectation. Their data indicates that KIF1C downregulation significantly alters the axonal transcriptome landscape.  Reviewer #3 (Significance (Required)):  The study is well-performed and informative, particularly for researchers interested in the local translation of axonal proteins and the axonal transcriptome. However, the authors did not validate their findings for any transcripts and did not perform any functional assays, so the manuscript lacks mechanistic insight. Interestingly, GTF3A is a transcription factor that stimulates polymerase III transcription of ribosomal proteins, and mRNAs for ribosomal proteins are enriched in human axons. Maybe there is an interesting story there. 

      We appreciate the favorable significance statement and the valuable feedback. We have conducted phenotypic examinations of the neurons but did not observe gross differences in differentiation, axon growth or axon length. We added a corresponding statement to the results section. While we understand the necessity of validating our RNA-seq findings on protein level, we believe it is beyond the scope of this manuscript. However, we will focus on incorporating this aspect into future studies and added a corresponding statement outlining the limitations of our study in the discussion stressing the importance of this.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Using microfluidics chambers and RNA sequencing (RNA-seq) of axons from iPSC-derived human cortical neurons, authors use RNA profiling to investigate the RNAs present in the soma and axons and the impact of KIF1C molecular motor downregulation (KIF1CKO) on the axonal transcriptome. The rationale is that mutations in KIF1C are associated with an autosomal recessive form of hereditary spastic paraplegia, and KIF1C is implicated in the long-range directional transport of APC-dependent mRNAs and RNA-dependent transport of the exon junction complex into neurites.

      Employing a well-defined RNA-seq pipeline for analysis, they obtained RNA sequences particular to axonal samples, outperforming previous studies. They detected over 16,000 genes in the soma (which includes axons) and RNA for more than 5,000 genes in axons. A comparison of the list of axonal genes revealed a strong correlation with previous publications, but they detected more genes overall. They identified transcripts enriched in axons compared to somas, notably those for ribosomal and mitochondrial proteins. Indeed, they observed enrichment for ribosomal subunits, respiratory chain complexes, ion transport, and mRNA splicing. The study also found that human axons exhibit a unique RNA transcription profile of transcription factors (TFs), with TFs such as GTF3A and ATF4 predominant in axons. At the same time, CREB3 was highly expressed in the soma. Upon analyzing the soma and axon transcriptomes from KIF1CKO cultures, they identified 189 differentially regulated transcripts: 89 downregulated and 100 upregulated in the KIF1CKO condition. Some of these transcripts are critical for synaptic growth and neurotransmission. Notably, only two targets of APC-target RNAs were downregulated, contrary to their expectation. Their data indicates that KIF1C downregulation significantly alters the axonal transcriptome landscape.

      Significance

      The study is well-performed and informative, particularly for researchers interested in the local translation of axonal proteins and the axonal transcriptome. However, the authors did not validate their findings for any transcripts and did not perform any functional assays, so the manuscript lacks mechanistic insight. Interestingly, GTF3A is a transcription factor that stimulates polymerase III transcription of ribosomal proteins, and mRNAs for ribosomal proteins are enriched in human axons. Maybe there is an interesting story there.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This study seeks to identify axonal transcriptome by RNA-sequencing of the iPSC-derived cortical neuron axons. This is achieved by comparing the RNA expressions between the axonal and soma compartments using microfluid system. The specific expression of axon specific RNAs in the axonal compartment validate the specificity of the approach. Some unique RNAs including TF specific RNAs are identified. Furthermore, this study compared the KIF1C-knockout neurons (which models hereditary spastic paraplegia characterized by axonal degeneration) with wildtype (WT) control neurons, which led to the identification of specific down-regulated RNAs involved in axonal development and guidance, neurotransmission, and synaptic formation.

      The data of this study are interesting and clearly presented. The major concerns are the lack of characterization of the neuron identities and the examination of functional deficits in the KIF1C-knockout neurons. For example: 1) are these neurons express layer V/VI markers at protein levels, and the proportion of positive neurons (efficiency of cortical neuron differentiation); 2) What are the phenotypic changes in the KIF1C-knockout neurons; are there change sin axonal growth or transport? 3) Day 58 was selected for collecting RNA for sequencing study: how this time point is selected? And are there phenotypic differences between the WT and knockout neurons at this time point?

      Some minor comments:

      1. The protein levels of some critical factors needs to be validated.
      2. Figure 4C, for the list genes, statistical analyses between WT and knockout groups are required.
      3. Page 15, the 5th to last sentence: "nucleus nucleus" (repeat)
      4. The sequencing data requires public links to the deposited library

      Significance

      The strength of this study is the combinations of iPSC differentiation, gene editing (KIF1C knockout iPSC) and microfluidic system. This allows the identification of specific axonal transcriptomes. Moreover, the comparisons of control and KIF1C knockout neurons at both axon and soma compartments enables the identification of RNAs and pathways caused by the loss of KIF1C.

      The limitation is the lack of functional assessment of the iPSC-derived neurons, especially phenotypic changes in the KIF1C-knockout neurons. Only one time point is selected for comparing the WT and KIF1C knockout neurons, and the relationship between this time point and disease phenotypes is unclear.

      This study will be of interest to researchers from both basic and translational fields, and in the fields of stem cells, neuroscience, neurology and genetics.

      My expertise includes stem cells, iPSC modeling, motor neuron diseases, and nerve degeneration.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This manuscript by Xu, Hörner, Schüle and colleagues is an RNA-seq study focusing characterization of axonal transcriptomes from human iPSC-derived cortical neurons. The authors have differentiated iPSC into neurons, cultured them in microfluidic devices and isolated axonal RNA, comparing this to corresponding cell soma transcriptomes. Second, axonal transcriptomes are compared between wild type and Kif1c knockout axons to determine Kif1c-dependently localized transcripts. Characterization of the latter allows the authors to suggest differentially expressed transcripts in Kif1c-KO axons can be mRNAs relevant for motor neuron degeneration owing to Kif1c mutations in hereditary spastic paraplegias.

      Major comments:

      Overall, his manuscript reads like work in (early) progress. This manuscript provides an interesting dataset, but needs substantial additional experimental and/or bioinformatic work to merit publication. The technical complexity of steps that have led to obtaining axonal transcriptomes can be appreciated, the soundness of generating these data is beyond doubt. However, the study stops at the point of generating axonal transcriptomes from wild type and Kif1c axons. No follow-up experiments are performed to study genes of interest found in RNA-seq. This could be compensated by in-depth bioinformatic analysis (e.g. comparisons with the many different datasets in known in the field), but this is clearly lacking as well.

      The results section only contains minimal bioinformatic analysis and nothing else. Indroduction and discussion are well, clearly written and are in good dialogue with the existing body of work. To improve the manuscript, at minimum these two aspects should be addressed:

      1. Characterization of the iPSC-derived neurons is missing (immunostaining with neuronal markers, e.g. Tau, MAP2, exclusion of glial markers, and lack of stem cell markers)
      2. Validation of candidates of interest (e.g. FISH analysis in axons vs somata, Kif1c vs wt). Very specific requests from the review are useless at this point, as the authors should have the liberty to focus.

      Minor comments:

      1. Details of RNA seq technicalities are redundant in the results section, e.g. „Our RNA-seq pipeline encompassed read quality control (QC), RNA-seq mapping, and gene quantification" (p. 7) is a trivial description - this and similar details should be skipped or described in methods.
      2. Fig1A: Y axis should start from 0
      3. Too much interpretational voice in figure legends (e.g. see Fig. 1, „PC1 clearly distinguishes the soma (blue)"
      4. PCA analysis seems redundant in Fig. 2C
      5. Subheading „Human motor axons show a unique transcription factor profile" is misleading - you are not dealing with motor iPS-derived motoneurons (Isl-1 positive), but cortical neurons (again, no marker information provided to assess this!)
      6. Fig. 3: Just by comparing top expressed factors in axonal samples is not informative - overall high expression of a certain transcript likely makes it easier for it to be picked up in the axonal compartment. Axon/soma ratios would perhaps be more appropriate.
      7. Figure 4 (KIF1C modulates the axonal transcriptome): you should show also data for the same genes in the soma, axonal data only is misleading (is overall expression changed?)

      Significance

      Axonal transcriptomes have been studied since early 2010s by a number of groups and several datasets exist from different model systems. The authors know these studies well, address their findings and cite them appropriately. Is the dataset in this manuscript novel? Does it contribute to the field? Several axonal transcriptomes have been characterized in thorough studies, and even in the specific niche (human IPS-derived motoneurons) a point of reference exists - as the authors themselves point out, it is the Nijssen 2018 study. With appropriate presentation and follow-up experiments this material could have merit as a replication study.

      Audience: specialized

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, the authors investigate potential developmental changes in Leishmania infantum isolates from different regions of Brazil (plus a single isolate from Portugal) both in vitro and in vivo. The manuscript present interesting phenotypic comparisons between isolates with a 12-kb deletion of a subtelomeric region on Chr31 and isolates with a complete sequence. The authors suggest that distinctions associated with the presence or absence of that region may potentially be linked with fitness in the wild. A more detailed analysis of such valuable sample cohort would increase the significance of this research. Therefore, I highlight a few important concerns that need to be addressed, as well as a disconnect between the presented data and conclusions made.

      Major issues

      • The % of metacyclics in all the sand fly infection experiments was extremely low ( <6%). The only exception was NonDEL_PI_2972, which showed around 16% metacyclics. This tells me that the parasites did not develop properly inside any of the 3 vectors used. Does an increase from 2% to 4% make any difference in vector competence? If metacyclogenesis is not significantly higher in DEL isolates, then there will most likely not be an increase in fitness concerning transmission. Also, I assume that both the stomodeal valve infection and the metacyclic % were assessed at 192h p.i., but this must be clearly stated in the text. At what time-point were parasites per gut counted at Charles Univ.? The parameters used in all the different sand fly experiments should be clearly labeled.
      • If the metacyclic % data are available for the Fiocruz colony experiments, then it will be important to present those. Conversely, are the #s of parasites/gut available using another time point at Charles Univ.?
      • It is mentioned that at least 3 independent sand fly infections were performed, so the data points should be shown for each of the replicates where applicable. A bar plot without error bars is not ideal for representing the stomodeal valve and metacyclic % data either. Based on the current plots, it is difficult to infer biological significance.
      • The entire presentation of the sand fly infection experiments is confusing and does not provide consistent findings to support the conclusions, which should be toned down. The whole rationale for the different vector species should be better explained.
      • The 3'NT activity in two HTZ isolates is as high as in some NonDEL, while in one HTZ strain it is as low as the DEL isolates. Is there any genomic difference between the HTZ to justify this difference? Since 3254 HTZ was not included in the qPCR analysis presented in Schwabl, et al. 2021, it is difficult to see any correlation.
      • Pg. 16 ln.398: The authors conclude that DEL isolates cause effective, yet less pathogenic infection compared to NonDEL isolates. However, it is not possible to reach that conclusion based on the early-infection data presented. For that, i.v.-infected mice would have to be monitored for several weeks and assessed for visceralization and VL severity.
      • The most relevant data in this work suggest that metacyclics in nonDEL vs DEL might present important differences. The META2 gene expression, the different M0 infection% rather than different # of intracellular amastigotes, and increases in parasite #s in the draining lymph node at 13-16 hours after ear inoculation suggest to me that the metacyclics could have different infection competence. That possibility should be addressed. Do they display similar morphology? Do they differentiate at similar rates to axenic amastigotes? SHERP transcript levels should be quantified in metacyclics from the different isolates.

      Minor issues

      • Gene IDs of the transcripts measured should be listed somewhere.
      • Fig.1C, Fig2A&B: missing p-value labels.
      • Fig.1B needs more detail on the statistical test used. Is it One-way ANOVA? What post-hoc test was used?
      • Figs. 2 & 3: "Mann Whitney Rank Sum", "Mann Whitney t-test". The name of the statistical test is either Mann Whitney U test or Wilcoxon rank-sum test.
      • Figs. 4 & 5: information missing on statistical tests used in both.
      • Pg.8 ln.212: The NT1 median FC in the NonDEL group does not seem to be 0.8 in Fig.2B.

      Typos:

      • Fig.2 title: "promastigotes harvest from" -> harvested
      • Fig2. legend: "Paraflagellar". Is this a paraflagellar rod protein (PFR)? If so, it should be specified throughout the text.
      • Fig 2 & 3: "ns = no significant" -> not significant
      • Different parts of the text: "alfa-tubulin" -> alpha-tubulin
      • Fig.4A: Y-axis reads "Inspected sand flies"

      Significance

      Despite constituting the same species, significant genetic differences exist in Leishmania infantum variants found in the American continent compared to those in Europe, Africa and Asia. Phenotypic comparison studies such as reported here are relevant to the Parasitology field and may lead to new insights on the pathogenesis of specific clinical outcomes of leishmaniasis disease. The authors attempt to associate a major genetic deletion found in specific neotropical Leishmania infantum strains, isolated in Brazil, with phenotypic changes that could potentially lead to fitness increase during cyclical transmission in the wild.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript the authors have investigated the role of sub chromosomal deletion found in new world Leishmania infantum species. The deletion (12 kb) spans over across the four copies of tetrasomic chromosome 31 which includes the loss of four open reading frames (ORF) LinJ.31.2370 (ecto-3ʹ-nucleotidase/nuclease), LinJ.31.2380 (ecto-3'-nucleotidase precursor), LinJ.31.2390 (helicase-like protein) and LinJ.31.2400 (3,2-trans-enoyl-CoA isomerase). The ecto-3'-nucleotidase/nuclease (3'NU/NT) activity has been shown to have an important role in trypanosomatids in the purine salvage pathway implicated as virulence factor affecting the parasite's ability to infect macrophages. The authors showed that having such deletions enhances the metacyclogenesis in vitro but are highly susceptible to killing by neutrophils and macrophages. In addition, they are also less virulent in vivo. The authors claim that enhanced metacyclogenesis increases its transmissibility in the invertebrate host but may not be highly infectious in the vertebrate host. They speculate that that such parasites may provide immune response that may control the infection in endemic population by having large group of asymptomatic individuals. These outcomes are highly speculative. This study is thought provoking but needs to be studied thoroughly.

      Following are the comments:

      1. Leishmania 3"NU/NT plays an important role in virulence of the parasite and its survival,

      a. how do 3'-NU/NT DEL parasites survive in the host which lack all the 4 copies of NT?

      b. what is the advantage of having such parasites circulating in the endemic areas? 2. Since metacyclics are important for the pathogenesis of the parasites,

      a. What is the mechanism of increased metacyclogenesis of L. Infantum 3'-NU/NT DEL parasites in the absence of 3'-NU/NT activity which is essential for the virulence? 3. How does lack of 3'NU/NT enhance transmissibility since such metacyclics from 3'-NU/NT DEL parasites barely survive in vivo?

      a. the parasites are less resistant to NET and are killed easily.

      b. there is reduced recruitment of neutrophils (NT) and monocytes in the ear. 4. Fig.7C: what is the reason for higher parasite load of DEL in dLN? 5. Do you think there is reduced recruitment of NT in the infected site which would have controlled the parasites, hence they migrate quickly in the dLN? To test this possibility the authors should perform an in vitro NT recruitment assay. 6. Line 417: How are the 3'-NU/NT DEL parasites continuous source for infection in sand flies, If such parasites are not infectious and will be cleared by the host? 7. Line 400: Is it possible that having 3'-NU/NT DEL parasites in circulation dampens the infectivity of the NON-DEL and thus over all infection rate in the population goes down? 8. Could the 3'-NU/NT DEL parasites be the source of asymptomatic infections? 9. Is there literature evidence for such a possibility in the endemic region?

      Significance

      The authors claim that enhanced metacyclogenesis increases transmissibility of Leishmania in the invertebrate host but may not be highly infectious in the vertebrate host. They speculate that that such parasites may provide immune response that may control the infection in endemic population by having large group of asymptomatic individuals. These outcomes are highly speculative. This study is thought provoking but needs to be studied thoroughly.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Florencio et al., presented a detailed analysis of Brazilian L. infantum isolates with about half with a 4 gene deletion on chromosome 31 and the other half without that deletion. This deletion was first observed in the context of miltefosine responsiveness by the Mottram group. Here a multinational team is concluding that this deletion is associated with increased metacyclogenesis, increase parasite load in some sandfly vector species factors that contribute to their prevalence in South American. Yet the deleted strains show decreased survival invitro including in macrophages, but from what I understand increased load in an animal model, possibly in line with their reduced efficacy in attracting/activating key immune cells.

      The authors provide an interesting hypothesis on the equilibrium between pathogenesis (virulence) and capacity to transmit. The perfect parasite would like to infect as many hosts as possible while not killing them. I will try to provide the pro and cons.

      1. Obviously (and as acknowledged on p. 344-346) the conclusive experiment would be to add the 4 gene locus (the authors believe that it is the 3'NU/NT gene, so a single gene would even be simpler) in a DEL strain or alternatively KO the locus in a nonDEL strain. This would prove that the phenotype observed is indeed due to the loss of this locus. I understand that this is additional work but it would provide a definite answer. Now we have to rely on associations but alas the associations are not as tight as one would wish.
      2. Not sure if it is a dichotomy but they show decreased survival in the presence of NET (Fig. 6A) in vitro but reducing the attraction of immune cells including neutrophils in vivo (Fig. 7). For the nonDEL strain they observe no effect of NET in vitro (Fig. 6) while they seem to attract more immune cells (Fig. 7) in vivo. The in vitro data seems to reach significance for the three group of strains but not for the in vivo work (one out of three).
      3. The authors can be lauded for adding more pair of strains but this also adds to the complexity and frankly raises questions. The two strains derived from the PI and MT location differs in many aspects compared to the MS-Rj comparators.
      4. It is remarkable that the authors have tested other sandfly vector species (Fig. 5) but what is the conclusion and does it help in our understanding? I see variations in the figure between strains and vectors but not sure what it means.

      Secondary points

      1. Please confirm that the data in Fig. 1D and 1E are all the DEL and nonDEL strains. The number of dots does not match with the number of strains tested.
      2. Do we really need Fig. 2 and 3? Not part of the main message.
      3. Why is Fig. 4A only parasite count and 4B,C parasite count (please specify the number of hours) and additional information about stomodeal valve and metacyclics?
      4. Please explain sentence on p. 374-375. I thought that DEL is associated with increased metacyclics and increase parasite count in the sandfly
      5. Just for curiosity, have the authors try to rescue the phenotype in vitro by adding purines in the medium with the DEL strains?

      Significance

      Florencio et al., presented a detailed analysis of Brazilian L. infantum isolates with about half with a 4 gene deletion on chromosome 31 and the other half without that deletion. This deletion was first observed in the context of miltefosine responsiveness by the Mottram group. Here a multinational team is concluding that this deletion is associated with increased metacyclogenesis, increase parasite load in some sandfly vector species factors that contribute to their prevalence in South American. Yet the deleted strains show decreased survival invitro including in macrophages, but from what I understand increased load in an animal model, possibly in line with their reduced efficacy in attracting/activating key immune cells.

      The authors provide an interesting hypothesis on the equilibrium between pathogenesis (virulence) and capacity to transmit. The perfect parasite would like to infect as many hosts as possible while not killing them.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank both reviewers for their detailed and critical assessment of our work. Below we provide a step-by-step response to your concerns.

      Reviewer #1

      Evidence, reproducibility and clarity

      The manuscript presents data demonstrating the function of BRI1 in removing the H3K27me3 epigenetic marks in genes involved in seed coat development in Arabidopsis. The results support that BRI1 may function here independently of brassinosteroid. The work combines genetics with a large panel of mutant lines, phenotyping by quantitative microscopy and chemical treatment, H3K27me3 profiling by CUT&TAG, and data mining for published gene profiling. The introduction is adequately informative, complete and explaining the state-of-the-art to the readers. The result part may be a bit lengthy (especially the first part) and some parts may be a bit repetitive.

      Thank you for your positive assessment of our work and for the constructive criticism. Below we respond to each of your points.

      Major

      1. Seed size is dependent of multiple factors. And few are explained here, notably the number of seeds per silique, the number of ovule per silique, the position of the silique of the branch (related to the age of the meristem), the total number of produced siliques (fertilised flowers) by the inflorescence meristem and the plant. And maybe if produced by the main and lateral branches. Were the authors consistent in the evaluation of analyzed siliques coming from the same type of branches, same age of the meristem, etc? Especially as some of the analysed mutants are dwarf, which is a sign of different plant fitness compared to WT.

      This is a valid point. We did aim to analyze seeds coming from the main inflorescences of the plants and at similar stages of shoot development. This was harder to achieve in some genotypes, as indeed some BR and JMJ mutants have different plant architectures. However, we did repeat those experiments multiple times and always found consistent differences between the WTs and the mutants. See also our response to your next point and to the first point raised by Reviewer 2, as well as our new Fig. S6.

      1. The seed perimeter measurements in BR mutant seeds (Figure S6) are variable. Are you sue the ovule size does not have any influence? What about presenting the relative size as earlier in the text?

      Yes, this is particularly true for the Col-0 vs dwf5 comparison. The reason for this is that a different growth chamber was used for this experiment (greenhouse vs a climate chamber). We have observed that absolute seed growth phenotypes can change depending on the environmental conditions, which is something we are currently studying. However, importantly, we do not see changes in relative growth of the mutants when compared to the WT, independently of the growth conditions. That is, BR mutants produce consistently smaller seeds than the WT, independently of the conditions in which the plants are grown. To illustrate this point, we now add a new Figure, Fig. S6, where we show four independent biological replicates of assays comparing seed size between WT and det2 or bri1. These replicates were done in different growth chambers.

      Indeed, presenting the data as relative size would solve this issue, but we worried that we would be hiding the "real" values by doing so. However, if the Reviewer and Editor deem it necessary, we could replot the data as relative to WT.

      1. The number of evaluated samples is often {plus minus} n = 30, sometimes less, meaning less than what a silique contains of seeds. Did the authors evaluate the variability and reproductibility of their measurements, e.g, how many siliques per plant, how many plants, how many biological repeats? For example, in Figure S6, the number of measured ovules were as low as 16, which could be the reason why no significant difference in size were observed (low statitical strength). The variation in the Col WT is already visible. Is this variation significant?

      On average we pooled seeds from 6-10 siliques coming from 2-3 different plants of the same genotype. We then took microscopic photos of 60 to 100 random seeds in those pools. Out of those, 30 random photos were used for the measurements. You are right this is an important point. We now added this information to the Methods section.

      Moreover, we did calculate whether the sample size we were using provided enough statistical power. For the differences that we see, of around 50 um in perimeter, 26 samples would have been enough to achieve 80% statistical power, which most studies use as standard. In most of our experiments we used closer to 30 samples, which gives us 95% power.

      Indeed, the left-most panel on Fig S6B is the exception. With that plot we mostly wanted to test if ovules produced by BR mutants were smaller than those of WT plants. That does not seem to be the case, even if the sample size is small. However, if deemed necessary, we can repeat those measurements with a higher sample number.

      1. You indicate (line 149) that REF6 is not expressed in the gametophyte but GFP signal is observed in the cytoplasm for the central cell in Fig 1. The same goes for the expression pattern with the GUS line in Figure S2. (Line 290) One can not exclude expression in the endosperm or embryo with the presented pictures, or in the seed coat in older seeds.

      We interpreted those diffuse signals in the cytoplasm of the gametophyte as background noise, as REF6 should be nuclearly localized. But we could be wrong. We therefore made changes to the text in lines 150-152 to reflect this.

      And you are right that REF6 is expressed in the endosperm and embryo in later stages of development. We mention this in lines 157-159.

      1. Make sure that you do not overstate your result conclusions, or add a reference to some of the statements. For example, line 185, for the choice of 3 DAP time point and the fact that seed coat development is based on cell expansion and interaction with the endosperm. Another example, in line 262, is where it is stated that the jmj mutants are compromised in ovule and pollen development. This was not assessed. You only checked the reduced seed set, not the fitness of the gametophytes. Or in line 337, where you indicate that KLUH is not expressed in all integument layers.

      Thank you for pointing this out. For the claim that seed size at early time points is dictated only by the seed coat and endosperm, and not by the embryo, we added the appropriate reference. For the claim that jmj mutants are compromised in ovule development, this was based on our observations of Fig. S3C. We do see malformed or absent megagametophytes in jmj mutants. For pollen development, you are correct that we did not formally address this. We rephrased the sentence to reflect that. For the statement that KLU is not expressed in all integument cell layers, we added the reference.

      1. Another example of this is in line 289 where you stated "a sporophytic function of JMJs at early stages of seed development, [..] and to a zygotic function at the later stages of seed development". I am not sure on what data do you base this conclusion as in all three categories (endosperm, embryo, seed coat) in Fig 2 and S5, genes are expressed in pre-globular stages. And again in line 475: "seed coat growth genes are expressed independentlyof fertilization". Do you have any evidence, a reference?

      The evidence for a sporophytic JMJ function at early stages of seed development, and zygotic function at later stages, comes from our observations that jmj seed phenotypes are maternal in origin at early stages, but become zygotic later in development. But you are correct that we have to be careful with this interpretation. We now modified that sentence accordingly.

      For the data of Fig 2E and Fig S5, we cannot rule out that some putative REF6 target genes are also expressed even when in the absence of REF6. The expression of those genes is also likely controlled by other factors. The point we wanted to make with those plots is that REF6 may have different target genes in different seed tissues, thus potentially regulating different developmental processes in a tissue-specific manner. We mention this in lines 288-290.

      For your second point, we added the adequate reference.

      1. (around lines 461) I understand that using a 35S promoter is not a good strategy as it would affect many other tissues. Did you consider using a tissue-specific approach as presented in Figure 4?

      We suppose you mean the 35S::ELF6 construct. Yes, this makes sense and we did spend quite some time trying to come up with a good strategy. However, we failed to find a suitable promoter. The issue is that we would need a promoter that is active in all (or most) seed coat layers, but only after fertilization. There are promoters like those of TT genes which are active post-fertilization, but only in one cell layer, and thus likely not useful for our purpose. And there are other promoters, like those of STK or ANT, which are expressed in most integument cell layers, but are also expressed during integument development, and not just after fertilization. So they would have the same issue as the 35S promoter. Unfortunately, so far we have not identified a promoter that would be useful for this kind of experiment, which is why we went with a constitutive promoter, but which is specific to the sporophytic tissues.

      1. You observed that the triple swn clf bri mutant is less dwarf than bri1 mutant and stated in line 483 that it is larger, has more leaves, grow tallerm and flower later and longer. Do you have any qunatitative data? If not, I would state that these observations are qualitative from growing plant aside.

      You are correct that this was based on qualitative assessments, rather than on quantitative data (as it was not the point of the manuscript). We now indicate this in lines 489-490.

      Minor: 1. The title should precise the studies species, here Arabidopsis thaliana. Also the title of one of the part could be rephrased. "in a zygotic manner" sounds strange.

      We modified both title and subtitle, as suggested.

      1. Scale bars are missing in many figures.

      Fixed.

      1. The font size in the graphs is small. The authors may use the empty space of the figures to increase the size of the graphs for clarity. Guidelines could be found here https://tpc.msubmit.net/html/TPC_Detailed_Figure_Guidelines.pdf, as example of good practices.

      You are right. We revised all the figures and increased the font size, especially in the plot labels.

      1. Be consitent in the mutant name, e.g., brz1-D is also presented as brz1-d.

      Fixed.

      1. Figure legend S1: I would not use the word "extremely" while you still have 30% seed set. Extremely would qualifiy for

      We suppose you mean Fig. S3. We corrected the legend.

      1. Figure S8 is missing the WT control for comparison.

      Fixed.

      1. Figure S12, stats are missing

      Fixed.

      1. I would recommend to add a line in the Supplemental tables with the name as this name disappears from the file name during upload. It would help the readers to navigate the data.

      We now made it so the top line is static and is always visible.

      1. Methods: Are all the lines listed used in the study? SR2200 is missing for the method, and please indicate the selection marker for each of the generated lines for open-access of the data if other researchers later use your lines.

      You are right that some references had been left over from a previous document. We now updated the list of lines.

      And indeed, we forgot to mention the use of SR2200. It is now added to the Methods section. We also added the information on the selection markers for the lines we generated.

      1. You have a duplicate for reference Vukašinovíc et al.

      Fixed.

      1. Line 393, remove "s" in embryo and endosperm, in coat (line 674), in size (lines 684, 686

      Fixed.

      1. Line 410, write RPS5A in upper case.

      Fixed throughout the manuscript.

      1. LIne 676, the sentence "...H3K27me3 to be removed from the integuments." I would recomend to be more precise. For example "H3K27mme3 marks to be removed from genes to be expressed in the integuments" or something like that.

      We rephrased this sentence to "We thus hypothesized that BR signaling would be required for JMJ function, allowing for H3K27me3 to be removed from genes necessary for seed coat formation."

      Significance

      The authors provide novel information on the step-wise regulation of seed coat development and its influence on seed size. This is a topic of general interest, beyond the plant model Arabidopsis, especially in the context of reduced seed set caused by (a)biotic stress. The results of this study are valuable to understand seed size regulation in differnet growth context or species. The group previously showed that the auxin phytohormone is necessary after fertilization to initiate seed coat differentiation by inhibiting PRC2. However, as seed coat develops mainly as cell elongation, the epigenetic marks are not diluted by cell division and needs to be actively removed. This study provides insight into this process by identifcation 2 JMJ proteins responsible for removing H3K27me3 marks in the seed coat after fertilization to initiation seed coat development and regulating seed size. BRI1, BES1 and BZR1 are involved in this process, indepently of brassinosteroid, to guide JMJ to their target loci. While the study bring some genetic evidence of this process, molecular insight is still missing. Notably the identification of the target genes and how BRI1 is regulated/activated upon fertilization. Or how auxin and BRI1 co-regulate the process. These questions appear how of scope of this current study.

      Thank you for the assessment. Indeed, the identification of BRI1 downstream genes is out of scope of this work. As you point out earlier in the review, the manuscript is already quite long, and adding such data would make it even more so.

      Reviewer #2

      In this study, Pankaj et al. investigate the role of brassinosteroids and H3K27me3 in seed development, particularly in controlling seed size. They demonstrate that defects in these pathways affect seed size control and suggest that this control occurs in the maternal seed coat. This paper presents novel findings that merit publication and would be of interest to the plant community. However, the data interpretation and presentation could be improved. Additionally, I have a few comments that necessitate further analysis and revision.

      Thank you for the careful and critical assessment of our work. Below we respond to each of the points you raised.

      Major Comments

      1. My main concern is the use of seed size measurement as a proxy for seed coat development. Mature seed size measurements can vary significantly with growth conditions, so it is crucial that the authors present at least three independent experiments (wild type and mutant grown in parallel) in a single box plot to ensure data reliability. Additionally, due to the high number of seeds analyzed, significant changes are often observed, though they are not always reproducible. The authors should standardize their seed measurements, using either seed area or seed perimeter.

      You are right that we do see some variation in seed size between experiments. And, indeed, we suspect this is due to slightly different plant growth conditions, for example when different growth chambers are used. As you suggest, we now show data from four independent biological replicates of seed size comparisons of WT and BR mutants. This is in the new Fig. S6. As you can see, although we do see variations in absolute seed sizes, depending on the growth conditions, there is a consistent difference between WT and mutant seeds across experiments.

      1. It would be beneficial to include data on cell division and cell elongation in the seed coat if the authors aim to extend the seed size phenotype to a seed coat phenotype.

      This is indeed a good point. However, we already showed in a previous publication that seed coat growth is driven by cell elongation and not cell division (https://elifesciences.org/articles/20542). But you are right that this is important to point out. We mention it in lines 66-67.

      1. It is challenging to be fully convinced by the seed coat specificity of the phenotype, as the authors observe variations in total seed set and phenotypic differences in self-crosses and when the mutants are used paternally. Some of the observed phenotypes do not support their hypothesis. In all mutant analyses, the authors should complement their phenotype analysis using seed coat-specific promoters and include heterozygote measurements, as done in some figures.

      We assume you mean the effect of jmj mutations. For BR mutants, we do show data supporting a seed coat effect (Fig. 4). For PRC2 mutants, that has also been previously described (doi.org/10.7554/eLife.20542 and doi.org/10.1073/pnas.1117111108).

      For the JMJ mutants, you are right that we cannot be 100% sure that their effect is purely sporophytic. We now modified the text accordingly to reflect this (see also the response to point 6 of Reviewer 1). We indeed show that REF6 and ELF6 are expressed in the sporophytic tissues of the ovule and that the double mutant has seed coat defects (smaller seed coats and defects in accumulation of proanthocyanidins). And although we can say that those defects are maternal in nature, we can not 100% conclude that they are simply due to the effect of those JMJs in the sporophyte. There may be gametophytic effects that we cannot rule out, even though we do not see either protein expressed in embryo sacs. Thank you for pointing this out.

      Doing a tissue-specific rescue of these phenotypes would be very informative indeed, but also very hard. As we mention in the response to point 7 of Reviewer 1, we do not currently have suitable promoters for this. So we simply cannot run such experiments in a reasonable time frame.

      Overall, we now tried to be more careful in our conclusions and avoid claiming that the effect of JMJs is purely sporophytic. We can make that argument for the BR machinery and for PRC2, but not necessarily for JMJs. You are correct in that assessment.

      1. The authors need to include a fluorescent reporter for ELF6; tissue-specific expression cannot be conclusively determined with the GUS reporter.

      We did obtain an ELF6::GFP line from Caroline Dean's lab (https://www.pnas.org/doi/full/10.1073/pnas.1605733113), but could not see much expression during endosperm or seed coat development. As you can see from that publication, even in embryos and in roots the expression of ELF6:GFP is very blurry. It seems ELF6 is simply expressed at very low levels. We therefore used the GUS reporter, as a more sensitive means to visualize where ELF6 is expressed. You are right that the results are not as precise as that obtained with a fluorescent reporter. However, note that we simply claim that ELF6 is expressed in the integuments and seed coat (line 155). This can be clearly seen in Fig. 1B. The blue product of the β-glucuronidase reaction should be immotile and not travel between tissues (also note that there are no plasmodesmata between endosperm and seed coat). Therefore, we believe that GUS is a suitable reporter to test the seed coat expression of ELF6.

      1. Text editing: In some places, the text is unclear and could benefit from simplification. The authors should replace the term "seed coat formation," as developmentally, integuments are already present before fertilization. The authors are not studying the formation of the seed coat but rather its growth. They should also clarify the term "PRC2 removal." It is unclear whether the authors mean PRC2 lack of expression in the integument, PRC2 eviction from chromatin, or removal of H3K27me3.

      Thank you for noting that. It is very important to us that the text is clear to the reader. If you could indicate where the text is unclear, we are happy to simplify it.

      Regarding the wording, we refer to "seed coat formation" because the seed coat only indeed forms after fertilization. Before fertilization, the sporophytic tissues that cover the megagametophyte are called integuments, and not seed coat. Therefore, we see the seed coat as "forming" from the integuments (i.e., the integuments become seed coat via growth and differentiation).

      With PRC2 removal we indeed mean reduction of expression of PRC2 components. We now make this clear in lines 54-55.

      Minor Comments

      1. L151: Is REF6 expressed in zygotic tissues?

      Reviewer 1 also raised this question. We now added this information to lines 148-150.

      1. Confirm mutant complementation with the different reporter lines.

      All mutant lines that we used have been previously described to be either loss-of-function or hypomorphic mutants. We did not use any mutant line that has not been previously described. We added all references to the corresponding publications in the Methods.

      1. Confirm by qPCR that JMJ13 is indeed not expressed in seeds.

      We tested JMJ13 as a possible factor involved in H3K27me3 removal in the seed coat due to it being described, together with ELF6 and REF6, as one of the three main H3K27 demethylases. But there are, in fact, transcriptomic datasets showing that the expression of JMJ13 is indeed very low or absent in seeds: see RNAseq data in Table S3 in doi.org/10.3389/fpls.2022.998664. Moreover we checked CPMs on published seed scRNAseq datasets (doi.org/10.1038/s41477-021-00922-0) and JMJ13 (AT5G46910) has zero transcript counts in these datasets.

      Because of these two independent instances showing that the expression of JMJ13 is extremely low in seeds (or even totally absent), together with the analysis that we did of the fluorescent reporter line, we believe this is sufficient evidence that this JMJ is specific to the pollen during reproductive development. Note that the reporter that we used is strongly expressed in pollen grains, as had been previously described (doi.org/10.1038/s41556-020-0515-y).

      Even so, if the Reviewer and the Editor deem it necessary that we check JMJ13 expression by qPCR, we can of course do so.

      1. Fig1a and Fig1b: Align the panels in the figure.

      Done.

      1. L183-189: This section is unclear.

      I am sorry that the section is not clear. If you direct us to the points that need to be cleared, we are happy to make changes.

      1. There may be a PDF artifact, but most figures have unattractive misaligned boxes.

      We went through every figure and made slight modifications to avoid such artifacts. We hope they now appear more clear in the new version.

      1. Change the color in Fig 2a.

      Fixed.

      1. The introduction is heavily self-cited. The authors should try to include a broader range of literature.

      It is not clear to us why the Reviewer sees it like that. We only refer to three of our publications in the Introduction. One review manuscript and two research manuscripts. We cite almost 40 manuscripts in the introduction. Therefore, citing three of our works does not seem out of line to us, especially since those manuscripts laid the foundation for this work.

      1. Fig3F: Typo in "microM."

      Fixed.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Review of "BRI1-mediated removal of seed coat H3K27me3 marks is a brassinosteroid-independent process" In this study, Pankaj et al. investigate the role of brassinosteroids and H3K27me3 in seed development, particularly in controlling seed size. They demonstrate that defects in these pathways affect seed size control and suggest that this control occurs in the maternal seed coat. This paper presents novel findings that merit publication and would be of interest to the plant community. However, the data interpretation and presentation could be improved. Additionally, I have a few comments that necessitate further analysis and revision.

      Major Comments

      • My main concern is the use of seed size measurement as a proxy for seed coat development. Mature seed size measurements can vary significantly with growth conditions, so it is crucial that the authors present at least three independent experiments (wild type and mutant grown in parallel) in a single box plot to ensure data reliability. Additionally, due to the high number of seeds analyzed, significant changes are often observed, though they are not always reproducible. The authors should standardize their seed measurements, using either seed area or seed perimeter.

      • It would be beneficial to include data on cell division and cell elongation in the seed coat if the authors aim to extend the seed size phenotype to a seed coat phenotype.

      • It is challenging to be fully convinced by the seed coat specificity of the phenotype, as the authors observe variations in total seed set and phenotypic differences in self-crosses and when the mutants are used paternally. Some of the observed phenotypes do not support their hypothesis. In all mutant analyses, the authors should complement their phenotype analysis using seed coat-specific promoters and include heterozygote measurements, as done in some figures.

      • The authors need to include a fluorescent reporter for ELF6; tissue-specific expression cannot be conclusively determined with the GUS reporter.

      • Text editing: In some places, the text is unclear and could benefit from simplification. The authors should replace the term "seed coat formation," as developmentally, integuments are already present before fertilization. The authors are not studying the formation of the seed coat but rather its growth. They should also clarify the term "PRC2 removal." It is unclear whether the authors mean PRC2 lack of expression in the integument, PRC2 eviction from chromatin, or removal of H3K27me3.

      Minor Comments:

      • L151: Is REF6 expressed in zygotic tissues?

      • Confirm mutant complementation with the different reporter lines.

      • Confirm by qPCR that JMJ13 is indeed not expressed in seeds.

      • Fig1a and Fig1b: Align the panels in the figure.

      • L183-189: This section is unclear.

      • There may be a PDF artifact, but most figures have unattractive misaligned boxes.

      • Change the color in Fig 2a.

      • The introduction is heavily self-cited. The authors should try to include a broader range of literature.

      • Fig3F: Typo in "microM."

      Cross-commenting:

      I think our reviews highlight the same issues. For me, the first point is definitely the most critical.

      Significance

      This paper presents novel findings that merit publication and would be of interest to the plant community. However, the data interpretation and presentation could be improved. Additionally, I have a few comments that necessitate further analysis and revision.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript presents data demonstrating the function of BRI1 in removing the H3K27me3 epigenetic marks in genes involved in seed coat development in Arabidopsis. The results support that BRI1 may function here independently of brassinosteroid. The work combines genetics with a large panel of mutant lines, phenotyping by quantitative microscopy and chemical treatment, H3K27me3 profiling by CUT&TAG, and data mining for published gene profiling. The introduction is adequately informative, complete and explaining the state-of-the-art to the readers. The result part may be a bit lengthy (especially the first part) and some parts may be a bit repetitive.

      Major comments:

      1. Seed size is dependent of multiple factors. And few are explained here, notably the number of seeds per silique, the number of ovule per silique, the position of the silique of the branch (related to the age of the meristem), the total number of produced siliques (fertilised flowers) by the inflorescence meristem and the plant. And maybe if produced by the main and lateral branches. Were the authors consistent in the evaluation of analyzed siliques coming from the same type of branches, same age of the meristem, etc? Especially as some of the analysed mutants are dwarf, which is a sign of different plant fitness compared to WT.

      2. The seed perimeter measurements in BR mutant seeds (Figure S6) are variable. Are you sue the ovule size does not have any influence? What about presenting the relative size as earlier in the text?

      3. The number of evaluated samples is often {plus minus} n = 30, sometimes less, meaning less than what a silique contains of seeds. Did the authors evaluate the variability and reproductibility of their measurements, e.g, how many siliques per plant, how many plants, how many biological repeats? For example, in Figure S6, the number of measured ovules were as low as 16, which could be the reason why no significant difference in size were observed (low statitical strength). The variation in the Col WT is already visible. Is this variation significant?

      4. You indicate (line 149) that REF6 is not expressed in the gametophyte but GFP signal is observed in the cytoplasm for the central cell in Fig 1. The same goes for the expression pattern with the GUS line in Figure S2. (Line 290) One can not exclude expression in the endosperm or embryo with the presented pictures, or in the seed coat in older seeds.

      5. Make sure that you do not overstate your result conclusions, or add a reference to some of the statements. For example, line 185, for the choice of 3 DAP time point and the fact that seed coat development is based on cell expansion and interaction with the endosperm. Another example, in line 262, is where it is stated that the jmj mutants are compromised in ovule and pollen development. This was not assessed. You only checked the reduced seed set, not the fitness of the gametophytes. Or in line 337, where you indicate that KLUH is not expressed in all integument layers.

      6. Another example of this is in line 289 where you stated "a sporophytic function of JMJs at early stages of seed development, [..] and to a zygotic function at the later stages of seed development". I am not sure on what data do you base this conclusion as in all three categories (endosperm, embryo, seed coat) in Fig 2 and S5, genes are expressed in pre-globular stages. And again in line 475: "seed coat growth genes are expressed independentlyof fertilization". Do you have any evidence, a reference?

      7. (around lines 461) I understand that using a 35S promoter is not a good strategy as it would affect many other tissues. Did you consider using a tissue-specific approach as presented in Figure 4?

      8. You observed that the triple swn clf bri mutant is less dwarf than bri1 mutant and stated in line 483 that it is larger, has more leaves, grow tallerm and flower later and longer. Do you have any qunatitative data? If not, I would state that these observations are qualitative from growing plant aside.

      Minor comments:

      1. The title should precise the studies species, here Arabidopsis thaliana. Also the title of one of the part could be rephrased. "in a zygotic manner" sounds strange.

      2. Scale bars are missing in many figures.

      3. The font size in the graphs is small. The authors may use the empty space of the figures to increase the size of the graphs for clarity. Guidelines could be found here https://tpc.msubmit.net/html/TPC_Detailed_Figure_Guidelines.pdf, as example of good practices.

      4. Be consitent in the mutant name, e.g., brz1-D is also presented as brz1-d.

      5. Figure legend S1: I would not use the word "extremely" while you still have 30% seed set. Extremely would qualifiy for <5%, I guess.

      6. Figure S8 is missing the WT control for comparison.

      7. Figure S12, stats are missing

      8. I would recommend to add a line in the Supplemental tables with the name as this name disappears from the file name during upload. It would help the readers to navigate the data.

      9. Methods: Are all the lines listed used in the study? SR2200 is missing for the method, and please indicate the selection marker for each of the generated lines for open-access of the data if other researchers later use your lines.

      10. You have a duplicate for reference Vukašinovíc et al.

      11. Line 393, remove "s" in embryo and endosperm, in coat (line 674), in size (lines 684, 686

      12. Line 410, write RPS5A in upper case.

      13. LIne 676, the sentence "...H3K27me3 to be removed from the integuments." I would recomend to be more precise. For example "H3K27mme3 marks to be removed from genes to be expressed in the integuments" or something like that.

      Cross-commenting:

      I have been comparing our peer-review reports of the manuscript and found much similarity on our assessment:

      1. The seed size assemment and how this relates to seed coat development

      2. The GUS expression of ELF6 is not sufficient for the provided conclusion of the ELF6 expression

      3. The same would be for REP6

      4. Use of tissue-specific (seed coat specific) promoters to confirm the conclusion.

      Significance

      The authors provide novel information on the step-wise regulation of seed coat development and its influence on seed size. This is a topic of general interest, beyond the plant model Arabidopsis, especially in the context of reduced seed set caused by (a)biotic stress. The results of this study are valuable to understand seed size regulation in differnet growth context or species. The group previously showed that the auxin phytohormone is necessary after fertilization to initiate seed coat differentiation by inhibiting PRC2. However, as seed coat develops mainly as cell elongation, the epigenetic marks are not diluted by cell division and needs to be actively removed. This study provides insight into this process by identifcation 2 JMJ proteins responsible for removing H3K27me3 marks in the seed coat after fertilization to initiation seed coat development and regulating seed size. BRI1, BES1 and BZR1 are involved in this process, indepently of brassinosteroid, to guide JMJ to their target loci. While the study bring some genetic evidence of this process, molecular insight is still missing. Notably the identification of the target genes and how BRI1 is regulated/activated upon fertilization. Or how auxin and BRI1 co-regulate the process. These questions appear how of scope of this current study.

      My filed of expertise: hormones, plant reproduction, Arabidopis, oilseed rape, microscopy, transformation

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Thank you very much for your editorial handling of our manuscript entitled 'A conserved fungal Knr4/Smi1 protein is vital for maintaining cell wall integrity and host plant pathogenesis'. We have taken on board the reviewers' comments and thank them for their diligence and time in improving our manuscript.

      Please find our responses to each of the comments below.

      Reviewer(s)' comments

      Reviewer #1


      Major comments:


      __1.1. As a more critical comment, I find the presentation of the figures somewhat confusing, especially with the mixing of main figures, supplements to the main figures, and actual supplemental data. On top of that, the figures are not called up in the right order (e.g. Figure 4 follows 2D, while 3 comes after 4; Figure 6 comes before 5...), and some are never called up (I think) (e.g. Figure 1B, Figure 2B). __


      __Response: __The figure order has been revised according to the reviewer's suggestion, while still following eLife's formatting guidelines for naming supplementals. Thank you.

      1.2. I agree that there should be more CWI-related genes in the wheat module linked to the FgKnr4 fungal module, or, vice-versa, CW-manipulating genes in the fungal module. It would at least be good if the authors could comment further on if they find such genes, and if not, how this fits their model.


      Response: Thank you for your insightful suggestion regarding the inclusion of more CWI-related genes in the wheat module linked to the FgKnr4 fungal module F16, or vice versa. We did observe a co-regulated response between the wheat module W05 which is correlated to the FgKnr4 module F16. Namely, we observed an enrichment of oxidative stress genes including respiratory burst oxidases and two catalases (lines 304 - 313) in the correlated wheat module (W05). Early expression of these oxidative stress inducing genes likely induces the CWI pathway in the fungus, which is regulated by FgKnr4. Knr4 functions as both a regulatory protein in the CWI pathway and as a scaffolding protein across multiple pathways in S. cerevisiae (Martin-Yken et al., 2016, https://onlinelibrary.wiley.com/doi/10.1111/cmi.12618 ). Scaffolding protein-encoding genes are typically expressed earlier than the genes they regulate to enable pre-assembly with their interacting partners, ensuring that signaling pathways are ready to activate when needed. In this context, the CWI integrity MAPKs Bck1 and Mkk1 are part of module F05, which includes two chitin synthases and a glucan synthase. This module is highly expressed during the late symptomless phase. The MAPK Mgv1, found in module F13, is expressed consistently throughout the infection process, which aligns with the expectation that MAPKs are mainly post-transcriptionally regulated. Thank you for bringing our attention to this, this is now included in the discussion (lines 427 - 443) along with eigengene expression plots of all modules added to the supplementary (Figure 3 - figure supplement 1).

      To explore potential shared functions of FgKnr4 with other genes in its module, we re-analyzed the high module membership genes within module F16, which includes FgKnr4, using Knetminer (Hassani-Pak et al., 2021; https://onlinelibrary.wiley.com/doi/10.1111/pbi.13583 ). This analysis revealed that 8 out of 15 of these genes are associated with cell division and ATP binding. Four of the candidate genes are also part of a predicted protein-protein interaction subnetwork of genes within module F16, which relate to cell cycle and ATP binding. In S. cerevisiae, the absence of Knr4 results in cell division dysfunction (Martin-Yken et al., 2016, https://onlinelibrary.wiley.com/doi/10.1111/cmi.12618 ). Accordingly, we tested sensitivity of ΔFgknr4 to microtubule inhibitor benomyl (a compound commonly used to identify mutants with cell division defects; Hoyt et al., 1991 https://www.cell.com/cell/pdf/0092-8674(81)90014-3.pdf). We found that the ΔFgknr4 mutant was more susceptible to benomyl, both when grown on solid agar and in liquid culture. This data has now been added Figure 7, and referred to in lines 338-348.

      __Specific issues: __


      1.3. In the case of figure 5, I generally find it hard to follow. In the text (line 262/263), the authors state that 5C shows "eye-shaped lesions" caused by ΔFgknr4 and ΔFgtri5, but I can't see neither (5C appears to be a ΔFgknr4 complementation experiment). The figure legend also states nothing in this regard.

      __Response: __Thank you for your suggestion. We have amended the manuscript to include an additional panel that shows the dissected spikelet without its outer glumes, making the eye shaped diseased regions more visible in Figure 5.

      __1.4. Figure 5D supposedly shows 'visibly reduced fungal burden' in ΔFgknr4-infected plants, but I can't really see the fungal burden in this picture, but the infected section looks a lot thinner and more damaged than the control stem, so in a way more diseased. __


      Response: __Thank you for your insight. We have revised our conclusions based on this image to state that while ΔFgknr4 can colonise host tissue, it does so less effectively compared to the wild-type strain as we are unable to quantitatively evaluate fungal burden using image-colour thresholding due to the overlapping colours of the fungal cells and wheat tissues. Decreased host colonisation is evidenced by (i) reduced fungal hyphae proliferation, particularly in the thicker adaxial cell layer, (ii) collapsed air spaces in wheat cells, and (iii) increased polymer deposition at the wheat cell walls, indicating an enhanced defence response. __Figure 5 has been amended to include these observations in the corresponding figure legend and the resin images now include insets with detailed annotation.

      __1.5. The authors then go on to state (lines 272-273) that they analyzed the amounts of DON mycotoxin in infected tissues, but don't seem to show any data for this experiment. __

      Response: __We have amended this to now include the data in __Figure 5 - figure supplement 2B, thank you.

      Reviewer #2


      __Major issues: __


      2.1 If Knf4 is involved in the CWI pathway, what other genes involved in the CWI pathway are in this fungal module? one of the reasons for developing modules or sub-networks is to assign common function and identify new genes contributing to the function. since FgKnr4 is noted to play a role in the CWI pathways, then genes in that module should have similar functions. If WGCN does not do that, what is the purpose of this exercise?


      Response: __Thank you for raising this point regarding the role of FgKnr4 in the CWI pathway and the expectations for genes of shared function within the FgKnr4 module F16. We did observe that the module containing FgKnr4 (F16) was also correlated to a wheat module (W05) which was significantly enriched for oxidative stress genes. This pathogen-host correlated pattern led us to study module F16, which otherwise lacks significant gene ontology term enrichment, unique gene set enrichments, and contains few characterised genes. This is now highlighted in __lines 233-246. This underscores the strength of the WGCNA. By using high-resolution RNA-seq data to map modules to specific infection stages, we identified an important gene that would have otherwise been overlooked. This approach contrasts with other network analyses that often rely on the guilt-by-association principle to identify novel virulence-related genes within modules containing known virulence factors, potentially overlooking significant pathways outside the scope of prior studies. Therefore, our analysis has already benefited from several advantages of WGCNA, including the identification of key genes with high module membership that may be critical for biological processes, as well as generating a high-resolution, stage-specific co-expression map of the F. graminearum infection process in wheat. This point is now emphasised in lines 233-252. As discussed in response to reviewer 1, Knr4 functions as both a regulatory protein in the CWI pathway and as a scaffolding protein across multiple pathways in S. cerevisiae (Martin-Yken et al., 2016, https://onlinelibrary.wiley.com/doi/10.1111/cmi.12618 ) which would explain its clustering separate from the CWI pathway genes. The high module membership genes within module F16 containing FgKnr4 were re-analysed using Knetminer (Hassani-Pak et al., 2021; https://onlinelibrary.wiley.com/doi/10.1111/pbi.13583 ), which found that 8/15 of these genes were related to cell division and ATP binding. Four of the candidate genes are also part of a predicted protein-protein interaction subnetwork of genes within module F16, which relate to cell cycle and ATP binding. In S. cerevisiae, the absence Knr4 leads to dysfunction in cell division. Accordingly, we tested sensitivity of ΔFgknr4 to the microtubule inhibitor benomyl (a compound commonly used to identify mutants with cell division defects; Hoyt et al., 1991 https://www.cell.com/cell/pdf/0092-8674(81)90014-3.pdf). We found that the ΔFgknr4 mutant was more susceptible to benomyl, both when grown on solid agar and in liquid culture. This data has now been added as Figure 7 and referred to in lines 338-348.


      2.2. Due to development defects in the Fgknr1 mutant, I would not equate to as virulence factor or an effector gene.


      __Response: __We are in complete agreement with the reviewer and are not suggesting that FgKnr4 is an effector or virulence factor, we have been careful with our wording to indicate that FgKnr4 is simply necessary for full virulence and its disruption results in reduced virulence and have outlined how we believe FgKnr4 participates in a fungal signaling pathway required for infection of wheat.


      2.3. What new information is provided with WGCN modules compared with other GCN network in Fusarium (examples of GCN in Fusarium is below) ____https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5069591/ https://doi.org/10.1186/s12864-020-6596-y____ DOI: 10.1371/journal.pone.0013021. The GCN networks from Fusarium have already identified modules necessary/involved in pathogenesis.

      Response: __The 2016 New Phytologist gene regulatory network (GRN) by Guo et al. is large and comprehensive. However, only three of the eleven datasets are in planta, with just one dataset focusing on F. graminearum infection on wheat spikes. The other two in planta datasets involve barley infection and Fusarium crown rot. By combining numerous in planta and in vitro datasets, the previous GRNs lack the fine resolution needed to identify genetic relationships under specific conditions, such as the various stages of symptomatic and symptomless F. graminearum infection of mature flowering wheat plants. This limitation is highlighted in the 2016 paper itself. This network is expanded in the Guo et al., 2020 BMC genomics paper where it includes one additional in planta and nine in vitro datasets. However, the in planta dataset involves juvenile wheat coleoptile infection, which serves as an artificial model for wheat infection but is not on mature flowering wheat plants reminiscent of Fusarium Head Blight of cereals in the field. This model differs significantly in the mode of action of F. graminearum, notably DON mycotoxin is not essential for virulence in this context (Armer et al. 2024, https://pubmed.ncbi.nlm.nih.gov/38877764/ ). The Guo et al., 2020 paper still faces the same issues in terms of resolution and the inability to draw conclusions specific to the different stages of F. graminearum infection. Additionally, these GRNs use Affymetrix data, which miss over 400 genes (~ 3 % of the genome) from newer gene models. In contrast, our study addresses these limitations by analysing a meticulously sampled, stage- and tissue-specific in planta RNA-seq dataset using the latest reference annotation. Our approach provides higher resolution and insights into host transcriptomic responses during the infection process. The importance of our study in the context of these GRNs is now addressed in the introduction (__lines 85-92).


      2.4. Ideally, the WGCN should have been used identify plant targets of Fusarium pathogenicity genes. This would have provided credibility and usefulness of the WGCN. Many bioinformatic tools are available to identify virulence factors and the utility of WGCN in this regard is not viable. However, if the authors had overlapped the known virulence factors in a fungal module to a particular wheat module, the impact of the WGCN would be great. The module W12 has genes from numerous traits represented and WGCN could have been used to show novel links between Fg and wheat. For example, does tri5 mutant affect genes in other traits?

      __Response: __Thank you for your suggestions. In this study we have shown the association between the main fungal virulence factor of F. graminearum, DON mycotoxin, with wheat detoxification responses. Through this we have identified a set of tri5 responsive genes and validated this correlation in two genes belonging to the phenylalanine pathway and one transmembrane detoxification gene. Although we could validate more genes in this tri5 responsive wheat module, our paper aimed to investigate previously unstudied aspects of the F. graminearum infection process and how the fungus responded to changing conditions within the host environment. We accomplished this by characterising a gene within a fungal module that had limited annotation enrichment and few characterised genes. Tri5 on the other hand is the most extensively studied gene in F. graminearum and while the network we generated may offer new insights into tri5 responsive genes, this is beyond the scope of our current study. In addition to the tri5 co-regulated response, we have also demonstrated the coordinated response between the fungal module F16, which contains FgKnr4 that is necessary for tolerance to oxidative stress, and the wheat module W05, which is enriched for oxidative stress genes.


      While our co-expression network approach can be used to explore and validate other early downstream signaling and defense components in wheat cells, several challenges must be considered: (a) the poor quality of wheat gene calls, (b) genetic redundancy due to both homoeologous genes and large gene families, and (c) the presence of DON, which can inhibit translation and prevent many transcriptional changes from being realised within the host responses. Additionally, most plant host receptors are not transcriptionally upregulated in response to pathogen infection (most R gene studies for the NBS-LRR and exLRR-kinase classes), making their discovery through a transcriptomics approach unlikely. These points will be included in our discussion (lines 408-413), thank you.

      Specific issues

      • *

      2.5. Since tri5 mutant was used a proof of concept to link wheat/Fg modules, it would have been useful to show that TRI14, which is not involved DON biosynthesis, but involved in virulence ( https://doi.org/10.3390/applmicrobiol4020058____) impact the wheat module genes.


      Response: __Our goal was to show that wheat genes respond to the whole TRI cluster, not just individual TRI genes. Therefore, the tri5 mutant serves as a solid proof-of-concept, because TRI5 is essential for DON biosynthesis, the primary function of the TRI gene cluster, thereby representing the function of the cluster as a whole. This is now clarified in __lines 217-219. Additionally, the uncertainties surrounding other TRI mutants would complicate the question we were addressing-namely, whether a wheat module enriched in detoxification genes is responding to DON mycotoxin, as implied by shared co-expression patterns with the TRI cluster. For instance, the referenced TRI14 paper indicates that DON is produced in the same amount in vitro in a single media. Although the difference is not significant, the average DON produced is lower for the two Δtri14 transformants tested. Therefore, we cannot definitively rule out that TRI14 is involved in DON biosynthesis and extrapolate this to DON production in planta. Despite this, the suggestion is interesting, and would make a nice experiment but we believe it does not contribute to the overall aim of this study.

      2.6. Moreover, prior RNAseq studies with tri5 mutant strain on wheat would have revealed the expression of PAL and other phenylpropanoid pathway genes?

      __Response: __We agree that this would be an interesting comparison to make but unfortunately no dataset comparing in planta expression of the tri5 mutant within wheat spikes exists.

      2.7. Table S1 lists 15 candidate genes of the F16 module; however, supplementary File 1 indicates 74 genes in the same module. The basis of exclusion should be explained. The author has indicated genes with high MM was used as representative of the module. The 59 remaining genes of this module did not meet this criteria? Give examples.


      Response: __The 15 genes with the highest module membership were selected as initial candidates for further shortlisting from the 74 genes within module F16. In WGCNA, genes with high module membership (MM) (i.e. intramodular connectivity) are predicted to be central to the biological functions of the module (Langfelder and Horvath, 2008; https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-9-559 ) and continues to be a metric to identify biologically significant genes within WGCN analyses (https://bmcplantbiol.biomedcentral.com/articles/10.1186/s12870-024-05366-0 Tominello-Ramirez et al., 2024; https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9151341/ ;Zheng et al., 2022; https://www.nature.com/articles/s41598-020-80945-3 Panahi and Hejazi et al 2021). Following methods by Mateus et al. (2019) (https://academic.oup.com/ismej/article/13/5/1226/7475138 ) key genes were defined as those exhibiting elevated MM within the module, which were also strongly correlated (R > |0.70|) with modules of the partner organism (wheat). We have clarified this point in the manuscript. Thank you for the suggestion. (__Lines 253-263).

      2.____8. A list from every module that pass this criteria will be useful resource for functional characterization studies.


      __Response: __A supplementary spreadsheet has been generated which includes full lists of the top 15 genes with the highest module membership within the five fungal modules correlated to wheat modules and a summary of shared attributes among them. Thank you for this suggestion.

      2.9. Figure 3 indicates TRI genes in the module F12; your PHI base in Supp File S2 lists only TRI14. Why other TRI genes such as TRI5 not present in this File?


      Response: For clarity, the TRI genes in module F12 are TRI3, TRI4, TRI11, TRI12, and TRI14 which was stated in Table 1. TRI5 clusters with its neighboring regulatory gene TRI6 in module F11, which exhibits a similar but reduced expression pattern compared to module F12. To improve clarity on this the TRI genes in module F12 are also listed in-text in line 168 and added to Figure 4. The enrichment and correlated relationship of W12 to a cluster's expression still imply a correlated response of the wheat gene to the TRI cluster's biosynthetic product (DON), which is absent in the Δtri5 mutant.

      TRI14 and TRI12 are listed in PHI-base. TRI12 was mistakenly excluded due to an unmapped Uniprot ID, which were added separately in the spreadsheet. We will recheck all unmapped ID lists to ensure all PHI-base entries are included in the final output. Thank you for pointing out this error.


      2.10. What is purpose of listing the same gene multiple times? Example, osp24 (a single gene in Fg) is listed 13 times in F01 module.


      __Response: __This is a consequence of each entry having a separate PHI ID, which represents different interactions including inoculations on different cultivar. Cultivar and various experimental details were omitted from the spreadsheet to reduce information density, however the multiple PHI base ID's will be kept separate to make the data more user friendly when working with the PHI-base database. An explanation for this is now provided in the file's explanatory worksheet, thank you.

      Reviewer #3:


      3.1. Why only use of high confidence transcripts maize to map the reads and not the full genome like Fusarium graminearum? I have never analyzed plant transcriptome.


      __Response: __ In the wheat genome, only high-confidence gene calls are used by the global community (Choulet et al., 2023; https://link.springer.com/chapter/10.1007/978-3-031-38294-9_4 ) until a suitable and stable wheat pan-genome becomes available.

      3.2. The regular output of DESeq are TPMs, how did the authors obtain the FPKM used in the analysis?


      Response: FPKM was calculated using the GenomicFeatures package and included on GitHub to enhance accessibility for other users. However, the input for WGCNA and this study as a whole was normalised counts rather than FPKM. The FPKM analysis was done to improve interoperability of the data for future users and made available on Github. To complement this, the information regarding FPKM calculation is now included in the methods section of the revised manuscript (line 491).

      3.3. Do the authors have a Southern blot to prove the location of the insertion and number of insertions in Zymoseptoria tritici mutant and complemented strains?


      __Response: __No, but the phenotype is attributed to the presence or absence of ZtKnr4, as the mutant was successfully complemented in multiple phenotypic aspects. This satisfies Koch's postulates which is the gold standard for reverse genetics experimentation (Falkow 1988; https://www.jstor.org/stable/4454582 ).

      __3.4. Boxplots and bar graphs should have the same format. In Figures 5 B and F and supplementary figure 6.3 the authors showed the distribution of samples but it is lacking in figure 3 B and all bar graphs. __


      __Response: __Graphs have been modified to display the distribution of all samples, thank you.

      3.5. Line 247 FGRAMPH1_0T23707 should be FGRAMPH1_01T23707


      __Response: __Thank you this has now been amended.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors of the manuscript entitled "A conserved fungal Knr4/Smi1 protein is vital for maintaining cell wall integrity and host plant pathogenesis" used a weighted gene co-expression network to identify Fusarium graminearum genes highly expressed during early symptomless infection of wheat. Based on its sequence and previous studies, authors selected FgKnr4 from the early symptomless Fusarium modules. The characterization of knockout strains revealed a role in morphogenesis, growth, cell wall stress tolerance, and virulence in F. graminearum and the phylogenetically distant fungus Zymoseptoria tritici.

      The methods are properly described and statistical analysis are reasonable so reproducibility is possible. The RNA-seq dataset is already published and the authors provided a repository with the code used to create the co-expression network. However, I have the following questions:

      • Why only use of high confidence transcripts maize to map the reads and not the full genome like Fusarium graminearum? I have never analyzed plant transcriptome.
      • The regular output of DESeq are TPMs, how did the authors obtain the FPKM used in the analysis?
      • Do the authors have a southern blot to prove the location of the insertion and number of insertions in Zymoseptoria tritici mutant and complemented strains?
      • Boxplots and bar graphs should have the same format. In Figures 5 B and F and supplementary figure 6.3 the authors showed the distribution of samples but it is lacking in figure 3 B and all bar graphs.
      • Line 247 FGRAMPH1_0T23707 should be FGRAMPH1_01T23707

      Referees cross-commenting

      I agree with reviewer 1, the order in which the figures are called in the text is confusing. Regardless of figures 5C-D I am no expert in the field therefore I can only say they look like they have not been edited.

      I agree with reviewer 1, data of DON mycotoxin production in infected issues is need it to support statement in line 272-273.

      I agree with Reviewer 2, the criteria to exclude genes from the final selection list should be explained.

      Significance

      The study showed, once again, that a weighted gene co-expression network is a great method to identify new genes of interest regardless of the organism or condition even if not very popular in the fungal pathogen field yet. The study proved that functions identified in a WGCN module from a pathogen have their opposite in the host module. The authors go beyond the theory and demonstrate the effect of the highest expressed gene during the early symptomless stage of infection in maize and wheat fungal pathogens.

      Fungal pathogen, RNA-seq, metabolic models, metabolism, comparative genomics

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: The authors in this manuscript use "dual weighting" to identify clusters or modules of genes from the fungus F. graminearum (Fg) with coordinated expression patterns with genes in wheat modules - potentially uncover key regulators or pathways linking Fg genes with plant traits, including plant pathogenesis. As proof of concept, the authors use one of the fungal genes FgKnr4 identified in a fungal module that has strong link with the wheat module. They were able to show that this gene is likely involved in CWI pathway and affects virulence properties of the fungus

      Major comments:

      Does the WGCN provide useful framework to link fungal genes affecting plant traits? If Knf4 is involved in the CWI pathway, what other genes involved in the CWI pathway are in this fungal module? This is not forthcoming. Due to development defects in the Fgknr1 mutant, I would not equate to as virulence factor or an effector gene.

      Since tri5 mutant was used a proof of concept to link wheat/Fg modules, it would have been useful to show that TRI14, which is not involved DON biosynthesis, but involved in virulence ( https://doi.org/10.3390/applmicrobiol4020058) impact the wheat module genes. Moreover, prior RNAseq studies with tri5 mutant strain on wheat would have revealed the expression of PAL and other phenylpropanoid pathway genes?

      Table S1 lists 15 candidate genes of the F16 module; however, supplementary File 1 indicates 74 genes in the same module.

      The basis of exclusion should be explained. The author has indicated genes with high MM was used as representative of the module. The 59 remaining genes of this module did not meet this criteria? Give examples. Did similar exclusion criteria used for other modules and if so, how many genes in each module pass the criteria? For example, Did TRI5 in module F12 pass this criteria. A list from every module that pass this criteria will be useful resource for functional characterization studies.

      Minor comments:

      Figure 3 indicates TRI genes in the module F12; your PHI base in Supp File S2 lists only TRI14. Why other TRI genes such as TRI5 not present in this File? What is purpose of listing the same gene multiple times? Example, osp24 (a single gene in Fg) is listed 13 times in F01 module.

      Referees cross-commenting

      agree with both reviewers regarding clarification of Figures.

      one of the reasons for developing modules or sub-networks is to assign common function and identify new genes contributing to the function. since FgKnr4 is noted to play a role in the CWI pathways, then genes in that module should have similar functions. If WGCN does not do that, what is the purpose of this exercise?

      Significance

      What new information is provided with WGCN modules compared with other GCN network in Fusarium (examples of GCN in Fusarium is below)

      https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5069591/ https://doi.org/10.1186/s12864-020-6596-y DOI: 10.1371/journal.pone.0013021

      The GCN networks from Fusarium have already identified modules necessary/involved in pathogenesis. Ideally, the WGCN should have been used identify plant targets of Fusarium pathogenicity genes. This would have provided credibility and usefulness of the WGCN.

      Many bioinformatic tools are available to Identify virulence factors and the utility of WGCN in this regard is not viable. However, if the authors had overlapped the known virulence factors in a fungal module to a particular wheat module, the impact of the WGCN would be great. The module W12 has genes from numerous traits represented and WGCN could have been used to show novel links between Fg and wheat. For example, does tri5 mutant affect genes in other traits?

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      A public mRNA-seq dataset from Dilks et al. (2019) for wheat spikelets infected by Fusarium graminearum was used to generate a dual weighted gene co-expression network (WGCN). Since colonization of the spike by F. graminearum progresses from spikelet to spikelet, thereby forming an infection-gradient from early to late stages, quasi spatio-temporal resolution for the transcriptomic dataset can be achieved by cutting the spike into equal pieces along this gradient (in this case cuts were done at rachis internodes 1-2, 3-4, 5-6, and 7-8. The authors created co-expression networks for both, fungal and plant genes, and cross-correlated them. They identify several modules specific for each infection stage. For further analysis, the authors focus on two module pairs. (1) the wheat module 12 (W12), which correlates to Fusarium module 12 (F12), and (2) the Fusarium module 16 (F16) and the correlated wheat modules 1 and 5 (W01/W05). The W12/F12 modules were deemed of interest because they were specific to the transition from symptomless to symptomatic infection stage. Here, the authors find genes related to mycotoxin production to be upregulated in the F12 module, while the W12 is enriched in genes involved in detoxification. F16 and W01/W05 are specific to the earliest stages of infection, and thus most likely involved in fungal virulence. Here, one of the key genes identified is FgKnr4, which the authors show to be important for fungal virulence, as gene knockout leads to a premature stop of disease progression. As the authors show that FgKnr4 is involved in activating cell wall-integrity mechanisms, and may function in oxidative stress-resistance, this reduced virulence may be the result a reduced ability of the fungus to withstand plant defense mechanisms. Interestingly, knocking out an orthologue of FgKnr4 in Zymoseptoria tritici led to similarly reduced virulence of this pathogenic fungus on wheat plant.

      Comments:

      Overall, I find the WGCN analysis to be very interesting and informative, especially because of the different stages of infection. As the dataset is made public (I believe), I think that this will be a really important resource for the community. The exemplary functional analysis of the F16/W01/W05 modules via FgKnr4 is very interesting and demonstrates that novel genes involved in virulence can be identified via this approach. A similar more detailed analysis of the W12/F12 modules with a focus on detoxification mechanisms in the plant (i.e. the W12 module) would be a very interesting bonus, but as much as I would be interested in reading about it, functional gene analyses in wheat are obviously time-consuming, and it is not essential to this manuscript. As a more critical comment, I find the presentation of the figures somewhat confusing, especially with the mixing of main figures, supplements to the main figures, and actual supplemental data. On top of that, the figures are not called up in the right order (e.g. Figure 4 follows 2D, while 3 comes after 4; Figure 6 comes before 5...), and some are never called up (I think) (e.g. Figure 1B, Figure 2B). In the case of figure 5, I generally find it hard to follow. In the text (line 262/263), the authors state that 5C shows "eye-shaped lesions" caused by ΔFgknr4 and ΔFgtri5, but I can't see neither (5C appears to be a ΔFgknr4 complementation experiment). The figure legend also states nothing in this regard. Figure 5D supposedly shows 'visibly reduced fungal burden' in ΔFgknr4-infected plants, but I can't really see the fungal burden in this picture, but the infected section looks a lot thinner and more damaged than the control stem, so in a way more diseased. The authors then go on to state (lines 272-273) that they analyzed the amounts of DON mycotoxin in infected tissues, but don't seem to show any data for this experiment. In contrast to the sometimes confusing data presentation, I find the table of correlated modules (table 1) very helpful, and obviously am happy to see that all data is available in the first author's GitHub account.

      Referees cross-commenting

      just to clarify in regards to my comment on Figures 5C-D, and Reviewer #3's comment "Regardless of figures 5C-D I am no expert in the field therefore I can only say they look like they have not been edited." - I didn't want to insinuate that the images have been edited. Based on the images provided, I just can't see what the authors state is shown. So this is not about editing/manipulation - just about image quality/choice. The phenotypic descriptions by the authors are quite detailed ("eye-shaped lesions", 'visibly reduced fungal burden'...), but at least for me, the images aren't good enough to illustrate and underpin their statements. Maybe better images are needed, maybe magnifications of the exact regions showing the phenotypes? But this is simply a matter of presentation, not of editing/manipulation.

      Second, I agree that there should be more CWI-related genes in the wheat module linked to the FgKnr4 fungal module, or, vice-versa, CW-manipulating genes in the fungal module. It would at least be good if the authors could comment further on if they find such genes, and if not, how this fits their model.

      Significance

      In summary, I think that the presented WGCN analysis of mRNA-seq data with quasi-spatio-temporal resolution is a very helpful approach to identify novel fungal virulence and plant immunity genes, and with the created datasets made public, this will be an interesting and valuable resource for the community. The identification and functional analysis of FgKnr4 works as proof-of-principle. If the data presentation is improved, I believe that this will be an interesting publication.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study identified Cpn1, a fission yeast ortholog of human Caprin1, to be involved in heterochromatin re-establishment in S. pompe by potentially regulating heterochromatic transcript stability and localization. Moreover, Cpn1 was shown to be important for stress granule formation. Although the role of Cpn1 for heterochromatin establishment and granule formation is strong, the point that there is a crosstalk between the two is weaker and could be explained through multiple independent effects. Overall, the manuscript provides interesting new observations with regards to Cpn1 function that are adequate for publication, however, a few additional validations need to be performed to strengthen the crosstalk conclusion, which could be a new significant connection.

      The following comments could be addressed before publication to improve the manuscript.

      Specific Comments

      1. The point that there is a "Crosstalk between heterochromatin integrity and cytoplasmic RNP granule formation" could be strengthened before publication, or the tone of the manuscript revised to reflect the issues below.

      A) There is no data in this section showing a direct crosstalk between heterochromatin integrity and granules. The results show that certain mutants alter heterochromatin integrity the formation of PABP-containing granules. However, those proteins could have different functions in the nucleus, cytoplasm and upon stress. Thus, granule formation could be independent of heterochromatin perturbation. E.g. Loss of Ago1 could impact cytoplasmic RNA abundance/ stability and this therefore influence granule assembly. Similarly, Cpn1/Caprin could be a multifunctional protein in S. pombe and affect various aspects of RNA metabolism. The possibility, that these proteins have multiple different functions in the cell and heterochromatin integrity and cytoplasmic RNP granule formation could be due to different functions of those proteins and not due to a crosstalk should be discussed as well.

      B) Figure 5 title says that "disruption of heterochromatin alters the formation of PABP-containing RNP granules". There is no data in this figure that shows that "disruption of heterochromatin" directly causes granules. Its rather that heterochromatin mutants show altered PABP-containing RNP granules. While this is fine, it should be pointed out that this is a correlation, with a direct connection being inferred.

      C) The strongest connection of heterochromatin integrity to RNP granules is the accumulation of heterochromatic transcripts in those granules. Therefore, the manuscript could be strengthened by rearranging the sections/figures.

      In addition, Fig. 5A shows PABP containing granules in unstressed conditions for rik1, clr4, ago1 loss. This suggests that those granules contain lots of polyadenylated RNA. Evidence is needed that the mutations studied are not affecting global cytoplasmic translation or mRNA decay (e.g. by puromycin staining and smRNA-FISH staining or qPCR (e.g. GAPDH)). Moreover, it needs to be shown that endogenous expression of Cpn1 is unchanged. If those perturbations affect Cpn1 (or Nxt3) levels, the granule phenotype could be solely due to changes in stress granule promoting proteins. 2. The work would be strengthened by adding some additional experiments. Specifically:

      A) "Previous studies by ourselves and others have provided evidence that accumulation of transcripts on chromatin can impair heterochromatin assembly, possibly through increased formation of RNA-DNA hybrids". Increased RNA-DNA hybrids/ R-loop structures were shown to lead to genomic instability, which could lead to micronuclei formation and nuclei leakage in the cytoplasm during stress. It would be nice to provide close-up view images of those stress induced cytoplasmic granules, that contain cenRNA. Do they contain weaker DAPI signal in those granules, which would be indicative of micronuclei? Moreover, providing an additional stain using either an R-loop antibody, cGAS, or a nuclei membrane marker such as LAMIN B1 could be used to rule out micronuclei/ membranous assemblies.

      B) RNA FISH and quantification for PABP foci in unstressed and stressed clr4Δ dhp1-2 cells is key, which should show the highest changes in foci and RNA accumulation in foci. Are those cells showing an even stronger effect on heterochromatin establishment?

      C) Compared to clr4Δ cpn1Δ, do clr4Δ dhp1-2 cpn1Δ (or cpn1Δ and dhp1-2 cpn1Δ) form more stress induced granules? And do those granules contain more heterochromatic transcripts?

      D) Is heterochromatic transcripts localization in -glucose induced granules also seen with heat shock? 3. Additional tests and quantification is needed to support that conclusion: "...whereas heterochromatin mutants show increased Pabp granule formation in absence of stress, they show a significant reduction in the average number of Pabp foci formed in the presence of stress,...". Different proteins could regulate/loss of proteins impact granule assembly in different manners, for example RNA localization, assembly, disassembly, foci number, foci size, % cells with foci etc. It looks like rik1Δ shows larger foci. Therefore, upon stress, there could be indeed fewer foci because they are larger. A quantification of foci area and total foci area per cell should support or reject that conclusion. Moreover, is the same trend also observed with starvation stress, or only heat shock? 4. The field generally believes G3BP1 is not an endonuclease, and therefore the authors might want to edit the section: " ...these could include, for example, Cpn1 binding partner Nxt3, the human ortholog of which, G3BP1, has been shown to function as an endoribonuclease for degradation of selected RNAs (63)." Although, it was shown that G3BP1 has endoribonuclease activity in that reference, this was never reproduced and is generally now accepted in the field that G3BP1 does not function as a endoribonuclease to regulate RNA homeostasis.

      Minor comment:

      1. Fig. 2 is a bit confusing. Maybe it could help if the controls - to rule out heterochromatin maintenance (B, C,D) - could be better grouped together or put into supplementary.
      2. Fig 4 A, figures for Cpn1R6A-GFP upon glucose starvation is missing.
      3. Fig 4 D, intensity is weaker in mkt1Δ, its difficult to see the granules. It looks like based on the image that there are less granules, but the quantification shows unchanged granules.
      4. Fig 5 D/ Supl S5A, images with stress are missing for rik1 loss (the ones leading to qualifications in 5D)
      5. Fig 5 C, rik1Δ, ago1Δ, co-colocalised with Cpn1-GFP are missing
      6. Fig.6D, one arrow showing cytoplasmic foci is shifted. PABP stain needs to be added to highlight these are cytoplasmic PABP foci.
      7. cen(dg) transcripts show lack of localization in granules in unstressed cells. The explanation why is a bit unclear. "It remained possible that these foci might rather be associated with RNA degradation, ...". Or RNA degradation happens in the nucleoplasm or cytoplasm. Moreover, in the discussion: "Although we were not able to detect stable accumulation of heterochromatic RNAs in these granules by RNA-FISH, we suspect that this may be because these granules are associated with RNA turnover, although it is also possible that they arise as a result of altered RNP homeostasis more broadly." This is confusing. If those granules form due to accumulation of heterochromatic RNAs, then they cannot be the sites for their degradation, because then they would disassemble, if heterochromatic RNAs are degraded.
      8. Clr4Δ and other conditions shows accumulation of cen(dg) RNA-FISH at the centromeres. It would be informative to see if Cpn1-GFP shows colocalization, which could provide additional evidence for the two models in Fig. 7.
      9. Fig. 6D/E: Its difficult to see changes of cen(dg) RNA-FISH intensity at the centromeres in the images. A close-up view should be provided without overexposure to indicate differences.

      Significance

      This manuscript begins to address a possible relationship between stress granule formation and the regulation of heterochromatin. This is an interesting connection, although the mechanism by which such these two processes are connected is unclear at this time.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In their manuscript titled "Fission yeast Caprin protein is required for efficient heterochromatin establishment", Zhang and colleagues describe the role of Caprin1 (Cpn1), the fission yeast ortholog of mammalian CAPRIN1, in the de novo establishment of heterochromatin. Using reporter assays for heterochromatin formation, they found that deletion of Cpn1 reduces H3K9 methylation, thereby impairing the de novo establishment of silencing, while the maintenance of silencing at centromeric repeats remains unaffected.

      The authors demonstrate that Cpn1 interacts with the fission yeast orthologs of known human CAPRIN1 interaction partners, Nxt3 and Ubp3. These factors are then tested for their influence on the establishment of silencing.

      Using microscopy to observe tagged Cpn1 and known stress granule markers, the authors show that Cpn1 localizes to stress granules. They also quantitatively assess the impact of Cpn1 interactors on stress granule formation. Additionally, the authors note that different RNAi mutants have stress granules even in the absence of envirmental stresses.

      Finally, they investigate the subcellular localization of the cen(dg) transcript using single-molecule RNA FISH. They find that in heterochromatin mutant cells, cen(dg) transcripts localize to the nucleus and exhibit both nuclear and cytoplasmic localization under glucose starvation conditions. The authors also show, through quantification of RNA FISH and RT-qPCR, that cen(dg) transcripts accumulate in Cpn1 mutants.

      Major comments:

      The authors suggest that Cpn1 is requried for efficient degradation of RNA which in-turn helps the establishemnt of heterochromatin potentially by preventing the formation of R-loops. Yet the localization of Cpn1 under non-stressed conditions is cytoplasmic. Additionally the authors show that Cpn1 helps to limit the accumulation of heterochromatic trancripts on chromatin, yet there is no evidence for a nuclear pool of Cpn1 in these condtions.

      In order to strengthen the link between these nuclear processes it is important that the authors follow up on some of the aspects detailed below.

      Is Cpn1 also present in the nucleus in non-stressed conditions? This would be a pre-requesit for a direct mechanisic link for the model that the authors suggest. This could be, for example, adressed by LeptomycinB treatment followed by imaging of Cpn1. Such an experiment could reveal if the protein is shutteling between the nucleus and the cytoplasm.

      In Fig 6 C the authors show co-localization of dh-transcripts with Papb1 under glucose starvation conditions. To strengthen their hypothesis of cen(dg) RNA binding/regulation by Cpn1 show and quantify co-localization of Cpn1 and cen(dg) transcripts.

      The authors observe cytoplasmic cenRNA in Clr4-delta dhp1-2 cells. To substantiate the hypothesis that Cpn1 binds such transcript co-localization of Cpn1-GFP with cen(dg) transcripts should be examined.

      Can the authors rule out that deletion of Cpn1 affects the RNA levels of proteins important for heterochromatin establishment? As Cpn1 could regulate the stability of mRNAs in the cytoplasm it might be worthwhile to consider also an indirect effect of Cpn1 deletion on the process of heterochromatin establishment. The study would benefit from a genomic characterization of Cpn1-delta cells using RNAseq or an extended discussion of this potential caveat.

      Minor comments:

      RNA-FISH images are labeled cen RNA, please provide consistent lables for the transcript throught the manuscript (cen(dg)).

      Please display individual data points for replicate experiments when displaying qPCR results. This would give the reader more opportunities to judge the distribution of the data points. (Fig2 C,D,F,G,H, Fig6 F)

      Fig 2 G: it is not immediately obvious that the two bar plots display different HOOD amplicons. The presentation of the data could be improved.

      Significance

      In the presented manuscript Zhang and colleauges place Cpn1 as a novel factor into the fission yeast RNAi pathway. This study suggests a link between RNAi and stress granule biology which would provide a novel connection of these fields. The manuscript will be of interest to a specialized audience, both in RNAi/heterochromation formation and stress granule biology and additionally would provide a novel function of a CAPRIN ortholog.

      The manuscript is well written and overall the data is presented well. Furthermore the authors provide a solid genetic characterization of Cpn1's effect on heterochromatin establishment. While the manuscript provides several interesting observations their mechanistic link remains unclear and I belive that it would be very important to substaniate their observations with experiments supporting a direct mechanistic link between heterochromatin establishemnt and Cpn1.

      Field of expertise: small RNA mediated heterochromatin formation, RNA biology, chromatin biology

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this study, Zhang et al. identified a novel factor named Cpn1 promoting de novo heterochromatin establishment in the fission yeast Schizosaccharomyces pombe. The authors established an elegant genetic approach to identify mutants impaired in the de novo heterochromatin assembly, allowing the quantitative assessment of heterochromatin establishment in a highly reproducible manner. This approach allowed them to revisit potential candidates for heterochromatin establishment from a previous study, leading to the identification of Cpn1. Cells lacking Cpn1 display primarily defects in heterochromatin establishment but not maintenance at constitutive heterochromatin. While the function of Cpn1 was unknown, the authors established a functional link to stress granule formation, demonstrating that Cpn1 is the ortholog of the human RNA-binding protein CAPRIN1, likewise forming a complex with two other factors, Nxt3 and Ubp3. Mutating a putative RNA-binding RRG motif in Cpn1 largely phenocopied the establishment defect seen in the deletion mutant. Moreover, Cpn1 and its complex members co-localize with the stress granule marker poly(A)-binding protein, Pabp. Conversely, deleting Cpn1 or mutating its RRG motif resulted in a reduced number of stress granule formation.

      Providing a further link between heterochromatin and stress granules, the authors showed that heterochromatin-deficient mutants accumulate Papb foci in the absence of stress cells, which was largely dependent on Cpn1. Conversely, the number of stress granules was reduced under stress conditions in these mutants, suggesting that heterochromatic transcripts compete with canonical RNPs that form stress granules. The molecular mechanism by which Cpn1 contributes to heterochromatin establishment remains unclear, though. In contrast to Mkt1, another establishment factor previously studied by the authors, no heterochromatic transcripts were found to be associated with Cpn1 when performing RNA-IP (RIP) experiments. The authors then analyzed pericentromeric transcripts (cen RNA) by smRNA-FISH. Deleting the H3K9 methyltransferase Clr4 resulted in the formation of nuclear foci that co-localized with the centromeric histone variant CENP-A. Glucose starvation increased the number of foci, which were also found in the cytoplasm under this stress condition and partially co-localized with Pabp. Notably, inactivating the exoribonuclease Dph1/Xrn2 also resulted in increased nuclear foci formation and accumulation in the cytosol, which was prevented when Cnp1 was absent. Hence, the authors proposed a model, by which Cpn1 limits accumulation of heterochromatic transcripts on chromatin by facilitating their export and cytoplasmic degradation by Dhp1.

      Major comments

      1. The authors suggest that Cnp1 contributes to heterochromatin establishment by facilitating the removal of excessive heterochromatic transcripts from chromatin. Nevertheless, despite the accumulation of pericentromeric transcripts inside the nucleus in clr4∆, direct evidence for their accumulation on chromatin remains elusive. While the authors cautiously avoid assigning Cnp1 a definite role in heterochromatic transcripts removal, investigating RNA:DNA hybrid accumulation through DNA-RNA immunoprecipitation (DRIP) could strengthen their conclusions. Given the successful application of DRIP by the authors in their Mkt1 study (Taglini et al., 2020) and its prior used by others (PMID: 28404620), this approach appears feasible and judicious when appropriate controls are implemented.
      2. While the data support the hypothesis that Cpn1 binds RNA, the authors could not detect Cpn1 association with heterochromatin transcripts. This could be due to transient interactions and their fast turnover, as the authors suggested. The authors could repeat the RIP experiments in mutants that prevents turnover of transcripts, for instance in dhp1-2 and rrp6∆ mutants.

      Minor comments:

      1. How was the quantification of Pabp and Cpn1 foci performed? Little information is provided ("images were [...] exported to ImageJ analysis"). Given the presence of additional (diffuse) signals even under stress conditions, I'm wondering how foci were distinguished from background? Was there a threshold for signals considered to be 'foci' versus background? The authors should give a more detailed description in the figure legends or Materials & Methods section.
      2. How was the relative sm-FISH intensity in Figure 6D and E determined? Have there been internal controls to ensure that hybridization efficiency was comparable for different strains/samples?

      Significance

      The is an intriguing study that provides several functional links between heterochromatin establishment and stress responses, using a combination of elegant yeast genetics, imaging, biochemical approaches and proteomics. While it was previously shown that heterochromatic transcripts can accumulate on chromatin interfering with heterochromatin assembly (Broenner et al., 2017), this study conceptionally advances our understanding of this process by describing a potential role for Cpn1 in facilitating nuclear export of heterochromatic transcripts. This study further describes the conservation of the human CAPRIN1 complex and its role in cytosolic stress granule assembly in yeast and therefore will be of broad interest for researchers interested in heterochromatin assembly and RNP homeostasis. A limitation of this study is the lack of a distinct molecular mechanism by which Cpn1 promotes heterochromatin establishment. Performing additional experiments could strengthen the authors' arguments and contribute to a better understanding of the underlying mechanism(s).

      I have a longstanding expertise in heterochromatin assembly, transcriptional silencing and yeast genetics using S. pombe and S. cerevisiae as model systems.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02546

      Corresponding author: Woo Jae, Kim

      1. General Statements

      The goal of this study is to provide the insights of one specific neuron ‘SIFa’ controls interval timing behavior by its receptor ‘SIFaR’ through neuropeptide relay. Interval timing, or the sense of time in the seconds to hours range, is important in foraging, decision making, and learning in humans via activation of cortico-striatal circuits. Interval timing requires completely distinct brain processes from millisecond or circadian timing. In summary, interval timing allows us to subjectively sense the passage of physical time, allowing us to integrate action sequences, thoughts, and behavior, detect developing trends, and predict future consequences.

      Many researchers have tried to figure out how animals, including humans, can estimate time intervals with such precision. However, most investigations have been conducted in the realm of psychology rather than biology thus far. Because the study of interval timing was limited in its ability to intervene in the human brain, many psychologists concentrated on developing convincing theoretical models to explain the known occurrence of interval timing.

      To overcome the limits of studying interval timing in terms of genetic control, we have reported that the time investment strategy for mating in Drosophila males can be a suitable behavioral platform to genetically dissect the principle of brain circuit mechanism for interval timing. For example, we previously reported that males prolong their mating when they have previously been exposed to rivals (Kim, Jan & Jan, "Contribution of visual and circadian neural circuits to memory for prolonged mating induced by rivals" Nature Neuroscience, 2012) (Kim et al, 2012), and this behavior is regulated by visual stimuli, clock genes, and neuropeptide signaling in a subset of neurons (Kim, Jan & Jan, “A PDF/NPF Neuropeptide Signaling Circuitry of Male Drosophila melanogaster Controls Rival-Induced Prolonged Mating” Neuron, 2013) (Kim et al, 2013). And we also reported that the sensory inputs are required for sexual experienced males to shorten their mating time (Lee, Sun, et al, “Taste and pheromonal inputs govern the regulation of time investment for mating by sexual experience in male Drosophila melanogaster” PLOS genetics, 2023) (Lee et al, 2023).

      Throughout their lives, all animals must make decisions in order to optimize their utility function. Male reproductive success is determined by how many sperms successfully fertilize an egg with a restricted number of investment resources. To optimize male reproductive fitness, a time investment strategy has been devised. As a consequence, we believe that the flexible responses of mating duration to different environmental contexts in Drosophila males might be an excellent model to investigate neural circuits for interval timing.

      The most well-known features of mammalian modulating energy homeostasis between the gut and the brain is one of the most intensively studied neuro-modulatory circuits via the neuronal relay of neuropeptides. In this article, we report that SIFa controls two alternate interval timing behaviors through neuropeptide relay signaling by SIFaR and other important neuropeptides and transmits the internal states of the male brain into decision making. According to our findings, male Drosophila utilize SIFa-SIFaR signaling modulating LMD and SMD behaviors. During our investigation in this regulation, we found a subset of cells that express SIFaR in SOG and AG region are important for the modulation of interval timing behaviors. Furthermore, we discovered a neuropeptide named Corazonin (Crz) which expressed in SIFaR is important for both LMD and SMD behaviors.

      Our discovery of neuropeptide relay of SIFa-SIFaR-Crz-CrzR in male Drosophila in modulating interval timing behaviors will be a huge step forward in our knowledge of interval timing behavior.

      2. Point-by-point description of the revisions

      Reviewer #1

      Comment 1. The authors are to be commended for the sheer quantity of data they have generated, but I was often overwhelmed by the figures, which try to pack too much into the space provided. As a result, it is often unclear what components belong to each panel. Providing more space between each panel would really help.

      __ Answer:__ We are grateful for the insightful feedback regarding the structure of our data presentation. In response to your valuable suggestion, we have made adjustments in this revised version by downsizing the diagram and ensuring the spacing between the panels.

      Comment 2. The use of three independent RNAi lines to knock down SIFaR expression is experimentally solid, as the common phenotype observed with all 3 lines supports the conclusion that the SIFaR is important for mating duration choice. However, the authors have not tested whether these lines effectively reduce SIFaR expression, nor whether the GAL80 constructs used to delimit knockdown are able to effectively do so. This makes it hard to make definitive conclusions with these manipulations, especially in the face of negative results. A lack of complete knockdown is suggested by the fact that the F24F06 driver rescues lethality when used to express SIFaR in the B322 mutant background, but does not itself produce lethality when used to express SIFaR RNAi. The authors should either conduct experiments to determine knockdown efficiency or explicitly acknowledge this limitation in drawing conclusions from their experiments. A similar concern relates to the CrzR knockdown experiments (eg Figure 7).

         __Answer:__ We appreciate the reviewer's attention to the details of our experimental design. Indeed, the validation of SIFaR-RNAi efficiency is crucial for interpreting our results accurately. In our initial experiments, we focused on the consistent phenotypic outcomes across the three independent RNAi lines, which collectively suggest the importance of SIFaR in LMD and SMD behaviors. However, we recognize the importance of confirming the effectiveness of our RNAi constructs in reducing SIFaR expression. Initially, we incorporated experiments utilizing *elav-GAL80* to demonstrate that the SIFaR knockdown mediated by the *elavc155* driver is sufficient to eliminate LMD and SMD behaviors. The corresponding results are presented in Figure 1C-D, with a detailed description provided in the manuscript as detailed below.
      

      "The inclusion of elav-GAL80, which suppresses GAL4 activity in a pan-neuronal context, was found to restore both LMD and SMD behaviors when SIFaR was knocked down by a pan-neuronal elavc155 driver (Fig. 1C-D). This observation suggests that the reduction in SIFaR expression mediated by the elavc155 driver is sufficient to significantly impair LMD and SMD behaviors."

         In response to the comments, we have conducted a thorough reevaluation in our revised manuscript. Specifically, we have confirmed the efficiency of the SIFaR-RNAi line HMS00299, which exhibited the most pronounced phenotype when co-expressed with the tub-GAL4 and nSyb-GAL4 drivers, using quantitative real-time PCR (qRT-PCR). It has come to our attention that we omitted mentioning the embryonic lethality induced by the HMS00299 line when combined with either tub-GAL4 or nSyb-GAL4 drivers, which is consistent with the homozygous lethality observed in the *SIFaRB322* mutant. To address this, we have performed qRT-PCR experiments by crossing the HMS00299 line with tub-GAL4; tub-GAL80ts, allowing for the temporary knockdown of SIFaR specifically during the adult stage. We utilized w-/SIFaR-RNAis as a control in these experiments. The outcomes are illustrated in Figure 1E, and we have made the necessary modifications and additions to the manuscript to accurately reflect the efficiency of the SIFaR-RNAi line as detailed below.
      

      "To ensure that RNAi did not have an off-target effect, we tested three independent RNAi strains and found that all three RNAi successfully disrupted LMD/SMD when expressed in neuronal populations. (Fig. S1E-J). We chose to use the HMS00299 line as SIFaR-RNAi for all our experiments because it efficiently disrupts LMD/SMD without UAS-dicer expression. Employment of broad drivers, including the tub-GAL4 and the strong neuronal driver nSyb-GAL4, with HMS00299 line consistently results in 100% embryonic lethality (data not shown). This phenotype mirrors the homozygous lethality observed in the SIFaRB322 mutant. The efficiency of HMS00299 SIFaR-RNAi lines was also validated through quantitative PCR analysis (Fig. 1E). Consequently, we infer that the knockdown of SIFaR using the HMS00299 line nearly completely diminishes the levels of the SIFaR protein."

      We also examined the knockdown efficiency of CrzR in the experiments related to Figure 8 (revised version), following a similar approach (Fig. S7K).

      Comment 3. Most of the behavioral experiments lack traditional controls, for example flies that contain either the GAL4 or UAS elements alone. The authors should explain their decision to omit these control experiments and provide an argument for why they are not necessary to correctly interpret the data. In this vein, the authors have stated in the methods that stocks were outcrossed at least 3x to Canton-S background, but 3 outcrosses is insufficient to fully control for genetic background.

      • *Answer: We sincerely thank the reviewer for insightful comments regarding the absence of traditional genetic controls in our study of LMD and SMD behaviors. We acknowledge the importance of such controls and wish to clarify our rationale for not including them in the current investigation. The primary reason for not incorporating all genetic control lines is that we have previously assessed the LMD and SMD behaviors of GAL4/+ and UAS/+ strains in our earlier studies. Our past experiences have consistently shown that 100% of the genetic control flies for both GAL4 and UAS exhibit normal LMD and SMD behaviors. Given these findings, we deemed the inclusion of additional genetic controls to be non-essential for the present study, particularly in the context of extensive screening efforts. Consequently, in accordance with the reviewer's recommendation, we conducted genetic validation experiments on novel genetic crosses, including SIFaR-RNAi/+, CrzR-RNAi/+, and GAL4NP5270/+, and incorporated the results in the supplementary figures (Supplementary information 1). We have made the necessary modifications and additions to the manuscript as below.

      "Given those genetic controls, as evidenced by consistent exhibition of normal LMD and SMD behaviors (Supplemental information 1), the observed reduction in SIFaR expression, driven by elavc155, is deemed sufficient to induce significant disruptions in LMD and SMD behaviors."

      However, we understand the value of providing a clear rationale for our methodology choices. To this end, we have added a detailed explanation in the "MATERIALS AND METHODS" section and the figure legends of Figure 1. This clarification aims to assist readers in understanding our decision to omit traditional controls, as outlined below.

      "__Mating Duration Assays for Successful Copulation__The mating duration assay in this study has been reported (Kim et al. 2012; Kim et al. 2013; Lee et al. 2023). To enhance the efficiency of the mating duration assay, we utilized the Df(1)Exel6234 (DF here after) genetic modified fly line in this study, which harbors a deletion of a specific genomic region that includes the sex peptide receptor (SPR) (Parks et al. 2004; Yapici et al. 2008). Previous studies have demonstrated that virgin females of this line exhibit increased receptivity to males (Yapici et al. 2008). We conducted a comparative analysis between the virgin females of this line and the CS virgin females and found that both groups induced SMD. Consequently, we have elected to employ virgin females from this modified line in all subsequent studies. For group reared (naïve) males, 40 males from the same strain were placed into a vial with food for 5 days. For single reared males, males of the same strain were collected individually and placed into vials with food for 5 days. For experienced males, 40 males from the same strain were placed into a vial with food for 4 days then 80 DF virgin females were introduced into vials for last 1 day before assay. 40 DF virgin females were collected from bottles and placed into a vial for 5 days. These females provide both sexually experienced partners and mating partners for mating duration assays. At the fifth day after eclosion, males of the appropriate strain and DF virgin females were mildly anaesthetized by CO2. After placing a single female into the mating chamber, we inserted a transparent film then placed a single male to the other side of the film in each chamber. After allowing for 1 h of recovery in the mating chamber in 25℃ incubators, we removed the transparent film and recorded the mating activities. Only those males that succeeded to mate within 1 h were included for analyses. Initiation and completion of copulation were recorded with an accuracy of 10 sec, and total mating duration was calculated for each couple. Genetic controls with GAL4/+ or UAS/+ lines were omitted from supplementary figures, as prior data confirm their consistent exhibition of normal LMD and SMD behaviors (Kim et al. 2012; Kim et al. 2013; Lee et al. 2023; Huang et al. 2024; Zhang et al. 2024). Hence, genetic controls for LMD and SMD behaviors were incorporated exclusively when assessing novel fly strains that had not previously been examined. In essence, internal controls were predominantly employed in the experiments, as LMD and SMD behaviors exhibit enhanced statistical significance when internally controlled. Within the LMD assay, both group and single conditions function reciprocally as internal controls. A significant distinction between the naïve and single conditions implies that the experimental manipulation does not affect LMD. Conversely, the lack of a significant discrepancy suggests that the manipulation does influence LMD. In the context of SMD experiments, the naïve condition (equivalent to the group condition in the LMD assay) and sexually experienced males act as mutual internal controls for one another. A statistically significant divergence between naïve and experienced males indicates that the experimental procedure does not alter SMD. Conversely, the absence of a statistically significant difference suggests that the manipulation does impact SMD. Hence, we incorporated supplementary genetic control experiments solely if they deemed indispensable for testing. All assays were performed from noon to 4 PM. We conducted blinded studies for every test."

         We appreciate the reviewer's inquiry regarding the genetic background of our experimental lines. In response to the comments, we would like to clarify the following. All of our GAL4, UAS, or RNAi lines, which were utilized as the virgin female stock for outcrosses, have been backcrossed to the Canton-S (CS) genetic background for over ten generations. The majority of these lines, particularly those employed in LMD assays, have been maintained in a CS backcrossed status for several years, ensuring a consistent genetic background across multiple generations. Our experience has indicated that the genetic background, particularly that of the X chromosome inherited from the female parent, plays a pivotal role in the expression of certain behavioral traits. Therefore, we have consistently employed these fully outcrossed females as virgins for conducting experiments related to LMD and SMD behaviors. It is noteworthy that, in contrast to the significance of genetic background for LMD behaviors, we have previously established in our work (Lee *et al*, 2023) that the genetic background does not significantly influence SMD behaviors. This distinction is important for the interpretation of our findings. To provide a comprehensive understanding of our experimental design, we have detailed the genetic background considerations in the __"Materials and Methods"__ section, specifically in the subsection __"Fly Stocks and Husbandry"__ as outlined below.
      

      "To reduce the variation from genetic background, all flies were backcrossed for at least 3 generations to CS strain. For the generation of outcrosses, all GAL4, UAS, and RNAi lines employed as the virgin female stock were backcrossed to the CS genetic background for a minimum of ten generations. Notably, the majority of these lines, which were utilized for LMD assays, have been maintained in a CS backcrossed state for long-term generations subsequent to the initial outcrossing process, exceeding ten backcrosses. Based on our experimental observations, the genetic background of primary significance is that of the X chromosome inherited from the female parent. Consequently, we consistently utilized these fully outcrossed females as virgins for the execution of experiments pertaining to LMD and SMD behaviors. Contrary to the influence on LMD behaviors, we have previously demonstrated that the genetic background exerts negligible influence on SMD behaviors, as reported in our prior publication (Lee et al, 2023). All mutants and transgenic lines used here have been described previously."

      Comment 4. Throughout the manuscript, the authors appear to use a single control condition (sexually naïve flies raised in groups) to compare to both males raised singly and males with previous sexual experience. These control conditions are duplicated in two separate graphs, one for long mating duration and one for short mating duration, but they are given different names (group vs naïve) depending on the graph. If these are actually the same flies, then this should be made clear, and they should be given a consistent name across the different "experiments".

      * * Answer: We are grateful to the reviewer for highlighting the potential for confusion among readers regarding the visualization methods used in our figures. In response to this valuable feedback, we have now included a more detailed explanation of the graph visualization techniques in the legends of Figure 1, as detailed below. This additional information should enhance the clarity and understanding of the figure for all readers.

      "In the mating duration (MD) assays, light grey data points denote males that were group-reared (or sexually naïve), whereas blue (or pink) data points signify males that were singly reared (or sexually experienced). The dot plots represent the MD of each male fly. The mean value and standard error are labeled within the dot plot (black lines). Asterisks represent significant differences, as revealed by the unpaired Student’s t test, and ns represents non-significant differences (*p* *

      Comment 5.* The authors have consistently conflated overlap of neuronal processes with synaptic connections. Claims of synaptic connectivity deriving solely from overlap of processes should be tempered and qualified.

      • For example, they say (Lines 201-202) "These findings suggest that SIFa neurons and GAL424F06-positive neurons form more synapses in the VNC than in the brain." This is misleading. Overlap of 24F06-LexA>CD8GFP and SIFa-GAL4>CD8RFP tells us nothing about synapse number, or even whether actual synapses are being formed.*

      • *Answer: We sincerely thank the reviewer for their insightful and constructive feedback regarding the interpretation of our data. We acknowledge the important point raised about the limitations of inferring synapse numbers from the overlap of membrane GFP and RFP signals. We fully concur that more specific techniques, such as the GRASP method, are necessary to accurately quantify synapse numbers, as we have demonstrated in subsequent sections of our manuscript. In the section where we describe the SIFa-SIFaR neuronal architecture labeled with membrane GFP and RFP, we recognize the need for caution in not overstating the implications of these findings as indicative of synapse formation. In light of the reviewer's comments, we have revised our discussion to more accurately reflect the nature of the SIFa-SIFaR neuronal arborizing patterns, as detailed below. This revision aims to provide a more nuanced interpretation of our observations and to align with the current scientific understanding of synaptic quantification.

      "As previously reported, SIFa neurons arborize extensively throughout the CNS, but the neuronal processes of GAL424F06-positive neurons are enriched in the optic lobe (OL), sub-esophageal ganglion (SOG), and abdominal ganglion (AG) (GFP signal in Fig. 2F). Neuronal processes that are positive for SIFa and SIFaR strongly overlap in the prow (PRW), prothoracic and metathoracic neuromere (ProNm and MesoNm), and AG regions (yellow signals in Fig. S3A). We quantified these overlapping neuronal processes between SIFa- and SIFaR-positive neurons and found that approximately 18% of SIFa neurons and 52% of GAL424F06-positive neurons overlap in brain (Fig. S3B, C), whereas approximately 48% of SIFa and 54% of GAL424F06-positive neurons overlap in VNC (Fig. S3D, E). These findings suggest that SIFa neurons and GAL424F06-positive neurons form more neuronal processes in the VNC than in the brain."

      • * Lines 210-211: "The overlap of DenMark and syt.EGFP signals was highly enriched in both SOG and ProNm regions, indicating that these regions are where GAL424F06 neurons form interconnected networks". This is misleading. Overlap of DenMark and syt.EGFP does not indicate synapses (especially since these molecules can be expressed outside the expected neuronal compartment if driven at high enough levels).*

      • *Answer: We are grateful for the reviewer's critical insights regarding our interpretation of the DenMark and syt.eGFP experiments. We acknowledge the reviewer's point that the overlap of DenMark and syt.eGFP signals does not conclusively indicate synapses and that some of these signals can be expressed outside the expected neuronal compartments, particularly at high levels.

        It is important to note that DenMark and syt.eGFP are markers of synaptic polarity. In the original publication of DenMark, the authors demonstrated that while these two markers are closely apposed, they do not necessarily overlap, as seen in the labeled yellow areas. They concluded that these areas could represent closely apposed regions where "LNv neurons establish presynaptic contacts within the aMe, suggesting that these contacts are on the postsynaptic sites of the LNv neurons themselves. (Nicolaï et al, 2010)" The authors also observed that DenMark-enriched structures appear juxtaposed to, rather than coexpressed with, syt.eGFP, indicating a potential for synapse formation between R neurons within the eb. In contrast, projections to the suboesophageal ganglion, which show strong Syt–GFP expression, are devoid of DenMark, suggesting a different interpretation of the signals (Nicolaï et al, 2010).

        Building on these findings, we have reanalyzed our data with caution (as shown in Figure S3). In the SOG region, where we observed strong yellow signals, these were not limited to cell bodies but also extended to the middle region filled with neural processes. Upon close examination of the DenMark and syt.eGFP signals, we confirmed that these yellow signals are closely juxtaposed, suggesting the possibility of synapse formation between SIFaR24F06 neurons within the SOG. We emphasize that this interpretation is based on the original findings from the DenMark study. To provide clarity for general readers, we have added further explanations regarding the interpretation of these signals, as detailed below. We believe that our revised analysis and the additional explanations will help to clarify the potential implications of our findings, while also acknowledging the limitations and the need for further investigation.

      "DenMark-enriched structures, localized within the SOG, are observed in close apposition to syt.eGFP signals, as indicated by the white-dashed circles (Fig. S3Fa). This spatial relationship suggests that SIFaR-expressing neurons, identified by GAL424F06 labeling, may form synapses with one another within the SOG. The colocalization of yellow signals resulting from the interaction between DenMark and syt.eGFP has been previously interpreted and validated by other researchers, supporting our observation (Nicolaï 2010,Kennedy 2018). In contrast to the yellow signals observed in the SOG, which are indicative of neural processes, the yellow signals detected in the ProNm appear to be associated with cell bodies rather than neural processes, as DenMark signals are often observed to leak out (as shown in Fig. S3Fb) (Nicolaï 2010,Kennedy 2018). Despite the presence of juxtaposed DenMark and syt.GFP signals in the ProNm, the interpretation of the yellow signals as potential synapses between SIFaR neurons remains an open question (indicated by the question mark in Fig. S3K).

        • Lines 320-322: "Neurons expressing Crz exhibit robust synaptic connections with SIFaR24F06 neurons located in the PRW region of the SOG in the brain (panels of Brain and SOG in Fig. 5A)". This is again misleading. They are not actually measuring synapses here, but instead looking at area of overlap between neuronal processes of Crz and SIFaR cells.*

        Answer: We sincerely appreciate the reviewer's critical feedback regarding our initial data interpretation. We acknowledge the important distinction that overlapping membrane markers do not provide a direct measure of synapse formation. In line with the reviewer's suggestion, we have revised the relevant sentence to more accurately reflect this understanding, as detailed below.

      "Neurons expressing Crz were observed in close proximity to SIFaR24F06-expressing neurons within the PRW-SOG of the brain (panels of Brain and SOG in Fig. 6A)."

      • * In Figs 3B and S4A, they are claiming that all neuronal processes within a given delineated brain area are synapses. The virtual fly brain and hemibrain resource have a way to actually identify synapses. This should be used in addition to the neuron skeleton. Otherwise, it is misleading to label these as synapses.*

        Answer: We are grateful for the reviewer's insightful comments that highlighted the potential for misleading information in our previous submission. Upon careful reexamination of the virtual fly brain model, we have made the necessary corrections and updated the figures in our revised manuscript (Figure 3B and S4B). This reanalysis has allowed us to further substantiate our findings, confirming that SIFa neurons indeed establish dense synaptic connections with multiple regions of the central brain.

      • * Furthermore, measuring the area of GRASP signal is not the same as quantifying synapses. We don't know if synapse number changes (eg in lines 240-242).*

      • Answer: We sincerely appreciate the reviewer's valuable suggestion regarding our quantification methods for assessing synaptic changes using GRASP signals. We acknowledge the reviewer's accurate observation that GRASP signals alone cannot provide an exact quantification of synapse number changes. In response to this feedback, we have employed the 'Particle analysis' function of ImageJ to infer the number of synapses from GRASP signals, clearly labeling them as 'number of particles' (as exemplified in Figures S4G and S4J). Additionally, we have compared the average size of each particle to enable a more precise comparison of synapse number changes (as shown in Figures S4H and S4K). While it is true that GRASP signals should not be directly equated with synapse counts, the quantification of GRASP signal intensity can still provide insights into the underlying synaptic connectivity, as described in the original GRASP paper (Feinberg et al, 2008a). Following this approach, previous studies have used signal intensity quantifications to draw conclusions about changes in synaptic specificity in various mutants. Since our methods for measuring GRASP intensity are consistent with the original techniques, we have updated our Y-axis labeling to reflect 'normalized GFP intensity (Norm. GFP Int.)', as exemplified in Figure 4. This change aims to provide a clearer and more accurate representation of our data.

      Comment 6.* In general, the first part of the manuscript (implicating SIFaR in mating duration) is much stronger than the second part, which attempts to demonstrate that SIFa acts through Crz-expressing neurons to induce its effects. The proof that SIFa acts through Crz-expressing neurons to modify mating duration is tenuous. The most direct evidence of this, achieved via knockdown on Crz in SIFaR-expressing cells, is relegated to supplemental figures. The calcium response of the Crz neurons to SIFa neuron activation (Fig. 6) is more of a lack of a decrease that is observed in controls. Also, this is only done in the VNC. Why not look in the brain, because the authors previously stated a hypothesis that the "transmission of signals through SIFaR in Crz-expressing neurons is limited to the brain" (lines 381-382)?

      Furthermore, the authors suggest that Crz acts on cells in the heart to regulate mating duration. It would be useful to add a discussion/speculation as to possible mechanisms for heart cells to regulate mating decisions. Is there evidence of CrzR in the heart? The SCope data presented in Fig. 7I-L and S7G-H is hard to read.*

      • Answer: We sincerely appreciate the reviewer's constructive feedback on the section of our manuscript that discusses the role of the SIFa-SIFaR connection in regulating mating duration. We understand that the initial presentation may not have been sufficiently convincing. As we detailed in our previous biorXiv preprint (Wong et al, 2019), we conducted a comprehensive screen of numerous neuropeptides and their receptors that mediate SIFa signals through SIFaR and added those data in Supplementary Table S1 and S2. Among these, Crz was identified as a key neuropeptide in this pathway and is also well-documented for its role in mating duration (Tayler et al, 2012). Our data clearly demonstrate that Crz neurons are responsive to the activity of SIFa neurons, supporting the validity of this connection. Additionally, in another manuscript focusing on the input signals for SIFa (Kim et al*, 2024), we established that CrzR does not function in SIFa neurons, confirming the bidirectional nature of SIFa-to-Crz signaling.

        Inadvertently, we had relegated the Crz knockdown results to supplementary figures, under the assumption that our screening results regarding the relationship between SIFaR and neuropeptides were already well-covered (Wong et al, 2019). In light of the reviewer's comments, we have now relocated the Crz knockdown results, particularly those involving SIFaR-expressing cells, to the main figures (Figure 6F-G). We have also included a more detailed description of our previous screening results within the manuscript, as outlined below, to provide a more comprehensive understanding of our findings.

      "Furthermore, the Crz peptide and Crz-expressing neurons have been characterized as pivotal relay signals in the SIFa-to-SIFaR pathway, which is essential for modulating interval timing behaviors (Wong 2019)."

         We greatly appreciate the reviewer's critical and constructive feedback regarding the detection of SIFa-to-Crz long-distance signaling, particularly their observation that this signaling is detectable from the brain to the VNC but not between brain regions. In response to the reviewer's suggestions, we have made the following adjustments to our manuscript:
      
      1. We have relocated our SIFa-Crz GCaMP data pertaining to the VNC region to the Figures 6L-O) to maintain focus on the primary findings within the main text.
      2. Our deeper analysis has led to the identification of two cells in the Super Intermediate Protocerebrum (SIP) regions that coexpress both Crz and SIFaR24F06, as well as OL cells (Figure 6D-E).
      3. We have included GCaMP data from the brain region in the main figure to provide a comprehensive view of the signaling dynamics (Fig. 6P-R and Fig. S6N-P).
      4. Upon examining the SIFa-to-Crz signaling through GCaMP calcium imaging, we observed that the calcium levels in Crz+/SIFaR+ SIP neurons consistently decreased upon SIFa activation (Figure 6P-R). In contrast, the calcium signals in Crz+/SIFaR+ OL neurons increased upon SIFa activation, similar to the pattern observed in Crz+ AG neurons in the VNC (Figure 6M-O and Figure S6N-P).
      5. We have summarized these findings in Figure 6S and provided a detailed description of the results in the manuscript, as outlined below. "To elucidate the direct response of Crz neurons to the activity of SIFa neurons, we conducted live calcium (Ca2+) imaging in the Super Intermediate Protocererbrum (SIP), OL and AG region of the VNC, where Crz neurons are situated (Fig. 6D, Fig. S6M). Upon optogenetic stimulation of SIFa neurons, we observed a significant increase in the activity of Crz in OL and AG region (Fig. 6L-O, Fig. S6N-P), evidenced by a sustained elevation in intracellular Ca2+ levels that persisted in a high level before gradually declining to baseline levels, where the cells in top region of the SIP exhibit consistently drop down after stimulated the SIFa neurons (Fig. 6P-R). These calcium level changes were in contrast to the control group (without all-trans retinal, ATR) (Fig. 6L-R, Fig. S6N-P). These findings confirm that Crz neurons in OL and AG are activated in response to SIFa neuronal activity, but the activity of Crz neurons in SIP are inhibited by the activition of SIFa neuron, reinforcing their role as postsynaptic effectors in the neural circuitry governed by SIFa neurons. Moreover, these results provide empirical support for the hypothesis that SIFa-SIFaR/Crz-CrzR long-range neuropeptide relay underlies the neuronal activity-based measurement of interval timing."

        We are grateful for the reviewer's opportunity to elaborate on the intriguing findings concerning the expression of CrzR in the heart and its potential link to mating duration. In the context of traditional interval timing models (Meck et al, 2012; Matell, 2014; Buhusi & Meck, 2005), the role of a pacemaker in generating a temporal flow for measuring time is considered essential. The heart, being a well-known pacemaker organ in animals, provides a compelling framework for our discussion. In response to the reviewer's insightful comments, we have expanded upon our hypotheses in the DISCUSSION section, exploring the possible connections between cardiac function and the regulation of mating duration. Our reflections on this topic are detailed as follows:


      "It has been reported that the interaction between the brain and the heart can influence time perception in humans (Khoshnoud et al, 2024). Heart rate is governed by intrinsic mechanisms, such as the muscle pacemaker, as well as extrinsic factors including neural and hormonal inputs (Andersen et al, 2015). Moreover, the pacemaker function is essential for the generation of interval timing capabilities (Meck et al, 2012; Matell, 2014; Buhusi & Meck, 2005), with the heart being recognized as the primary pacemaker organ within the animal body. Consequently, the CrzR in the fly heart may respond to the Crz signal sent by SIFaR+/Crz+ cells and modulate the heart rate, thereby impacting the perception of time in male flies."

         We appreciate the reviewer's interest in the expression of CrzR in the heart and its potential implications for our study. In response to the reviewer's comments, we have conducted a thorough examination of the fly SCope RNAseq dataset. Our analysis revealed that CrzR is indeed broadly expressed in heart tissue, particularly in areas where the Hand gene is also expressed. This significant finding has been incorporated into our manuscript and is depicted in Figure 8L. As illustrated in Figures 8I-L, which present the SCope tSNE plot for various cell types including neurons, glial cells, muscle systems, and heart, the heart tissue exhibits the most robust expression of CrzR. This observation suggests that the Hand-GAL4 mediated CrzR knockdown experiments may provide insights into the role of CrzR expression in the heart and its influence on the interval timing behavior of male fruit flies. We have expanded upon this interpretation in the relevant sections of our manuscript to ensure a clear and comprehensive understanding of our results.
      
      • *

      Comment 7. In several cases, the effects of being raised single are opposite the effects of sexual experience. For example, in Fig. 4T, calcium activity is increased in the AG following sexual experience, but decreased in flies raised singly. Likewise, Crz-neurons in the OL have increased CaLexA signal in singly-raised flies but reduced signals in flies with previous sexual experience. In some cases, manipulations selectively affect LMD or SMD. It would be useful to discuss these differences and consider the mechanistic implications of these differential changes, when they all result in decreased mating duration. This could help to clarify the big picture of the manuscript.

         __Answer:__ We sincerely appreciate the reviewer's insightful suggestions regarding the potential mechanistic underpinnings of how differential calcium activities may modulate LMD and SMD behaviors. In response to this valuable input, we have expanded our discussion to include a hypothesis on how neuropeptide relays could potentially induce context-dependent modulation of synaptic changes and calcium activities within distinct neuronal subsets. This addition aims to provide a more comprehensive understanding of the complex interactions at play, as detailed in the revised manuscript.
      

      "Employing two distinct yet comparable models of interval timing behavior, LMD and SMD, we demonstrated that differential SIFa to SIFaR signaling is capable of modulating context-dependent behavioral responses. Synaptic strengths between SIFa and SIFaR neurons was notably enhanced in group-reared naive males. However, these synaptic strengths specifically diminished in the OL, CB, and AG when males were singly reared, with a particular decrease in the AG region when males were sexually experienced (Fig. 4A-J). Intriguingly, overall calcium signaling within SIFaR24F06 neurons was significantly reduced in group-reared naive males, yet these signals surged dramatically in the OL with social isolation and in the AG with sexual experience (Fig. 4K-T). These calcium signals, as reported by the transcriptional calcium reporter CaLexA, were corroborated by GCaMP live imaging in both the AG and OL regions (Fig. 6L-O and Fig. S6N-P), indicating a close association between elevated calcium levels and LMD and SMD behaviors. The modulation of context-dependent synaptic plasticity and calcium dynamics by the SIFa neuropeptide through a single SIFaR receptor raises the question of how a single receptor can elicit such diverse responses. Recent neuroscientific studies in Drosophila have shown that individual neurons can produce multiple neurotransmitters and that neuropeptides are often colocalized with small molecule neurotransmitters (Nässel 2018,Deng 2019,Croset 2018,Kondo 2020). Consistent with this, we have previously reported that SIFa neurons utilize a variety of neurotransmitters, including glutamate, dopamine, and tyramine (Kim 2024). Therefore, we propose that the SIFa-SIFaR-Crz-CrzR neuropeptidergic relay circuitry may interact with different neurotransmitters in distinct neuronal subpopulations to regulate context-dependent behaviors. Supporting this hypothesis, glutamate, known to function as an inhibitory neurotransmitter in the olfactory pathway of Drosophila (Liu 2013), may be one such candidate. We speculate that neuropeptide cotransmission could underlie the mechanisms facilitating these complex, context-dependent behavioral patterns. Further research is warranted to elucidate how such cotransmission contributes to the intricate behavioral repertoire of the fly."

      Minor Comments: Comment 8. For CaLexA experiments (eg Fig 7A-D), signal intensity should be quantified in addition to area covered. Increased intensity would indicate greater calcium activity within a particular set of neurons.

      • *Answer: We appreciate the reviewer's insightful comments and acknowledge the importance of using intensity measurements in our analysis of CaLexA signals. We concur that the intensity of these signals is indeed correlated with the area measurements, which is a critical factor to consider. In response to the reviewer's valuable suggestion, we have revised our approach and now present our data based on intensity measurements. These have been incorporated as a primary dataset in all our CaLexA results to provide a more accurate representation of our findings. Additionally, we have updated the labeling of our Y-axis to "Norm. GFP Int.", which stands for "normalized GFP intensity". This change ensures clarity and consistency in the presentation of our data, aligning with the reviewer's recommendations and enhancing the overall quality of our manuscript.

      Comment 9. In Figure 5K: quantification of cell overlap is missing. In the text they state that there are ~100 neurons that co-express SIFaR24F06 and Crz. How was this determined? Is there a graph or numerical summary of this assertion?

          __Answer:__ We sincerely thank the reviewer for pointing out the oversight in our initial submission regarding the quantification data. In response to this valuable feedback, we have now included the quantification of neurons co-expressing SIFaR24F06 and Crz in the optic lobe (OL) within Figure 6E. This addition ensures that the figure is complete and provides the necessary numerical support for our observations.
      

      Comment 10. In lines 709-711: "Our experience suggests that the relative mating duration differences between naïve and experienced condition and singly reared are always consistent; however, both absolute values and the magnitude of the difference in each strain can vary. So, we always include internal controls for each treatment as suggested by previous studies." I had trouble understanding this section of methods. What is done with the data from the internal controls?

         __Answer:__ We appreciate the reviewer's attention to the methodology of our study, particularly regarding the use of internal controls in our mating duration assays. As referenced in our cited work by Bretman et al. (2011) (Bretman *et al*, 2011), our internal control strategy involves a comparison of mating durations between males that have been presented with specific sensory cues and those that have not. This approach includes assessing both males that have been exposed to signals and those that have not, which serves as an internal control for each experimental setup. The purpose of this design is to isolate the effects of our manipulations from other potential confounding factors. In response to the reviewer's comments, we have provided a more detailed description of our mating duration assay in the Methods section. We have also expanded our explanation to clarify how this internal control mechanism ensures that any observed differences in mating duration are attributable to the experimental manipulations and not to extraneous variables. This additional information should provide a clearer understanding of our methodology and the rationale behind our experimental design.
      

      Comment 11. Could the authors comment on why the brain GRASP signal is so different in Figures 3A and 4A? I realize that different versions of GRASP were used in these experiments, but I would expect broad agreement between the different approaches.

         __Answer: __We appreciate the reviewer’s insight. GRASP and t-GRASP are similar technologies that can clearly show the synaptic connection between neurons. GRASP technology was first generated and performed in *C. elegens* (Feinberg *et al*, 2008b). In 2018, the researchers developed a targeted GFP Reconstitution Across Synaptic Partners method, t-GRASP, which resulted in a strong preferential GRASP signal in synaptic regions.
      
         In our study, we utilized both techniques because of the limitations of the chromosomes where GAL4 and lexA lines located. We also found that during data processing, our method could clearly distinguish the changes in GRASP and t-GRASP signals across three different conditions (naïve, single, and exp.). Therefore, we do not have a particular preference for one technique over the other; both methods are applicable to our experiment.
      
         The genotype we used in Figure 3A is *SIFa2A-lexA, GAL424F06; lexAop-nSyb-spGFP1-10, UAS-CD4-spGFP11*, where the synaptic transmission occurs from *SIFa2A-lexA *to* GAL424F06*. In Figure 4A, the genotype we used is *GAL4SIFa.PT, lexASIFaR-2A; lexAop-2-post-t-GRASP, UAS-pre-t-GRASP*, where the synaptic transmission occurs from SIFa. PT to SIFaR-2A.
      
         In our back-to-back submission paper, “Peptidergic neurons with extensive branching orchestrate the internal states and energy balance of male *Drosophila melanogaster,*” (Kim *et al*, 2024) we identified that *SIFa2A* can label posterior-ventral SIFa neurons (SIFaVP), which can only project to ellipsoid body and fan-shaped body. Combining the GRASP technique, Figure 3A cannot show a strong signal as in Figure 4A. We’ve shown in Figure 1G that *SIFaR-2A *covers almost the whole CNS in *Drosophila*. Thus, the synaptic transmission from SIFa. PT (label 4 SIFa neurons) to SIFaR-2A shows a strong signal under the use of the t-GRASP technique. In this case, the GRASP signals in Figure 3A and Figure 4A are so different because of the usage of different GRASP techniques and different fly lines. We appreciate the reviewer's attention to the clarity of our presentation. In response to the comments, we have taken the opportunity to meticulously revise the figure legends to ensure that the differences are explicitly highlighted and easily understood by the readers.
      

      __ __


      Reviewer #2

      Major concerns: Comment 1.* It is highly interesting that the duration of mating behavior is dependent on external and motivational factors. In fact, that provides an elegant way to study which neuronal mechanisms orchestrate these factors. However, it remains elusive why the authors link the differentially motivated durations of mating behavior to the psychological concept of interval timing. This distracts from the actually interesting neurobiology, and is not necessary to make the study interesting. *

      * * Answer: We are grateful for the opportunity provided by the reviewer to elaborate on our rationale for utilizing the mating duration of male fruit flies as an exemplary genetic model for studying interval timing. At the outset, we would like to acknowledge that mating duration has gained recognition as a valuable genetic model for interval timing, as evidenced by the NIH-NIGMS R01 grant awarded to Michael Crickmore. This grant, which can be reviewed at the provided link (https://grantome.com/grant/NIH/R01-GM134222-01), underscores the significance of this model. Crickmore and colleagues have described in the grant's abstract that "mating duration in Drosophila offers a powerful system for exploring changes in motivation over time as behavioral goals are achieved," and it has the potential to provide "the first mechanistic description of a neuronal interval timing system."

         In light of this, we have incorporated our rationale into the INTRODUCTION section of our manuscript, as detailed below. We believe that our argumentation, supported by the grant's emphasis on the topic, will not only address the reviewer's concerns but also demonstrate to the broader scientific community the significance of the fruit fly's mating duration as a model for interval timing. This concept has been a cornerstone in the historical development of neuroscientific understanding of time perception. We hope that our expanded discussion will effectively convey the potential of the fruit fly mating duration as a genetic model to offer profound insights into the neural mechanisms underlying interval timing, a concept of enduring importance in the field of neuroscience.
      

      "The dimension of time is the fundamental basis for an animal's survival. Being able to estimate and control the time between events is crucial for all everyday activities (RICHELLE & LEJEUNE, 1980). The perception of time in the seconds-to-hours range, referred to as ‘interval timing’, is involved in foraging, decision making, and learning via activation of cortico-striatal circuits in mammals (Golombek et al, 2014). Interval timing requires entirely different neural mechanisms from millisecond or circadian timing (Meck et al, 2012; Merchant et al, 2012; Buhusi & Meck, 2005). There is abundant psychological research on time perception because it is a universal cognitive dimension of experience and behavioral plasticity. Despite decades of research, the genetic and neural substrates of temporal information processing have not been well established except for the molecular bases of circadian timing (Buhusi et al, 2009; Tucci et al, 2014). Thus, a simple genetic model system to study interval timing is required. Considering that the mating duration in fruit flies, which averages approximately 20 minutes, is well within the range addressed by interval timing mechanisms, this behavioral parameter provides a relevant context for examining the neural circuits that modulate the Drosophila's perception of time intervals. Such an investigation necessitates an understanding of the extensive neural and behavioral plasticity underlying interval timing (Thornquist et al, 2020; Gautham et al, 2024; Crickmore & Vosshall, 2013)."

      Comment 2.* In figure 4 A and 4K, fluorescence microscopy images of brains and ventral nerve chords are shown, one illustrating GRASP experiments, and one showing CaLexA experiments. The extreme difference between the differentially treated flies (bright fluorescence versus almost no fluorescence) is - in its drastic form- surprising. Online access to the original confocal microscopy images (raw data) might help to convince the reader that these illustrations do not reflect the most drastic "representative" examples out of a series of brain stainings. *

      * * Answer: We sincerely appreciate the reviewer's thoughtful suggestion to enhance the accessibility of our microscopy images for readers who may be interested. In response to this valuable feedback, we have compiled all of our quantified image files into zip format and included them as Supplementary Information 2 and 3. We believe that this additional material will be beneficial for readers seeking a more in-depth view of our data.

      Comment 3. In particular for behavioral experiments, genetic controls should always be conducted. That is, both the heterozygous Gal4-line as well as the heterozygous UAS-line should be used as controls. This is laborious, but important.

         __Answer:__ We sincerely appreciate the reviewer's critical feedback regarding the genetic controls in our study. We acknowledge the importance of this aspect and wish to clarify that we have indeed conducted a substantial number of genetic control experiments for both LMD and SMD behaviors. It is worth noting that much of this data has been previously published in other works. Recognizing the interest from another reviewer on the same topic, we have chosen to reiterate our response here for clarity and convenience. Our comprehensive approach to genetic controls ensures the robustness of our findings, and we believe that the published data further substantiates the reliability of our experimental procedures.
      
         We sincerely thank the reviewer for insightful comments regarding the absence of traditional genetic controls in our study of LMD and SMD behaviors. We acknowledge the importance of such controls and wish to clarify our rationale for not including them in the current investigation. The primary reason for not incorporating all genetic control lines is that we have previously assessed the LMD and SMD behaviors of GAL4/+ and UAS/+ strains in our earlier studies. Our past experiences have consistently shown that 100% of the genetic control flies for both GAL4 and UAS exhibit normal LMD and SMD behaviors. Given these findings, we deemed the inclusion of additional genetic controls to be non-essential for the present study, particularly in the context of extensive screening efforts. However, in accordance with the reviewer's recommendation, we conducted genetic validation experiments on novel genetic crosses, including SIFaR-RNAi/+, CrzR-RNAi/+, and GAL4NP5270/+, and incorporated the results in the supplementary figures (Supplementary information 1). We have made the necessary modifications and additions to the manuscript as below.
      

      "Given those genetic controls, as evidenced by consistent exhibition of normal LMD and SMD behaviors (Supplemental information. 1), the observed reduction in SIFaR expression, driven by elavc155, is deemed sufficient to induce significant disruptions in LMD and SMD behaviors."

      We understand the value of providing a clear rationale for our methodology choices. To this end, we have added a detailed explanation in the "MATERIALS AND METHODS" section and the figure legends of Figure 1. This clarification aims to assist readers in understanding our decision to omit traditional controls, as outlined below.

      "__Mating Duration Assays for Successful Copulation__The mating duration assay in this study has been reported (Kim et al. 2012; Kim et al. 2013; Lee et al. 2023). To enhance the efficiency of the mating duration assay, we utilized the Df(1)Exel6234 (DF here after) genetic modified fly line in this study, which harbors a deletion of a specific genomic region that includes the sex peptide receptor (SPR) (Parks et al. 2004; Yapici et al. 2008). Previous studies have demonstrated that virgin females of this line exhibit increased receptivity to males (Yapici et al. 2008). We conducted a comparative analysis between the virgin females of this line and the CS virgin females and found that both groups induced SMD. Consequently, we have elected to employ virgin females from this modified line in all subsequent studies. For group reared (naïve) males, 40 males from the same strain were placed into a vial with food for 5 days. For single reared males, males of the same strain were collected individually and placed into vials with food for 5 days. For experienced males, 40 males from the same strain were placed into a vial with food for 4 days then 80 DF virgin females were introduced into vials for last 1 day before assay. 40 DF virgin females were collected from bottles and placed into a vial for 5 days. These females provide both sexually experienced partners and mating partners for mating duration assays. At the fifth day after eclosion, males of the appropriate strain and DF virgin females were mildly anaesthetized by CO2. After placing a single female into the mating chamber, we inserted a transparent film then placed a single male to the other side of the film in each chamber. After allowing for 1 h of recovery in the mating chamber in 25℃ incubators, we removed the transparent film and recorded the mating activities. Only those males that succeeded to mate within 1 h were included for analyses. Initiation and completion of copulation were recorded with an accuracy of 10 sec, and total mating duration was calculated for each couple. Genetic controls with GAL4/+ or UAS/+ lines were omitted from supplementary figures, as prior data confirm their consistent exhibition of normal LMD and SMD behaviors (Kim et al. 2012; Kim et al. 2013; Lee et al. 2023; Huang et al. 2024; Zhang et al. 2024). Hence, genetic controls for LMD and SMD behaviors were incorporated exclusively when assessing novel fly strains that had not previously been examined. In essence, internal controls were predominantly employed in the experiments, as LMD and SMD behaviors exhibit enhanced statistical significance when internally controlled. Within the LMD assay, both group and single conditions function reciprocally as internal controls. A significant distinction between the naïve and single conditions implies that the experimental manipulation does not affect LMD. Conversely, the lack of a significant discrepancy suggests that the manipulation does influence LMD. In the context of SMD experiments, the naïve condition (equivalent to the group condition in the LMD assay) and sexually experienced males act as mutual internal controls for one another. A statistically significant divergence between naïve and experienced males indicates that the experimental procedure does not alter SMD. Conversely, the absence of a statistically significant difference suggests that the manipulation does impact SMD. Hence, we incorporated supplementary genetic control experiments solely if they deemed indispensable for testing. All assays were performed from noon to 4 PM. We conducted blinded studies for every test."

      Minor comments: Comment 4.* Line 75: word missing ("...including FEEDING-RELATED BEHAVIOR, courtship, ..."). *

         __Answer:__ We appreciate your vigilance in identifying this error. We have made the necessary correction to ensure the accuracy of our manuscript.*
      

      *

      Comment 5.* Line 120: word missing ("SIFaR expression in adult neurons BUT not glia..."). *

         __Answer:__ We appreciate your careful review and attention to detail. Thank you for bringing this to our notice. We have made the necessary corrections to address the error.*
      

      *

      Comment 6.* I find the figures often to be quite overloaded, and anatomical details often very small (e.g., figure 7A). *

      * * Answer: We appreciate the constructive critique on the layout of our data presentation. Following your insightful recommendation, we have revised the manuscript to enhance clarity. Specifically, we have resized the diagram to be more compact and have also increased the spacing between the panels for better readability.


      __ __


      Reviewer #3

      Major Comments Comment 1.* Are the key conclusions convincing? The key conclusions are intriguing but require more robust data to be fully convincing. While the study presents compelling evidence for the involvement of SIFa and SIFaR in mating behaviors, additional experiments are needed to firmly establish the proposed mechanisms. *

      * * Answer: We are deeply grateful for the insightful and constructive feedback provided by the reviewer on the SIFa-to-SIFaR signaling pathway. We are particularly encouraged by the reviewer's agreement with our findings that support the role of SIFa and SIFaR in regulating mating duration. We concur with the reviewer's suggestion that additional experiments and mechanistic insights are essential to substantiate our conclusions. To this end, we have conducted and included several new experiments, particularly GCaMP data, in the main figures (Figure 6 and S6). Our focus has been intensified on the SIFa-to-Crz signaling, given Crz's established role in controlling mating duration behavior. Below is a summary of the additional experiments we have incorporated:

      1. We have repositioned the SIFa-Crz GCaMP data related to the VNC to Figures 6L-O to ensure that the main text highlights our primary findings.
      2. Our more detailed analysis has identified two cells in the Super Intermediate Protocerebrum (SIP) regions that co-express Crz and SIFaR24F06, along with OL cells (Figure 6D-E).
      3. To provide a complete view of the signaling dynamics, we have included GCaMP data from the brain region in the main figure (Figure 6P-R and Supplementary Figure S6N-P).
      4. Through GCaMP calcium imaging to assess SIFa-to-Crz signaling, we found that calcium levels in Crz+/SIFaR+ SIP neurons consistently decreased with SIFa activation (Figure 6P-R). Conversely, calcium signals in Crz+/SIFaR+ OL neurons increased with SIFa activation, mirroring the pattern seen in Crz+ AG neurons in the VNC (Figure 6M-O and Supplementary Figure S6N-P).
      5. A synthesis of these results is presented in Figure 6S, and we have elaborated on these findings in the manuscript with a detailed description, as detailed below. "To elucidate the direct response of Crz neurons to the activity of SIFa neurons, we conducted live calcium (Ca2+) imaging in the Super Intermediate Protocererbrum (SIP), OL and AG region of the VNC, where Crz neurons are situated (Fig. 6D, Fig. S6M). Upon optogenetic stimulation of SIFa neurons, we observed a significant increase in the activity of Crz in OL and AG region (Fig. 6L-O, Fig. S6N-P), evidenced by a sustained elevation in intracellular Ca2+ levels that persisted in a high level before gradually declining to baseline levels, where the cells in top region of the SIP exhibit consistently drop down after stimulated the SIFa neurons (Fig. 6P-R). These calcium level changes were in contrast to the control group (without all-trans retinal, ATR) (Fig. 6L-R, Fig. S6N-P). These findings confirm that Crz neurons in OL and AG are activated in response to SIFa neuronal activity, but the activity of Crz neurons in SIP are inhibited by the activition of SIFa neuron, reinforcing their role as postsynaptic effectors in the neural circuitry governed by SIFa neurons. Moreover, these results provide empirical support for the hypothesis that SIFa-SIFaR/Crz-CrzR long-range neuropeptide relay underlies the neuronal activity-based measurement of interval timing."

        We are truly grateful for the reviewer's perceptive recommendations concerning the possible mechanisms of LMD and SMD behaviors. In light of this constructive feedback, we have enhanced our discussion to encompass a theoretical framework on the potential role of neuropeptide relays in mediating context-dependent adjustments of synaptic plasticity and calcium signaling within specific neuronal populations. This supplementary perspective is designed to elucidate the intricate dynamics involved, as further elaborated in the updated manuscript.

      "Employing two distinct yet comparable models of interval timing behavior, LMD and SMD, we demonstrated that differential SIFa to SIFaR signaling is capable of modulating context-dependent behavioral responses. Synaptic strengths between SIFa and SIFaR neurons was notably enhanced in group-reared naive males. However, these synaptic strengths specifically diminished in the OL, CB, and AG when males were singly reared, with a particular decrease in the AG region when males were sexually experienced (Fig. 4A-J). Intriguingly, overall calcium signaling within SIFaR24F06 neurons was significantly reduced in group-reared naive males, yet these signals surged dramatically in the OL with social isolation and in the AG with sexual experience (Fig. 4K-T). These calcium signals, as reported by the transcriptional calcium reporter CaLexA, were corroborated by GCaMP live imaging in both the AG and OL regions (Fig. 6L-O and Fig. S6N-P), indicating a close association between elevated calcium levels and LMD and SMD behaviors. The modulation of context-dependent synaptic plasticity and calcium dynamics by the SIFa neuropeptide through a single SIFaR receptor raises the question of how a single receptor can elicit such diverse responses. Recent neuroscientific studies in Drosophila have shown that individual neurons can produce multiple neurotransmitters and that neuropeptides are often colocalized with small molecule neurotransmitters (Nässel 2018,Deng 2019,Croset 2018,Kondo 2020). Consistent with this, we have previously reported that SIFa neurons utilize a variety of neurotransmitters, including glutamate, dopamine, and tyramine (Kim 2024). Therefore, we propose that the SIFa-SIFaR-Crz-CrzR neuropeptidergic relay circuitry may interact with different neurotransmitters in distinct neuronal subpopulations to regulate context-dependent behaviors. Supporting this hypothesis, glutamate, known to function as an inhibitory neurotransmitter in the olfactory pathway of Drosophila (Liu 2013), may be one such candidate. We speculate that neuropeptide cotransmission could underlie the mechanisms facilitating these complex, context-dependent behavioral patterns. Further research is warranted to elucidate how such cotransmission contributes to the intricate behavioral repertoire of the fly."

      • *

      Comment 2. Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? The authors should qualify certain claims as preliminary or speculative. Specifically, the proposed SIFa-SIFaR/Crz-CrzR neuropeptide relay pathway is only investigated via imaging approach. More experiments using behavioral tests are needed to confirm that Crz relays the SIFa signaling pathway. For example, Crz-Gal4>UAS-SIFaR RNAi should be done to show that SIFaR+ Crz+ cells are necessary for LMD and SMD.

         __Answer:__ We are grateful for the reviewer's constructive suggestion regarding the need to provide additional behavioral assays using RNAi knockdown to substantiate the SIFa-SIFaR/Crz-CrzR neuropeptide relay. Following the reviewer's advice, we have conducted experiments involving SIFaR24F06/Crz-RNAi and Crz-GAL4/SIFaR-RNAi. The outcomes of these experiments have been detailed and are now presented in a clear and comprehensive manner.
      
         To further aid in the understanding of our results, we have also included a summary diagram in Figure 6S, which illustrates the key findings from these assays. This visual representation is intended to provide a concise overview of the data and to highlight the significance of the SIFa-SIFaR/Crz-CrzR neuropeptide relay in the context of our study.
      

      Comment 3.* Would additional experiments be essential to support the claims of the paper? Yes, additional experiments are essential. Detailed molecular and imaging studies are needed to support claims about synaptic reorganization. For example: ○ More controls are needed for RNAi and Gal80ts experiments, such as Gal4-only control, RNAi-only control, etc. *

         __Answer:__ We sincerely appreciate the reviewer's critical feedback regarding the genetic controls in our study. We acknowledge the importance of this aspect and wish to clarify that we have indeed conducted a substantial number of genetic control experiments for both LMD and SMD behaviors. It is worth noting that much of this data has been previously published in other works. Recognizing the interest from another reviewer on the same topic, we have chosen to reiterate our response here for clarity and convenience. Our comprehensive approach to genetic controls ensures the robustness of our findings, and we believe that the published data further substantiates the reliability of our experimental procedures.
      
         We sincerely thank the reviewer for insightful comments regarding the absence of traditional genetic controls in our study of LMD and SMD behaviors. We acknowledge the importance of such controls and wish to clarify our rationale for not including them in the current investigation. The primary reason for not incorporating all genetic control lines is that we have previously assessed the LMD and SMD behaviors of GAL4/+ and UAS/+ strains in our earlier studies. Our past experiences have consistently shown that 100% of the genetic control flies for both GAL4 and UAS exhibit normal LMD and SMD behaviors. Given these findings, we deemed the inclusion of additional genetic controls to be non-essential for the present study, particularly in the context of extensive screening efforts. However, in accordance with the reviewer's recommendation, we conducted genetic validation experiments on novel genetic crosses, including SIFaR-RNAi/+, CrzR-RNAi/+, and GAL4NP5270/+, and incorporated the results in the supplementary figures (Supplementary information 1). We have made the necessary modifications and additions to the manuscript as below.
      

      "Given those genetic controls, as evidenced by consistent exhibition of normal LMD and SMD behaviors (Supplemental information 1), the observed reduction in SIFaR expression, driven by elavc155, is deemed sufficient to induce significant disruptions in LMD and SMD behaviors."

      We understand the value of providing a clear rationale for our methodology choices. To this end, we have added a detailed explanation in the "MATERIALS AND METHODS" section and the figure legends of Figure 1. This clarification aims to assist readers in understanding our decision to omit traditional controls, as outlined below.

      "__Mating Duration Assays for Successful Copulation__The mating duration assay in this study has been reported (Kim et al. 2012; Kim et al. 2013; Lee et al. 2023). To enhance the efficiency of the mating duration assay, we utilized the Df(1)Exel6234 (DF here after) genetic modified fly line in this study, which harbors a deletion of a specific genomic region that includes the sex peptide receptor (SPR) (Parks et al. 2004; Yapici et al. 2008). Previous studies have demonstrated that virgin females of this line exhibit increased receptivity to males (Yapici et al. 2008). We conducted a comparative analysis between the virgin females of this line and the CS virgin females and found that both groups induced SMD. Consequently, we have elected to employ virgin females from this modified line in all subsequent studies. For group reared (naïve) males, 40 males from the same strain were placed into a vial with food for 5 days. For single reared males, males of the same strain were collected individually and placed into vials with food for 5 days. For experienced males, 40 males from the same strain were placed into a vial with food for 4 days then 80 DF virgin females were introduced into vials for last 1 day before assay. 40 DF virgin females were collected from bottles and placed into a vial for 5 days. These females provide both sexually experienced partners and mating partners for mating duration assays. At the fifth day after eclosion, males of the appropriate strain and DF virgin females were mildly anaesthetized by CO2. After placing a single female into the mating chamber, we inserted a transparent film then placed a single male to the other side of the film in each chamber. After allowing for 1 h of recovery in the mating chamber in 25℃ incubators, we removed the transparent film and recorded the mating activities. Only those males that succeeded to mate within 1 h were included for analyses. Initiation and completion of copulation were recorded with an accuracy of 10 sec, and total mating duration was calculated for each couple. Genetic controls with GAL4/+ or UAS/+ lines were omitted from supplementary figures, as prior data confirm their consistent exhibition of normal LMD and SMD behaviors (Kim et al. 2012; Kim et al. 2013; Lee et al. 2023; Huang et al. 2024; Zhang et al. 2024). Hence, genetic controls for LMD and SMD behaviors were incorporated exclusively when assessing novel fly strains that had not previously been examined. In essence, internal controls were predominantly employed in the experiments, as LMD and SMD behaviors exhibit enhanced statistical significance when internally controlled. Within the LMD assay, both group and single conditions function reciprocally as internal controls. A significant distinction between the naïve and single conditions implies that the experimental manipulation does not affect LMD. Conversely, the lack of a significant discrepancy suggests that the manipulation does influence LMD. In the context of SMD experiments, the naïve condition (equivalent to the group condition in the LMD assay) and sexually experienced males act as mutual internal controls for one another. A statistically significant divergence between naïve and experienced males indicates that the experimental procedure does not alter SMD. Conversely, the absence of a statistically significant difference suggests that the manipulation does impact SMD. Hence, we incorporated supplementary genetic control experiments solely if they deemed indispensable for testing. All assays were performed from noon to 4 PM. We conducted blinded studies for every test."

      *○ Using synaptic markers and high-resolution imaging to observe synaptic changes directly. *

         __Answer:__ We sincerely appreciate the reviewer's constructive suggestion to provide high-resolution imaging for a more direct observation of synaptic changes. While we have already included high-resolution imaging data showcasing postsynaptic and presynaptic alterations using Denmark and syt.eGFP (Figure S3), GRASP (Figure 3A-D), and tGRASP (Figure 4A-J), we recognize the value of further elucidation. Consequently, we have conducted additional experiments to examine the presynaptic changes in SIFaR24F06 neurons under varying social contexts, as presented in Figure 5A-G. We are confident that the comprehensive dataset we have now provided, which includes these new findings, will not only address the reviewer's concerns but also effectively convey to the readers the dynamic and critical nature of SIFa-SIFaR synaptic changes in modulating interval timing behaviors.
      

      *○ Electrophysiological recordings from neurons expressing SIFa and SIFaR to analyze their functional connectivity and activity patterns. *

         __Answer:__ We sincerely appreciate the reviewer's constructive suggestions regarding the inclusion of electrophysiological recordings from neurons expressing SIFa and SIFaR to analyze functional connectivity and activity patterns. In response to this valuable feedback, we have conducted *in vivo* calcium imaging using the GCaMP indicator. The results have been incorporated into our manuscript, demonstrating SIFa-SIFaR connectivity and alterations in activity patterns (Figure 5H-L), as well as SIFa-Crz connectivity and changes in activity patterns (Figure 6 and Figure S6). We are confident that these additional data provide compelling evidence supporting the notion that the SIFa-SIFaR/Crz-CrzR neuropeptide relay circuits are robustly interconnected and exhibit activity changes in concert with the observed neuronal modifications.
      
      • *

      Comment 4.* Are the suggested experiments realistic in terms of time and resources? The suggested experiments are realistic but will require considerable time and resources. Detailed molecular interaction studies, imaging synaptic plasticity, and electrophysiological recordings could take several months to over a year, depending on the complexity and availability of necessary equipment and expertise. The cost would be moderate to high, involving expenses for reagents, imaging equipment, and animal husbandry for maintaining Drosophila stocks. *

      * * Answer: We are grateful for the reviewers' understanding and support for our additional analysis in the revision experiments. While we have already conducted a multitude of experiments pertinent to this manuscript, we are well-positioned to provide a comprehensive revision of the data within a relatively short timeframe.

      Comment 5. Are the data and the methods presented in such a way that they can be reproduced? The methods are generally described in detail, allowing for potential reproducibility. However, more precise documentation of certain experimental conditions, such as the timing and conditions of RNAi induction and temperature controls, is necessary. The methods about imaging analysis are too detailed. The exact steps about how to use ImageJ should be removed.

      * * Answer: We sincerely appreciate the reviewer's meticulous comments regarding the omission of certain methodological details in our manuscript. In response, we have now included a detailed description of the temperature control procedures for conditional RNAi induction in the "Fly Stocks and Husbandry" section, as detailed below.

      "For temperature-controlled experiments, including those utilizing the temperature-sensitive tub-GAL80ts driver, the flies were initially crossed and maintained at a constant temperature of 22℃ within an incubator. The temperature shift was initiated post-eclosion. Once the flies had emerged, they were transferred to an incubator set at an elevated temperature of 29℃ for a defined period, after which the experimental protocols were carried out. Wild-type flies were Canton-S (CS)."

         We appreciate the reviewer's guidance on refining our manuscript. In response to the suggestion, we have streamlined the image analysis methods section, removing excessive details to present the information in a more concise and clear manner as below.
      

      "Quantitative analysis of fluorescence intensity

      To ascertain calcium levels and synaptic intensity from microscopic images, we dissected and imaged five-day-old flies of various social conditions and genotypes under uniform conditions. The GFP signal in the brains and VNCs was amplified through immunostaining with chicken anti-GFP primary antibody. Image analysis was conducted using ImageJ software. For the quantification of fluorescence intensities, an investigator, blinded to the fly's genotype, thresholded the sum of all pixel intensities within a sub-stack to optimize the signal-to-noise ratio, following established methods (Feinberg 2008). The total fluorescent area or region of interest (ROI) was then quantified using ImageJ, as previously reported. For CaLexA signal quantification, we adhered to protocols detailed by Kayser et al. (Kayser et al, 2014), which involve measuring the ROI's GFP-labeled area by summing pixel values across the image stack. This method assumes that changes in the GFP-labeled area are indicative of alterations in the CaLexA signal, reflecting synaptic activity. ROI intensities were background-corrected by measuring and subtracting the fluorescent intensity from a non-specific adjacent area, as per Kayser et al. (Kayser et al, 2014). For the analysis of GRASP or tGRASP signals, a sub-stack encompassing all synaptic puncta was thresholded by a genotype-blinded investigator to achieve the optimal signal-to-noise ratio. The fluorescence area or ROI for each region was quantified using ImageJ, employing a similar approach to that used for CaLexA quantification (Feinberg 2008)."

      Comment 6. Are the experiments adequately replicated and statistical analysis adequate? Most figures in the manuscript need to be re-plotted. The right y-axis "Difference between means" is not necessary, if not confusing. The image panels are too small to see, while the quantification of overlapping cells are unnecessarily large. The figures are too crowded with labels and texts, which makes it extremely difficult to comprehend the data.

         __Answer:__ We appreciate the reviewer's suggestion to refine our figures, and we have indeed reformatted them to provide clearer presentation and improved readability. Regarding the removal of dot blot membranes (DBMs), we have given this considerable thought. While we understand the recommendation, we have chosen to retain the DBMs in our manuscript. Our decision is based on the fact that our analysis encompasses not only traditional t-tests but also incorporates estimation statistics, which have been demonstrated to be effective for biological data analysis (Claridge-Chang & Assam, 2016). The inclusion of DBMs is essential for the accurate interpretation of these estimation statistics, ensuring a comprehensive representation of our findings.
      

      Minor Comments Comment 7. Specific experimental issues that are easily addressable. Clarify the timing of RNAi induction and provide more detailed figure legends for better understanding and reproducibility.

         __Answer:__ We sincerely appreciate the reviewer's suggestion aimed at enhancing our manuscript. As previously addressed in our response to __*Comment 5*__, we have incorporated additional details regarding the timing of RNAi induction within the Methods section. Furthermore, we have expanded upon the figure legends to provide a clearer understanding of our findings, ensuring that the content is accessible to a broader readership.
      

      * Comment 8. Are prior studies referenced appropriately? Yes. *

      __ Answer:__ We are grateful for the reviewer's acknowledgment that our references have been appropriately included and integrated into the manuscript.

      * Comment 9. Are the text and figures clear and accurate? The text is generally clear, but the figures need re-work. See comment above. *

      • *Answer: We appreciate the feedback from the reviewers regarding the clarity of our figures. In response to other reviewers' concerns about the figures appearing too crowded, we have carefully revised the layout of all figures to ensure they are more spacious and aesthetically improved for better readability and visual appeal.

      * Comment 10. Suggestions to improve the presentation of data and conclusions. Use smaller fonts in the bar plots and make the plots smaller. Enlarge the imaging panels and let the pictures tell the story. *

      * * Answer: We sincerely appreciate the reviewer's constructive suggestion. In response, we have revised the figures by enlarging the images and adjusting the font sizes in the bar plots to enhance readability and clarity.

      REFERENCES

      Andersen JL, MacMillan HA & Overgaard J (2015) Temperate Drosophila preserve cardiac function at low temperature. J Insect Physiol 77: 26–32

      Bretman A, Westmancoat JD, Gage MJG & Chapman T (2011) Males Use Multiple, Redundant Cues to Detect Mating Rivals. Curr Biol 21: 617–622

      Buhusi CV, Aziz D, Winslow D, Carter RE, Swearingen JE & Buhusi MC (2009) Interval Timing Accuracy and Scalar Timing in C57BL/6 Mice. Behav Neurosci 123: 1102–1113

      Buhusi CV & Meck WH (2005) What makes us tick? Functional and neural mechanisms of interval timing. Nat Rev Neurosci 6: 755–765

      Claridge-Chang A & Assam PN (2016) Estimation statistics should replace significance testing. Nat Methods 13: 108–109

      Crickmore MA & Vosshall LB (2013) Opposing Dopaminergic and GABAergic Neurons Control the Duration and Persistence of Copulation in Drosophila. Cell 155: 881–893

      Feinberg EH, VanHoven MK, Bendesky A, Wang G, Fetter RD, Shen K & Bargmann CI (2008a) GFP Reconstitution Across Synaptic Partners (GRASP) Defines Cell Contacts and Synapses in Living Nervous Systems. Neuron 57: 353–363

      Feinberg EH, VanHoven MK, Bendesky A, Wang G, Fetter RD, Shen K & Bargmann CI (2008b) GFP Reconstitution Across Synaptic Partners (GRASP) Defines Cell Contacts and Synapses in Living Nervous Systems. Neuron 57: 353–363

      Gautham AK, Miner LE, Franco MN, Thornquist SC & Crickmore MA (2024) Molecular control of temporal integration matches decision-making to motivational state. bioRxiv: 2024.03.01.582988

      Golombek DA, Bussi IL & Agostino PV (2014) Minutes, days and years: molecular interactions among different scales of biological timing. Philosophical Transactions Royal Soc B Biological Sci 369: 20120465

      Khoshnoud S, Leitritz D, Bozdağ MÇ, Igarzábal FA, Noreika V & Wittmann M (2024) When the heart meets the mind: Exploring the brain-heart interaction during time perception. J Neurosci: e2039232024

      Kim WJ, Jan LY & Jan YN (2012) Contribution of visual and circadian neural circuits to memory for prolonged mating induced by rivals. Nat Neurosci 15: 876–883

      Kim WJ, Jan LY & Jan YN (2013) A PDF/NPF Neuropeptide Signaling Circuitry of Male Drosophila melanogaster Controls Rival-Induced Prolonged Mating. Neuron 80: 1190–1205

      Kim WJ, Song Y, Zhang T, Zhang X, Ryu TH, Wong KC, Wu Z, Wei Y, Schweizer J, Nguyen K-NH, et al (2024) Peptidergic neurons with extensive branching orchestrate the internal states and energy balance of male Drosophila melanogaster. bioRxiv: 2024.06.04.597277

      Lee SG, Sun D, Miao H, Wu Z, Kang C, Saad B, Nguyen K-NH, Guerra-Phalen A, Bui D, Abbas A-H, et al (2023) Taste and pheromonal inputs govern the regulation of time investment for mating by sexual experience in male Drosophila melanogaster. PLOS Genet 19: e1010753

      Matell MS (2014) Neurobiology of Interval Timing. Adv Exp Med Biol: 209–234

      Meck WH, Doyère V & Gruart A (2012) Interval Timing and Time-Based Decision Making. Frontiers Integr Neurosci 6: 13

      Merchant H, Harrington DL & Meck WH (2012) Neural Basis of the Perception and Estimation of Time. Annu Rev Neurosci 36: 313–336

      Nicolaï LJJ, Ramaekers A, Raemaekers T, Drozdzecki A, Mauss AS, Yan J, Landgraf M, Annaert W & Hassan BA (2010) Genetically encoded dendritic marker sheds light on neuronal connectivity in Drosophila. Proc National Acad Sci 107: 20553–20558

      RICHELLE M & LEJEUNE H (1980) Time in Animal Behaviour. Part III: Mech: 188–199

      Tayler TD, Pacheco DA, Hergarden AC, Murthy M & Anderson DJ (2012) A neuropeptide circuit that coordinates sperm transfer and copulation duration in Drosophila. Proc National Acad Sci 109: 20697–20702

      Thornquist SC, Langer K, Zhang SX, Rogulja D & Crickmore MA (2020) CaMKII Measures the Passage of Time to Coordinate Behavior and Motivational State. Neuron 105: 334-345.e9

      Tucci V, Buhusi CV, Gallistel R & Meck WH (2014) Towards an integrated understanding of the biology of timing. Philosophical Transactions Royal Soc B Biological Sci 369: 20120470

      Wong K, Schweizer J, Nguyen K-NH, Atieh S & Kim WJ (2019) Neuropeptide relay between SIFa signaling controls the experience-dependent mating duration of male Drosophila. Biorxiv: 819045

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The article investigates the role of the neuropeptide SIFa and its receptor SIFaR in regulating two distinct mating duration behaviors in male Drosophila melanogaster, Longer-Mating-Duration (LMD) and Shorter-Mating-Duration (SMD). The study reveals that SIFaR expression in specific neurons is required for both behaviors. It shows that social context and sexual experience lead to synaptic reorganization between SIFa and SIFaR neurons, altering internal brain states. The SIFa-SIFaR/Crz-CrzR neuropeptide relay pathway is essential for generating these behaviors, with Crz neurons responding to SIFa neuron activity. Furthermore, CrzR expression in non-neuronal cells is critical for regulating LMD and SMD behaviors. The study utilizes neuropeptide RNAi screening, chemoconnectome (CCT) knock-in, and genetic intersectional methods to elucidate these findings.

      Major Comments

      • Are the key conclusions convincing? The key conclusions are intriguing but require more robust data to be fully convincing. While the study presents compelling evidence for the involvement of SIFa and SIFaR in mating behaviors, additional experiments are needed to firmly establish the proposed mechanisms.
      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? The authors should qualify certain claims as preliminary or speculative. Specifically, the proposed SIFa-SIFaR/Crz-CrzR neuropeptide relay pathway is only investigated via imaging approach. More experiments using behavioral tests are needed to confirm that Crz relays the SIFa signaling pathway. For example, Crz-Gal4>UAS-SIFaR RNAi should be done to show that SIFaR+ Crz+ cells are necessary for LMD and SMD.
      • Would additional experiments be essential to support the claims of the paper? Yes, additional experiments are essential. Detailed molecular and imaging studies are needed to support claims about synaptic reorganization. For example:
        • More controls are needed for RNAi and Gal80ts experiments, such as Gal4-only control, RNAi-only control, etc.
        • Using synaptic markers and high-resolution imaging to observe synaptic changes directly.
        • Electrophysiological recordings from neurons expressing SIFa and SIFaR to analyze their functional connectivity and activity patterns.
      • Are the suggested experiments realistic in terms of time and resources? The suggested experiments are realistic but will require considerable time and resources. Detailed molecular interaction studies, imaging synaptic plasticity, and electrophysiological recordings could take several months to over a year, depending on the complexity and availability of necessary equipment and expertise. The cost would be moderate to high, involving expenses for reagents, imaging equipment, and animal husbandry for maintaining Drosophila stocks.
      • Are the data and the methods presented in such a way that they can be reproduced? The methods are generally described in detail, allowing for potential reproducibility. However, more precise documentation of certain experimental conditions, such as the timing and conditions of RNAi induction and temperature controls, is necessary. The methods about imaging analysis are too detailed. The exact steps about how to use ImageJ should be removed.
      • Are the experiments adequately replicated and statistical analysis adequate? Most figures in the manuscript need to be re-plotted. The right y-axis "Difference between means" is not necessary, if not confusing. The image panels are too small to see, while the quantification of overlapping cells are unnecessarily large. The figures are too crowded with labels and texts, which makes it extremely difficult to comprehend the data.

      Minor Comments

      • Specific experimental issues that are easily addressable. Clarify the timing of RNAi induction and provide more detailed figure legends for better understanding and reproducibility.
      • Are prior studies referenced appropriately? Yes.
      • Are the text and figures clear and accurate? The text is generally clear, but the figures need re-work. See comment above.
      • Suggestions to improve the presentation of data and conclusions. Use smaller fonts in the bar plots and make the plots smaller. Enlarge the imaging panels and let the pictures tell the story.

      Significance

      Nature and Significance of the Advance

      This study aims to advance understanding of how neuropeptides modulate context-dependent behaviors in Drosophila. It provides novel insights into the role of SIFa and SIFaR in interval timing behaviors, contributing to the broader field of neuropeptide research and behavioral neuroscience. However, the significance of the findings is limited by the preliminary nature of some claims and the need for additional supporting data.

      Context in Existing Literature

      The work builds on previous studies that identified various roles of neuropeptides in behavior modulation but lacked detailed mechanistic insights. By elucidating the SIFa-SIFaR/Crz-CrzR pathway, this study attempts to fill a gap in the literature, but more robust evidence is required to solidify its contributions.

      Interested Audience

      The findings will interest neuroscientists, behavioral biologists, and researchers studying neuropeptides and their roles in behavior and neural circuitry. Additionally, this research may have implications for understanding neuropeptidergic systems in other organisms, making it relevant to a broader audience in the fields of neurobiology and physiology.

      Field of Expertise

      Keywords: Neuropeptides, Drosophila melanogaster, Behavioral Neuroscience. Areas without sufficient expertise: courtship behavior.

      Recommendation

      I recommend a major revision of this manuscript. The study presents intriguing findings, but several key claims are preliminary and require additional experiments for support. The data is poorly presented and the figures can be significantly improved. Detailed molecular and imaging studies, as well as more rigorous statistical analyses, are necessary to strengthen the conclusions. Addressing these concerns will significantly improve the robustness and impact of the paper.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Zhang et al., "Long-range neuropeptide relay as a central-peripheral communication mechanism for the context-dependent modulation of interval timing behaviors".

      The authors investigate mating behavior in male fruit flies, Drosophila melanogaster, and test for a role of the SIFamide receptor (SIFaR) in this type of behavior, in particular mating duration in dependence of social isolation and prior mating experience. The anatomy of SIFamide-releasing neurons in comparison with SIFamide receptor-expressing neurons is characterized in a detail-rich manner. Isolating males or exposing them to mating experience modifies the anatomical organization of SIFamidergic axon termini projecting onto SIFamide receptor-expressing neurons. This structural synaptic plasticity is accompanied by changes in calcium influx. Lastly, it is shown that corazonin-releasing neurons are modulated by SIFamide releasing neurons and impact the duration of mating behavior.

      Overall, this highly interesting study advances our knowledge about the behavioral roles of SIFamide, and contributes to an understanding how motivated behavior such as mating is orchestrated by modulatory peptides. The approach to take the entire organism, including peripheral tissue, into consideration, is very good and a rather unique point. The manuscript has only some points that are less convincing, and these should be addressed.

      Major concerns:

      • It is highly interesting that the duration of mating behavior is dependent on external and motivational factors. In fact, that provides an elegant way to study which neuronal mechanisms orchestrate these factors. However, it remains elusive why the authors link the differentially motivated durations of mating behavior to the psychological concept of interval timing. This distracts from the actually interesting neurobiology, and is not necessary to make the study interesting.
      • In figure 4 A and 4K, fluorescence microscopy images of brains and ventral nerve chords are shown, one illustrating GRASP experiments, and one showing CaLexA experiments. The extreme difference between the differentially treated flies (bright fluorescence versus almost no fluorescence) is - in its drastic form- surprising. Online access to the original confocal microscopy images (raw data) might help to convince the reader that these illustrations do not reflect the most drastic "representative" examples out of a series of brain stainings.
      • In particular for behavioral experiments, genetic controls should always be conducted. That is, both the heterozygous Gal4-line as well as the heterozygous UAS-line should be used as controls. This is laborious, but important.

      Minor comments:

      • Line 75: word missing ("...including FEEDING-RELATED BEHAVIOR, courtship, ...").
      • Line 120: word missing ("SIFaR expression in adult neurons BUT not glia...").
      • I find the figures often to be quite overloaded, and anatomical details often very small (e.g., figure 7A).

      Significance

      Overall, this highly interesting study advances our knowledge about the behavioral roles of SIFamide, and contributes to an understanding how motivated behavior is orchestrated by modulatory peptides. The approach to take the entire organism, including peripheral tissue, into consideration, is very good and a rather unique point.

      Since decades it has been investigated how sensory stimuli are processed and encoded by the brain, and how behavioral actions are executed. Likewise, principles underlying learning and memory, sleep, orentation, circadian rhythms, etc. are subject to intense investigation. However, how motivational factors (sleep pressure, hunger, sexual drive) are actually "encoded", signaled and finally used to orchstrate behavior and guide decision-making is, to a very large degree, unknown - in any species. The model use here (Drosophila and its peptidergic system wit SIFamide as a central hub) represents actually a ideal entry point to study just this question. In this sense, the manuscript is at the forefront of modern, state-of-the-art neurobiology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript from Zhang et al. primarily investigates the contribution of the SIFa neuropeptide receptor (SIFaR) to mating duration in male fruit flies. Through RNAi-mediated downregulation, they show that SIFaR receptor is necessary for previous experience to alter mating duration. Using cell-specific knockdown and rescue of the SIFaR receptor, they identify a population of ~400 neurons that could underlie this effect. This is still a large number of cells but is narrowed from the ~1,200 total SIFaR-expressing neurons. They then use the GRASP synaptic labeling technique to show that SIFa+ neurons form synapses onto the relevant SIFaR-expressing population, and that the area of synaptic contact is systematically altered depending on the fly's past mating history. Finally, they provide evidence to argue that SIFa neurons act through SIFaR neurons that release the neuropeptide corazonin to regulate mating duration. Overall, the authors have used an impressive array of techniques in their attempt to define the neural circuits and molecules involved in changing internal state to modify the duration of mating.

      Major Comments:

      1. The authors are to be commended for the sheer quantity of data they have generated, but I was often overwhelmed by the figures, which try to pack too much into the space provided. As a result, it is often unclear what components belong to each panel. Providing more space between each panel would really help.
      2. The use of three independent RNAi lines to knock down SIFaR expression is experimentally solid, as the common phenotype observed with all 3 lines supports the conclusion that the SIFaR is important for mating duration choice. However, the authors have not tested whether these lines effectively reduce SIFaR expression, nor whether the GAL80 constructs used to delimit knockdown are able to effectively do so. This makes it hard to make definitive conclusions with these manipulations, especially in the face of negative results. A lack of complete knockdown is suggested by the fact that the F24F06 driver rescues lethality when used to express SIFaR in the B322 mutant background, but does not itself produce lethality when used to express SIFaR RNAi. The authors should either conduct experiments to determine knockdown efficiency or explicitly acknowledge this limitation in drawing conclusions from their experiments. A similar concern relates to the CrzR knockdown experiments (eg Figure 7).
      3. Most of the behavioral experiments lack traditional controls, for example flies that contain either the GAL4 or UAS elements alone. The authors should explain their decision to omit these control experiments and provide an argument for why they are not necessary to correctly interpret the data. In this vein, the authors have stated in the methods that stocks were outcrossed at least 3x to Canton-S background, but 3 outcrosses is insufficient to fully control for genetic background.
      4. Throughout the manuscript, the authors appear to use a single control condition (sexually naïve flies raised in groups) to compare to both males raised singly and males with previous sexual experience. These control conditions are duplicated in two separate graphs, one for long mating duration and one for short mating duration, but they are given different names (group vs naïve) depending on the graph. If these are actually the same flies, then this should be made clear, and they should be given a consistent name across the different "experiments".
      5. The authors have consistently conflated overlap of neuronal processes with synaptic connections. Claims of synaptic connectivity deriving solely from overlap of processes should be tempered and qualified.
        • For example, they say (Lines 201-202) "These findings suggest that SIFa neurons and GAL424F06-positive neurons form more synapses in the VNC than in the brain." This is misleading. Overlap of24F06-LexA>CD8GFP and SIFa-GAL4>CD8RFP tells us nothing about synapse number, or even whether actual synapses are being formed.
        • Lines 210-211: "The overlap of DenMark and syt.EGFP signals was highly enriched in both SOG and ProNm regions, indicating that these regions are where GAL424F06 neurons form interconnected networks". This is misleading. Overlap of DenMark and syt.EGFP does not indicate synapses (especially since these molecules can be expressed outside the expected neuronal compartment if driven at high enough levels).
        • Lines 320-322: "Neurons expressing Crz exhibit robust synaptic connections with SIFaR24F06 neurons located in the PRW region of the SOG in the brain (panels of Brain and SOG in Fig. 5A)". This is again misleading. They are not actually measuring synapses here, but instead looking at area of overlap between neuronal processes of Crz and SIFaR cells.
        • In Figs 3B and S4A, they are claiming that all neuronal processes within a given delineated brain area are synapses. The virtual fly brain and hemibrain resource have a way to actually identify synapses. This should be used in addition to the neuron skeleton. Otherwise, it is misleading to label these as synapses.
        • Furthermore, measuring the area of GRASP signal is not the same as quantifying synapses. We don't know if synapse number changes (eg in lines 240-242).
      6. In general, the first part of the manuscript (implicating SIFaR in mating duration) is much stronger than the second part, which attempts to demonstrate that SIFa acts through Crz-expressing neurons to induce its effects. The proof that SIFa acts through Crz-expressing neurons to modify mating duration is tenuous. The most direct evidence of this, achieved via knockdown on Crz in SIFaR-expressing cells, is relegated to supplemental figures. The calcium response of the Crz neurons to SIFa neuron activation (Fig. 6) is more of a lack of a decrease that is observed in controls. Also, this is only done in the VNC. Why not look in the brain, because the authors previously stated a hypothesis that the "transmission of signals through SIFaR in Crz-expressing neurons is limited to the brain" (lines 381-382)?

      Furthermore, the authors suggest that Crz acts on cells in the heart to regulate mating duration. It would be useful to add a discussion/speculation as to possible mechanisms for heart cells to regulate mating decisions. Is there evidence of CrzR in the heart? The SCope data presented in Fig. 7I-L and S7G-H is hard to read. 7. In several cases, the effects of being raised single are opposite the effects of sexual experience. For example, in Fig. 4T, calcium activity is increased in the AG following sexual experience, but decreased in flies raised singly. Likewise, Crz-neurons in the OL have increased CaLexA signal in singly-raised flies but reduced signals in flies with previous sexual experience. In some cases, manipulations selectively affect LMD or SMD. It would be useful to discuss these differences and consider the mechanistic implications of these differential changes, when they all result in decreased mating duration. This could help to clarify the big picture of the manuscript.

      Minor Comments:

      1. For CaLexA experiments (eg Fig 7A-D), signal intensity should be quantified in addition to area covered. Increased intensity would indicate greater calcium activity within a particular set of neurons.
      2. In Figure 5K: quantification of cell overlap is missing. In the text they state that there are ~100 neurons that co-express SIFaR24F06 and Crz. How was this determined? Is there a graph or numerical summary of this assertion?
      3. In lines 709-711: "Our experience suggests that the relative mating duration differences between naïve and experienced condition and singly reared are always consistent; however, both absolute values and the magnitude of the difference in each strain can vary. So, we always include internal controls for each treatment as suggested by previous studies." I had trouble understanding this section of methods. What is done with the data from the internal controls?
      4. Could the authors comment on why the brain GRASP signal is so different in Figures 3A and 4A? I realize that different versions of GRASP were used in these experiments, but I would expect broad agreement between the different approaches.

      Significance

      This study will be most relevant to researchers interested in understanding neuronal control of behavior. The manuscript offers a conceptual advance in identifying cell types and molecules that influence mating duration decisions. The strength of the manuscript is the number of different assays used; however, there is a sense that this has occurred at the cost of providing a cohesive narrative. The first part of the manuscript (detailing the role of SIFaR in LMD and SMD) is relatively stronger and more conclusive.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      Reviewer #1 (Evidence, reproducibility, and clarity (Required)): This is an interesting manuscript from the Jagannathan laboratory, which addresses the interaction proteome of two satellite DNA-binding proteins, D1 and Prod. To prevent a bias by different antibody affinities they use GFP-fusion proteins of D1 and Prod as baits and purified them using anti GFP nanobodies. They performed the purifications in three different tissues: embryo, ovary and GSC enriched testes. Across all experiments, they identified 500 proteins with surprisingly little overlap among tissues and between the two different baits. Based on the observed interaction of prod and D1 with members of the canonical piRNA pathway the authors hypothesized that both proteins might influence the expression of transposable elements. However, neither by specific RNAi alleles or mutants that lead to a down regulation of D1 and Prod in the gonadal soma nor in the germline did they find an effect on the repression of transposable elements. They also did not detect an effect of a removal of piRNA pathway proteins on satellite DNA clustering, which is mediated by Prod and D1. However, they do observe a mis-localisation of the piRNA biogenesis complex to an expanded satellite DNA in absence of D1, which presumably is the cause of a mis-regulation of transposable elements in the F2 generation.This is an interesting finding linking satellite DNA and transposable element regulation in the germline. However, I find the title profoundly misleading as the link between satellite DNA organization and transgenerational transposon repression in Drosophila has not been identified by multi-tissue proteomics but by a finding of the Brennecke lab that the piRNA biogenesis complex has a tendency to localise to satellite DNA when the localisation to the piRNA locus is impaired. Nevertheless, the investigation of the D1 and Prod interactome is interesting and might reveal insights into the pathways that drive the formation of centromeres in a tissue specific manner.

      We thank the reviewer for the overall positive comments on our manuscript. As noted above, we have performed a substantial number of revision experiments and improved our text. We believe that our revised manuscript demonstrates a clear link between our proteomics data and the transposon repression. We would like to make three main points,

      1. Our proteomics data identified that D1 and Prod co-purified transposon repression proteins in embryos. To test the functional significance of this association, we have used Drosophila genetics to generate flies lacking embryonic D1. In adult ovaries from these flies, we observe a striking elevation in transposon expression and Chk2-dependent gonadal atrophy. Along with the results from the control genotypes (F1 D1 mutant, F2 D1 het), our data clearly indicate that embryogenesis (and potentially early larval development) are a period when D1 establishes heritable TE silencing that can persist throughout development.
      2. Based on the newly acquired RNA-seq and small RNA seq data, we have edited our title to more accurately reflect our data. Specifically, we have substituted the word 'transgenerational' with 'heritable', meaning that the presence of D1 during early development alone is sufficient to heritably repress TEs at later stages of development.
      3. In addition, our RNA seq and small RNA seq experiments demonstrate that D1 makes a negligible contribution to piRNA biogenesis and TE repression in adults, despite the significant mislocalization of the RDC complex. In this regard, our results are substantially different from the recent Kipferl study from the Brennecke lab (Baumgartner et al. 2022).

        Major comments Unfortunately, the proteomic data sets are not very convincing. Not even the corresponding baits are detected in all assays. I wonder whether the extraction procedure really allows the authors to analyze all functionally relevant interactions of Prod and D1. It would be good to see a western blot or an MS analysis of the soluble nuclear extract they use for purification compared to the insoluble chromatin. It may well be that a large portion of Prod or D1 is lost in this early step. I also find the description of the proteomic results hard to follow as the authors mostly list which proteins the find as interactors of Prod and D1 without stating in which tissue or with what bait they could purify them (i.e. p7: Importantly, our hits included known chromocenter-associated or pericentromeric heterochromatin-associated proteins, such as Su(var)3-9[52], ADD1[23,24], HP5[23,24],mei-S332[53], Mes-4[23], Hmr[24,39,54], Lhr[24,39], and members of the chromosome passenger complex, such as borr and Incenp[55]). It would be interesting to at least discuss tissue specific interactions.

      Out of six total AP-MS experiments in this manuscript (D1 x 3, Prod x 2 and Piwi), we observe a strong enrichment of the bait in 5/6 attempts (log2FC between 4-12). In the initial submission, the lack of a third high-quality biological replicate for the D1 testis sample meant that only the p-value (0.07) was not meeting the cutoff. To address this, we have repeated this experiment with an additional biological replicate (Fig. S2A), and our data now clearly show that D1 is significantly enriched in the testis sample.

      As suggested by the reviewer, we have also assessed our lysis conditions (450mM NaCl and benzonase) and the solubilization of our baits post-lysis. In Fig. S1D, we have blotted equivalent fractions of the soluble supernatant and insoluble pellet from GFP-Piwi embryos and show that both GFP-Piwi and D1 are largely solubilized following lysis. In Fig. S1E, we also show that our IP protocol works efficiently.

      GFP-Prod pulldown in embryos is the only instance in which we do not detect the bait by mass spectrometry. Here, one reason could be relatively low expression of GFP-Prod in comparison to GFP-D1 (Fig. S1E), which may lead to technical difficulties in detecting peptides corresponding to Prod. However, we note that Prod IP co-purified proteins such as Bocks that were previously suggested as Prod interactors from high-throughput studies (Giot et al. 2003; Guruharsha et al. 2011). In addition, Prod IP from embryos also co-purified proteins known to associate with chromocenters such as Hcs and Saf-B. Finally, the concordance between D1 and Prod co-purified proteins from embryo lysates (Fig. 2A, C) suggest that the Prod IP from embryos is of reasonable quality.

      We also acknowledge the reviewer's comment that the description of the proteomic data was hard to follow. Therefore, we have revised our text to clearly indicate which bait pulled down which protein in which tissue (lines 148-156). We have also highlighted and discussed bait-specific and tissue-specific interactions in the text (lines 162-173).

      Minor comments The authors may also want to provide a bit more information on the quantitation of the proteomic data such as how many values were derive from the match-between runs function and how many were imputed as this can severely distort the quantification.

      Figure 1: Distribution of data after imputation in embryo (left), ovary (middle) and testis (right) datasets. Imputation is performed with random sampling from the 1% least intense values generated by a normal distribution.

      To ensure the robustness of our data analysis, we considered only those proteins that were consistently identified in all replicates for at least one bait (GFP-D1, GFP-Prod or NLS-GFP). This approach resulted in a relative low number of missing values. However, it is also important to bear in mind that in an AP-MS experiment, the number of missing values is higher, as interactors are not identified in the control pulldown. Importantly, the imputation of missing values during the data analysis did not alter the normal distribution of the dataset (Fig. 1, this document). Detailed information regarding the imputed values is also provided (Table 1, this document). The coding script used for the data analysis is available in the PRIDE submission of the dataset (Table 2, this document). This information has been added to our methods section under data availability.

      Table 1: ____Number of match-between-runs and imputations for embryo, ovary and testis datasets

      Dataset

      #match-between-runs

      %match-between-runs

      %imputation

      Embryo

      5541/27543

      20.11%

      8.36%

      Ovary

      1936/9530

      20.30%

      8.18%

      Testis

      1748/7168

      24.39%

      3.12%

      Table 2: ____Access to the PRIDE submission of the datasets

      Name

      ID PRIDE

      UN reviewer

      PW reviewer

      IP-MS of D1 from Testis tissue

      PXD044026

      reviewer_pxd044026@ebi.ac.uk

      ydswDQVW

      IP-MS of Piwi from Embryo tissue

      PXD043237

      reviewer_pxd043237@ebi.ac.uk

      TMCoDsdx

      IP-MS of Prod and D1 proteins from Ovary tissue

      PXD043236

      reviewer_pxd043236@ebi.ac.uk

      VOHqPmaS

      IP-MS of Prod and D1 proteins from Embryo tissue

      PXD043234

      reviewer_pxd043234@ebi.ac.uk

      L77VXdvA

      **Referee Cross-Commenting** I agree with the two other reviewers that the connection between the interactome and the transgenerational phenotype is unclear. This is also what I meant i my comment that the title is somewhat misleading. A systematic analysis of the D1 and Prod knock down effect on mRNAs and small Rnas would indeed be helpful to better understand the interesting effect.

      As suggested by the reviewer, we have performed RNA seq and small RNA seq in control and D1 mutant ovaries (Fig. 4) to fully understand the contribution of D1 in piRNA biogenesis and TE repression. Briefly, the mislocalization of RDC complex in D1 mutant ovaries does not significantly affect TE-mapping piRNA biogenesis (Fig. 4C, E). In addition, loss of D1 does not substantially alter TE expression in the ovaries (Fig. 4B) or alter the expression of genes involved in TE repression (Fig. 4F). Along with the results presented in Fig. 5 and Fig. 6, our data clearly indicate that embryogenesis (and potentially early larval development) is a critical period during which D1 makes an important contribution to TE repression.

      Reviewer #1 (Significance (Required)): Nevertheless, the investigation of the D1 and Prod interactome is interesting and might reveal insights into the pathways that drive the formation of centromeres in a tissue specific manner. It may be mostly interesting for the Drosophila community but could also be exiting for a broader audience interested in the connection of heterochromatin and its indirect effect on the regulation of transposable elements.

      We thank the reviewer again for the helpful and constructive comments, which have enabled us to significantly improve our study. We are excited by the results from our study, which illuminate unappreciated aspects of transcriptional silencing in constitutive heterochromatin.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): Chavan et al. set out to enrich our compendium of pericentric heterochromatin-associated proteins - and to learn some new biology along the way. An ambitious AP-Mass baited with two DNA satellite-binding proteins (D1 and Prod), conducted across embryos, ovaries, and testes, yielded hundreds of candidate proteins putatively engaged at chromocenters. These proteins are enriched for a restricted number of biological pathways, including DNA repair and transposon regulation. To investigate the latter in greater depth, the authors examine D1 and prod mutants for transposon activity changes using reporter constructs for multiple elements. These reporter constructs revealed no transposon activation in the adult ovary, where many proteins identified in the mass spec experiments restrict transposons. However, the authors suggest that the D1 mutant ovaries do show disrupted localization of a key member of a transposon restriction pathway (Cuff), and infer that this mislocalization triggers a striking, transposon derepression phenotype in the F2 ovaries.

      The dataset produced by the AP-Mass Spec offers chromosome biologists an unprecedented resource. The PCH is long-ignored chromosomal region that has historically received minimal attention; consequently, the pathways that regulate heterochromatin are understudied. Moreover, attempting to connect genome organization to transposon regulation is a new and fascinating area. I can easily envision this manuscript triggering a flurry of discovery; however, there is quite a lot of work to do before the data can fully support the claims.

      We appreciate the reviewer taking the time to provide thoughtful comments and constructive suggestions to improve the manuscript. We believe that we have addressed all the comments raised to the significant advantage of our paper.

      Major comments 1. The introduction requires quite a radical restructure to better highlight the A) importance of the work and B) limit information whose relevance is not clear early in the manuscript. A. Delineating who makes up heterochromatin is a long-standing problem in chromosome biology. This paper has huge potential to contribute to this field; however, it is not the first. Others are working on this problem in other systems, for example PMID:29272703. Moreover, we have some understanding of the distinct pathways that may impact heterochromatin in different tissues (e.g., piRNA biology in ovaries vs the soma). Also, the mutant phenotypes of prod and D1 are different. Fleshing out these three distinct points could help the reader understand what we know and what we don't know about heterochromatin composition and its special biology. Understanding where we are as a field will offer clear predictions about who the interactors might be that we expect to find. For example, given the dramatically different D1 and prod mutant phenotypes (and allele swap phenotypes), how might the interactors with these proteins differ? What do we know about heterochromatin formation differences in different tissues? And how might these differences impact heterochromatin composition?

      The reviewer brings up a fair point and we have significantly reworked our introduction. We share the reviewer's opinion that improved knowledge of the constitutive heterochromatin proteome will reveal novel biology.

      1. The attempt to offer background on the piRNA pathway and hybrid dysgenesis in the Introduction does not work. As a naïve reader, it was not clear why I was reading about these pathways - it is only explicable once the reader gets to the final third of the Results. Moreover, the reader will not retain this information until the TE results are presented many pages later. I strongly urge the authors to shunt the two TE restriction paragraphs to later in the manuscript. They are currently a major impediment to understanding the power of the experiment - which is to identify new proteins, pathways, and ultimately, biology that are currently obscure because we have so little handle on who makes up heterochromatin.

      We agree with this suggestion. We have introduced the piRNA pathway in the results section (lines 205 - 216), right before this information is needed. We've also removed the details on hybrid dysgenesis, since our new data argues against a maternal effect from the D1 mutant.

      The implications of the failure to rescue female fertility by the tagged versions of both D1 and Prod are not discussed. Consequently, the reader is left uneasy about how to interpret the data.

      We understand this point raised by the reviewer. However, in our proteomics experiments, we have used GFP-D1 and GFP-Prod ovaries from ~1 day old females (line 579, methods). These ovaries are morphologically similar to the wild type (Fig. S1C) and their early germ cell development appears to be intact. Moreover, chromocenter formation in female GSCs is comparable to the wildtype (Fig. 1C-D). These data, along with the rescue of the viability of the Prod mutant (Fig. S1A-B), suggest that the presence of a GFP tag is not compromising D1 or Prod function in the early stages of germline development and is consistent with published and unpublished data from our lab. In addition, D1 and Prod from ovaries co-purify proteins such as Rfc38 (D1), Smn (D1), CG15107 (Prod), which have been identified in previous high-throughput screens (Guruharsha et al. 2011; Tang et al. 2023). For these reasons, we believe that GFP-D1 and GFP-Prod ovaries are a good starting point for the AP-MS experiment. We speculate that the failure to completely rescue female fertility may be due to improper expression levels of GFP-D1 or GFP-Prod flies at later stages of oogenesis, which are not present in ovaries from newly eclosed females and therefore unlikely to affect our proteomic data.

      1. How were the significance cut-offs determined? Is the p-value reported the adjusted p-value? As a non-expert in AP-MS, I was surprised to find that the p-value, at least according to the Methods, was not adjusted based on the number of tests. This is particularly relevant given the large/unwieldy(?) number of proteins that were identified as signficant in this study. Moreover, the D1 hit in Piwi pull down is actually not significant according to their criteria of p We used a standard cutoff of log2FC>1 and p2FC and low p-value) since these are more likely to be bona fide interactors. Third, we have used string-DB for functional analyses where we can place our hits in existing protein-protein interaction networks. Using this approach, we detect that Prod (but not D1) pulls down multiple members of the RPA complex in the embryo (RPA2 and RpA-70, Fig. S2B) while embryonic D1 (but not Prod) pulls down multiple components of the origin recognition complex (Orc1, lat, Orc5, Orc6, Fig. S2C) and the condensin I complex (Cap-G, Cap-D2, barr, Fig. S2D). Altogether, these filtering strategies allow us to eliminate as many false positives as possible while making sure to minimize the loss of true hits through multiple testing correction.

      How do we know if the lack of overlap across tissues is indeed germline- or soma-specialization rather than noise?

      To address this part of the comment, we have amended our text (lines 162-173) as follows,

      'We also observed a substantial overlap between D1- and Prod-associated proteins (yellow points in Fig. 2A, B, Table S1-3), with 61 hits pulled down by both baits (blue arrowheads, Fig. 2C) in embryos and ovaries. This observation is consistent with the fact that both D1 and Prod occupy sub-domains within the larger constitutive heterochromatin domain in nuclei. Surprisingly, only 12 proteins were co-purified by the same bait (D1 or Prod) across different tissues (magenta arrowheads, Fig. 2C, Table S1-3). In addition, only a few proteins such as an uncharacterized DnaJ-like chaperone, CG5504, were associated with both D1 and Prod in embryos and ovaries (Fig. 2D). One interpretation of these results is that the protein composition of chromocenters may be tailored to cell- and tissue-specific functions in Drosophila. However, we also note that the large number of unidentified peptides in AP-MS experiments means that more targeted experiments are required to validate whether certain proteins are indeed tissue-specific interactors of D1 and Prod.'

      To make this inference, conducting some validation would be required. More generally, I was surprised to see no single interactor validated by reciprocal IP-Westerns to validate the Mass-Spec results, though I am admittedly only adjacent to this technique. Note that colocalization, to my mind, does not validate the AP-MS data - in fact, we would a priori predict that piRNA pathway members would co-localize with PCH given the enrichment of piRNA clusters there.

      Here, we would point out that we have conducted multiple validation experiments with a specific focus on the functional significance of the associations between D1/Prod and TE repression proteins in embryos. While the reviewer suggests that piRNA pathway proteins may be expected to enrich at the pericentromeric heterochromatin, this is not always the case. For example, Piwi and Mael are present across the nucleus (pulled down by D1/Prod in embryos) while Cutoff, which is present adjacent to chromocenters in nurse cells, was not observed to interact with either D1 or Prod in the ovary samples.

      Therefore, for Piwi, we performed a reciprocal AP-MS experiment in embryos due to the higher sensitivity of this method (GFP-Piwi AP-MS, Fig. 3B). Excitingly, this experiment revealed that four largely uncharacterized proteins (CG14715, CG10208, Ugt35D1 and Fit) were highly enriched in the D1, Prod and Piwi pulldowns and these proteins will be an interesting cohort for future studies on transposon repression in Drosophila (Fig. 3C).

      Furthermore, we believe that determining the localization of proteins co-purified by D1/Prod is an important and orthogonal validation approach. For Sov, which is implicated in piRNA-dependent heterochromatin formation, we observe foci that are in close proximity to D1- and Prod-containing chromocenters (Fig. 3A).

      As for suggestion to validate by IP-WBs, we would point out that chromocenters exhibit properties associated with phase separated biomolecular condensates. Based on the literature, these condensates tend to associate with other proteins/condensates through low affinity or transient interactions that are rarely preserved in IP-WBs, even if there are strong functional relationships. One example is the association between D1 and Prod, which do not pull each other down in an IP-WB (Jagannathan et al. 2019), even though D1 and Prod foci dynamically associate in the nucleus and mutually regulate each other's ability to cluster satellite DNA repeats (Jagannathan et al. 2019). Similarly, IP-WB using GFP-Piwi embryos did not reveal an interaction with D1 (Fig. S4B). However, our extensive functional validations (Figures 4-6) have revealed a critical role for D1 in heritable TE repression.

      The AlphaFold2 data are very interesting but seem to lack of negative control. Is it possible to incorporate a dataset of proteins that are not predicted to interact physically to elevate the impact of the ones that you have focused on? Moreover, the structural modeling might suggest a competitive interaction between D1 and piRNAs for Piwi. Is this true? And even if not, how does the structural model contribute to your understanding for how D1 engages with the piRNA pathway? The Cuff mislocalization?

      In the revised manuscript, we have generated more structural models using AlphaFold Multimer (AFM) for proteins (log2FC>2, p0.5 and ipTM>0.8), now elaborated in lines 175-177. Despite the extensive disorder in D1 and Prod, we identified 22 proteins, including Piwi, that yield interfaces with ipTM scores >0.5 (Table S4, Table S8). These hits are promising candidates to further understand D1 and Prod function in the future.

      For the predicted model between Prod/D1 and Piwi (Fig. S4A), one conclusion could indeed be competition between D1/Prod and piRNAs for Piwi. Another possibility is that a transient interaction mediated by disordered regions on D1/Prod could recruit Piwi to satellite DNA-embedded TE loci in the pericentromeric heterochromatin. These types of interactions may be especially important in the early embryonic cycles, where repressive histone modifications such as H3K9me2/3 must be deposited at the correct loci for the first time. We suggest that mutating the disordered regions on D1 and Prod to potentially abrogate the interaction with Piwi will be important for future studies.

      The absence of a TE signal in D1 and Prod mutant ovaries would be much more compelling if investigated more agnostically. The observation that not all TE reporter constructs show a striking signal in the F2 embryos makes me wonder if Burdock and gypsy are not regulated by these two proteins but possibly other TEs are. Alternatively, small RNA-seq would more directly address the question of whether D1 and Prod regulate TEs through the piRNA pathway.

      We completely agree with this comment from the reviewer. We have performed RNA seq on D1 heterozygous (control) and D1 mutant ovaries in a chk26006 background. Since Chk2 arrests germ cell development in response to TE de-repression and DNA damage(Ghabrial and Schüpbach 1999; Moon et al. 2018), we reasoned that the chk2 mutant background would prevent developmental arrest of potential TE-expressing germ cells and we observed that both genotypes exhibited similar gonad morphology (Fig. 4A). From our analysis, we do not observe a significant effect on TE expression in the absence of D1, except for the LTR retrotransposon tirant (Fig. 4B). We also do not observe differential expression of TE repression genes (Fig. 4F).

      We have complemented our RNA seq experiment with small RNA profiling from D1 heterozygous (control) and D1 mutant ovaries. Here, overall piRNA production and antisense piRNAs mapping to TEs were largely unperturbed (Fig. 4C-E).

      Overall, our data is consistent with the TE reporter data (Fig. S7) and suggests that zygotic depletion of D1 does not have a prominent role in TE repression. However, we have uncovered that the presence of D1 during embryogenesis is critical for TE repression in adult gonads (Fig. 6, Fig. S9).

      I had trouble understanding the significance of the Cuff mis-localization when D1 is depleted. Given Cuff's role in the piRNA pathway and close association with chromatin, what would the null hypothesis be for Cuff localization when a chromocenter is disrupted? What is the null expectation of % Cuff at chromocenter given that the chromocenter itself expands massively in size (Figure 4D). The relationship between these two factors seems rather indirect and indeed, the absence of Cuff in the AP would suggest this. The biggest surprise is the absence of TE phenotype in the ovary, given the Cuff mutant phenotype - but we can't rule out given the absence of a genome-wide analysis. I think that these data leave the reader unconvinced that the F2 phenotype is causally linked to Cuff mislocalization.

      We apologize that this data was not more clearly represented. In a wild-type context, Cuff is distributed in a punctate manner across the nurse cell nuclei, with the puncta likely representing piRNA clusters (Fig. 5A-B). We find that a small fraction of Cuff (~5%) is present adjacent to the nurse cell chromocenter (inset, Fig. 5A and Fig. 5D). In the absence of D1, the nurse cell chromocenters increase ~3-4 fold in size. However, the null expectation is still that ~5% of total Cuff would be adjacent to the chromocenter, since the piRNA clusters are not expected to expand in size. In reality, we observe ~27% of total Cuff is mislocalized to chromocenters. Our data indicate that the satellite DNA repeats at the larger chromocenters must be more accessible to Cuff in the D1 mutant nurse cells. This observation is corroborated by the significant increase in piRNAs corresponding to the 1.688 satellite DNA repeat family (Fig. 4E).

      The lack of TE expression in the F1 D1 mutant was indeed surprising based on the Cuff mislocalization but as the reviewers pointed out, we only analyzed two TE reporter constructs in the initial submission. In the revised manuscript, we have performed both RNA seq and small RNA seq. Surprisingly, our data reveal that the Cuff mislocalization does not significantly affect piRNA biogenesis (Fig. 4C, D) and piRNAs mapping to TEs. As a result, both TE repression (Fig. 4B) and fertility (Fig. 6D) are largely preserved in the absence of D1 in adult ovaries.

      Finally, we thank the reviewer for their excellent suggestion to incorporate the F2 D1 heterozygote (Fig. S9) in our analysis! This important control has revealed that the maternal effect of the D1 mutant is negligible for gonad development and fertility (Fig. 6B-D). Rather, our data clearly emphasize embryogenesis (or early larval development) as a key period during which D1 can promote heritable TE repression. Essentially, D1 is not required during adulthood for TE repression if it was present in the early stages of development.

      Apologies if I missed this, but Figure 5 shows the F2 D1 mutant ovaries only. Did you look at the TM6 ovaries as well? These ovaries should lack the maternally provisioned D1 (assuming that females are on the right side) but have the zygotic transcription.

      As mentioned above, this was a great suggestion and we've now characterized this important control in the context of germline development and fertility, to the significant advantage of our paper.

      Minor comments 9. Add line numbers for ease of reference

      We apologize for this. Line numbers have been added in the full revision.

      1. The function of satellite DNA itself is still quite controversial - I would recommend being a bit more careful here - the authors could refer instead to genomic regions enriched for satellite DNA are linked to xyz function (see Abstract line 2 and 7, for example.)

      The abstract has been rewritten and does not directly implicate satellite DNA in a specific cellular function. However, we have taken the reviewer's suggestion in the introduction (line 57-58).

      "Genetic conflicts" in the introduction needs more explanation.

      This sentence has been removed from the introduction in the revised manuscript.

      "In contrast" is not quite the right word. Maybe "However" instead (1st line second paragraph of Intro)

      Done. Line 57 of the revised manuscript.

      Results: what is the motivation for using GSC-enriched testis?

      We use GSC-enriched testes for practical reasons. First, they contain a relatively uniform population of mitotically dividing germ cells unlike regular testes which contain a multitude of post-mitotic germ cells such as spermatocytes, spermatids and sperm. Second, GSC-enriched testes are much larger than normal testes and reduced the number of dissections that were needed for each replicate.

      1. Clarify sentence about the 500 proteins in the Results section - it's not clear from context that this is the union of all experiments.

      Done. Lines 145-149 in the revised manuscript.

      The data reported are not the first to suggest that satellite DNA may have special DNA repair requirements. e.g., PMID: 25340780

      We apologize if we gave the impression that we were making a novel claim. Specialized DNA repair requirements at repetitive sequences have indeed been previously hypothesized(Charlesworth et al. 1994) and studied and we have altered the text to better reflect this (lines 193-195). We have cited the study suggested by the reviewer as well as studies from the Chiolo(Chiolo et al. 2011; Ryu et al. 2015; Caridi et al. 2018) and Soutoglou(Mitrentsi et al. 2022) labs, which have also addressed this fascinating question.

      Page 10: indicate-> indicates.

      Done.

      1. Page 14: revise for clarity: "investigate a context whether these interactions could not take place"

      We've incorporated this suggestion in the revised text (lines 383-386).

      1. Might be important to highlight the 500 interactions are both direct and indirect. "Interacting proteins" alone suggests direct interactions only.

      Done. Lines 145-149.

      The effect of the aub mutant on chromocenter foci did not seem modest to me - however, the bar graphs obscure the raw data - consider plotting all the data not just the mean and error?

      Done. This data is now represented by a box-and-whisker plot (Fig. S5), which shows the distribution of the data.

      Reviewer #2 (Significance (Required)):

      The dataset produced by the AP-Mass Spec offers chromosome biologists an unprecedented resource. The PCH is long-ignored chromosomal region that has historically received minimal attention; consequently, the pathways that regulate heterochromatin are understudied. Moreover, attempting to connect genome organization to transposon regulation is a new and fascinating area. I can easily envision this manuscript triggering a flurry of discovery; however, there is quite a lot of work to do before the data can fully support the claims.

      This manuscript represents a significant contribution to the field of chromosome biology.

      We thank the reviewer for the positive evaluation of our manuscript, and we really appreciate the great suggestion for the F2 D1 heterozygote control! Overall, we believe that our manuscript is substantially improved with the addition of RNA seq, small RNA seq and important genetic controls. Moreover, we are excited by the potential of our study to open new doors in the study of pericentromeric heterochromatin.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): In the manuscript entitled "Multi-tissue proteomics identifies a link between satellite DNA organization and transgenerational transposon repression", the authors used two satellite DNA-binding proteins, D1 and Prod, as baits to identify chromocenter-associated proteins through quantitative mass spectrometry. The proteomic analysis identified ~500 proteins across embryos, ovaries, and testes, including several piRNA pathways proteins. Depletion of D1 or Prod did not directly contribute to transposon repression in ovary. However, in the absence of maternal and zygotic D1, there was a dramatic increase of agametic ovaries and transgenerational transposon de-repression. Although the study provides a wealth of proteomic data, it lacks mechanistic insights into how satellite DNA organization influence the interactions with other proteins and their functional consequences.

      We thank the reviewer for highlighting that this study will be a valuable resource for future studies on the composition and function of pericentromeric heterochromatin. Based on the reviewer's request for more mechanistic knowledge into how satellite DNA organization affects transposon repression, we have performed RNA seq and small RNA seq, added important genetic controls and significantly improved our text. In the revised manuscript, our data clearly demonstrate that embryogenesis (and potentially early larval development) is a critical time period when D1 contributes to heritable TE repression. Flies lacking D1 during embryogenesis exhibit TE expression in germ cells as adults, which is associated with Chk2-dependent gonadal atrophy. We are particularly excited by these data since TE loci are embedded in the satellite DNA-rich pericentromeric heterochromatin and our study demonstrates an important role for a satellite DNA-binding protein in TE repression.

      Major____ comments 1. While the identification of numerous interactions is significant, it would be helpful to acknowledge that lots of these proteins were known to bind DNA or heterochromatin regions. To strengthen the study, I recommend conducting functional validation of the identified interactions, in addition to the predictions made by Alphfold 2.

      We are happy to take this comment on board. In fact, we believe that the large number of DNA-binding and heterochromatin-associated proteins identified in this study are a sign of quality for the proteomic datasets. In the revised manuscript, we have highlighted known heterochromatin proteins co-purified by D1/Prod in lines 148-151 as well as proteins previously suggested to interact with D1/Prod from high-throughput studies in lines 153-156 (Guruharsha et al. 2011; Tang et al. 2023). In this study, we have focused on the previously unknown associations between D1/Prod and TE repression proteins and functionally validated these interactions as presented in Figures 3-6.

      The observation of transgenerational de-repression is intriguing. However, to better support this finding, it would be better to provide a mechanistic explanation based on the data presented.

      We appreciate this comment from the reviewer, which is similar to major comment #6 raised by reviewer #2. To generate mechanistic insight into the underlying cause of gonad atrophy in the F2 D1 mutant, we have performed RNA seq, small RNA seq and analyzed germline development and fertility in the F2 D1 heterozygous control (Fig. S9).

      For the RNA seq, we used D1 heterozygous (control) and D1 mutant ovaries in a chk26006 background. Since Chk2 arrests germ cell development in response to TE de-repression and DNA damage(Ghabrial and Schüpbach 1999; Moon et al. 2018), we reasoned that the chk2 mutant background would prevent developmental arrest of potential TE-expressing germ cells and we observed that both genotypes exhibited similar gonad morphology (Fig. 4A). From our analysis, we do not observe a significant effect on TE expression in the absence of D1, except for the LTR retrotransposon tirant (Fig. 4B). We also do not observe differential expression of TE repression genes (Fig. 4F).

      We have complemented our RNA seq experiment with small RNA profiling from D1 heterozygous (control) and D1 mutant ovaries. Here, overall piRNA production and antisense piRNAs mapping to TEs were largely unperturbed (Fig. 4C-E). Together, these data are consistent with the TE reporter data (Fig. S7) and suggests that zygotic depletion of D1 does not have a prominent role in TE repression.

      However, we have uncovered that the presence of D1 during embryogenesis is critical for TE repression in adult gonads (Fig. 6, Fig. S9). Essentially, either only maternal deposited D1 (F1 D1 mutant) or only zygotically expressed D1 (F2 D1 het) were sufficient to ensure TE repression and fertility. In contrast, a lack of D1 during embryogenesis (F2 D1 mutant) led to elevated TE expression and Chk2-mediated gonadal atrophy.

      Our results also explain why previous studies have not implicated either D1 or Prod in TE repression, since D1 likely persists during embryogenesis when using depletion approaches such as RNAi-mediated knockdown or F1 generation mutants.

      Minor____ comments 3. Given the maternal effect of the D1 mutant, in Figure 4, I suggest analyzing not only nurse cells but also oocytes to gain a more comprehensive understanding.

      We agree with the reviewer that this experiment can be informative. In the F2 D1 mutant ovaries, germ cell development does not proceed to a stage where oocytes are specified, thus limiting microscopy-based approaches. Nevertheless, we have gauged oocyte quality in all the genotypes using a fertility assay (Fig. 6D) since even ovaries that have a wild-type appearance can produce dysfunctional gametes. In this experiment, we observe that fertility is largely intact in the F1 D1 mutant and F2 D1 heterozygote strains, suggesting that it is the presence of D1 during embryogenesis (or early larval development) that is critical for heritable TE repression and proper oocyte development.

      The conclusion that D1 and Prod do not directly contribute to the repression of transposons needs further analysis from RNA-seq data. Alternatively, the wording could be adjusted to indicate that D1 and Prod are not required for specific transposon silencing, such as Burdock and gypsy.

      Agreed. We have performed RNA-seq in D1 heterozygous (control) and D1 mutant ovaries in a chk26006 background (Fig. 4A, B) as described above.

      As D1 mutation affects Cuff nuclear localization, it would be insightful to sequence the piRNA in the ovaries.

      Agreed. We have performed small RNA profiling from D1 heterozygous (control) and D1 mutant ovaries. Despite the significant mislocalization of the RDC complex, overall piRNA production and antisense piRNAs mapping to TEs were largely unaffected (Fig. 4C-E). However, we observed significant changes in piRNAs mapping to complex satellite DNA repeats (Fig. 4D), but these changes were not associated with a maternal effect on germline development or fertility (F2 D1 heterozygote, Fig. 6B-D).

      **Referee Cross-Commenting**

      I appreciate the valuable insights provided by the other two reviewers regarding this manuscript. I concur with their assessment that substantial improvements are needed before considering this manuscript for publication.

      1. The proteomics data has the potential to be a valuable resource for other scientific community. However, I share the concerns raised by reviewer 1 about the current quality of the data sets. Addressing this, it will augment the manuscript with additional data to show the success of the precipitation. Moreover, as reviewer 2 and I suggested, additional co-IP validations of the IP-MS results are needed to enhance the reliability.

      In the revised manuscript, we have performed multiple experiments to address the quality of the MS datasets based on comments raised by reviewer #1. They are as follows,

      Out of six total AP-MS experiments in this manuscript (D1 x 3, Prod x 2 and Piwi), we observe a strong enrichment of the bait in 5/6 attempts (log2FC between 4-12, Fig. 2A, B, Fig. S2A, Table S1-S3, Table S7). In the D1 testis sample from the initial submission, the lack of a third biological replicate meant that only the p-value (0.07) was not meeting the cutoff. To address this, we have repeated this experiment with an additional biological replicate (Fig. S2A), and our data now clearly show that D1 is also significantly enriched in the testis sample.

      As suggested by the reviewer #1, we have assessed our lysis conditions (450mM NaCl and benzonase) and the solubilization of our baits post-lysis. In Fig. S1D, we have blotted equivalent fractions of the soluble supernatant and insoluble pellet from GFP-Piwi embryos and show that both GFP-Piwi and D1 are largely solubilized following lysis. In Fig. S1E, we also show that our IP protocol works efficiently.

      The only instance in which we do not detect the bait by mass spectrometry is for GFP-Prod pulldown in embryos. Here, one reason could be relatively low expression of GFP-Prod in comparison to GFP-D1 (Fig. S1E), which may lead to technical difficulties in detecting peptides corresponding to Prod. However, we note that Prod IP from embryos co-purified proteins such as Bocks that were previously suggested as Prod interactors from high-throughput studies (Giot et al. 2003; Guruharsha et al. 2011). In addition, Prod IP from embryos also co-purified proteins known to associate with chromocenters such as Hcs(Reyes-Carmona et al. 2011) and Saf-B(Huo et al. 2020). Finally, the concordance between D1 and Prod co-purified proteins from embryo lysates (Fig. 2A, C) suggest that the Prod IP from embryos is of reasonable quality.

      As for the IP-WB validations, we would point out that chromocenters exhibit properties associated with phase separated biomolecular condensates. In our experience, these condensates tend to associate with other proteins/condensates through low affinity or transient interactions that are rarely preserved in IP-WBs, even if there are strong functional relationships. One example is the association between D1 and Prod, which do not pull each other down in an IP-WB (Jagannathan et al. 2019), even though D1 and Prod foci dynamically associate in the nucleus and mutually regulate each other's ability to cluster satellite DNA repeats (Jagannathan et al. 2019). Similarly, IP-WB using GFP-Piwi embryos did not reveal an interaction with D1 (Fig. S4B). However, our extensive functional validations (Figures 4-6) have revealed a critical role for D1 in heritable TE repression.

      I agree with reviewer 2 that the present conclusion is not appropriate regarding D1 and Prod do not contribute to transposon silencing. As reviewer 2 and I suggested, the systematical analysis by using both mRNA-seq and small RNA-seq is required to draw the conclusion.

      Agreed. We have performed RNA seq and small RNA seq as elaborated above. Our conclusions on the role of D1 in TE repression are now much stronger.

      1. The transgenerational phenotype is an intriguing aspect of the study. I agree with reviewer 2 that the inclusion of TM6 ovaries would be a nice control. Further, it is hard to connect this phenotype with the interactions identified in this manuscript. It would be appreciated if the author could provide a mechanistic explanation.

      We have significantly improved these aspects of our study in the revised manuscript. Through the analysis of germline development in the F2 D1 heterozygotes as suggested by reviewer #2, in addition to the recommended RNA seq and small RNA seq, we have now identified embryogenesis (and potentially early larval development) as a time period when D1 makes an important contribution to TE repression. Loss of D1 during embryogenesis leads to TE expression in adult germline cells, which is associated with Chk2-dependent gonadal atrophy. Taken together, we have pinpointed the specific contribution of a satellite DNA-binding protein to transposon repression.

      Reviewer #3 (Significance (Required)):

      Although this study successfully identified several interactions, the authors did not fully elucidate how these interactions contribute to the transgenerational silencing of transposons or ovary development.

      We thank the reviewer for the thoughtful comments and overall positive evaluation of our study, especially the proteomic dataset. We are confident that the revised manuscript has improved our mechanistic understanding of the contribution made by a satellite DNA-binding protein in TE repression.

      References

      Baumgartner L, Handler D, Platzer SW, Yu C, Duchek P, Brennecke J. 2022. The Drosophila ZAD zinc finger protein Kipferl guides Rhino to piRNA clusters eds. D. Bourc'his, K. Struhl, and Z. Zhang. eLife 11: e80067.

      Caridi CP, D'Agostino C, Ryu T, Zapotoczny G, Delabaere L, Li X, Khodaverdian VY, Amaral N, Lin E, Rau AR, et al. 2018. Nuclear F-actin and myosins drive relocalization of heterochromatic breaks. Nature 559: 54-60.

      Charlesworth B, Sniegowski P, Stephan W. 1994. The evolutionary dynamics of repetitive DNA in eukaryotes. Nature 371: 215-220.

      Chiolo I, Minoda A, Colmenares SU, Polyzos A, Costes SV, Karpen GH. 2011. Double-strand breaks in heterochromatin move outside of a dynamic HP1a domain to complete recombinational repair. Cell 144: 732-744.

      Ghabrial A, Schüpbach T. 1999. Activation of a meiotic checkpoint regulates translation of Gurken during Drosophila oogenesis. Nat Cell Biol 1: 354-357.

      Giot L, Bader JS, Brouwer C, Chaudhuri A, Kuang B, Li Y, Hao YL, Ooi CE, Godwin B, Vitols E, et al. 2003. A protein interaction map of Drosophila melanogaster. Science 302: 1727-1736.

      Guruharsha KG, Rual JF, Zhai B, Mintseris J, Vaidya P, Vaidya N, Beekman C, Wong C, Rhee DY, Cenaj O, et al. 2011. A protein complex network of Drosophila melanogaster. Cell 147: 690-703.

      Huo X, Ji L, Zhang Y, Lv P, Cao X, Wang Q, Yan Z, Dong S, Du D, Zhang F, et al. 2020. The Nuclear Matrix Protein SAFB Cooperates with Major Satellite RNAs to Stabilize Heterochromatin Architecture Partially through Phase Separation. Molecular Cell 77: 368-383.e7.

      Jagannathan M, Cummings R, Yamashita YM. 2019. The modular mechanism of chromocenter formation in Drosophila eds. K. VijayRaghavan and S.A. Gerbi. eLife 8: e43938.

      Mitrentsi I, Lou J, Kerjouan A, Verigos J, Reina-San-Martin B, Hinde E, Soutoglou E. 2022. Heterochromatic repeat clustering imposes a physical barrier on homologous recombination to prevent chromosomal translocations. Molecular Cell 82: 2132-2147.e6.

      Moon S, Cassani M, Lin YA, Wang L, Dou K, Zhang ZZ. 2018. A Robust Transposon-Endogenizing Response from Germline Stem Cells. Dev Cell 47: 660-671 e3.

      Pascovici D, Handler DCL, Wu JX, Haynes PA. 2016. Multiple testing corrections in quantitative proteomics: A useful but blunt tool. PROTEOMICS 16: 2448-2453.

      Reyes-Carmona S, Valadéz-Graham V, Aguilar-Fuentes J, Zurita M, León-Del-Río A. 2011. Trafficking and chromatin dynamics of holocarboxylase synthetase during development of Drosophila melanogaster. Molecular Genetics and Metabolism 103: 240-248.

      Ryu T, Spatola B, Delabaere L, Bowlin K, Hopp H, Kunitake R, Karpen GH, Chiolo I. 2015. Heterochromatic breaks move to the nuclear periphery to continue recombinational repair. Nat Cell Biol 17: 1401-1411.

      Tang H-W, Spirohn K, Hu Y, Hao T, Kovács IA, Gao Y, Binari R, Yang-Zhou D, Wan KH, Bader JS, et al. 2023. Next-generation large-scale binary protein interaction network for Drosophila melanogaster. Nat Commun 14: 2162.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In the manuscript entitled "Multi-tissue proteomics identifies a link between satellite DNA organization and transgenerational transposon repression", the authors used two satellite DNA-binding proteins, D1 and Prod, as baits to identify chromocenter-associated proteins through quantitative mass spectrometry. The proteomic analysis identified ~500 proteins across embryos, ovaries, and testes, including several piRNA pathways proteins. Depletion of D1 or Prod did not directly contribute to transposon repression in ovary. However, in the absence of maternal and zygotic D1, there was a dramatic increase of agametic ovaries and transgenerational transposon de-repression. Although the study provides a wealth of proteomic data, it lacks mechanistic insights into how satellite DNA organization influence the interactions with other proteins and their functional consequences.

      Major

      1. While the identification of numerous interactions is significant, it would be helpful to acknowledge that lots of these proteins were known to bind DNA or heterochromatin regions. To strengthen the study, I recommend conducting functional validation of the identified interactions, in addition to the predictions made by Alphfold 2.
      2. The observation of transgenerational de-repression is intriguing. However, to better support this finding, it would be better to provide a mechanistic explanation based on the data presented. Minor
      3. Given the maternal effect of the D1 mutant, in Figure 4, I suggest analyzing not only nurse cells but also oocytes to gain a more comprehensive understanding.
      4. The conclusion that D1 and Prod do not directly contribute to the repression of transposons needs further analysis from RNA-seq data. Alternatively, the wording could be adjusted to indicate that D1 and Prod are not required for specific transposon silencing, such as Burdock and gypsy.
      5. As D1 mutation affects Cuff nuclear localization, it would be insightful to sequence the piRNA in the ovaries.

      Referee Cross-Commenting

      I appreciate the valuable insights provided by the other two reviewers regarding this manuscript. I concur with their assessment that substantial improvements are needed before considering this manuscript for publication.

      1. The proteomics data has the potential to be a valuable resource for other scientific community. However, I share the concerns raised by reviewer 1 about the current quality of the data sets. Addressing this, it will augment the manuscript with additional data to show the success of the precipitation. Moreover, as reviewer 2 and I suggested, additional co-IP validations of the IP-MS results are needed to enhance the reliability.
      2. I agree with reviewer 2 that the present conclusion is not appropriate regarding D1 and Prod do not contribute to transposon silencing. As reviewer 2 and I suggested, the systematical analysis by using both mRNA-seq and small RNA-seq is required to draw the conclusion.
      3. The transgenerational phenotype is an intriguing aspect of the study. I agree with reviewer 2 that the inclusion of TM6 ovaries would be a nice control. Further, it is hard to connect this phenotype with the interactions identified in this manuscript. It would be appreciated if the author could provide a mechanistic explanation.

      Significance

      Although this study successfully identified several interactions, the authors did not fully elucidate how these interactions contribute to the transgenerational silencing of transposons or ovary development.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Chavan et al. set out to enrich our compendium of pericentric heterochromatin-associated proteins - and to learn some new biology along the way. An ambitious AP-Mass baited with two DNA satellite-binding proteins (D1 and Prod), conducted across embryos, ovaries, and testes, yielded hundreds of candidate proteins putatively engaged at chromocenters. These proteins are enriched for a restricted number of biological pathways, including DNA repair and transposon regulation. To investigate the latter in greater depth, the authors examine D1 and prod mutants for transposon activity changes using reporter constructs for multiple elements. These reporter constructs revealed no transposon activation in the adult ovary, where many proteins identified in the mass spec experiments restrict transposons. However, the authors suggest that the D1 mutant ovaries do show disrupted localization of a key member of a transposon restriction pathway (Cuff), and infer that this mislocalization triggers a striking, transposon derepression phenotype in the F2 ovaries.

      The dataset produced by the AP-Mass Spec offers chromosome biologists an unprecedented resource. The PCH is long-ignored chromosomal region that has historically received minimal attention; consequently, the pathways that regulate heterochromatin are understudied. Moreover, attempting to connect genome organization to transposon regulation is a new and fascinating area. I can easily envision this manuscript triggering a flurry of discovery; however, there is quite a lot of work to do before the data can fully support the claims.

      Major

      1. The introduction requires quite a radical restructure to better highlight the A) importance of the work and B) limit information whose relevance is not clear early in the manuscript. A. Delineating who makes up heterochromatin is a long-standing problem in chromosome biology. This paper has huge potential to contribute to this field; however, it is not the first. Others are working on this problem in other systems, for example PMID:29272703. Moreover, we have some understanding of the distinct pathways that may impact heterochromatin in different tissues (e.g., piRNA biology in ovaries vs the soma). Also, the mutant phenotypes of prod and D1 are different. Fleshing out these three distinct points could help the reader understand what we know and what we don't know about heterochromatin composition and its special biology. Understanding where we are as a field will offer clear predictions about who the interactors might be that we expect to find. For example, given the dramatically different D1 and prod mutant phenotypes (and allele swap phenotypes), how might the interactors with these proteins differ? What do we know about heterochromatin formation differences in different tissues? And how might these differences impact heterochromatin composition? B. The attempt to offer background on the piRNA pathway and hybrid dysgenesis in the Introduction does not work. As a naïve reader, it was not clear why I was reading about these pathways - it is only explicable once the reader gets to the final third of the Results. Moreover, the reader will not retain this information until the TE results are presented many pages later. I strongly urge the authors to shunt the two TE restriction paragraphs to later in the manuscript. They are currently a major impediment to understanding the power of the experiment - which is to identify new proteins, pathways, and ultimately, biology that are currently obscure because we have so little handle on who makes up heterochromatin.
      2. The implications of the failure to rescue female fertility by the tagged versions of both D1 and Prod are not discussed. Consequently, the reader is left uneasy about how to interpret the data.
      3. How were the significance cut-offs determined? Is the p-value reported the adjusted p-value? As a non-expert in AP-MS, I was surprised to find that the p-value, at least according to the Methods, was not adjusted based on the number of tests. This is particularly relevant given the large/unwieldy(?) number of proteins that were identified as signficant in this study. Moreover, the D1 hit in Piwi pull down is actually not significant according to their criteria of p <0.05 (D1 is p=0.05).
      4. How do we know if the lack of overlap across tissues is indeed germline- or soma-specialization rather than noise? To make this inference, conducting some validation would be required. More generally, I was surprised to see no single interactor validated by reciprocal IP-Westerns to validate the Mass-Spec results, though I am admittedly only adjacent to this technique. Note that colocalization, to my mind, does not validate the AP-MS data - in fact, we would a priori predict that piRNA pathway members would co-localize with PCH given the enrichment of piRNA clusters there.
      5. The AlphaFold2 data are very interesting but seem to lack of negative control. Is it possible to incorporate a dataset of proteins that are not predicted to interact physically to elevate the impact of the ones that you have focused on? Moreover, the structural modeling might suggest a competitive interaction between D1 and piRNAs for Piwi. Is this true? And even if not, how does the structural model contribute to your understanding for how D1 engages with the piRNA pathway? The Cuff mislocalization?
      6. The absence of a TE signal in D1 and Prod mutant ovaries would be much more compelling if investigated more agnostically. The observation that not all TE reporter constructs show a striking signal in the F2 embryos makes me wonder if Burdock and gypsy are not regulated by these two proteins but possibly other TEs are. Alternatively, small RNA-seq would more directly address the question of whether D1 and Prod regulate TEs through the piRNA pathway.
      7. I had trouble understanding the significance of the Cuff mis-localization when D1 is depleted. Given Cuff's role in the piRNA pathway and close association with chromatin, what would the null hypothesis be for Cuff localization when a chromocenter is disrupted? What is the null expectation of % Cuff at chromocenter given that the chromocenter itself expands massively in size (Figure 4D). The relationship between these two factors seems rather indirect and indeed, the absence of Cuff in the AP would suggest this. The biggest surprise is the absence of TE phenotype in the ovary, given the Cuff mutant phenotype - but we can't rule out given the absence of a genome-wide analysis. I think that these data leave the reader unconvinced that the F2 phenotype is causally linked to Cuff mislocalization.
      8. Apologies if I missed this, but Figure 5 shows the F2 D1 mutant ovaries only. Did you look at the TM6 ovaries as well? These ovaries should lack the maternally provisioned D1 (assuming that females are on the right side) but have the zygotic transcription.

      Minor

      1. Add line numbers for ease of reference
      2. The function of satellite DNA itself is still quite controversial - I would recommend being a bit more careful here - the authors could refer instead to genomic regions enriched for satellite DNA are linked to xyz function (see Abstract line 2 and 7, for example.)
      3. "Genetic conflicts" in the introduction needs more explanation.
      4. "In contrast" is not quite the right word. Maybe "However" instead (1st line second paragraph of Intro)
      5. Results: what is the motivation for using GSC-enriched testis?
      6. Clarify sentence about the 500 proteins in the Results section - it's not clear from context that this is the union of all experiments.
      7. The data reported are not the first to suggest that satellite DNA may have special DNA repair requirements. e.g., PMID: 25340780
      8. Page 10: indicate-> indicates.
      9. Page 14: revise for clarity: "investigate a context whether these interactions could not take place"
      10. Might be important to highlight the 500 interactions are both direct and indirect. "Interacting proteins" alone suggests direct interactions only.
      11. The effect of the aub mutant on chromocenter foci did not seem modest to me - however, the bar graphs obscure the raw data - consider plotting all the data not just the mean and error?

      Significance

      The dataset produced by the AP-Mass Spec offers chromosome biologists an unprecedented resource. The PCH is long-ignored chromosomal region that has historically received minimal attention; consequently, the pathways that regulate heterochromatin are understudied. Moreover, attempting to connect genome organization to transposon regulation is a new and fascinating area. I can easily envision this manuscript triggering a flurry of discovery; however, there is quite a lot of work to do before the data can fully support the claims.

      This manuscript represents a significant contribution to the field of chromosome biology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is an interesting manuscript from the Jagannathan laboratory, which addresses the interaction proteome of two satellite DNA-binding proteins, D1 and Prod. To prevent a bias by different antibody affinities they use GFP-fusion proteins of D1 and Prod as baits and purified them using anti GFP nanobodies. They performed the purifications in three different tissues: embryo, ovary and GSC enriched testes. Across all experiments, they identified 500 proteins with surprisingly little overlap among tissues and between the two different baits. Based on the observed interaction of prod and D1 with members of the canonical piRNA pathway the authors hypothesized that both proteins might influence the expression of transposable elements. However, neither by specific RNAi alleles or mutants that lead to a down regulation of D1 and Prod in the gonadal soma nor in the germline did they find an effect on the repression of transposable elements. They also did not detect an effect of a removal of piRNA pathway proteins on satellite DNA clustering, which is mediated by Prod and D1. However, they do observe a mis-localisation of the piRNA biogenesis complex to an expanded satellite DNA in absence of D1, which presumably is the cause of a mis-regulation of transposable elements in the F2 generation.

      This is an interesting finding linking satellite DNA and transposable element regulation in the germline. However, I find the title profoundly misleading as the link between satellite DNA organization and transgenerational transposon repression in Drosophila has not been identified by multi-tissue proteomics but by a finding of the Brennecke lab that the piRNA biogenesis complex has a tendency to localise to satellite DNA when the localisation to the piRNA locus is impaired.

      Nevertheless, the investigation of the D1 and Prod interactome is interesting and might reveal insights into the pathways that drive the formation of centromeres in a tissue specific manner.

      Major comments

      Unfortunately, the proteomic data sets are not very convincing. Not even the corresponding baits are detected in all assays. I wonder whether the extraction procedure really allows the authors to analyze all functionally relevant interactions of Prod and D1. It would be good to see a western blot or an MS analysis of the soluble nuclear extract they use for purification compared to the insoluble chromatin. It may well be that a large portion of Prod or D1 is lost in this early step. I also find the description of the proteomic results hard to follow as the authors mostly list which proteins the find as interactors of Prod and D1 without stating in which tissue or with what bait they could purify them (i.e. p7: Importantly, our hits included known chromocenter-associated or pericentromeric heterochromatin-associated proteins, such as Su(var)3-9[52], ADD1[23,24], HP5[23,24],mei-S332[53], Mes-4[23], Hmr[24,39,54], Lhr[24,39], and members of the chromosome passenger complex, such as borr and Incenp[55]). It would be interesting to at least discuss tissue specific interactions.

      Minor comments

      The authors may also want to provide a bit more information on the quantitation of the proteomic data such as how many values were derive from the match-between runs function and how many were imputed as this can severely distort the quantification.

      Referee Cross-Commenting

      I agree with the two other reviewers that the connection between the interactome and the transgenerational phenotype is unclear. This is also what I meant i my comment that the title is somewhat misleading. A systematic analysis of the D1 and Prod knock down effect on mRNAs and small Rnas would indeed be helpful to better understand the interesting effect.

      Significance

      Nevertheless, the investigation of the D1 and Prod interactome is interesting and might reveal insights into the pathways that drive the formation of centromeres in a tissue specific manner. It may be mostly interesting for the Drosophila community but could also be exiting for a broader audience interested in the connection of heterochromatin and its indirect effect on the regulation of transposable elements.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript by Kehrer et al., use an elegant Apex2 BioID method to identify novel putative microneme proteins by mass-spectrometry and pick one candidate for further characterization. They identify a novel putative microneme protein they name Akratin which they characterize through targeted gene deletion and a series of complementation experiments. This reveals first that akratin appears to be functioning in male gamete egress, and though complementation using a putative trafficking mutant, also in midgut traversal.

      Overall the study is thoroughly performed but some of the conclusions are not fully supported.

      1)The newly identified microneme protein is still putative in my mind as the authors have not co-localized it with another marker. This is crucial for conclusions about its putative function and crucial for the trafficking experiment as explained below. It is also important given the high number of putative false positives in the BioID experiment.

      2)I would consider it essential to also localise the Apex2 tagged SOAP protein as the authors cannot be sure that there is a partial mislocalisation of the protein leading to false positives.

      3)I am not convinced by the trafficking defect. This could be because the localsation in the images are not easy to distinguish and it may be much clearer looking down the microscope. I think co-localisation with another microneme marker would go a long way and demonstrating that akratin upon mutation actually localises elsewhere is important. It is even more important since there is no phenotype in male egress, but then later in ookinetes, which is a bit surprising if this is a proper conserved trafficking motif.

      4)The candidate selection section is poorly described. A flow chart or clearer inclusion/ exclusion criteria would be useful.

      5)I understand the approach to focus on more abundant biotinylated proteins, however, I think it may not be the best approach to use peptide counting. Apex2 labelling as the authors rightfully say, is mainly based on tyrosine labelling of surface exposed areas, so the abundance of proteins in the IP will depend on accessible tyrosines, protein abundance, distance from the bait, size of the protein and how many tryptic peptides can be generated. Reproducible results between 2 conditions are more likely to show true positives and may be the best way to prioritize, or assign confidence. Also: cOuld the authors use mean intensity values for the peptides covering proteins as a metric for abundance using label free quantification? This is not a requirement but may allow quantification in a slightly better way. I am not sure about the Table S1 colour scheme (the legend does not explain green, purple and blue shading). Are all green ones confirmed microneme proteins? Please add a proper descripton of the table and columns.

      6)Figure 2C and D are from PlasmoDB and should ideally not be included as figure panels. This is misleading and could either be mentioned in the text, or put into supplementary data with a clear note that the authors have not aquired these data. I would also suggest to move figures 3A-C into figure 2 and present the KO with the complementation data for a direct comparison.

      Minor:

      1)When the authors say "numbers of peptides identified": is this unique peptides or does it include non-unique peptides?

      2)Figures 1 I-K could move into supplementary as they are somewhat non-informative given the nature of BioID described in the main points.

      3)Line 253: Whether akratin is involved in membrane lysis directly, or important for microneme secretion so this is a knock-on effect is not known. This could be added to the discussion, but there is no evidence for this statement in the results section.

      4)Line 274: Refers to Figure 3F, which does not exist.

      5)Line 333: Overall I think this is a bit of an overstatement. The use of Apex2 in these conditions is definitely nice to see but for now the authors have validated none of the microneme proteins by co-localization. So we are still a bit in the dark how well the method works in terms of false positives. The targeting motif in my mind is not yet confirmed in the absence of co-localisations with other markers. An alternative explanation could be that the c-terminus of the protein is important for its function in one stage, but not another but that trafficking is not- or only marginally affected.

      Significance

      The significance of the manuscript in my mind lies in the application of Apex2 in Plasmodium parasites, which will be an advance for the field. However, we do not learn about labeling times, how short it can be so its potential is not fully looked at.

      The list of the putative micronemes will of course be of high interest for the community, but because of the limited validation in this study will require further validation by others.

      The identification of the dual function of this protein in transmission in egress and ookinete traversal is interesting and surely leads to further studies. The identification of a putative differential trafficking motif is intruiging, if, as stated in the major concerns, this can be validated.

      My expertise lies in Plasmodium biology with good knowledge of mass-spectrometry approaches.

      Referees Cross-commenting

      I agree with the assessment of the other reviewer, a slightly more detailed discussion of the hits would be desireable (exported proteins, why are they there). This could be a drawback of the system used, and mentioned.

      Western blot of the GFP is a very good idea to clarify whether the localization is maybe, in parts, GFP that is not fused to the full lenght protein, either by cleavage, or a breakdown product.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Proximity labelling using BirA has emerged as a highly successful approach to identify interaction partners and proteins in compartments of a protein of interest in the living cell. A number of recent studies applied this approach with malaria parasites and demonstrated its usefulness. However, a drawback of using BirA is the time required to obtain good yields of biotinylated proteins (usually many hours). APEX is a much faster alternative to BirA that however has so far not been used in apicomplexan parasites. Here Kehrer and colleagues use APEX2 to identify proteins of the micronemes of mature Ookinetes, a task that due to the short time available for labelling, would have been difficult with BirA. From the obtained hits the authors chose a protein they named akratin and carried out a detailed functional analysis. They found that akratin is needed for microgametogenesis and for ookinete migration. Complementation with either the Pb or Pf GFP-tagged version of akratin rescued all defects of the mutant, despite the low sequence similarity and differing number of predicted transmembrane segments in the P. falciparum protein. Interestingly a mutant form of this protein with an ablated putative trafficking motif in its C-terminus also caused an ookinete migration phenotype but microgametogenesis was unaffected, hinting at intricacies in its function.

      This is a clearly written and interesting paper reporting the first use of APEX-based BioID in malaria parasites and demonstrates that it is possible to take advantage of BioID in short lived stages. The ookinete microneme proteome reported here compares favourably with that derived from organelle purifications and will provide a resource to identify the proteins and understand their involvement in migration and adhesion of this and possibly other parasite stages. The example protein chosen for functional analysis in this work nicely illustrates this. A validation of some more of the unknown hits to be true microneme proteins would have been beneficial, but given the high number of known true positives in the hit list this is not absolutely essential for the paper. Overall this manuscript therefore provides a nice piece of work adding a list of proteins for future study and a new player important for mosquito infection.

      Minor points:

      1.several proteins taken as true hits in TableS1 are not obvious to me, such as plasmepsin V and several exported proteins. Are they expected to be trafficked via the micronemes in ookinetes?

      2.Fig. 1C: '+ Streptavidin', this should be '+ streptavidin beads'

      3please insert a referral to Figure S2 (similarity of the Plasmodium akratin homologs) somewhere around line 180, the only reference to this figure I could find in the text was in line 214. Figure S3 (line 184) is mentioned in a context to support the absence of homologues outside Plasmodium spp but shows the generation of the akratin KO. Maybe this should have been a citation to Figure S2, but then something is missing from that figure.

      4.line 192, it might be useful to clarify in the text what 10 and 7 blood stage growth means (multiplication rate?).

      5.the observation that akratin as a multi TM protein with signal peptide seems to be soluble in the parasite is rather unlikely. Is there a precedent for this? My best guess would be that this is GFP alone due to degradation or processing of the akratin-GFP fusion. A Western blot, if sufficient material is available, would clarify this. In regards to the localisation of akratin in the different stages it should also be taken into account that calling anything 'vesicular' based on fluorescent microscopy is rather speculative.

      6.line 199, how can the low number of ookinetes affect their speed? Could this remark have been intended for the next sentence (199-201/Fig. 2I) as the lack of transmission to mosquitoes may have been due to the reduced number of ookinetes rather than a deficiency of individual ookinetes? To exclude the latter, were parasite number used matched to exclude that this was the case in Fig. 2I?

      7.line 272-4, is there a way to quantify the phenotype in ookinetes, e.g. using intensity profile plots showing the distribution across the cell? Many cells still seem to show peripheral staining in the trafficking mutant. Could it again be that some of the protein is processed and the increased cytoplasmic staining represents GFP alone (see also comment 5)? In that case the level of processing rather than trafficking (alone) might be affected in the mutant.

      8.lines 396 and 397, there is a minuscule -1 before the chemicals in my pdf.

      9.line 58. remove 'not' and 'or': neither been identified nor been characterised

      10.lines 88 to 94: English could be polished some more in this part. Line 91, replace 'ones', e.g. with 'proteins'

      11.lines 245 and 253: add commas after (Figure 4B) and (Figure 4D), respectively

      12.line 254; change possibly into possible

      13.line 296-71: it might be helpful to mention in the text that this was again the complementation strategy (used for the Pb and Pf akratin)

      14.while going beyond the scope of this manuscript, would it be possible to use the APEX introduced here to label structures in EM? A comment on this might be useful for readers interested in using this domain.

      Significance

      This is the first use of APEX2 in apicomplexans and this will be very useful for the field. The ookinete microneme proteome will provide a resource to study key aspects of this stage. The unknown proteins in the hit list still have to be confirmed to be true positives, but given the high number of true positives among the known proteins, the success rate should be acceptable. Akratin is a new protein among several known to be important for transmission to the mosquito host. As illustrated by this and other studies, many important questions remain how microgametes egress from the cell and how in vitro gliding and mosquito gut wall penetration relate. Akratin, together with the previously known proteins, will be instrumental in solving these questions.

      My expertise (to put this assessment into perspective): I am a cell biologist working with P. falciparum blood stages and have no or limited experience with mosquito stages and P. berghei. My grasp of the technical difficulties to work with ookinetes and other mosquito stages as presented in this manuscript are therefore limited. Nevertheless, it appears to me that the work is well done and that the overall outcome from this paper is significant and will be of great interest to the field.

      Referees Cross-Commenting

      I think the other reviewer's request to carry out a co-loc experiment with a microneme marker to show that akratin and SOAP-APEX are in the correct location is very reasonable and should be done. I also share the view about the trafficking domain. This is reflected in my comment (points 5 and 7) on quantifying the phenotype of the 'trafficking mutant' and the possbility of not looking at the full length protein as it may be a processed/degraded version of the protein that still contains GFP. Western blots, if possible with these parasite stages, might clear this up.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02378

      Corresponding author(s): Angelika Böttger

      [Please use this template only if the submitted manuscript should be considered by the affiliate journal as a full revision in response to the points raised by the reviewers.

      • *

      If you wish to submit a preliminary revision with a revision plan, please use our "Revision Plan" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      After we have carefully studied the four reviews we have received, we made some major revisions to the manuscript. These included the following main points:

      • Concerns regarding clarity of the manuscript: we have substantially edited the abstract, introduction and discussion part of the manuscript and added many more references to previous work by other authors, especially Cazet 2021, Tursch 2022 and Gahan 2017. We focused our introduction and discussion on organizer function and on the Gierer-Meinhardt-Model for pattern formation. We think that the conclusions are of great general interest because they suggest a function of the Hydra head organizer according to the original definition by Hans Spemann, that is “harmonious interlocking of separate processes which makes up development”. Notch signaling, in our interpretation, is an instrument for this function of the organizer. Comparison with Craspedacusta compellingly illustrates this idea.
      • Concerns regarding Craspedacusta experiments: we have isolated four Craspedacusta transcripts (CsSp5, CsWnt3, CsAlx and CsNOWA) and analyzed their response to DAPT during head regeneration in Craspedacusta. This revealed that DAPT did not inhibit CsWnt3 expression, in accordance with it not having an effect on head regeneration in Craspedacusta However, DAPT inhibited expression of the other potential CsNotch target genes, confirming that DAPT generally works in Craspedacusta polyps as Notch-inhibitor.
      • Concerns regarding HyKayak function: we have conducted a rescue experiment to show the function of Hykayak as a target for Notch-regulated repressor genes and a local inhibitor of Wnt-3 expression, which revealed that the expected up-regulation of HyWnt3 in DAPT-treated animals was very weak and did not rescue the DAPT-regeneration phenotype-this was discussed, but data were not included.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Major: • The introduction is lacking a full description of what is known about transcriptional changes during Hydra regeneration and in particular the role of Wnt signalling in this process. Of note the authors do not cite several important studies from recent years including (but not limited to):

      *https://doi.org/10.1073/pnas.2204122119 *

      *https://doi.org/10.1186/s13072-020-00364-6 *

      *https://doi.org/10.1101/587147 *

      *https://doi.org/10.7554/eLife.60562 This problem is further compounded later when the authors do not cite/discuss work which has performed the same or similar analyses to their own. The authors should endeavor to give a more comprehensive background knowledge. *

      Answer:

      Our work focuses on the role of Notch-signalling during Hydra head regeneration, specifically when the head is removed at an apical position. We therefore now have included additional information about transcriptional changes during this process in the introduction. In addition, we have included the suggested citations in the introduction to give a more general background knowledge.

      e.g. .Following decapitation, the expression of Hyβ-catenin and HyTcf was upregulated earliest, followed by local activation of Wnt genes. Among these, HyWnt3 and HyWnt11 appeared within 1.5 h of head removal, followed by HyWnt1, HyWnt9/10c, HyWnt16, and HyWnt7, indicating their role in the formation of the Hydra head organizer (Hobmayer et al., 2001; Lengfeld et al., 2009; Philipp et al., 2009; Tursch et al., 2022).

      • The authors do not cite or reference at all the study by Cazet et al. which used iCRT14 along with RNAseq and ATACseq to probe the role of Wnt signaling during early regeneration. This is a major issue. Although I appreciate that the authors have done much longer time courses and that their data therefore add something to our understanding it will still be important to discuss here. For example, the authors show that Wnt3 is activated normally in iCRT14 animals. Is this also seen in the RNAseq from Cazet et al.*
      • *

      Answer:

      iCRT14 was used in Hydra regeneration experiments by Gufler et al (which we did cite) and Cazet et al, but the specific aspects of hypostome and tentacle regeneration were not addressed. Cazet et al. have analyzed HyWnt3expression after iCRT treatment during the first 12 hrs of regeneration. Our data show, in addition that HyWnt3 is not controlled by TCF-dependent transcription during Hydra head regeneration after apical cuts throughout the whole regeneration process including the morphogenesis state. Nevertheless, the other Wnt-genes, which are indicated in canonical Wnt-signalling are affected by iCRT14 also in our study.

      We have now included comparison of Cazet- and our data, we wrote:

      HyWnt3 and Wnt9/10c expression are swiftly induced by injuries. When HyWnt3 and HyWnt9/10 activities are sustained, organizers can be formed, which induce ectopic heads when the original organizer tissue (the head) is removed (Cazet et al., 2021; Tursch et al., 2022).”

      The effect of iCRT14 had been analyzed in previous studies (Cazet et al., 2021; Gufler et al., 2018; Tursch et al., 2022). All showed b-catenin-dependency for down-regulation of head specific genes in foot regenerates at time points up to 12 hrs after head removal, including HyWnt3. They also stated a failure of head regeneration in the presence of iCRT14 but, in accordance with our study, did not reveal that HyWnt3 expression at future heads depended on b-catenin. None of these studies analyzed the regeneration of tentacles and hypostomes separately and they did not report whether* the regeneration of hypostomes 48 hrs after head removal occurred normally upon iCRT14 treatment. *

      • The visualizations used in Figure 3 are quite difficult to interpret and do to in all cases match descriptions in the text. The way the same type data is displayed in figure 5 so much nicer. It is also better to treat the same types of data in the same manner consistently throughout the paper. For Hes, for example, the authors state that there is a reduction although the data shows that this is very small and taking into account the 95% confidence interval does not seem to be significant. If this is the case then the positive control is not working in this experiment. This would be much clear if individual time points were compared like in figure 5 and statistical tests shown. The authors then state that Alx is not affected but there is actually a larger effect than what they deemed significant for Hes (the axes are notably different between these two and I think a more consistent axis would make the genes more comparable). Similarly, Gsc is described as being not affected at 8 hours but it appears again to change more that the positive control Hes. Given this I would call into question the validity of this dataset and/or the interpretation by the authors. A more thorough analysis including taking better into account statistical significance would go a long way to increasing confidence in this data. • The same issues in interpretation described for Figure 3 also apply to figure 4. The authors state that Wnt7 is affected less than Wnt1 and 3 but this is not evident in the figure and no comparative analysis is performed to confirm this. The same for Wnt 11 and 9/10c where what the authors description is very difficult to see in the figure. Sp5 is apparently upregulated, but this is not discussed. Again the axes are notably different making it even more difficult to compare between samples. __Answer*____:__

      We have now presented the data by simple scatter blots with significance information for every data point. This allows comparison between samples as requested by the reviewer. The GAMs were moved to the supplement. We believe that some readers may appreciate GAM-representation of the data because of the accessibility of the confidence interval over time.

      Concerning DAPT:

      “We now performed RT-qPCR analysis to compare gene expression dynamics of these genes during head regeneration 0, 8, 24, 36 and 48 hrs after head removal. Animals were either treated with 30 µM DAPT in 1% DMSO, or 1% DMSO as control for the respective time frames. Timepoint 0 was measured immediately after head removal. The results of these analyses revealed that HyHes expression was clearly inhibited by DAPT during the first 36 hrs after head removal (Fig. 3B), confirming previously published data which had indicated HyHes as a direct target for NICD (Münder et al., 2010). HyAlx expression levels were slightly up-regulated after 24 hrs, but later partially inhibited by DAPT (Fig. 3C). CnGsc expression under DAPT treatment initially (8hrs) was comparable to control levels, but then it was strongly inhibited (Fig. 3D). This goes along with the observed absence of organizer activity in regenerating Hydra tips (Münder et al., 2013). Interestingly, a similar result was seen for HySp5 expression, which was also normal at 8 hrs but was then inhibited by DAPT at later time points (Fig. 3E). HyKayak, while expression is normal after 8 hrs, was strongly overexpressed between 24 and 36 hrs of regeneration in DAPT-treated polyps in comparison to control regenerates (Fig. 3F).

      Concerning iCRT14

      Next, following the same procedure as described for DAPT, we compared the gene expression dynamics of iCRT14-treated regenerates with control regenerates. We found that the expression of HyWnt3 was not inhibited by iCRT14. In fact, it even appeared slightly up-regulated at the 8 hrs time point (Fig. 4A). Normal HyWnt3-expression at the end of the regeneration period was confirmed by in-situ hybridization for HyWnt3 as shown in Fig. 1D, indicating that HyWnt3 expression patterns and expression levels in ecto- and endodermal cells of the hypostome were faithfully regenerated (Fig. 4A). In contrast, HyAlx expression was completely abolished by iCRT14 (Fig. 4B), consistent with the observation that iCRT14-treated head regenerates did not regenerate any tentacles (Fig. 1A). HySp5 expression was not significantly affected by iCRT14 treatment at any time point (Fig. 4C).

      Furthermore, we found that CnGsc levels in iCRT14 remained similar to control regenerates up to 24 hrs, but were attenuated at later time points (Fig. 4D), very similar to the expression dynamics of the Notch-target gene HyHes (Fig. 4E). The expression of HyKayak was decreased at 8 hrs after head removal in the presence of iCRT14, but then increased above control levels after 48 hrs (Fig. 4F). There were no significant changes in the expression dynamics of HyBMP2/4 and HyBMP5/8b between iCRT14-treated regenerates and controls (Fig. 4G, H).”

      The precise number of biological replicates can be seen in the individual diagrams, they included for most genes three biological replicates, with always three technical replicates for each data point. Biological replicates were obtained over several years by different researchers. For some genes, we obtained very consistent data with high confidence in every experiment (e.g. HyWnt3, HyBMP4). We illustrate this in table 1, where three arrows indicate all such cases. With some genes we observed greater variation, which we interpret as no effect or a minor effect in table 1. Some of these variations may be explained by our observation of wave-like patterns in the expression dynamics. Therefore, we have included the following statement:

      “In addition, the gene expression dynamics for many of the analyzed genes appears in wave-like patterns in some experiments (see Figs S3 and S4). As we have only four time points measured, we cannot draw strong conclusions from these observations, except that some of the deviations in our data points (e.g. 48 hrs HyHes)”

      • In their description of figure 4 the authors completely omit to discuss the Cazet et al dataset which has the exact same early timepoints for iCRT14 treatment. This must be discussed and compared and any difference noted. * Answer:

      We included the iCRT14 results from Cazet et al., in our revised manuscript (see above).

      • End of page 11: The authors propose a model thereby the role of Notch in Wnt3 expression may be due to the presence of a repressor. However, I don't see any putative evidence at that stage. The authors also do not cite relevant work from both Cazet et al. and Tursch et al which show that Wnt3 is likely upregulated by bZIP TFs. In both these cases the authors show evidence of bZIP TF binding sites in the Wnt3 promoter along with other analyses. This is very relevantto the model presented by the authors here and must be discussed and compared. - * In particular the authors put forward HyKayak as an inhibitor of Wnt3 and this should be discussed along with the previous work.

      Answer:

      Tursch et al. 2022 did not claim that HyWnt3 is upregulated by bZiP TFs. They showed that HyWnt3 was strongly upregulated in a position-independent manner upon inhibition of the p38 or JNK (c-Jun N-terminal kinase) pathways (i.e., stress-induced MAPK pathways). This would rather support our hypothesis that HyKayak (AP-1 protein) might be a repressor of Wnt3-expression.

      Cazet et al have indicated that injury-responsive bZIP TFs are the most plausible regulators of canonical Wnt-signalling components during the early generic wound response. They identified CRE-elements, which can be bound by bZIP TFs, in the putative regulatory sequences of HyWnt3. However, they focused on the early stage of regeneration (0-12hpa), and showed that bZIP TFs, including jun, fos and creb are transiently upregulated at 3hpa and hypothesise that they could induce the upregulation of HyWnt3 at this stage as an injury response. We have to point out that the Hydra fos-homolog Hykayak, which our work is concerned with, is not identical with the fos-gene described in Cazet’s paper. In addition, the Hykayak gene was downregulated by Notch signalling during the morphogenesis state of regeneration (24-36 hrs), which is not the same stage investigated by Cazet et al. To avoid confusion, we have now included the Cazet-fos-sequence in our sequence comparison in Fig. S1 (fos_Cazet_HYDVU). Moreover, we have included more information about fos_Cazet in the context of a comparison with HyKayak.

      • *

      Different bZiP transcriptional factors (TFs) may have different effects on the expression of Wnt genes, and these effects are context-dependent. In previous research, Cazet et al. identified another Hydra fos gene (referred to as fos_cazet) and bZiP TF binding sites in the putative regulatory sequences of HyWnt3 and HyWnt9/10c. They showed that bZiP TF-genes, including Jun and fos, were transiently upregulated 3 hrs after amputation, therefore they hypothesized that bZiP TFs could induce TCF-independent upregulation of HyWnt3 during the early generic wound response (Cazet et al., 2021). However, in our study HyKayak expression continuously increased throughout the entire head regeneration process (Fig. 3E and 4E) including the morphogenesis stages (24-48 hrs post-amputation). Another study reported that inhibition of the JNK pathway (which disrupts formation of the AP-1 complex) resulted in upregulation of HyWnt3 expression in both, head and foot regenerates (Tursch et al., 2022). This result might support our hypothesis, but it only included the first 6 hours after amputation, similar to Cazet’s research. Therefore, it appears that HyKayak and fos_Cazet may have opposing roles in the regulation of Wnt-gene expression and are possibly activated by different signaling pathways depending on the stages of regeneration.

      • On page 12 the authors conclude based on gene expression in inhibitor treatment that there is a “change in complex composition of the two transcription factors.” This is something which would require biochemical evidence and I therefore suggest they remove this entirely. * Answer:

      we have removed this sentence

      • The authors use experiments in Craspedacusta to test their hypothesis of the role of Wnt and Notch signaling in Hydra. There is, in my opinion, an incorrect logic here. Regardless of the outcome, the roles of Wnt and Notch could conceivably be different in the two species and therefore testing hypothesis from one is not possible in the other. The authors should reframe their discussion of this to be more of a comparative framework. Moreover, the results do not necessarily indicate what the authors say. In Hydra Notch signaling is required to form the hypostome/mouth and this is not the case in Craspedacusta while Wnt signaling is required. The authors do not cite an important study from another Hydrozoan Hydractinia (Gahan at al.,2017). In that study the authors show that DAPT inhibits tentacles during regeneration but that the hypostome (or at least the arrangement of neurons and cnidocytes around the mouth) forms normally. This would indicate that in Hydractinia the process of head formation is different to in Hydra and would be congruent with what is shown here in Craspedacusta. This should be more thoroughly discussed, and all relevant literature cited.* Answer:

      We have concentrated our Craspedacusta work on Notch-signalling now. We only show that DAPT does not inhibit the regeneration of Craspedacusta heads. We have included new data showing that nevertheless it has an effect on the expression of hypothetical Notch target genes, but not on CsWnt3 (new Fig. 7). We have re-written our discussion accordingly and included the Hydractinia-work about Notch (Gahan2017). Although the Hydractinia paper lacks gene expression studies making it difficult to directly compare with the Hydra data, it supports our claim that Notch is required for regeneration of polyps with head and tentacles. We indeed do not know anything about Wnt-signalling in Craspedacusta. Our new results show that it is probably expressed in the head, because we observe very low levels of expression in the polyps after head removal, which increases considerably during regeneration of the head. This was included in the results:

      Results:

      “Finally, we investigated the expression of the Craspedacusta Wnt3-gene and its response to DAPT treatment during head regeneration. We observed low expression level of CsWnt3 after head removal (t=0), which dramatically increased as the head regenerated, suggesting that Wnt3 is expressed in the head of Craspedacusta polyps as it is in the head of other cnidarians including Hydra, Hydractinia and Nematostella (Hobmayer et al., 2000; Kusserow et al., 2005; Plickert et al., 2006). In accordance with having no effect on head regeneration, DAPT also did not inhibit CsWnt3 expression during this process in Craspedacusta. This is opposite to the situation in Hydra. If CsWnt3 would be involved in the Craspedacusta head regeneration, this could explain the failure of DAPT to interfere with this process”.

      Discussion part

      “Head regeneration also occurs in the colonial sea water hydrozoan Hydractinia. Colonies consist of stolons covering the substrate and connecting polyps, including feeding polyps, which have hypostomes and tentacles, and are capable of head regeneration, similar to Hydra polyps. Wnt3 is expressed at the tip of the head and by RNAi mediated knockdown it was shown that this gene is required for head regeneration (Duffy et al., 2010). In the presence of DAPT, Gahan et al observed that proper heads did not regenerate, similar to Hydra. However, they observed regeneration of the nerve ring around the hypostome indicating the possibility that hypostomes had been regenerated. Unfortunately, this study did not include gene expression data and therefore it is not clear whether Wnt3 expression was affected or not (Gahan et al., 2017).

      …..

      An interesting question was whether regeneration of cnidarian body parts, which are only composed of one module, also requires Notch-signalling. This is certainly true for the Hydra foot, which regenerates fine in the presence of DAPT (Käsbauer et al. 2007). Moreover, we tested head regeneration in Craspedacusta polyps, which do not have tentacles, and show that DAPT does not have an effect on this regeneration process. This corroborates our idea that Notch is required for regeneration in cnidarians, when this process involves two pattern forming processes to produce two independent structures, which are controlled by different signalling modules. This would be the case for the Hydra and for the Hydractinia heads, but not for Craspedacusta. ”

      Moreover, we indicate at the end of our discussion that further studies about head regeneration in Craspedacusta and the genes involved would be desirable. We believe this would be beyond the scope of the current paper and we are working on a new Craspedacusta study.

      “Future studies on expression patterns of the genes that control formation of the Hydra head, including Sp5 and Alx in Craspedacusta could provide insights into the evolution of cnidarian body patterns. Sp5 and Alx appear to be conserved targets of Notch-signalling in the two cnidarians we have investigated. Wnt-3, while being inhibited by Notch-inhibition in Hydra head regenerates, is not a general target of Notch signalling. It was not affected by DAPT in our comparative transcriptome analysis (Moneer et al. 2021b) on uncut Hydra polyps, and it was also not affected by DAPT in regenerating heads of Craspedacusta.”

      • From reading the manuscript I do not fully understand the model the authors put forward. It is unclear what "coordinating two independent pattern forming systems" really means. It might be beneficial to make a schematic illustration of the model and how it goes wrong in both sets of inhibitor treatments. * Answer:

      We have edited the manuscript considerably and explained what we mean with the two pattern forming systems. It starts with the abstract:

      Hydra head regeneration consists of two parts, hypostome/organizer and tentacle development.”

      Thus, in accordance with regeneration of two head structures we find two signaling and gene expression modules with HyWnt3 and HyBMP4 part of a hypostome/organizer module, and BMP5/8, HyAlx and b-catenin part of a tentacle module. We conclude that Notch functions as an inhibitor of tentacle production in order to allow regeneration of hypostome/head organizer.

      “Polyps of Craspedacusta do not have tentacles and thus, after head removal only regenerate a hypostome with a crescent of nematocytes around the mouth opening. This corroborates the idea that Notch-signaling mediates between two pattern forming processes during Hydra head regeneration”

      We have included the description of the organizer concept in the introduction, because we consider this relevant for our model:

      “The “organizer effect” entails a “harmonious interlocking of separate processes which makes up development”, or a side-by-side development of structures independently of each other (Spemann, 1935). In addition to inducing the formation of such structures, the organizer must ensure their patterning (Anderson and Stern, 2016). With reference to Hydra’s hydranth formation after head removal or transplantation, this involves the side-by side induction of hypostome tissue and tentacle tissue. Moreover, it includes the establishment of a regularly organized ring of tentacles with the hypostome doming up in the middle. The function of the Hydra“center of organization” would then be to pattern hypostome and tentacles and to allow for their harmonious re-formation after head removal”.

      In the discussion we integrate the organizer concept with the Gierer-Meinhardt reaction-diffusion models which still explain many aspects of Hydra development.

      Is Notch part of the organizer? The organizer is defined as a piece of tissue with inductive and structuring capacity. Notch is expressed in all cells of Hydra polyps (Prexl et al., 2011) and overexpression of NICD does not induce second axes all over the Hydra body column (Pan et al., 2024), as seen with overexpression of stabilized b-catenin (Gee et al., 2010). Moreover, Notch functions differently during regeneration after apical and basal cuts. Phenotypically during head regeneration in DAPT, we clearly recognize a missing inhibition of tentacle tissue after apical cuts and missing inhibition of head induction after basal cuts (Pan et al., 2024). We would thus rather suggest that the organizer activity of Hydra tissue utilizes Notch-signaling as a mediator of inhibition. As our study of transgenic NICD overexpressing and knockdown polyps had suggested, the localization of Notch signaling cells depends on relative concentrations of Notch- and Notch-ligand proteins, which are established by gradients of signaling molecules that define the Hydra body axis (Pan et al., 2024; Sprinzak et al., 2010) . This is in very good agreement with a ”reaction-diffusion-model” provided by Alfred Gierer and Hans Meinhardt (Gierer and Meinhardt, 1972; Meinhardt and Gierer, 1974) suggesting a gradient of positional values across the Hydra body column. This gradient may determine the activities of two activation/inhibition systems, one for tentacles and one for the head. When the polyps regenerate new heads, Notch could provide inhibition for either system, depending on the position of the cut.

      We provide a new Fig. 8., which clearly illustrates the effects of DAPT and iCRT14 on hypostome and tentacle regeneration.

      Minor: • The abstract could be rewritten to have more of an introduction to the problem rather than jumping directly into results. It would also be beneficial if the abstract followed the logic of the paper.

      Answer: We agree and have re-written the abstract.

      • In Figure 3 and 4 it is not clear why they are divided into A and B. It appears that the categorization of genes into different groups lacks a clear rationale. This seems totally unnecessary. In addition, the order in which the genes are described in the text does not match what is seen in the figure making it confusing to follow. • In Figure 5 the authors use two different types of charts and I would stick with one. B is much better as it shows the individual data points as well as other information. I would use this throughout including in Figure 3 and 4. *

      __Answer: __

      We changed Fig. 3, 4 and 5 according to these comments and now present the data in one format over all three figures, in scatterplots (more detailed answer above).

      We are now describing the results in the order of the figures, with A and B omitted.

      Figure S3 is missing a description of panel C.

      In figure S3 it is not clear why the inhibitor was removed and not kept on throughout the experiment. Please discuss. __Answer: __

      Fig. S3 was removed.

      Figure S4 has no A or B in the figure, only in the legend. __Answer: __

      We have included A and B…

      *Reviewer #1 (Significance (Required)):

      Although some of the authors data appear to be novel I find the study makes only minor progress on the questions. In particular the authors do not properly cite the relevant literature and to put their manuscript into the correct context. The new model proposed by the authors is based entirely on qPCR data which is not thoroughly analyzed and are not strong enough in the absence of information about the spatial expression the genes they discuss. The proposal of HyKayak as a negative regulator of Wnt3 is interesting but the authors do not provide any solid direct evidence for this (ChIP, EMSA etc) and it is somewhat in disagreement with other models of bZIP function in the literature (which again are not discussed).*

      The manuscript is of limited general interest. It has a number of interesting observations which would be of interest to the Hydra community and the broader cnidarian community. The study lacks contextualization within a broader framework, whether it be in the context of regeneration or Wnt/Notch signaling. This limitation may narrow the overall interest in it.

      Answer:

      Our previous analysis of the effect of Notch on head regeneration in Hydra (Münder 2013) had suggested the inhibition model, which is part of Fig. 8. We show now that during head regeneration in Hydra formation of two structures is guided by different signaling/transcription modules, one using Wnt3 and BMP4, but not b-catenin; and one using BMP5/8 and b-catenin. We suggest that Notch functions as an inhibitor “of use” to the organizer when the “two-part” head structure is regenerated.

      We agree that our original manuscript was not well enough written to clearly put it into developmental context. We now focus the discussion of our work sharply on the organizer problem and think that the conclusions are of great general interest. In a simple view they suggest that the function of the Hydra head organizer is to allow harmonious development of head and tentacles, which we consider separate, and on a molecular basis independently regulated parts of the Hydra head. Notch signaling, in our interpretation, is an instrument of the organizer. Our comparison with Craspedacusta illustrates this idea. Craspedacusta only regenerates one head structure, which is possible in the absence of this instrument (also see reviewers 3 and 4).

      Concerning HyKayak, there is no disagreement with other authors as we analyze a fos-gene different from the one discussed by Cazet et al (see above). We have conducted a rescue experiments as suggested by reviewer 3 with the Kayak-inhibitor and with HyKayak shRNAi knockdown, however, rescue of the phenotype was not achieved although HyWnt3 was upregulated after DAPT treatment in the knockdown group. We attribute this to the very strong effect of DAPT. We have adjusted our hypothesis and only suggest that HyKayak could be a target for the Notch-induced repressor genes (e.g.HyHes). We mentioned this failed rescue in the manuscript (answer for see reviewer 3). Further experiments, e.g Chip/EMSA constitute a new project on the basis of these ideas and should be reserved for further studies of the Kayak-function in Hydra.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      *The study investigates the role of Notch and beta-catenin signaling in coordinating head regeneration in Hydra. It combines gene expression dynamics, inhibitor treatments, and comparisons with Craspedacusta polyps to propose a lateral inhibition model for Notch function during Hydra head regeneration, mediating between two pattern-forming systems.

      Three main concerns arise from this work:*

      • Lack of spatial expression data: The study proposes a model based on pattern-forming systems but falls short of providing direct spatial expression data for the genes under consideration in both control and treated scenarios. This gap weakens the empirical support for the proposed model. __Answer:*__

      The expression patterns for most of the presented genes including HyAlx and HyWnt3 in the presence and absence of DAPT have been published before (Münder 2013). Expression patterns for all other genes during regeneration (except Hykayak) are already known from literature. For Hykayak we have included expression data from Siebert et al (single cell transcriptome analysis) in the supplementary material. For iCRT14 treatment, we carried out a FISH-experiment and showed that HyWnt3 is expressed in the normal pattern at the hypostome. For further genes after DAPT and iCRT-treatment in situ hybridisation data are indeed lacking (e.g. BMP5/8). However, we have analyzed some very strongly downregulated regulated genes (e.g. HyAlx completely downregulated by iCRT14, all HyWnts and BMP2-4completely downregulated by DAPT) and for those in situ hybridisation could (1) be difficult due to low expression in treated samples and (2) may not be informative.

      • Clarity and relevance of Craspedacusta comparisons: The section discussing the regeneration in Craspedacusta polyps appears somewhat disjointed from the main narrative, with its contribution to the overarching story of Hydra regeneration remaining unclear. *

      Answer:

      We had not intended to explain gene expression during Craspedacusta head regeneration but wanted to prove our hypothesis that Notch is needed to allow side-by-side development of two newly arising structures, which use different signalling modules during head regeneration. That Notch is __not __needed for the regeneration of Craspedacusta, a polyp without tentacles, appears to strengthen our main hypothesis. In order to connect this point more clearly to the narrative we have included new data. We show that CsWnt3 expression lowers after head removal and rises when the head regenerates, indicating CsWnt3-expression in the head of Craspedacusta polyps. Moreover, we show now that Notch in Craspedacusta may have similar target genes as in Hydra (e.g. Sp5 and Alx), might also affect nematocyte differentiation as in Hydra, but does not inhibit Wnt3 expression. We also acknowledge that a precise understanding of the molecular pathways for head regeneration in Craspedacusta requires further work and have removed the results of iCRT14 treatment because of our lack of knowledge about the role of b-catenin in Craspedacusta patterning. Citations from our changed text are found in the answer to reviewer 1.

      • Accessibility of the text: The study's presentation, including its title, abstract, and main text, presents challenges in terms of clarity and accessibility, making it difficult for readers to follow and understand the research's scope, methodologies, and conclusions.*

      • *

      Answer:

      We agree and have completely re-written the abstract, and large parts of the introduction and discussion (also see above answer for reviewer 1).

      Reviewer #2 (Significance (Required)):

      In conclusion, while the study aims to advance our understanding of the complex signaling pathways governing Hydra head regeneration, it necessitates significant revisions. Enhancing the empirical evidence through detailed spatial patterning data, clarifying the comparative analysis with Craspedacusta polyps, and __refining the narrative __to improve accessibility are critical steps needed to solidify the study's contributions to the field.

      Answer:

      By including Kayak-expression data from Siebert et al and indicating the places of major expression of all analysed genes schematically in the Figs describing the qPCR data we revised our manuscript. We have added new data about Craspedacusta and believe that our re-written manuscript refines the narrative by focusing on the organizer (see answer to reviewer 1).



      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Major comments:

      - In the abstract, the authors assert that their findings "indicate competing pathways for hypostome and regeneration." However, the nature of this competition and its resolution is not adequately elucidated within the manuscript. The term "competition" lacks context and clarity, leaving the reader without a clear understanding of what pathways are competing, for what, and how this competition is resolved during regeneration. Furthermore, this concept is not further explored or referenced throughout the remainder of the manuscript, leaving it somewhat disconnected from the main body of the research. It is recommended that the authors either revise the statement in the abstract to provide more clarity on the competing pathways and their implications for regeneration, or alternatively, if the authors believe there is sufficient evidence to support the claim of competing pathways, they should expand upon this point within the main body of the manuscript. Additional argumentation and evidence would be necessary to substantiate such a claim and provide a deeper understanding of the mechanisms underlying regeneration in Hydra.

      Answer:

      We agree and have removed any reference to “competing” pathways from the abstract and the main text.

      - The abstract makes a significant assertion regarding the mechanism by which Notch signaling impacts the expression of HyWnt3, suggesting that it operates by inhibiting HyKayak-mediated repression of HyWnt3 rather than directly activating transcription at the HyWnt3 promoter. This claim is central to the goals outlined in the study, which aim to elucidate the functioning of Notch signaling in HyWnt3 expression. To bolster this assertion, it would be prudent for the authors to conduct experiments demonstrating the mediating role of Kayak. Specifically, demonstrating that downregulation of Kayak through RNAi can rescue the DAPT-mediated downregulation of Wnt3 would provide strong support for the authors' claim. Additionally, while not strictly necessary, it would be beneficial to investigate whether chemical inhibition of Wnt can rescue the phenotype resulting from Kayak RNAi. Conducting and analyzing such experiments within a 2-3-month revision period should be feasible given that the authors already possess all necessary materials and have developed the required methods. These additional experiments would not only strengthen the evidence supporting the authors' claim but also provide further insights into the regulatory mechanisms at play in Notch signaling and HyWnt3 expression.

      • *

      Answer:

      We have conducted the suggested rescue experiments with the kayak-inhibitor, however, rescue was not achieved. We also tried rescue experiments by combining DAPT treatment and Kayak shRNA knockdown. HyWnt3 was slightly upregulated after DAPT treatment in the Kayak knockdown group but the phenotype could not be rescued. We are therefore now only state that HyKayak could be a target for the Notch-induced repressor genes (e.g.HyHes). We mentioned the failed rescue experiments in the manuscript:

      Results:

      *The up-regulation of HyKayak by DAPT suggests that HyKayak may serve as a potential target gene for Notch-regulated repressors including HyHes and CnGsc, potentially acting as a repressor of HyWnt3 gene transcription. *

      Discussion:

      We therefore suggest that the Hydra Fos-homolog HyKayak inhibits HyWnt3 expression and can be a target for a Notch-induced transcriptional repressor (like HyHes) in the regenerating Hydra head. Nevertheless, we were not able to rescue the DAPT-phenotype by inhibiting HyKayak, neither by using the inhibitor nor by shRNA-treatment, probably due to the strength of the DAPT effect. Therefore, we cannot exclude that Notch activates HyWnt3 directly, or that it represses unidentified Wnt-inhibitors through HyHes or CnGsc.

      - The usage of the term "lateral inhibition" in the title and abstract of the manuscript carries specific implications, as it is commonly associated with distinct mechanisms in the context of Notch signaling and reaction-diffusion systems. Notably, in the Notch signaling context, lateral inhibition typically refers to the amplification of small differences between neighboring cells through direct interactions, facilitated by the limitations of Notch signaling to immediate neighbors. Conversely, in reaction-diffusion systems, such as the Gierer-Meinhardt model, lateral inhibition describes long-range inhibition associated with pattern formation.

      Given this discrepancy, it is crucial for the authors to clarify their interpretation of "lateral inhibition" to avoid ambiguity and ensure accurate understanding. If they are referring to Notch-specific lateral inhibition, they should provide evidence of adjacent localization of Notch and Delta cells to support their argument. Alternatively, if they are invoking the concept of long-range inhibition described by the Gierer-Meinhardt model, they must explain how a membrane-tethered ligand like Notch can exert effects beyond one cell diameter from the signaling center.

      * Regardless of the interpretation chosen by the authors, addressing this clarification will have significant implications for the subsequent treatment of their arguments. Depending on their chosen interpretation, experimental demonstrations may be necessary to substantiate their claims, which could be laborious and time-consuming. However, such demonstrations are essential for establishing the validity of the term "lateral inhibition" as used in the title and abstract of the manuscript.*

      Answer:

      We agree with the reviewer concerning the term “lateral inhibition” and have now removed it. Instead, we have emphasized that our data clearly show during apical regeneration a Notch-mediated inhibition of tentacle tissue formation. We also discuss data from our most recent publication (Pan 2024) showing that this is the opposite at basal cuts, where the loss of Notch function leads to the regeneration of two heads. We then discuss this in the context of the Gierer-Meinhardt Model and in the context of the organizer (also see above in answer to reviewer 1):

      It is true that it is difficult to reconcile the long-range signaling processes, on which the Gierer-Meinhardt model is based with the cell-cell interactions mediated by Notch-signaling. We have now published a mathematical model to explain our understanding of this for the role of Notch during budding and in steady state animals (Pan2024), which is based on work by Sprinzak et al 2010. For head regeneration, we do not have such a model yet. Here we are looking at expression patterns changing over time. Therefore, we assume waves of gene expression, relying on the autoinhibitory function of the HyHes-repressor. This is included in the discussion:

      In addition, the gene expression dynamics for many of the analyzed genes appears in wave-like patterns in some experiments (see Figs S3 and S4). As we have only four time points measured, we cannot draw strong conclusions from these observations, except that some of the deviations in our data points (e.g. 48 hrs HyHes) might be caused by oscillations. Nevertheless, we propose that the dynamic development of gene expression patterns over the time course of regeneration hint at a wave like expression of Notch-target genes (e.g. HyAlx, (Münder et al., 2013; Smith et al., 2000)). Hes-genes have been implicated in mediating waves of gene expression, e.g. during segmentation and as part of the circadian clock (Kageyama et al., 2007). This property is due to the capability of Hes-proteins to inhibit their own promoter. Future models for head regeneration in Hydra should consider the function of Notch to inhibit either module of the regeneration process and the potential for the Notch/Hes system to cause waves of gene expression. Such waves intuitively seem necessary to change the gene expression patterns underlying morphogenesis during the time course of head regeneration.

      - The utilization of Craspedacusta as a comparative model in the argumentation of the manuscript appears somewhat unclear. The authors posit that Notch is essential for organizer emergence in Hydra, while Wnt is not necessary, as indicated by the observed effects of iCRT14 beta-catenin/TCF inhibition. However, in Craspedacusta, which lacks tentacles but possesses an organizer, one might anticipate a conserved requirement for organizer formation but not tentacle development. Therefore, it would be reasonable to expect that Craspedacusta would still form an organizer under iCRT14 treatment but would not depend on Notch signaling, as the necessity to separate tentacle formation from organizer formation is absent. The authors' observation that Craspedacusta fails to form an organizer under iCRT14 treatment partially aligns with these expectations. However, the complexity of the results suggests a need for a deeper understanding of the involvement of different pathways in Craspedacusta. Before applying inhibitors, it would be crucial to elucidate the spatiotemporal differences in the expression of relevant Wnt and Notch pathway components between Hydra and Craspedacusta. This knowledge would provide valuable insights into the specific roles of these pathways in organizer formation and tentacle development in both species, helping to clarify the observed differences in response to iCRT14 treatment. Additionally, considering the possibility of differential sensitivity to iCRT14 (see comment below) between Hydra and Craspedacusta would be essential for accurately interpreting the results and drawing meaningful conclusions regarding the involvement of Notch and Wnt signaling pathways in these processes.

      Answer:

      We have clarified in our re-written manuscript that the organizer functions in Hydra heads and head regeneration to harmonize the development of two independent structures (see answer for reviewer 1) and that Notch-signalling is an instrument to achieve this. Craspedacusta polyps do not have tentacles, thus we do not see two independent structures. Correspondingly, we see that they do not need Notch-signaling. We do not know whether they have organizer tissue, because they are too small to perform transplantation experiments. Similarly, in situ hybridisation experiments to look for CsWnt expression are hard to envisage. What we have now done during the revision of this paper are RT-qPCR experiments to follow the expression of CsWnt3 after head removal until a new head is formed. This indicated the localization of CsWnt3 expression in the head (citations in response to reviewer 1).

      We agree that the role of Wnt/b-catenin for Craspedacusta cannot be sufficiently described with our iCRT14 experiment and therefore removed it. To strengthen the DAPT data, we also examined Craspedacusta homologs of the Hydra Notch-target genes that we had previously identified (Moneer2021). We found that expression of CsSp5 and CsAlx were inhibited by DAPT. This was also true for the nematocyte gene NOWA (see new Fig. 7). In Hydra, DAPT blocks one important differentiation step of nematocytes and therefore the expression of all genes expressed in differentiating capsule precursors, including NOWA is inhibited, while the number of mature capsules does not change. To see the same DAPT effect on NOWA-expression in Craspedacusta reassured us that DAPT had entered the animals and might also have a similar effect on nematocytes as in Hydra.

      Minor comments - The concentration-dependent effects of iCRT14 on beta-catenin signaling, as demonstrated by Gufler et al. 2018, suggest that the efficacy of inhibition may vary depending on the concentration used. Specifically, Gufler et al. found that a concentration of 10µM was sufficient for efficient inhibition of beta-catenin signaling. However, in the current study, the authors utilized a concentration of 5µM of iCRT14. Given the central role of the observed effects, particularly the persistence of Wnt3 expression, in the argumentation of the manuscript, it is plausible that these effects could be attributed to partial inhibition resulting from the lower concentration of iCRT14 used in the study. To address this potential limitation, the authors could consider conducting a quick examination of the effects of 10µM iCRT14 or utilizing other inhibitors of beta-catenin/TCF interaction, such as iCRT3. By comparing the effects of different concentrations or alternative inhibitors, the authors could ascertain whether the observed effects are indeed attributable to partial inhibition from 5µM iCRT14, or if they persist despite higher concentrations or alternative inhibitors. This additional experimentation would provide valuable insights into the specificity and efficacy of the inhibition and strengthen the validity of the conclusions drawn regarding the role of beta-catenin signaling in the observed phenomena.

      Answer:

      The iCRT14 concentration was adjusted to 5 µM because the initial 10µM proved to be too toxic. 5µM also produced the same phenotypes and results as seen before. Cazet et al. also used 5 µM iCRT14 in their study.

      - The use of Generalized Additive Models (GAMs) in Figures 3 and 4 to present the time series qPCR results may introduce some challenges in interpretation due to the potential for distortion of values at specific time points based on neighboring ones. Given the relatively low time resolution of the data, this approach could lead to a distorted depiction of the temporal dynamics. For instance, in Figure 3B, where Wnt3 peaks at 10 hours, the absence of measurements between 8 and 24 hours introduces uncertainty regarding the accuracy and reliability of this peak at 10 hours.

      * To address these concerns and enhance clarity, it may be advisable for the authors to consider presenting the data using simple boxplots instead of GAMs. Boxplots provide a more straightforward visualization of the distribution of data at each time point, allowing for a clearer interpretation of trends and fluctuations over time. This approach would mitigate the potential for distortion introduced by GAMs and provide a more accurate representation of the temporal dynamics observed in the qPCR results*

      • *

      Answer:

      We agree and have changed the data representation to simple scatterplots (see answers for reviewer 1).

      - The comparison of the effects of iCRT14 versus DAPT treatments would benefit from having consistent gene expression data across both treatments. However, in Figure 4A, there are fewer genes tested compared to Figure 3A, with Hes and Kayak omitted. While the authors interpretation suggests that these genes may not change after iCRT14 treatment due to their upstream position in the signaling pathways, it is essential to empirically demonstrate this relationship, as it is central to the conclusions drawn. To address this gap in the analysis, it would be valuable for the authors to provide a time series of differential expression for Hes and Kayak following iCRT14 treatment.

      Answer:

      We have provided a time series for expression of HyHes and HyKayak in responses to iCRT14 treatment during regeneration (see Fig.4).

      “We found that the expression the Notch-target gene HyHes remained similar to control regenerates up to 24 hrs, but then was attenuated (Fig. 4A), possibly due to failure of tentacle boundary formation, the tissue where HyHes is strongly expressed…The expression of HyKayak was decreased at 8 hrs after head removal in the presence of iCRT14, came back to normal up to 36 hrs and was suddenly increased after 48 hrs (Fig. 4E), correlating with inhibition of the HyHes repressor. There were no significant changes in the expression dynamics of HyBMP2/4 and HyBMP5/8b between iCRT14-treated regenerates and controls (Fig. 4F, G).”

      - The analysis of the impact of chemical inhibition of Notch and Wnt signaling in Figure 7 schematic highlights changes in spatial expression patterns of the target genes. However, the interpretation of their impact primarily relies on qPCR data. As evident from Figure 7, when Notch is inhibited, it is anticipated that Kayak expression will shift from the area of the tentacles to the tip. This spatial shift in expression patterns is a critical aspect of the authors' arguments, especially considering the centrality of Kayak in their findings. Notably, similar spatial expression patterns have been demonstrated for Alx using FISH in Pan et al., available on BioRxiv. Given the importance of Kayak in the presented arguments, it is advisable to also investigate its spatial expression patterns using techniques such as FISH.

      • *

      Answer:

      We have, instead of FISH-experiments, included expression data for HyKayak from Siebert et al 2019 (single cell transcriptome data) in Fig. S1D, which show its expression in head- and battery cells (tentacle cells). This is similar to HyAlx. Therefore, Kayak-FISH would be expected to reveal expression of the gene at the tip of the regenerate the whole time, similar to HyAlx, because tentacle gene inhibition or patterning does not occur (see Münder 2013). Due to the failure of our rescue experiment to demonstrate the function of kayak we have omitted kayak from Fig. 8 and only mention in the discussion that it could be a target for Notch activated transcriptional repressors, like HyHes or CnGsc.

      Reviewer #3 (Significance (Required)):

      *The paper introduces novelties to the field of regeneration and developmental biology by leveraging Craspedacusta polyp as a novel model system for investigating the evolutionary and developmental dynamics of tentacles. In doing so, it sheds new light on the intricate mechanisms underlying tentacle formation and patterning. Furthermore, the study implicates Kayak in the regulation of Wnt3, adding a fresh perspective to our understanding of the molecular pathways governing Hydra regeneration. Notably, the results of the research challenge the prevailing notion of autoregulation of Wnt3, which has long been considered fundamental to organizer formation in Hydra. While these findings offer intriguing insights, further investigation will be crucial to conclusively ascertain the validity of this assertion. *

      • *

      Despite the clarity of the data presented, the interpretation and integration of these findings in the manuscript are lacking. The narrative at times feels disjointed, with different storylines loosely connected. While the findings are intriguing and merit publication, a substantial revision of the manuscript is necessary to provide a more coherent and illuminating interpretation of the results. *The implications of this research extend beyond the specific confines of Craspedacusta polyp and Hydra biology. It holds significant relevance for both the Hydra biology community and the broader field of Notch signaling research. *

      By highlighting the pivotal role of Notch signaling in regeneration and patterning within Hydra, the study enriches our comprehension of this model organism and its evolutionary adaptations. Moreover, it provides a valuable lens through which the evolution of Notch signalling cascades can be examined. This interdisciplinary approach underscores the interconnectedness of diverse biological systems and underscores the importance of exploring novel model organisms to unravel the complexities of evolution and development.

      • *

      Answer:

      We have edited the manuscript considerably and re-written the introduction and the discussion parts. We are focusing on integrating this work with the organizer concept in developmental biology, and on the Gierer-Meinhardt-model, and point out that Notch-signaling is required for the development of two head structures by inhibiting the development of either one during head regeneration, which is necessary to enable the development of the other one. Which one is inhibited depends on the positional value of the tissue where the cut occurs. Craspedacusta polyps do only have one structure. We suggest that this is why head regeneration does not require Notch-signalling in Craspedacusta. In contrast, as we have included in our discussion now, Hydractinia polyps, again with head/mouth and tentacles, require Notch-signaling for head regeneration (according to Gahan 2019), see also answers for reviewers 1 and 2.



      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Major comments:

      The conclusions from the experiments are drawn accurately, not overstating the results. The main conclusion, that in Hydra Notch pathway mediates between two patterning modules, hypostome and tentacle forming modules, is supported by in situ hybridization and qPCR analyses of hypostome and tentacle specific genes.

      OPTIONAL. Authors hypothesize, that Notch maintains expression of Wnt3 vie its targets, transcriptional repressors Goosecoid or Hes, which halt the expression of Wnt3 repressor HyKayak. Epistatic relationships between Notch, Goosecoid or Hes and HyKayak could be tested, first, by combining pharmacological inhibition of Notch by DAPT with shRNA-mediated knockdown and second, in double knockdowns generated by electroporating shRNAs for two genes simultaneously. If the proposed in the pathway relationships are correct the repressive effect of DAPT treatment on an organizer regeneration should be rescued in HyKayak shRNA-mediated knockdown. Regeneration of an organizer also should occur in Notch/HyKayak and Goosecoid (Hes)/HyKayak shRNA-mediated double knockdowns. Electroporation of shRNAs for multiple genes is an effective and quick way to generate double and triple knockdowns. The proposed experiments will much strengthen the conclusions drawn from this study. Given that the authors have successfully used shRNA-mediated technique to generate HyKayak knockdown animals, they should be able to complete the proposed experiments within in a couple of months. Answer:

      We very much like the suggested strategy to probe the regeneration pathways by shRNA-mediated knockdown experiments- this will be a basis for future investigations.

      We conducted the suggested rescue experiment by combining the DAPT treatment and Kayak shRNA knockdown. HyWnt3 was slightly upregulation after DAPT treatment in the Kayak knockdown group. However, this upregulation did not rescue the organizer’s regeneration. We think that the effect of DAPT is too strong. We have included this in the discussion of our results (see answer for reviewer 2).

      • The data are presented in a logical and clear manner. The paper is easy to read, and the conclusions are explicit for each experimental section. The methodology is described in detail and should be easy to reproduce.*

      • All experiments are done with multiple biological and technical replicates. However, the description of statistical analysis used in each case is missing, p values and error bars are missing in Fig. 2B and Fig. S4. Author should add this information in the main text or in the figure legends.*

      Answer:

      The statistical information was now added in the methods section.

      Minor comments:

      • Fig. 1E. It would be more convincing to present tentacle and hypostome regeneration data separately, comparing hypostome regeneration in treated animals with DMSO control, and in a separate analysis comparing tentacle regeneration with control. Provide the description of statistical method, p values and error bars. If authors prefer to stick to the current way of presenting they should also provide description of statistical analysis used and statistical data.*
      • *

      Answer:

      We changed the representation in Fig. 1E. We now use scatter plots in the main text with p-values added, and explained the statistics of the GAM representation in the supplementary material.

      • Results, section 4 Kayak. Authors use T5424 inhibitor to block the potential interactions between HyKayak with HyJun. The resulted increase in Wnt3 expression measured by qPCR clearly supports the idea of HyKayak being a repressor of Wnt3. However, authors are going further and present the phenotype of T5424 treatment, shortening of the tentacles. Many factors can influence the length of the tentacles. For example, shortening of tentacles is a strong indication of poisoning or animal being in general unwell. At a concentration double of the one used in the experiment T5424 causes a disintegration of the animals (Fig. 3S). It would be more convincing if the authors could provide an in situ hybridization image showing an expansion of Wnt3 expression domain down the hypostome. This is the result one would expect from the inhibition of HyKayak which, according to the proposed mechanism, restricts Wnt3 spatial expression to the most apical portion of the regenerating tip. Alternatively, authors could try to see if T5424 rescues the inhibition of an organizer formation resulted by DAPT treatment. The latter experiment might be difficult to perform due to a possible toxic effect of multidrug treatment. I suggest that authors either include the proposed experiments or leave the results of the Fig S3 out.*

      Answer:

      According to this suggestion we have removed the phenotypes of polyps after treatment with T5424.

      • Results, section 3.2, paragraph 4. 'This also applies for the suggested Hydra organizer gene CnGsc, and BMP2/4 (Broun, Sokol et al. 1999). Please, insert the citation for BMP2/4.*

      • *

      Answer:

      We inserted the citation for BMP2-4 (Watanabe 2014).


      Reviewer #4 (Significance (Required)):

      *Significance:

      The current study is a continuation of the author's previous work where they have characterized Notch pathway in Hydra and showed its role in the regeneration of an organizer and patterning of Hydra head. Here, they present the study of Notch pathway in the context of b-catenin pathway, a pathway that has been shown to be essential for the axis induction and patterning in Hydra. The authors challenge this dogma and show, that during head regeneration b-catenin transcriptional activity is not required either to maintain the expression of wnt3 nor to acquire an inductive activity of the regenerating organizer. Second, they show, that transcriptional fos-related factor Kayak is negatively regulated by Notch-signaling and, in turn, represses transcription of Wnt3. Based on those findings authors propose a function of the Notch pathway in Hydra head regeneration, particularly in spatial separation of the hypostome/organizer module from the tentacle module. The role of Notch pathway in lateral inhibition is well documented in bilaterians. However, in Cnidaria, a sister group to Bilateria, the function of Notch was so far restricted to neurogenesis. This study is very important for our understanding of the evolution of morphogenesis as it shows the ancient role that the Notch pathway is playing in axial patterning, possibly, through lateral inhibition.

      This study can be of a great interest to both researchers specializing in cnidarian development and to a broader audience interested in the evolution of morphogenesis.*

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      This is a very nice study exploring the function of Notch pathway in Hydra, a member of early Metazoa Cnidaria. The main conclusions of the study are:

      1. -catenin pathway is required not for regeneration of Hydra hypostome/ the organizer, as previously thought, but rather for regeneration of tentacles and correct patterning of Hydra head.
      2. During head regeneration Notch pathway, possibly, through lateral inhibition, blocks tentacle fate at the most apical region of the regenerating tip, allowing a hypostome/an organizer to develop. This might occur via the targets of Notch pathway, transcriptional repressors Goosecoid and Hes.
      3. During head regeneration Notch pathway is required to maintain the expression of Hydra organizer gene Wnt3 as well as other canonical wnt genes. This occurs, possibly, through repression of HyKayak, Hydra homologue of a transcriptional fos-related factor Kayak, that, in turn, represses Wnt3.

      Major comments:

      1. The conclusions from the experiments are drawn accurately, not overstating the results. The main conclusion, that in Hydra Notch pathway mediates between two patterning modules, hypostome and tentacle forming modules, is supported by in situ hybridization and qPCR analyses of hypostome and tentacle specific genes.
      2. OPTIONAL. Authors hypothesize, that Notch maintains expression of Wnt3 vie its targets, transcriptional repressors Goosecoid or Hes, which halt the expression of Wnt3 repressor HyKayak. Epistatic relationships between Notch, Goosecoid or Hes and HyKayak could be tested, first, by combining pharmacological inhibition of Notch by DAPT with shRNA-mediated knockdown and, second, in double knockdowns generated by electroporating shRNAs for two genes simultaneously. If the proposed in the pathway relationships are correct the repressive effect of DAPT treatment on an organizer regeneration should be rescued in HyKayak shRNA-mediated knockdown. Regeneration of an organizer also should occur in Notch/HyKayak and Goosecoid(Hes)/HyKayak shRNA-mediated double knockdowns. Electroporation of shRNAs for multiple genes is an effective and quick way to generate double and triple knockdowns. The proposed experiments will much strengthen the conclusions drawn from this study. Given that the authors have successfully used shRNA-mediated technique to generate HyKayak knockdown animals, they should be able to complete the proposed experiments within in a couple of months.
      3. The data are presented in a logical and clear manner. The paper is easy to read, and the conclusions are explicit for each experimental section. The methodology is described in detail and should be easy to reproduce.
      4. All experiments are done with multiple biological and technical replicates. However, the description of statistical analysis used in each case is missing, p values and error bars are missing in Fig. 2B and Fig. S4. Author should add this information in the main text or in the figure legends.

      Minor comments:

      1. Fig. 1E. It would be more convincing to present tentacle and hypostome regeneration data separately, comparing hypostome regeneration in treated animals with DMSO control, and in a separate analysis comparing tentacle regeneration with control. Provide the description of statistical method, p values and error bars. If authors prefer to stick to the current way of presenting they should also provide description of statistical analysis used and statistical data.
      2. Results, section 4 Kayak. Authors use T5424 inhibitor to block the potential interactions between HyKayak with HyJun. The resulted increase in Wnt3 expression measured by qPCR clearly supports the idea of HyKayak being a repressor of Wnt3. However, authors are going further and present the phenotype of T5424 treatment, shortening of the tentacles. Many factors can influence the length of the tentacles. For example, shortening of tentacles is a strong indication of poisoning or animal being in general unwell. At a concentration double of the one used in the experiment T5424 causes a disintegration of the animals (Fig. 3S). It would be more convincing if the authors could provide an in situ hybridization image showing an expansion of Wnt3 expression domain down the hypostome. This is the result one would expect from the inhibition of HyKayak which, according to the proposed mechanism, restricts Wnt3 spatial expression to the most apical portion of the regenerating tip. Alternatively, authors could try to see if T5424 rescues the inhibition of an organizer formation resulted by DAPT treatment. The latter experiment might be difficult to perform due to a possible toxic effect of multidrug treatment. I suggest that authors either include the proposed experiments or leave the results of the Fig S3 out.
      3. Results, section 3.2, paragraph 4. 'This also applies for the suggested Hydra organizer gene CnGsc, and BMP2/4 (Broun, Sokol et al. 1999). Please, insert the citation for BMP2/4.

      Significance

      The current study is a continuation of the author's previous work where they have characterized Notch pathway in Hydra and showed its role in the regeneration of an organizer and patterning of Hydra head. Here, they present the study of Notch pathway in the context of -catenin pathway, a pathway that has been shown to be essential for the axis induction and patterning in Hydra. The authors challenge this dogma and show, that during head regeneration -catenin transcriptional activity is not required either to maintain the expression of wnt3 nor to acquire an inductive activity of the regenerating organizer. Second, they show, that transcriptional fos-related factor Kayak is negatively regulated by Notch signaling and, in turn, represses transcription of Wnt3. Based on those findings authors propose a function of the Notch pathway in Hydra head regeneration, particularly in spatial separation of the hypostome/organizer module from the tentacle module. The role of Notch pathway in lateral inhibition is well documented in bilaterians. However, in Cnidaria, a sister group to Bilateria, the function of Notch was so far restricted to neurogenesis. This study is very important for our understanding of the evolution of morphogenesis as it shows the ancient role that the Notch pathway is playing in axial patterning, possibly, through lateral inhibition.

      This study can be of a great interest to both researchers specializing in cnidarian development and to a broader audience interested in the evolution of morphogenesis.

      The reviewer's field of expertise includes cnidarian development, axial patterning and morphogenesis in Hydra, biochemical pathways in Hydra axial patterning, Hippo pathway regulation and tissue patterning in multiple organisms

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In the reviewed study by Steichele et al., the authors investigate the roles of Notch and Wnt signaling pathways in the regeneration process of Hydra hypostome and tentacles. They observe that treatment with DAPT prevents the emergence of an organizer and hypostome, whereas inhibition of beta-catenin/TCF interaction does not affect organizer formation but hinders tentacle development. The authors delve into the molecular mechanisms underlying these observations, examining the impact of different treatments on the expression of genes crucial for organizer and tentacle emergence. Notably, they identify Kayak as a target of Notch signaling, which is upregulated following DAPT treatment. Furthermore, RNAi against Kayak results in the overexpression of Wnt3, suggesting an inhibitory role of Kayak on Wnt3 expression. This inhibitory effect is corroborated using the Fos/Jun inhibitor T5224. In their investigation, the authors compare the effects of Notch and Wnt signaling inhibition using a polyp species, Craspedacusta, which possesses an organizer but lacks tentacles. They find that while Notch inhibition does not affect this polyp, inhibition of canonical Wnt signaling does.

      Major comments:

      • In the abstract, the authors assert that their findings "indicate competing pathways for hypostome and regeneration." However, the nature of this competition and its resolution is not adequately elucidated within the manuscript. The term "competition" lacks context and clarity, leaving the reader without a clear understanding of what pathways are competing, for what, and how this competition is resolved during regeneration. Furthermore, this concept is not further explored or referenced throughout the remainder of the manuscript, leaving it somewhat disconnected from the main body of the research.

      It is recommended that the authors either revise the statement in the abstract to provide more clarity on the competing pathways and their implications for regeneration, or alternatively, if the authors believe there is sufficient evidence to support the claim of competing pathways, they should expand upon this point within the main body of the manuscript. Additional argumentation and evidence would be necessary to substantiate such a claim and provide a deeper understanding of the mechanisms underlying regeneration in Hydra.<br /> - The abstract makes a significant assertion regarding the mechanism by which Notch signaling impacts the expression of HyWnt3, suggesting that it operates by inhibiting HyKayak-mediated repression of HyWnt3 rather than directly activating transcription at the HyWnt3 promoter. This claim is central to the goals outlined in the study, which aim to elucidate the functioning of Notch signaling in HyWnt3 expression. To bolster this assertion, it would be prudent for the authors to conduct experiments demonstrating the mediating role of Kayak. Specifically, demonstrating that downregulation of Kayak through RNAi can rescue the DAPT-mediated downregulation of Wnt3 would provide strong support for the authors' claim. Additionally, while not strictly necessary, it would be beneficial to investigate whether chemical inhibition of Wnt can rescue the phenotype resulting from Kayak RNAi. Conducting and analyzing such experiments within a 2-3-month revision period should be feasible given that the authors already possess all necessary materials and have developed the required methods. These additional experiments would not only strengthen the evidence supporting the authors' claim but also provide further insights into the regulatory mechanisms at play in Notch signaling and HyWnt3 expression. - The usage of the term "lateral inhibition" in the title and abstract of the manuscript carries specific implications, as it is commonly associated with distinct mechanisms in the context of Notch signaling and reaction-diffusion systems. Notably, in the Notch signaling context, lateral inhibition typically refers to the amplification of small differences between neighboring cells through direct interactions, facilitated by the limitations of Notch signaling to immediate neighbors. Conversely, in reaction-diffusion systems, such as the Gierer-Meinhardt model, lateral inhibition describes long-range inhibition associated with pattern formation.

      Given this discrepancy, it is crucial for the authors to clarify their interpretation of "lateral inhibition" to avoid ambiguity and ensure accurate understanding. If they are referring to Notch-specific lateral inhibition, they should provide evidence of adjacent localization of Notch and Delta cells to support their argument. Alternatively, if they are invoking the concept of long-range inhibition described by the Gierer-Meinhardt model, they must explain how a membrane-tethered ligand like Notch can exert effects beyond one cell diameter from the signaling center.

      Regardless of the interpretation chosen by the authors, addressing this clarification will have significant implications for the subsequent treatment of their arguments. Depending on their chosen interpretation, experimental demonstrations may be necessary to substantiate their claims, which could be laborious and time-consuming. However, such demonstrations are essential for establishing the validity of the term "lateral inhibition" as used in the title and abstract of the manuscript. - The utilization of Craspedacusta as a comparative model in the argumentation of the manuscript appears somewhat unclear. The authors posit that Notch is essential for organizer emergence in Hydra, while Wnt is not necessary, as indicated by the observed effects of iCRT14 beta-catenin/TCF inhibition. However, in Craspedacusta, which lacks tentacles but possesses an organizer, one might anticipate a conserved requirement for organizer formation but not tentacle development. Therefore, it would be reasonable to expect that Craspedacusta would still form an organizer under iCRT14 treatment but would not depend on Notch signaling, as the necessity to separate tentacle formation from organizer formation is absent.

      The authors' observation that Craspedacusta fails to form an organizer under iCRT14 treatment partially aligns with these expectations. However, the complexity of the results suggests a need for a deeper understanding of the involvement of different pathways in Craspedacusta. Before applying inhibitors, it would be crucial to elucidate the spatiotemporal differences in the expression of relevant Wnt and Notch pathway components between Hydra and Craspedacusta. This knowledge would provide valuable insights into the specific roles of these pathways in organizer formation and tentacle development in both species, helping to clarify the observed differences in response to iCRT14 treatment. Additionally, considering the possibility of differential sensitivity to iCRT14 (see comment below) between Hydra and Craspedacusta would be essential for accurately interpreting the results and drawing meaningful conclusions regarding the involvement of Notch and Wnt signaling pathways in these processes.

      Minor comments

      • The concentration-dependent effects of iCRT14 on beta-catenin signaling, as demonstrated by Gufler et al. 2018, suggest that the efficacy of inhibition may vary depending on the concentration used. Specifically, Gufler et al. found that a concentration of 10µM was sufficient for efficient inhibition of beta-catenin signaling. However, in the current study, the authors utilized a concentration of 5µM of iCRT14. Given the central role of the observed effects, particularly the persistence of Wnt3 expression, in the argumentation of the manuscript, it is plausible that these effects could be attributed to partial inhibition resulting from the lower concentration of iCRT14 used in the study. To address this potential limitation, the authors could consider conducting a quick examination of the effects of 10µM iCRT14 or utilizing other inhibitors of beta-catenin/TCF interaction, such as iCRT3. By comparing the effects of different concentrations or alternative inhibitors, the authors could ascertain whether the observed effects are indeed attributable to partial inhibition from 5µM iCRT14, or if they persist despite higher concentrations or alternative inhibitors. This additional experimentation would provide valuable insights into the specificity and efficacy of the inhibition and strengthen the validity of the conclusions drawn regarding the role of beta-catenin signaling in the observed phenomena.
      • The use of Generalized Additive Models (GAMs) in Figures 3 and 4 to present the time series qPCR results may introduce some challenges in interpretation due to the potential for distortion of values at specific time points based on neighboring ones. Given the relatively low time resolution of the data, this approach could lead to a distorted depiction of the temporal dynamics. For instance, in Figure 3B, where Wnt3 peaks at 10 hours, the absence of measurements between 8 and 24 hours introduces uncertainty regarding the accuracy and reliability of this peak at 10 hours.

      To address these concerns and enhance clarity, it may be advisable for the authors to consider presenting the data using simple boxplots instead of GAMs. Boxplots provide a more straightforward visualization of the distribution of data at each time point, allowing for a clearer interpretation of trends and fluctuations over time. This approach would mitigate the potential for distortion introduced by GAMs and provide a more accurate representation of the temporal dynamics observed in the qPCR results - The comparison of the effects of iCRT14 versus DAPT treatments would benefit from having consistent gene expression data across both treatments. However, in Figure 4A, there are fewer genes tested compared to Figure 3A, with Hes and Kayak omitted. While the authors interpretation suggests that these genes may not change after iCRT14 treatment due to their upstream position in the signaling pathways, it is essential to empirically demonstrate this relationship, as it is central to the conclusions drawn. To address this gap in the analysis, it would be valuable for the authors to provide a time series of differential expression for Hes and Kayak following iCRT14 treatment. - The analysis of the impact of chemical inhibition of Notch and Wnt signaling in Figure 7 schematic highlights changes in spatial expression patterns of the target genes. However, the interpretation of their impact primarily relies on qPCR data. As evident from Figure 7, when Notch is inhibited, it is anticipated that Kayak expression will shift from the area of the tentacles to the tip. This spatial shift in expression patterns is a critical aspect of the authors' arguments, especially considering the centrality of Kayak in their findings. Notably, similar spatial expression patterns have been demonstrated for Alx using FISH in Pan et al., available on BioRxiv. Given the importance of Kayak in the presented arguments, it is advisable to also investigate its spatial expression patterns using techniques such as FISH.

      Significance

      The paper introduces novelties to the field of regeneration and developmental biology by leveraging Craspedacusta polyp as a novel model system for investigating the evolutionary and developmental dynamics of tentacles. In doing so, it sheds new light on the intricate mechanisms underlying tentacle formation and patterning. Furthermore, the study implicates Kayak in the regulation of Wnt3, adding a fresh perspective to our understanding of the molecular pathways governing Hydra regeneration. Notably, the results of the research challenge the prevailing notion of autoregulation of Wnt3, which has long been considered fundamental to organizer formation in Hydra. While these findings offer intriguing insights, further investigation will be crucial to conclusively ascertain the validity of this assertion.

      Despite the clarity of the data presented, the interpretation and integration of these findings in the manuscript are lacking. The narrative at times feels disjointed, with different storylines loosely connected. While the findings are intriguing and merit publication, a substantial revision of the manuscript is necessary to provide a more coherent and illuminating interpretation of the results.

      The implications of this research extend beyond the specific confines of Craspedacusta polyp and Hydra biology. It holds significant relevance for both the Hydra biology community and the broader field of Notch signaling research. By highlighting the pivotal role of Notch signaling in regeneration and patterning within Hydra, the study enriches our comprehension of this model organism and its evolutionary adaptations. Moreover, it provides a valuable lens through which the evolution of Notch signaling cascades can be examined. This interdisciplinary approach underscores the interconnectedness of diverse biological systems and underscores the importance of exploring novel model organisms to unravel the complexities of evolution and development.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The study investigates the role of Notch and beta-catenin signaling in coordinating head regeneration in Hydra. It combines gene expression dynamics, inhibitor treatments, and comparisons with Craspedacusta polyps to propose a lateral inhibition model for Notch function during Hydra head regeneration, mediating between two pattern-forming systems.

      Three main concerns arise from this work:

      1. Lack of spatial expression data: The study proposes a model based on pattern-forming systems but falls short of providing direct spatial expression data for the genes under consideration in both control and treated scenarios. This gap weakens the empirical support for the proposed model.
      2. Clarity and relevance of Craspedacusta comparisons: The section discussing the regeneration in Craspedacusta polyps appears somewhat disjointed from the main narrative, with its contribution to the overarching story of Hydra regeneration remaining unclear.
      3. Accessibility of the text: The study's presentation, including its title, abstract, and main text, presents challenges in terms of clarity and accessibility, making it difficult for readers to follow and understand the research's scope, methodologies, and conclusions.

      Significance

      In conclusion, while the study aims to advance our understanding of the complex signaling pathways governing Hydra head regeneration, it necessitates significant revisions. Enhancing the empirical evidence through detailed spatial patterning data, clarifying the comparative analysis with Craspedacusta polyps, and refining the narrative to improve accessibility are critical steps needed to solidify the study's contributions to the field.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Steichele et al tackle a long standing question about the precise role of Notch signalling during Hydra head regeneration. They compare the inhibition of Wnt signalling and Notch signalling by pharmacological inhibition. The authors show that inhibition of Wnt signalling blocks tentacle formation but not formation of the hypostome, wnt3 expression or organizer activity. The authors further attempt to understand this in a comparative sense using Craspedacusta. In addition, the authors propose HyKayak as a potential repressor of Wnt3 expression downstream of Notch signalling. there are howerver a number of major and minor problems with the study which must be addressed before it is suitable for publication as outlined below:

      Major:

      • The introduction is lacking a full description of what is known about transcriptional changes during Hydra regeneration and in particular the role of Wnt signalling in this process. Of note the authors do not cite several important studies from recent years including (but not limited to):

      https://doi.org/10.1073/pnas.2204122119

      https://doi.org/10.1186/s13072-020-00364-6

      https://doi.org/10.1101/587147

      https://doi.org/10.7554/eLife.60562

      This problem is further compounded later when the authors do not cite/discuss work which has performed the same or similar analyses to their own. The authors should endeavor to give a more comprehensive background knowledge. - The authors do not cite or reference at all the study by Cazet et al. which used iCRT14 along with RNAseq and ATACseq to probe the role of Wnt signaling during early regeneration. This is a major issue. Although I appreciate that the authors have done much longer time courses and that their data therefore add something to our understanding it will still be important to discuss here. For example, the authors show that Wnt3 is activated normally in iCRT14 animals. Is this also seen in the RNAseq from Cazet et al., - The visualizations used in Figure 3 are quite difficult to interpret and do to in all cases match descriptions in the text. The way the same type data is displayed in figure 5 si much nices.It is also better to treat the same types of data in the same manner consistently throught the paper. For Hes, for example, the authors state that there is a reduction although the data shows that this is very small and taking into account the 95% confidence interval does not seem to be significant. If this is the case then the positive control is not working in this experiment. This would be much clear if individual time points were compared like in figure 5 and statistical tests shown. The authors then state that Alx is not affected but there is actually a larger effect than what they deemed significant for Hes ( the axes are notably different between these two and I think a more consistent axis would make the genes more comparable). Similarly, Gsc is described as being not affected at 8 hours but it appears again to change more that the positive control Hes. Given this I would call into question the validity of this dataset and/or the interpretation by the authors. A more thorough analysis including taking better into account statistical significance would go along way to increasing confidence in this data. - The same issues in interpretation described for Figure 3 also apply to figure 4. The authors state that Wnt7 is affected less than Wnt1 and 3 but this is not evident in the figure and no comparative analysis is performed to confirm this. The same for Wnt 11 and 9/10c where what the authors description is very difficult to see in the figure. Spt5 is apparently upregulated, but this is not discussed. Again the axes are ntably different making it even more difficult to compare between samples. - In their description of figure 4 the authors completely omit to discuss the Cazet et al dataset which has the exact same early timepoints for iCRT14 treatment. This must be discussed and compared and any difference noted. - End of page 11: The authors propose a model thereby the role of Notch in Wnt3 expression may be due to the presence of a repressor. However, I don't see any putative evidence at that stage. The authors also do not cite relevant work from both Cazet et al. and Tursch et al which show that Wnt3 is likely upregulated by bZIP TFs. In both these cases the authors show evidence of bZIP TF binding sites in the Wnt3 promoter along with other analyses. This is very relevant to the model presented by the authors here and must be discussed and compared. In particular the authors put forward HyKayak as an inhibitor of Wnt3 and this should be discussed along with the previous work. - On page 12 the authors conclude based on gene expression in inhibitor treatment that there is a change in complex composition of the two transcription factors. This is something which would require biochemical evidence and I therefore suggest they remove this entirely. - The authors use experiments in Craspedacusta to test their hypothesis of the role of Wnt and Notch signaling in Hydra. There is, in my opinion, an incorrect logic here. Regardless of the outcome, the roles of Wnt and Notch could conceivably be different in the two species and therefore testing hypothesis from one is not possible in the other. The authors should reframe their discussion of this to be more of a comparative framework. Moreover, the results do not necessarily indicate what the authors say. In Hydra Notch signaling is required to form the hypostome/mouth and this is not the case in Craspedacusta while Wnt signaling is required. The authors do not cite an important study from another Hydrozoan Hydractinia (Gahan at al.,2017). In that study the authors show that DAPT inhibits tentacles during regeneration but that the hypostome (or at least the arrangement of neurons and cnidocytes around the mouth) forms normally. This would indicate that in Hydractinia the process of head formation is different to in Hydra and would be congruent with what is shown here in Craspedacusta. This should be more thoroughly discussed, and all relevant literature cited. - From reading the manuscript I do not fully understand the model the authors put forward. It is unclear what "coordinating two independent pattern forming systems" really means. It might be beneficial to make a schematic illustration of the model and how it goes wrong in both sets of inhibitor treatments.

      Minor:

      • The abstract could be rewritten to have more of an introduction to the problem rather than jumping directly into results. It would also be beneficial if the abstract followed the logic of the paper.
      • In Figure 3 and 4 it is not clear why they are divided into A and B. It appears that the categorization of genes into different groups lacks a clear rationale .This seems totally unnecessary. In addition, the order in which the genes are described in the text does not match what is seen in the figure making it confusing to follow.
      • In Figure 5 the authors use two different types of charts and I would stick with one. B is much better as it shows the individual data points as well as other information. I would use this throughout including in Figure 3 and 4.
      • Figure S3 is missing a description of panel C.
      • In figure S3 it is not clear why the inhibitor was removed and not kept on throughout the experiment. Please discuss.
      • Figure S4 has no A or B in the figure, only in the legend.

      Significance

      Although some of the authors data appear to be novel I find the study makes only minor progress on the questions. In particular the authors do not properly cite the relevant literature and to put their manuscript into the correct context. The new model proposed by the authors is based entirely on qPCR data which is not thoroughly analyzed and are not strong enough in the absence of information about the spatial expression the genes they discuss. The proposal of HyKayak as a negative regulator of Wnt3 is interesting but the authors do not provide any solid direct evidence for this (ChIP, EMSA etc) and it is somewhat in disagreement with other models of bZIP function in the literature (which again are not discussed).

      The manuscript is of limited general interest. It has a number of interesting observations which would be of interest to the Hydra community and the broader cnidarian community. The study lacks contextualization within a broader framework, whether it be in the context of regeneration or Wnt/Notch signaling. This limitation may narrow the overall interest in it.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      I have mixed feelings regarding this manuscript. On the one hand, the authors did an impressive amount of work. On the other hand, the manuscript seems overly descriptive (writing should be more concise) without a clear message or hypothesis that is cohesive to all the presented evidence. Below, I will outline my concerns.

      We appreciate the comment about missing a cohesive presentation. We worked extensively to improve that in the revised manuscript.

      Reviewer #1- first part

      1. I am not an expert in the field of viral biology and immunology. I wonder how well the IFN treatment mimics the cellular response to infection (yet without the virus). Also, how good is ruxolitinib at blocking the IFN response ? I would appreciate it if you could explain both with one or two sentences and provide the necessary references.

      The reviewer is correct that we cannot claim that interferon treatment mimics exactly the cellular response. However, the expression of interferon-stimulated genes (ISGs) is a major arm of the antiviral response to HCMV (c.f. doi:10.3390/v10090447, doi:10.2217/fvl-2018-0189). In addition, Ruxolitinib is a potent and selective Janus kinase 1 and 2 inhibitor (doi:10.1021/ol900350k), and we have shown in the past that it very effectively reduces the expression of many ISGs (doi: 10.1038/s41590-018-0275-z). Since ISGs constitute a major part of the host response to HCMV infection, the fact that their expression leads to minor changes in the tRNA pool strongly suggests that it is mainly the virus (as opposed to the host cell) that mediates the changes seen in the tRNA pools during HCMV infection. In the revised version, these claims were amended, and relevant references were added (pages 5, lines 132-136).

      (MAJOR) Can these two treatments really allow the effects of host response and viral infection to be separated? OR in other words, are these two effects really orthogonal? In my opinion, they are NOT. Fig. 1E seems to support my opinion, as the changes seen for the "IFN" sample relative to the "uninfected" sample (referred to as "changes-A" below), are parallel to the changes seen for the "24hpi + ruxo" sample relative to the "24hpi" sample ("changes-B"). More specifically, changes-A represent the host response, as argued by the author, whereas changes-B represent the elimination of the host response (due to ruxo, conditioned on the virus-driven effect). If the virus-driven effect and the host response could really be separated, one would expect changes-A and changes-B are more or less opposite. However, they appeared to be parallel, suggesting that uninfected versus infected conditions can have totally different (even opposite) host responses. More importantly, if one cannot separate the host response from virus-driven effects, the conclusion of "tRNA changes are driven by virus, not host response" is then unfounded.

      This is an important point to clarify. Changes-A indeed represent the effect of the host antiviral response on the tRNA pool. Changes-B, however, represent a mix of two effects. 1: counteracting the effect of the host antiviral response on the tRNA pool, which we show is a minor effect, and 2: The enhanced effect of the virus, since ruxolitinib, by inhibiting the host antiviral response, enhances the viral infection. It may indeed be that both the virus and the host antiviral effects are in the same direction. However, it is clear that the antiviral effect is minor. Thus, it is likely that the second effect of ruxolitinib (i.e., allowing enhanced viral infection) is the more substantial one. Therefore, it seems as though the viral effect and the elimination of the host effect are in the same direction. This point was clarified in the revised version (page 6, lines 145-146).

      Even if we let go of this previous point and accept that these results indeed offer some support for the notion that the virus-driven effect are the main contributor to the shifts in tRNA pool, the support is at best moderate. A big gap here is "how?" I suggest the authors should at least give some insight on how virus can do that in Discussion (and mention it with one sentence in Results).

      We certainly welcome the challenge, which we now meet in the revision. In short, here, transcription regulation of tRNAs, mainly upon viral infection, is poorly studied. Unlike other herpesviruses, HCMV does not cause a host shut-off of the host transcripts. Upon HCMV infection, the tRNA transcription machinery is upregulated significantly, which probably contributes to the upregulation in pre-tRNA (doi.org/10.1016/j.semcdb.2023.01.011). However, it is still unknown what the viral factors are that promote upregulation in the tRNA transcription machinery. We now relate to this point in the results (page 6, lines 147-148) and discuss the known effects of viral infection of tRNA expression in the discussion section (page 15, lines 447-451).

      The authors compared the HCMV codon usage to the proliferation and differentiation signatures of human cells. But these two signatures are not compared with measured tRNA expression. It might shed some light on the general characteristics of tRNA pool shifts due to infection (towards a proliferation-like or differentiation-like signature). This fits in the general topic of virus-host interaction and might give more evidence for the point that HMCV is adapted to a differentiation signature (as it drives the host into that state).

      We performed the analysis suggested by the reviewer. We found that the tRNA pool of uninfected HFF cells correlated to the same extent with proliferation codon usage (r=0.29, p-value=0.029) and differentiation codon usage (r=0.26, p-value=0.05). Similar correlations to the proliferation and differentiation signature were found when analyzing the tRNA pool 72h post-infection (proliferation r=0.33, p-value=0.011, differentiation r=0.28, p-value=0.034). This result suggests no general shift in the tRNA pool towards a specific codon usage signature.

      How is the dashed box in Fig3A/B chosen?

      We determined the dashed lines based on the most prominent groups of transcripts best adapted to proliferation or differentiation codon usage signatures. Figure S3A clearly shows the two groups without viral genes. We emphasize this point in the legend of Figure S3A (page 36, lines 1157).

      The tAI values shown in Fig3C-E are extremely low (compared to other reports I am aware of). Does this mean that the adaptation of viral codon usage to human cell supply is actually very weak? This is in opposition to the major claims made in this section.

      We acknowledge that the tAI values presented here are lower than typically presented. However, this is due to how tAI was calculated rather than the potential weak adaptation between viral genes and tRNA supply. Specifically, unlike previous works that estimate tRNA availability based on tRNA gene copy number, here we calculated tAI using tRNA sequencing (in order to capture the dynamics in the tRNA pool during infection). Indeed, the value of tAI calculated by tRNA read counts is lower than tAI calculated by tRNA copy number. This is due to the skewed distribution of tRNA read counts (some tRNAs are highly expressed, and others are lowly expressed), while tRNA copy number is distributed more evenly. Thus, due to the mathematical nature of the tAI (computing geometric rather than arithmetic average of tRNA availability), the skewed distribution observed in the data results in lower tAI values. When computing tAI based on gene copy number, we get higher tAI values (0.3 on average). Nevertheless, as all tAI calculations here were done similarly, the comparisons between gene groups or genes are valid.

      I believe that the part about SARS-CoV-2 could be made more concise. It is sufficient to mention that results may differ from those obtained with HCMV in one paragraph.

      The section on SARS-Cov-2 is now made rather succinct. This virus is mainly given as a comparison to the primary virus studied in this paper - HCMV.

      Line 299 on page 11 - I do not believe codon usage between different viruses can be directly compared, let alone reaching such a conclusion. Some viruses have low CAI or tAI to humans, but they have co-evolved with humans for a long time. Furthermore, there are viruses that infect multiple hosts, but their CAI for a host with which they have long co-evolved is higher while their CAI for a host that is relatively new is lower.

      We agree with the reviewer that a direct link between co-evolution time and tAI may not always exist. Indeed, other factors might explain the observation that SARS-CoV-2 genes are less adapted than HCMV genes. These may include effective population sizes and mutation rates that vary substantially. We, therefore, removed this conclusion from the manuscript.

      (MAJOR) A more general comment is that there is a difference between tRNA expression and the abundance of translation-ready tRNA. The process of charging tRNA with amino acids may take a long time. It is the abundance of the charged-tRNA (the ternary complex of aminoacylated tRNA and EF-Tu-GTP) that is of biological importance. In this regard, the use of tRNA expression falls short.

      The reviewer raises a valid point. Indeed, our tRNA sequencing protocol measures both charged and uncharged tRNAs that constitute the cell's mature tRNA pool. Compared to previous studies that focus on the transcription process of tRNAs in viral-infection models by sequencing the pre-tRNAs, here we look at the mature tRNA pool that accounts for both transcription and post-transcription processes. Therefore, we changed the use of "tRNA expression" to "mature-tRNA levels" and "highly" or "lowly-abundant tRNAs" rather than “highly” or “lowly expressed tRNAs” in the manuscript. We note, however, that although limited in the ability to differentiate between charged and uncharged tRNAs, the tRNA sequencing protocol used here is commonly used and validated as a state-of-the-art protocol in tRNA sequencing (10.1016/j.molcel.2021.01.028, 10.1038/s41467-020-17879-x, etc.), mainly because it addresses the level of "ready-to-use" tRNA.

      Reviewer #1- second part

      1. (MAJOR) Prior to the actual competition assay in the first high-throughput screen (cell competition assay), the authors applied two days of antibiotic selection and two days of recovery. This could result in a serious problem of false negatives or drop outs. Specifically, an sgRNA targeting an essential gene with high efficiency would kill the cells, leaving no (or a small number of) cells in the ancestor population at the beginning of the competition process. A sgRNA's enrichment in competing populations cannot be reliably estimated in such situations. I am not certain that the FDR used in Figure 5B is sufficient to address this issue. Please clarify whether it could. Providing raw counts for competing and ancestor populations would also be helpful.

      As customary in CRISPR screens, the step of lentiviral transduction and antibiotic selection is necessary to ensure that only CRISPR-edited cells are left in the population. Indeed, essential genes like housekeeping genes are probably removed from the competing population relatively quickly, which might result in their dropouts. We could have lost some tRNA hits in the cell growth CRISPR screen (Figure 5B-C) because of their overall essentiality for cell growth. The MAGcK tool we used, the state-of-the-art in the field, filters out sgRNAs with low read counts to be able to calculate false discovery rates. Indeed, we identified 15 tRNAs that were depleted from the competing cells. We believe that our procedure minimizes the concern of dropouts. tRNA dropout in the HCMV infection CRISPR screen (Figure 6B-C) can also happen, which means our screen underestimates the essentiality of tRNAs to HCMV infection. However, this concern does not affect the significance of the hits we did find. We acknowledge this inherent difficulty in CRISPR screens and will provide the raw read counts of all samples upon full submission. We emphasize, though, that while valid, this concern applies to essentially any CRISPR screen that is commonplace in genomics these days.

      It is also highly questionable to me the nearly negligible effects of tRNA modification enzymes. This may be explained by the point above. Indeed, the dots of tRNA modification enzymes in general appear to have higher FDR (lower y values) when compared to red dots with similar enrichment levels.

      This is a valid point. We found a lack of essentiality of tRNA modification enzymes in both screens. We analyzed additional CRISPR screens and compared the effect of tRNA modification enzyme knockouts relative to the restriction and dependency factors we used in the library. The tested screens included 34 knockout CRISPR screens we downloaded from the BioGRID ORCS database that have similar parameters to our screen. Namely, they all test cell proliferation in a time-course manner, using a pooled sgRNA library and using the MAGeCK tool for data analysis. Overall, the screens use different human cell lines and diverse sgRNA libraries. Although potentially surprising, we found that the lack of essentiality of tRNA modification enzymes was also observed in the analyzed CRISPR screens (Figure S5B and on page 11, lines 322-330, and on page 18, lines 539-541). One potential reason is if some of these enzymes were "backed up" by others, which we mentioned. Another explanation is that most tRNA modification enzymes are indeed not essential for growth and for viral infection (now described in the Discussion, page 18, lines 544-545). Alternatively, dropouts can explain this result, as suggested by the reviewer. To examine the likelihood of the dropout option, we examined the average raw read count of the tRNA modification enzyme in the ancestor samples. We compared it to that of other sub-groups. We found that raw read counts of the tRNA modification enzymes are not different than other sub-groups in the CRISPR library. Thus, the dropout issue cannot explain our screens' lack of essentiality of tRNA modification enzymes.

      The screen based on IE2-GFP labeled HCMV measures a phenotype that is very difficult to interpret. Particularly, I am not sure if GFP2 and GFP3 are good controls for comparing GFP4 (GFP1 might be better). Various factors can affect GFP levels, including, but not limited to, dilution caused by a rapidly dividing host cell, unhealthy translational machinery resulting from infection or microenvironment. My point is supported by some observations in Fig6B. For example, SEC61B, a restriction factor for HCMV infection, is enriched in the GFP2 group, contrary to expectations. It is necessary for the authors to prove with firm evidence that their choice of GFP signal thresholds is appropriate.

      We acknowledge the concern. Specifically, the translation of the GFP gene itself could be affected by the tRNA manipulation done. To account for this potential concern, we tested the codon usage of the eGFP gene (which is the GFP version we used in the system) and compared it with tRNA essentiality, as determined by the cell growth CRISPR screen. We report this in the revised manuscript (page 13, lines 390-392, and added Figure S6D). We found that GFP does not tend to significantly use codons that correspond to essential or less essential tRNAs. The same lack of correlation was also found for the tRNA essentiality upon HCMV infection (not shown).

      More generally, we show that GFP intensity does correlate with viral genome copies (Figure S6A). Also, from mRNA-seq data of temporal HCMV infection (10.1016/j.celrep.2022.110653), IE2 (UL122) shows a dynamic expression- high expression pick in early infection, then a decline in expression level followed by a gradual increase.

      Altogether, we believe that the IE2-GFP level provides a good estimation for viral load.

      Regarding SEC61B, which served as a control in our screen – the referee is rightly asking why it behaves oppositely from what's expected, given that this was supposed to be a restriction factor of HCMV infection. We returned to the literature on the essentiality of this gene upon HCMV infection. In Weissman's paper (10.1038/384432a0), which was the reference for choosing control genes in our system, this gene was targeted through two different CRISPR technologies, once with CRISPR knockout and once with CRISPRi. Interestingly, only upon CRISPRi did this gene prove to be a restriction factor (i.e., improved infection upon reduction of the gene). We comment on this peculiar fact in the revised manuscript (page 13, lines 370-374). However, we note that the rest of our positive and negative controls deliver the expected results – increasing or reducing infection as expected from their role, thus lending considerable support to our experimental system. It is possible, especially in light of our screen, and since other positive and negative controls behave as expected, that the status of the SEC61B gene as a "restriction factor" of viral infection needs to be reconsidered, as we now suggest.

      I would appreciate more information regarding why restriction factors of cell growth have a high GFP2/GFP4. Intuitively, a KO of restriction factors of cell growth should result in better growth and higher GFP, thus leading to enrichment in GFP4, not GFP2.

      The reviewer raises an interesting question (although not at the heart of this work, as sgRNAs for the cell growth restriction factor mainly aim to serve as controls for the CRISPR screen). HCMV has a complex interaction with the cellular cell cycle. Specifically, it establishes a unique G1/S arrest that is both stimulatory and inhibitory since, on the one hand, it serves the virus to arrest the cell cycle, a critical step for viral genome replication. On the other hand, the virus needs many of the resources that serve cell growth. Both p53 and CDKN1A are important regulators at this stage; therefore, their interaction with the virus may indeed be complex. For example, p53 is upregulated by a viral infection. However, it is sequestered in the viral replication compartments, and its transcriptional are down-regulated, but its absence harms viral propagation (doi: 10.1128/mBio.02934-21, doi: 10.1128/jvi.72.3.2033-2039.1998, doi: 10.1128/jvi.00505-06). Therefore, it is not surprising that genes related to cell growth and cell cycle have complex effects on HCMV infection. We mention the essentiality of p53 for HCMV infection in the results (page 14, line 404).

      Line 404 "nonetheless"

      We appreciate the reviewer for noticing the typo. We corrected it.

      Reviewer #1 (Significance (Required)):

      The relation between human tRNA supply and viral translation is a topic of profound biological and biomedical importance. In this study, the authors used HCMV infection as the primary model to investigate this question. Results fall into two major parts: (i) changes in the tRNA pool during viral infection, and (ii) the impact of tRNA-related gene KO on viral infection.

      We appreciate the detailed report. We addressed the major points raised in the revised manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this study by Aharon-Hefetz et al., the researchers examined changes in tRNA pools during virus infections. The translation machinery plays a crucial role in virus replication. Consequently, host cells have developed sensors and effectors within this compartment to counteract viral mechanisms. The translation apparatus serves as a pivotal point in the virus-host conflict. Therefore, investigating alterations in the translation machinery during infections is vital for gaining a comprehensive understanding of the infection process. This study offers a thorough and high-quality analysis of data in a relevant cell culture system involving two different viruses. By conducting tRNA sequencing, the researchers studied the human tRNA pool following infections with human Cytomegalovirus (HCMV) and SARS-CoV-2. Changes in tRNA expression induced by HCMV were mainly driven by the virus infection itself, with minimal impact from the cellular immune response. Interestingly, specific tRNA post-transcriptional modifications seemed to influence stability and were subject to manipulation by HCMV. Conversely, SARS-CoV-2 did not lead to significant alterations in tRNA expression or post-transcriptional modifications. Moreover, a systematic CRISPR screen targeting human tRNA genes and modification enzymes allowed the identification of specific tRNAs and enzymes that either enhanced or reduced HCMV infectivity and cellular growth. This information enabled them to control the development of HCMV-specific tRNA modifications, highlighting the importance of these tRNA epitranscriptome modifications in virus replication. The authors concluded that the observed differences between the viruses are consistent with HCMV genes aligning with differentiation codon usage and SARS-CoV-2 genes reflecting proliferation codon usage. This observation's connection to the biology of HCMV and SARS-CoV-2 lies in the codon usage of structural and gene expression-related viral genes, showing a significant adaptation to host cell tRNA pools. Notably, these genes from both viruses demonstrated the highest adaptation to the tRNA pool of infected cells. The reason behind this phenomenon remains unclear. One hypothesis suggests that a high level of structural gene expression is necessary during activation. Testing this hypothesis could involve examining if hindering tRNA modifications affects virus morphogenesis. In summary, this study presents an interesting and innovative perspective on how viruses modify the translation machinery. The meticulous analysis sheds light on a central interaction point between viruses and their host cells.

      Reviewer #2 (Significance (Required)):

      In summary, this study presents an interesting and innovative perspective on how viruses modify the translation machinery. The meticulous analysis sheds light on a central interaction point between viruses and their host cells.

      We thank the reviewer for finding our work interesting, innovative, and well analyzed

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary

      Aharon-Hefetz et al. present the expression dynamics and modification signatures of tRNAs using DM-tRNA-seq in human foreskin fibroblasts or Calu3 cells during infections with two diverse viruses, HCMV and SARS-CoV2, respectively. They also use a newly designed tRNA-centric CRISPR library to screen the essentiality of tRNA and tRNA factors during HCMV-GFP infection. They find several tRNAs that are differentially expressed during HCMV infection, and most closely resemble the set of tRNAs shown to be used during cellular differentiation. Additionally, tRNA differential expression does not resemble that following interferon treatment, implying that virus modulation of tRNAs is unique to the general interferon response. They compare codon usage signatures during infection to their prior-defined sets of proliferation/differentiation tRNA genes. In their CRISPR screen, they find that different tRNAs can promote or restrict HCMV infection levels, as measured by the intensity of GFP fluorescence marker in their virus. Surprisingly, there were few tRNA modification factor hits that contributed to growth or infection.

      Reviewer #3- major comments

      1. The topic of this work is important, and the analysis performed here is assumed to be top quality, based on the previous work by the last author. The weakness with this body of work is a lack of rigor, specifically regarding validation and follow-up studies. Without these experiments, the reader lacks confidence in stated conclusions. For example: There is no validation or clue to how penetrant CRISPR is against tRNA genes. Given how duplicated some tRNA families are, it is possible that CRISPR is more effective against certain families compared to others. While this is likely an inherent caveat in all CRISPR screens, it would lend confidence in this approach to see some validation of tRNA KO by northern blot or RT-qPCR or sequencing.

      We thank the reviewer for raising this important issue. Indeed, many tRNA genes appear in multiple copies in the human genome. Yet, based on our previous work, we expect parallel editing of multiple copies using the same sgRNA. In our previous work (doi.org/10.7554/eLife.58461), we validated, based on several tRNA families, the ability of our tRNA CRISPR system to successfully target and affect tRNA expression levels. This included sequencing of the edited tRNA genes (i.e., DNA sequencing), in which we observed diverse INDEL mutations that predicted full disruption of the tRNA structure. Furthermore, we sequenced the tRNA pool of CRISPR-edited cells and found the downregulation of the targeted tRNAs to be up to 2-4-fold. This previous work provides foundations and confidence in this tRNA-CRISPR approach.

      Nevertheless, to further mitigate the reviewer's concern, we also plan to perform additional experiments in the current settings. We will choose individual tRNAs from our CRISPR screen as representatives to validate CRISPR editing. We will target each tRNA independently and test expression reduction by sequencing. We shall share the results in the full revision if granted.

      1. There is no validation that tRNA modification factor knockouts alter tRNA modification levels. Without this knowledge, the lack of essentiality cannot be confidently and fully interpreted. If the group does not validate whether individual tRNA modification factor knockouts alter modification profiles, then all possible explanations should be posited. For example, it is possible that 1) there could be major redundancy among tRNA modification enzymes, as the authors posit in the Discussion 2) tRNA modification enzymes are not essential for growth bc their activity/the modification they place is non-essential for growth, OR 3) the knockouts are not fully penetrant. I think this Discussion should be expanded to make caveats clearer. Perhaps referencing whether tRNA modification factors have been shown to be essential in other CRISPR screens would be helpful.

      Regarding the possible explanations for the lack of essentiality of tRNA modification enzymes, we agree with all three possibilities the reviewer raised. Reviewer #1 raised an additional option, in which tRNA modification enzymes are essential for HCMV infection and cell growth; thus, we cannot detect them in the screens because they drop out early in the process (before collecting the ancestor samples). We checked this possibility and found comparable read counts of sgRNAs targeting tRNA modification enzymes to that of other sub-libraries. This result suggests the drop-outs of sgRNA targeting are unlikely to happen on our screens.

      Furthermore, as the reviewer asked, we analyzed additional CRISPR screens and compared the effect of tRNA modification enzyme knockouts relative to the restriction and dependency factors we used in the library. The tested screens included 34 knockout CRISPR screens we downloaded from the BioGRID ORCS database that have similar parameters to our screen. Namely, they all test cell proliferation in a time-course manner, using a pooled sgRNA library and using the MAGeCK tool for data analysis. Overall, the screens use different human cell lines and diverse sgRNA libraries. Although potentially surprising, we found that the lack of essentiality of tRNA modification enzymes was also observed in the analyzed CRISPR screens (Figure S5B and on page 11, lines 322-330, and on page 18, lines 539-541).

      1. There is no validation that factors modulating GFP intensity in the HCMV screen actually impact virus replication. This is the point most important to this body of work. While GFP intensity does correlate to genome copies as shown by the authors, GFP read-out on a case-by-case basis could be simply due to factors required for expression/translation of GFP. Are any of the tRNA hits enriched or not represented in GFP reporter sequence? Either way, this information is informative.

      We acknowledge the concern. Specifically, the translation of the GFP gene itself could be affected by the tRNA manipulation done. To account for this potential concern, we tested the codon usage of the eGFP gene (which is the GFP version we used in the system) and compared it with tRNA essentiality, as determined by the cell growth CRISPR screen. We report this in the revised manuscript (page 13, lines 390-392, and added Figure S6D). We found that GFP does not tend to significantly use codons that correspond to essential or less essential tRNAs. The same lack of correlation was also found for the tRNA essentiality upon HCMV infection (not shown).

      Additionally, given that the hits are cross-compared ONLY to other infected (low intensitiy "GFP+") cells, and not to an uninfected population, there is no guarantee that these primarily drive HCMV infection. The top hits should be validated in HFFs, infected with HCMV, with resulting titers/viral gene expression/genome copies measured. Additionally, the reasons for not using a GFP- population as a control should be clarified.

      We agree that additional experiments on some hits may be warranted. We plan to examine for such an effect on infection using an individual gene version of the assay. In particular, we will target individually candidate tRNA genes following validation (as described previously in point 1). We will then infect the tRNA-targeted cells with HCMV and measure the effectiveness of HCMV infection using a standard titer assay.

      The reviewer also suggest comparing GFP1/2/3 to an ancestor in addition to comparing them to GFP4. Towards that we now show a GFP2 vs ancestor comparison (shown below). The results look very similar and are now added to the supplemental material of the revised manuscript (page 13, lines 385-387, Figure S6B).

      Though careful codon usage analysis for HCMV versus the human host was analyzed, it seems pertinent to analyze whether the differentially expressed tRNAs during infection correlate to either codon usage profiles. Figure 3C and S3C intend to address this point for viral gene groups; however, I would encourage the authors to expand the description of these results to make them easier to interpret, especially for those not in the tRNA field. For example, "tRNA adaptation index (tAI)" is not defined in the text, but simply referenced. For clarity, you should include a brief explanation of what this measure describes. Following, when reporting results from Figure 3, the results can then be delivered with more specific and interpretable language. These steps will ensure maximal scientific communication to the audience.

      We appreciate the reviewer's comment regarding the importance of scientific communication and making this manuscript easier to interpret, especially for readers unfamiliar with the world of tRNAs and translation efficiency. We added a description of our motivation to use tAI and the meaning of the measurement (page 9, lines 241-243). We also elaborated on the results part and made the results more interpretable (page 9, lines 245 and 249-250).

      Finally, given that changes are most visible at 72 hpi, the analysis should include expression based on this time point for comparison.

      Regarding the time point used for tAI calculation (Figure 3), we tested the tAI measured by the tRNA pool at 72hpi and got very similar results to that obtained using the tRNA pool measured at 24hpi. As 24hpi represents the pick of HCMV infection, we decided to present this analysis. In the current revised version, we also added the analysis done using the tRNA pool measured 72hpi as suggested by the reviewer (Figure S3D).

      Reviewer #3- minor comments

      1. I would recommend more care in terminology used for the CRISPR screen (Figures 5 and 6) to make the manuscript easier to digest. Labeling sgRNAs-containing cells as " Reduced Growth/Infection" or "Increased Growth/Infection" is not immediately easy to understand. For example, saying this sgRNA "increased growth" could refer to the knockdown increasing growth OR could mean that this sgRNA was enriched in cells with increased growth, which are opposing. It might be more clear to state to use depleted/enriched terminology in these figure labels. This also applies to the text, be sure to plainly describe the terminology and what it means each time you refer to the CRISPR results.

      This is a good point. Indeed, focusing on the significant enrichment of the sgRNAs, rather than their effect on growth or infection, is more straightforward. We changed the terminology in Figures 5C and 6C and the text in the current version.

      Is there actual evidence that the new tRNA sgRNA library is more effective than that used previously? State if so.

      We assume the referee refers to our previous paper on the smaller-scale library (doi.org/10.7554/eLife.58461). The addition here is that the library is much more comprehensive (the previous one targeted only 20 tRNAs). We point it out in the revised manuscript (page 17, lines 499-501).

      Fig 1A-C: The cutoff for "red" symbol distinction is not stringent enough. 1.05 would be red, but that is not convincingly upregulated. The cutoff should be at least FC>1.2.

      We thank the reviewer for bringing our intention to this point. In the current version, we changed the cutoff of absolute fold change higher than 1.2 in Figures 1A-C and S1A (also in legend).

      Need thorough description of tRNA bioinformatics and modification analysis (citing past work is not appropriate here-need to make accessible to your audience).

      Further thorough descriptions of tRNA bioinformatics and modification analysis are added in the revised version (page 6, lines 149-151, page 7, lines 178-183).

      Line 182- Result headings could be more informative, even with small adjustments. For example "Specific tRNA modifications are modulated in response to HCMV infection" is more clear and accurate, as there are only a few measurable changes in tRNA modification. Limitations of using sequencing techniques to analyze modifications (versus MS) should also be discussed.

      We changed that heading accordingly.

      We also mentioned the advantages and disadvantages of using sequencing to assess tRNA modification levels (page 7, lines 184-187).

      It is not immediately clear why the viral plot looks different in Fig S3B compared to Fig 3B.

      We thank the referee for spotting this. We employed different length cutoffs on the genes in each panel and have now fixed that in the revised manuscript.

      Line 254-255. This point is not immediately clear-please include more specific language detailing the logic leading to this conclusion.

      Indeed, the logic here was missing. The idea was that longer genes are associated with gene conservation, hence functionality. Thus, non-canonical HCMV genes that are both long and codon-optimized might have a function during HCMV infection. We added this explanation to the text (pages 8-9, lines 235-238).

      Line 408- "may be essential"-I would modify the language here. Especially given there is no true comparison with uninfected cells.

      We improved the language throughout the revised manuscript.

      There are a number of recent publications profiling tRNA expression in herpesviruses. These should be mentioned and discussed in the context of this work. I know some were included in the reference list, but the body of work as a whole, and how this work fits in and pushes the horizon, could be further emphasized. It is quite impressive that this is a conserved feature of herpesvirus infection. a. PMID: 36752632 b. PMID: 35110532 c. PMID: 34535641 d. PMID: 33986151 e. PMID: 33323507 f. PMID: 35458509

      We thank the reviewer for highlighting these works. We added a discussion item regarding tRNA expression in HCMV and other herpesviruses with the references (pages 15-16, lines 447-458)

      CoV2 Discussion point-The lack of tRNA expression regulation might have more to do with the length of the infection (6 hpi cov2- also didn't see much a change at 5hpi with hcmv). This should be proposed as a possibility.

      It is a possibility that due to the high stability of tRNAs, expression regulation of tRNAs will not affect the tRNA pool in short infection such as of SARS-CoV-2. We added this explanation in the discussion part, page 15, lines 441-442.

      Line 582. Misspelled schlafen in Discussion. (SLFN, not SFLN)

      The point is fixed in the revised manuscript.

      Reviewer #3 (Significance (Required)):

      General assessment: I found this paper exciting to read, given the dearth of knowledge regarding viral modulation of tRNA expression.

      We appreciate the reviewer's comment

      However, the work is highly descriptive, with a complete absence of follow-up or validation studies. At the very least, I would have hoped that the authors validated that viral titer (and not just GFP intensity) was impacted by some of the hits. The lack of confirmation and quality control overall diminishes confidence in the stated conclusions.

      However, I think the topic is timely, important, and that this manuscript offers tools to the community at large to learn more about viral manipulation or other drivers of tRNA regulation. Once follow-up/validation experiments are added to the work, as detailed below, this manuscript will be of broad importance and highly impactful.

      As mentioned above, we plan to add such validations to the fully revised manuscript.

      Advance: While there have been many studies suggesting tRNA regulation occurs during viral infection (these pubs should be referenced as mentioned above), this is an advance due to the fact that it begins to address whether tRNA expression changes functionally impact viral replication. This will be much more solid with follow-up experiments confirming that hits alter HCMV replication (rather than GFP intensity).

      Audience: This will be of broad interest to those with interest in virology and gene expression. The new sub-libraries of tRNA-related factors might be useful to be tested in other cell types and settings. Again, as the work stands, it is descriptive and hypothesis-stimulating, but the conclusions need validation and further support.

      We thank the referee for the encouraging words and the suggested analyses. We already implemented most of the suggestions in the current revised version and hope to add further experiments in a fully revised manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      Aharon-Hefetz et al. present the expression dynamics and modification signatures of tRNAs using DM-tRNA-seq in human foreskin fibroblasts or Calu3 cells during infections with two diverse viruses, HCMV and SARS-CoV2, respectively. They also use a newly designed tRNA-centric CRISPR library to screen the essentiality of tRNA and tRNA factors during HCMV-GFP infection. They find several tRNAs that are differentially expressed during HCMV infection, and most closely resemble the set of tRNAs shown to be used during cellular differentiation. Additionally, tRNA differential expression does not resemble that following interferon treatment, implying that virus modulation of tRNAs is unique to the general interferon response. They compare codon usage signatures during infection to their prior-defined sets of proliferation/differentiation tRNA genes. In their CRISPR screen, they find that different tRNAs can promote or restrict HCMV infection levels, as measured by the intensity of GFP fluorescence marker in their virus. Surprisingly, there were few tRNA modification factor hits that contributed to growth or infection.

      Major Comments

      1. The topic of this work is important, and the analysis performed here is assumed to be top quality, based on the previous work by the last author. The weakness with this body of work is a lack of rigor, specifically regarding validation and follow-up studies. Without these experiments, the reader lacks confidence in stated conclusions. For example:
        • a. There is no validation or clue to how penetrant CRISPR is against tRNA genes. Given how duplicated some tRNA families are, it is possible that CRISPR is more effective against certain families compared to others. While this is likely an inherent caveat in all CRISPR screens, it would lend confidence in this approach to see some validation of tRNA KO by northern blot or RT-qPCR or sequencing.
        • b. There is no validation that tRNA modification factor knockouts alter tRNA modification levels. Without this knowledge, the lack of essentiality cannot be confidently and fully interpreted. If the group does not validate whether individual tRNA modification factor knockouts alter modification profiles, then all possible explanations should be posited. For example, it is possible that 1) there could be major redundancy among tRNA modification enzymes, as the authors posit in the discussion 2) tRNA modification enzymes are not essential for growth bc their activity/the modification they place is non-essential for growth, OR 3) the knockouts are not fully penetrant. I think this discussion should be expanded to make caveats clearer. Perhaps referencing whether tRNA modification factors have been shown to be essential in other CRISPR screens would be helpful.
        • c. There is no validation that factors modulating GFP intensity in the HCMV screen actually impact virus replication. This is the point most important to this body of work. While GFP intensity does correlate to genome copies as shown by the authors, GFP read-out on a case-by-case basis could be simply due to factors required for expression/translation of GFP. Are any of the tRNA hits enriched or not represented in GFP reporter sequence? Either way, this information is informative. Additionally, given that the hits are cross-compared ONLY to other infected (low intensitiy "GFP+") cells, and not to an uninfected population, there is no guarantee that these primarily drive HCMV infection. The top hits should be validated in HFFs, infected with HCMV, with resulting titers/viral gene expression/genome copies measured. Additionally, the reasons for not using a GFP- population as a control should be clarified.
      2. Though careful codon usage analysis for HCMV versus the human host was analyzed, it seems pertinent to analyze whether the differentially expressed tRNAs during infection correlate to either codon usage profiles. Figure 3C and S3C intend to address this point for viral gene groups; however, I would encourage the authors to expand the description of these results to make them easier to interpret, especially for those not in the tRNA field. For example, "tRNA adaptation index (tAI)" is not defined in the text, but simply referenced. For clarity, you should include a brief explanation of what this measure describes. Following, when reporting results from Figure 3, the results can then be delivered with more specific and interpretable language. These steps will ensure maximal scientific communication to the audience. Finally, given that changes are most visible at 72 hpi, the analysis should include expression based on this time point for comparison.

      Minor Comments

      1. I would recommend more care in terminology used for the CRISPR screen (Figures 5 and 6) to make the manuscript easier to digest. Labeling sgRNAs-containing cells as " Reduced Growth/Infection" or "Increased Growth/Infection" is not immediately easy to understand. For example, saying this sgRNA "increased growth" could refer to the knockdown increasing growth OR could mean that this sgRNA was enriched in cells with increased growth, which are opposing. It might be more clear to state to use depleted/enriched terminology in these figure labels. This also applies to the text, be sure to plainly describe the terminology and what it means each time you refer to the CRISPR results.
      2. Is there actual evidence that the new tRNA sgRNA library is more effective than that used previously? State if so.
      3. Fig 1A-C: The cutoff for "red" symbol distinction is not stringent enough. 1.05 would be red, but that is not convincingly upregulated. The cutoff should be at least FC>1.2.
      4. Need thorough description of tRNA bioinformatics and modification analysis (citing past work is not appropriate here-need to make accessible to your audience)
      5. Line 182- Result headings could be more informative, even with small adjustments. For example "Specific tRNA modifications are modulated in response to HCMV infection" is more clear and accurate, as there are only a few measurable changes in tRNA modification. Limitations of using sequencing techniques to analyze modifications (versus MS) should also be discussed.
      6. It is not immediately clear why the viral plot looks different in Fig S3B compared to Fig 3B.
      7. Line 254-255. This point is not immediately clear-please include more specific language detailing the logic leading to this conclusion.
      8. Line 408- "may be essential"-I would modify the language here. Especially given there is no true comparison with uninfected cells.
      9. There are a number of recent publications profiling tRNA expression in herpesviruses. These should be mentioned and discussed in the context of this work. I know some were included in the reference list, but the body of work as a whole, and how this work fits in and pushes the horizon, could be further emphasized. It is quite impressive that this is a conserved feature of herpesvirus infection.
        • a. PMID: 36752632
        • b. PMID: 35110532
        • c. PMID: 34535641
        • d. PMID: 33986151
        • e. PMID: 33323507
        • f. PMID: 35458509
      10. CoV2 discussion point-The lack of tRNA expression regulation might have more to do with the length of the infection (6 hpi cov2- also didn't see much a change at 5hpi with hcmv). This should be proposed as a possibility.
      11. Line 582. Misspelled schlafen in discussion. (SLFN, not SFLN)

      Significance

      General assessment: I found this paper exciting to read, given the dearth of knowledge regarding viral modulation of tRNA expression. However, the work is highly descriptive, with a complete absence of follow-up or validation studies. At the very least, I would have hoped that the authors validated that viral titer (and not just GFP intensity) was impacted by some of the hits. The lack of confirmation and quality control overall diminishes confidence in the stated conclusions. However, I think the topic is timely, important, and that this manuscript offers tools to the community at large to learn more about viral manipulation or other drivers of tRNA regulation. Once follow-up/validation experiments are added to the work, as detailed below, this manuscript will be of broad importance and highly impactful.

      Advance: While there have been many studies suggesting tRNA regulation occurs during viral infection (these pubs should be referenced as mentioned above), this is an advance due to the fact that it begins to address whether tRNA expression changes functionally impact viral replication. This will be much more solid with follow-up experiments confirming that hits alter HCMV replication (rather than GFP intensity).

      Audience: This will be of broad interest to those with interest in virology and gene expression. The new sub-libraries of tRNA-related factors might be useful to be tested in other cell types and settings. Again, as the work stands, it is descriptive and hypothesis-stimulating, but the conclusions need validation and further support.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study by Aharon-Hefetz et al., the researchers examined changes in tRNA pools during virus infections. The translation machinery plays a crucial role in virus replication. Consequently, host cells have developed sensors and effectors within this compartment to counteract viral mechanisms. The translation apparatus serves as a pivotal point in the virus-host conflict. Therefore, investigating alterations in the translation machinery during infections is vital for gaining a comprehensive understanding of the infection process.

      This study offers a thorough and high-quality analysis of data in a relevant cell culture system involving two different viruses. By conducting tRNA sequencing, the researchers studied the human tRNA pool following infections with human Cytomegalovirus (HCMV) and SARS-CoV-2. Changes in tRNA expression induced by HCMV were mainly driven by the virus infection itself, with minimal impact from the cellular immune response. Interestingly, specific tRNA post-transcriptional modifications seemed to influence stability and were subject to manipulation by HCMV. Conversely, SARS-CoV-2 did not lead to significant alterations in tRNA expression or post-transcriptional modifications.

      Moreover, a systematic CRISPR screen targeting human tRNA genes and modification enzymes allowed the identification of specific tRNAs and enzymes that either enhanced or reduced HCMV infectivity and cellular growth. This information enabled them to control the development of HCMV-specific tRNA modifications, highlighting the importance of these tRNA epitranscriptome modifications in virus replication. The authors concluded that the observed differences between the viruses are consistent with HCMV genes aligning with differentiation codon usage and SARS-CoV-2 genes reflecting proliferation codon usage. This observation's connection to the biology of HCMV and SARS-CoV-2 lies in the codon usage of structural and gene expression-related viral genes, showing a significant adaptation to host cell tRNA pools. Notably, these genes from both viruses demonstrated the highest adaptation to the tRNA pool of infected cells. The reason behind this phenomenon remains unclear. One hypothesis suggests that a high level of structural gene expression is necessary during activation. Testing this hypothesis could involve examining if hindering tRNA modifications affects virus morphogenesis. In summary, this study presents an interesting and innovative perspective on how viruses modify the translation machinery. The meticulous analysis sheds light on a central interaction point between viruses and their host cells.

      Significance

      In summary, this study presents an interesting and innovative perspective on how viruses modify the translation machinery. The meticulous analysis sheds light on a central interaction point between viruses and their host cells.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      I have mixed feelings regarding this manuscript. On the one hand, the authors did an impressive amount of work. On the other hand, the manuscript seems overly descriptive (writing should be more concise) without a clear message or hypothesis that is cohesive to all the presented evidence. Below, I will outline my concerns.

      Regarding the first part.

      1. I am not an expert in the field of viral biology and immunology. I wonder how well the IFN treatment mimics the cellular response to infection (yet without the virus). Also, how good is ruxolitinib at blocking the IFN response ? I would appreciate it if you could explain both with one or two sentences and provide the necessary references.
      2. (MAJOR) Can these two treatments really allow the effects of host response and viral infection to be separated? OR in other words, are these two effects really orthogonal? In my opinion, they are NOT. Fig. 1E seems to support my opinion, as the changes seen for the "IFN" sample relative to the "uninfected" sample (referred to as "changes-A" below), are parallel to the changes seen for the "24hpi + ruxo" sample relative to the "24hpi" sample ("changes-B"). More specifically, changes-A represent the host response, as argued by the author, whereas changes-B represent the elimination of the host response (due to ruxo, conditioned on the virus-driven effect). If the virus-driven effect and the host response could really be separated, one would expect changes-A and changes-B are more or less opposite. However, they appeared to be parallel, suggesting that uninfected versus infected conditions can have totally different (even opposite) host responses. More importantly, if one cannot separate the host response from virus-driven effects, the conclusion of "tRNA changes are driven by virus, not host response" is then unfounded.3. Even if we let go of this previous point and accept that these results indeed offer some support for the notion that the virus-driven effect are the main contributor to the shifts in tRNA pool, the support is at best moderate. A big gap here is "how?" I suggest the authors should at least give some insight on how virus can do that in Discussion (and mention it with one sentence in Results).
      3. The authors compared the HCMV codon usage to the proliferation and differentiation signatures of human cells. But these two signatures are not compared with measured tRNA expression. It might shed some light on the general characteristics of tRNA pool shifts due to infection (towards a proliferation-like or differentiation-like signature). This fits in the general topic of virus-host interaction and might give more evidence for the point that HMCV is adapted to a differentiation signature (as it drives the host into that state).
      4. How is the dashed box in Fig3A/B chosen?
      5. The tAI values shown in Fig3C-E are extremely low (compared to other reports I am aware of). Does this mean that the adaptation of viral codon usage to human cell supply is actually very weak? This is in opposition to the major claims made in this section.
      6. I believe that the part about SARS-CoV-2 could be made more concise. It is sufficient to mention that results may differ from those obtained with HCMV in one paragraph.
      7. Line 299 on page 11 - I do not believe codon usage between different viruses can be directly compared, let alone reaching such a conclusion. Some viruses have low CAI or tAI to humans, but they have co-evolved with humans for a long time. Furthermore, there are viruses that infect multiple hosts, but their CAI for a host with which they have long co-evolved is higher while their CAI for a host that is relatively new is lower.
      8. (MAJOR) A more general comment is that there is a difference between tRNA expression and the abundance of translation-ready tRNA. The process of charging tRNA with amino acids may take a long time. It is the abundance of the charged-tRNA (the ternary complex of aminoacylated tRNA and EF-Tu-GTP) that is of biological importance. In this regard, the use of tRNA expression falls short.

      Regarding the second part,

      1. (MAJOR) Prior to the actual competition assay in the first high-throughput screen (cell competition assay), the authors applied two days of antibiotic selection and two days of recovery. This could result in a serious problem of false negatives or drop outs. Specifically, an sgRNA targeting an essential gene with high efficiency would kill the cells, leaving no (or a small number of) cells in the ancestor population at the beginning of the competition process. A sgRNA's enrichment in competing populations cannot be reliably estimated in such situations. I am not certain that the FDR used in Figure 5B is sufficient to address this issue. Please clarify whether it could. Providing raw counts for competing and ancestor populations would also be helpful.
      2. It is also highly questionable to me the nearly negligible effects of tRNA modification enzymes. This may be explained by the point above. Indeed, the dots of tRNA modification enzymes in general appear to have higher FDR (lower y values) when compared to red dots with similar enrichment levels.
      3. The screen based on IE2-GFP labeled HCMV measures a phenotype that is very difficult to interpret. Particularly, I am not sure if GFP2 and GFP3 are good controls for comparing GFP4 (GFP1 might be better). Various factors can affect GFP levels, including, but not limited to, dilution caused by a rapidly dividing host cell, unhealthy translational machinery resulting from infection or microenvironment. My point is supported by some observations in Fig6B. For example, SEC61B, a restriction factor for HCMV infection, is enriched in the GFP2 group, contrary to expectations. It is necessary for the authors to prove with firm evidence that their choice of GFP signal thresholds is appropriate.
      4. I would appreciate more information regarding why restriction factors of cell growth have a high GFP2/GFP4. Intuitively, a KO of restriction factors of cell growth should result in better growth and higher GFP, thus leading to enrichment in GFP4, not GFP2.
      5. Line 404 "nonetheless"

      Significance

      The relation between human tRNA supply and viral translation is a topic of profound biological and biomedical importance. In this study, the authors used HCMV infection as the primary model to investigate this question. Results fall into two major parts: (i) changes in the tRNA pool during viral infection, and (ii) the impact of tRNA-related gene KO on viral infection.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Jeong et al. describes effects of neuronal signalling on collective behavior by measuring social distance (SD) which is used as a measure for social network behavior. Authors screened for a panel of inbred DGRP lines and compared the SD due to prior experience of group or single culturing when flies are recorded in a 55 mm diameter petri-dish. The screen uncovered 3 short Sd and three long-SD lines, and subsequent experiments showed differences in various behaviors such as recovery from injury, search for food and SD. Using RNA-seq from heads of flies they implicate Dsk signalling and show neuronal architecture and activity differences between grouped and isolated male flies. They implicate Dsk signalling in recovery from injury affecting SD but it was dispensable for grouped vs. isolated flies. I have suggestions to support the claims made, analysis and interpretation of the data and improve the clarity of writing. See my specific comments below.

      Major comments:

      1. For recording social behaviour in flies, arenas with sloped walls have been extensively used called 'fly bowl' (Simon et al. 2010 doi:10.1371/journal.pone.0008793; Robie et al., 2017, doi: 10.1016/j.cell.2017.06.032), or 'flyworld' (Liu et al., 2018 doi:10.1371/journal.pcbi.1006410). Such geometry ensures that flies don't walk on the side of the arena, and don't occlude each other. However, in the screen carried out in this manuscript, a petri-dish of 5.5 cm diameter filled with agar was used to record social network formation. Given the propensity of flies to walk on the walls of such circular arena, it will be difficult to know if the long and short SD behavior resulting from propensity to form clusters is an artefact of the assay condition used. It would be important to test the SNB and SD of at least the 6 selected short and long SD lines in arena with sloped walls to rule out this possibility.

      To minimize complications from side-wall walking or z-stacking of individual flies, the circular arena used in our study was filled with agar, making a space of 5.5 cm in diameter x ~0.15 cm in height (see below). In fact, the surface tension of the agar solution led to a rounded interface between the agar bed and the side wall that could prevent side-wall walking and partially mimic the sloped wall effects. We included representative video clips for SNB recording in the revised manuscript to further justify our experimental conditions. We also revised our method text accordingly and cited the relevant reference.

      Methods section would require additional details about the SNB assay for instance, the height of the agar bed and the effective height in which interactions was recorded is not mentioned.

      As described in our response to reviewer #1, major comment #1 above, we revised our method text accordingly.

      Figure 2C and 2D results in larvae seems to contradict previous studies that have shown that isolated flies eat more as adults (Li et al. Nature, 2021) and Dsk-RNAi increases feeding in larvae and adults (Soderberg et al., 2012). It might be due to unique characteristic of DGRP lines used and would be helpful to discuss this.

      We reason that high-digging activity in a group of individual larvae can increase the accessibility to "solid" food, thereby promoting their food intake over 12-h during development. However, food consumption rates and their regulation can vary depending on developmental stages or feeding conditions (e.g., larvae vs. adults; liquid vs. solid food; long-term vs. short-term) (https://pubmed.ncbi.nlm.nih.gov/30914005/; https://pubmed.ncbi.nlm.nih.gov/24937262/). It is thus not fair to align our results directly to those observed under very different biological/experimental contexts. For instance, Li et al. measured the amount of liquid food consumption in individually isolated adults from group culture vs. transient social isolation (i.e., 1-week isolation after eclosion). Soderberg et al. assessed the 15-minute feeding activities of larvae or adults on solid food but did not compare the feeding activity between grouped vs. isolated individuals. Therefore, it is not conclusive whether the DSK-depletion phenotypes are relevant to social experience or whether DSK signaling controls short-term feeding per se. Accordingly, we believe our observations do not necessarily contradict previous studies or indicate characteristics unique to the DGRP lines used in our study.

      Rutabaga mutants for Fig S3 are directly compared with CS flies in the maze assay and it appears from methods that these lines were not isogenized, this can significantly impact the results. Similarly for some of the subsequent Dsk experiments it appears that lines were not isogenized (see below). These experiments would either need to be repeated of this caveat needs to be explicitly mentioned to avoid misinterpretation of the data.

      Since genetic backgrounds could substantially contribute to mutant phenotypes, we mentioned the caveat in our revised manuscript. As the reviewer suggested above, testing isogenized lines could be one option to confirm the genetic effects. Our study took an alternative approach where the observed phenotypes were validated by independent genetic models. For instance, the importance of DSK signaling in injury-induced SNB plasticity was validated by genomic deletions of DSK and DSK receptor genes, as well as by transgenic RNAi (Fig. 7B and 7C). In the revised manuscript, we examined additional mutant alleles of rutabaga (Fig. S6, rut[1] and rut[2080]), CCKLR-17D1 (Fig. S15, CCKLR-17D1[delta1] and CCKLR-17D1[delta2]), and CCKLR-17D3 genes (Fig. S15, CCKLR-17D3[delta1] and CCKLR-17D3[delta2]) to substantiate our original findings.

      For Figure 3 describing RNA-seq data additional analysis would be helpful. Gene expression from isolated and grouped flies have been studied earlier by microarray and RNA-seq methods (Wang et al., PNAS 2008; Agrawal et al., JEB 2020; Li et al. Nature, 2021). Data from these studies should be compared with to see if there are common patterns of gene expression between long and short SD flies vs. group and isolated flies.

      According to the reviewer's suggestion, we compared our DEG analysis with those reported in the previous studies. Genes upregulated in socially deprived flies overlapped substantially between our data and the published ones. However, the number of genes commonly upregulated in grouped cultures was very limited in the pairwise comparisons, and Dsk was the only gene upregulated across DEG analyses. Also, DEGs between the short vs. long SD lines barely overlapped with those between grouped vs. isolated flies across independent studies. We speculate that Drosophila has evolved a genetic reprogram where social isolation robustly induces the expression of select genes regardless of genetic backgrounds (i.e., DGRP lines in our study vs. Canton-S in the previous studies), whereas diverse genetic pathways shape the baseline SD traits. We revised our text accordingly and included these new analyses in the revised manuscript (Fig. S10 and Dataset S3).

      GEO accession number and the analyzed list of DEGs should be provided as supplementary information.

      As described in our original manuscript, we submitted our raw data to the European Nucleotide Archive (ENA accession number PRJEB61423). Since ENA and GEO share their data, uploading our data redundantly onto the GEO should not be necessary. Our original manuscript also included all the DEG lists as supplementary tables (Datasets S1-S3).

      Figure 3E & F are not referred to in the main text, also there is no description of how the data was generated. Is this based on published data from Mackay lab about DGRP lines, if so, aggression experiments were not convincing in those studies and have been shown to not recapitulate 'real' aggression by other labs for several of the DGRP lines tested (Chowdhury et al., 2021, doi: 10.1038/s42003-020-01617-6).

      The main text of our original manuscript actually referred to Fig. 3E and 3F. We revised our figure legend to indicate the resources of raw DGRP data and clarified the method for the correlation comparisons. Since we employed the published aggression data from the Mackay lab study (https://pubmed.ncbi.nlm.nih.gov/26100892), we experimentally validated that the long-SD lines indeed show more lunges (i.e., a well-established indicator of aggression behaviors) than the short-SD lines in our revised manuscript (Fig. S11).

      Dsk was shown to be reduced in isolated flies by RNA-seq and play a role in aggression by an earlier study (Agrawal et al., 2020) and should be cited appropriately (line 180-181) and elsewhere.

      The paper was appropriately cited in our original/revised manuscript.

      For Fig. 4A-B, source images for other two DGRP lines should be included at least in supplementary information, if not as main figure.

      Representative confocal images for the other DGRP lines were included in the revised manuscript (Fig. 6A).

      For Fig. 5, what is the reason that uninjured flies don't show any SD phenotype? Are there any changes in their velocity? This is mentioned in passing on line 228-29 but should be properly discussed.

      Genomic deletions or transgenic manipulations of the DSK-CCKLR-17D1 pathway gave consistent effects on the injury-induced clustering but not on baseline SD or walking speed. We reason that the DSK-CCKLR-17D1 pathway is dedicated to encoding early-life social experience by enforcing DSK neuron activity and their male-specific postsynaptic signaling. We clarified our text including the genetic background issue in the revised manuscript. Please also see our response to reviewer #1, major comment #4 above.

      Trans-Tango and UAS-Denmark, SytGFP experiments were performed previously by Wu et al., 2020 and Wang et al., 2021 for Dsk, these two studies observed that P1 neurons are presynaptic and Dsk neurons are post synaptic but in Figure 4 it's not clear what are the presynaptic and post synaptic neurons. Also these studies are not cited appropriately in this section.

      The two studies expressed trans-Tango in P1 neurons (P1>trans-Tango) to demonstrate that DSK-expressing neurons are postsynaptic to P1 neurons. They further visualized some overlaps between axon terminals of P1 neurons (P1>sytGFP) and dendrites of DSK neurons (Dsk>DenMark). On the other hand, we expressed the trans-Tango in DSK neurons (Dsk>trans-Tango) to visualize their male-specific/social experience-dependent postsynaptic targets. We also visualized brain regions positive for both synaptic signals from Dsk>sytGFP and the postsynaptic signals from Dsk>trans-Tango. The two studies were cited in our original manuscript to discuss presynaptic partners of DSK neurons and their distinct roles in animal behaviors. We further cited the two studies in this result section of our revised manuscript.

      Minor comments:

      1. Line no. 286: please mention about the relative humidity and light & dark cycle conditions and when experiments were conducted (ZT).

      Flies were reared at 40-50% humidity under 12-h light: 12-h dark cycles. Behavioral experiments were primarily conducted between ZT4 and ZT8. We revised the method text accordingly.

      Line no. 311: How many days old flies were used (isolated and group housed) for the behavior and transcriptomic studies?

      We revised the method text to better describe our experimental conditions.

      Line no. 349: for RNA extraction please mention how many fly heads were used and ZT for collection.

      Flies were harvested at ZT4-6, and total RNAs were extracted from 35 fly heads. We revised the method text accordingly.

      Line no. 358: Italicize "Drosophila melanogaster".

      Corrected.

      Reviewer #1 (Significance (Required)):

      This manuscript will be of interest to neuroscientists studying Drosophila social behaviors. The manuscript asks interesting questions and authors have done extensive set of experiments but the progress appears incremental given the current state of the field, especially for the later part of the manuscript. Some of the interpretation would also require additional data to bolster the claims made. Finally, the findings from this study could be better discussed in the context of what it is already known.

      We believe our revised text and additional data in the revised manuscript clarify the reviewer concerns and better support our original findings.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The article explores the social network behavior (SNB) of Drosophila, focusing on individual social distance (SD) within groups over time. A systemic analysis revealed that short SD is associated with long developmental time, low food intake, and hypoactivity. Group culturing compensates for developmental inferiority in short social distance individuals. Social interactions during early development positively impact adult physiology and adaptive social plasticity. Transcriptome analyses show genetic diversity for SD traits. The neuropeptide Drosulfakinin (DSK) signaling mediates social network behavior plasticity via receptor CCKLR-17D1, particularly in males, suggesting a dedicated neural mechanism encoding early-life experiences to adaptively transform group properties. The research suggests that animals have developed neural mechanisms to encode early-life experiences. It offers insights into the genetic foundation and adaptability of social behavior in Drosophila, shedding light on the neural processes involved in social memory and the adaptive behaviors of groups. These findings have broader implications for understanding similar neural mechanisms governing social memory and group behaviors in other species.

      Major concerns:

      Major 1. In Figure 2H, the latency to 75% arrival of short-SD isolated fruit flies (no matter with or without pioneers) is close to that of group fruit flies with pioneers while the latency to 75% arrival of long-SD isolated fruit flies (no matter with or without pioneers) is close to that of group fruit flies without pioneers; How to explain the difference in latency to 75% array between long-SD and short-SD isolated fruit flies? It seems that only in the long-SD fruit flies from the grouped experience, the absence of pioneers will increase the time it takes to reach the target food in the maze.

      Social deprivation effects on pioneer-free group foraging somewhat varied across the long SD lines (Fig. 4B and S5, blue). We reason that the hyperactivity of individual long-SD flies facilitates their food-seeking behaviors in the maze, weakening the pioneer effects or even overriding the group property. Nonetheless, our statistical analyses validated that 1) prior social experience did not significantly affect the group performance of 16 naive flies in the maze assay (the only exception was DGRP707, a long-SD line that showed longer latency in group-cultured naive flies than in socially isolated ones), 2) the presence of pioneers significantly shortened the latency in both short- and long-SD lines, and 3) the pioneer effects were evident only in group-cultured flies. We accordingly revised our result text to better elucidate our conclusion.

      Major 2. In Figure 3C, there are two up-regulated genes in the Drosophila group that overlap in short-SD and long-SD strains. Apart from Dsk, what is the other gene? In addition, for isolated fruit flies, both short-SD and long-SD lines have more gene expression upregulated. How to explain this phenomenon? Can you briefly explore the reasons for their upregulation and instead of involvement in SNB plasticity, what kind of physiological functions may they have?

      The other gene commonly upregulated in group-cultured DGRP flies was Arc1 (Activity-regulated cytoskeleton-associated protein), implicated in synaptic plasticity and fat metabolism (https://flybase.org/reports/FBgn0033926.htm). Arc1 downregulation upon social isolation could be relevant to the weak postsynaptic signaling of DSK neurons (Fig. 6D and 6E) or be a part of the metabolic reprogramming (Fig. 5D; also see below). Nonetheless, we focused on the neuropeptide DSK, given its unique expression in the brain and relevance to other social behaviors (e.g., mating, aggression). In fact, Dsk was the only overlapping gene that was downregulated upon social isolation across independent studies (Fig. S10A).

      As the reviewer pointed out, social isolation upregulated many genes, including those involved in metabolism. Our revised manuscript additionally showed that upregulated but not downregulated genes upon social isolation were substantially conserved across genetic backgrounds or independent DEG studies (Fig. 5C and S10A). We speculate that Drosophila has evolved a genetic reprogram where social isolation elevates metabolic gene expression to adaptively induce a metabolic shift for energy storage and fitness. We revised our text accordingly.

      Major 3. In lines 219-223, the genomic deletion by mutant or depletion by RNA interference emphasizes the role of neuropeptides DSK and its receptor CCKLR-17D1 in injury-induced clustering behaviors. How about the effect of neuropeptides overexpression? Do they confer injury-induced social interactions to isolated male flies. Meanwhile, in line 238, the transgenic excitation of CCKLR-17D1 neurons emphasizes the function of neuronal synaptic transmission in the pathway. Indeed, both neuropeptide expression and neuronal synaptic connections may be involved in the regulation of injury-induced clustering behaviors. It is recommended to separate the discussion of protein expression and the respective regulatory modes at the neuronal circuit level.

      We could not test DSK overexpression effects on injury-induced clustering in socially isolated males since we failed to validate DSK overexpression from a relevant transgenic line (https://flybase.org/reports/FBal0184043.htm). Instead, we provided additional data in the revised manuscript that independent genomic deletions of the CCKLR-17D1 locus (Fig. S15) or transgenic silencing of the synaptic transmission in CCKLR-17D1 neurons (Fig. S16) suppressed the injury-induced clustering in group-cultured male flies. According to the reviewer's suggestion, we modified our text to better distinguish between the effects of gene/protein expression vs. relevant neuron activities on social behavior plasticity.

      Major 4. Since a significant portion of the work in the first half of this paper is focused on elucidating two types of social distance in SNB, is there any difference in the regulation of social network plasticity by Dsk signaling pathway in the short-SD and long-SD lines?

      As the reviewer suggested, it will be informative to determine if Dsk signaling for social behavior plasticity is differentially regulated in short- vs. long-SD lines. One technical issue is that genetic factors shaping their SD traits still need to be defined. So, we are limited to performing standard genetic/transgenic experiments using the DGRP lines while retaining their SD phenotypes. Accordingly, our current approach was to compare DSK expression in short- vs. long-SD lines under grouped- vs. isolated-culture conditions. Future studies should address the review comment above.

      Minor ones:

      Minor 1. There is a color difference between the data spots and the figure legends in Figure 2H.

      Corrected.

      Minor 2. The anatomical sample images in Figure 4 and Figure 5 require scale bars.

      We added scale bars to Fig. 6 and 7 in the revised manuscript.

      Minor 3. The "grouped" and "grp" in Figures 3B-3F can be unified as "grp", while the "isolated" and "iso" can be unified as "iso". So that the male and female symbols in Figure 3F will not have any deviation in the mark.

      We unified the labels throughout the revised manuscript according to the reviewer's suggestion.

      Minor 4. The difference in Denmark signals of each group of neurons under the condition of injury should also be compared in Figure 4C.

      The DenMark signals were also compared between control and injury conditions (Fig. S12).

      Minor 5. What is the effect of inactivating CCKLR-17D1 or CCKLR-17D3 by shibire on injury-induced clustering in group-cultured adults in Figure 5E? (This relates to major comment 3)

      We actually employed a tetanus toxin light chain (TNT) to block synaptic transmission in CCKLR-17D1 neurons and found that the transgenic manipulation of CCKLR-17D1 neuron activity suppressed injury-induced clustering in group-cultured males (Fig. S16). Since 1) our additional data using independent deletion alleles further excluded the possible implication of CCKLR-17D3 in the SNB plasticity (Fig. S15) and 2) a transgenic Gal4 knock-in for the CCKLR-17D-3 locus is not available, we focused on the CCKLR-17D1 experiments in our current study and wished to leave more detailed circuit analyses for future studies.

      Reviewer #2 (Significance (Required)):

      General assessment:

      The strengths of this work is that the authors have identified specific lines with short social distance or long social distance by conducting extensive screening experiments. By transcriptome analyses and gene ontology (GO) analyses they revealed genes up or down regulation in the social experience. They have also narrowed down to the DSK signaling involved in the social experience encoding process. However, the study's limitation lies in the lack of clarity regarding the DSK signaling pathway. The mechanisms through which social experiences affect neuronal activity and synaptic connections remain unclear. Further research on upstream and downstream pathways could enhance understanding. Although the article proposes injury-induced clustering behaviors, the key sensory pathways involved in social network behavior plasticity during early social experiences are not well-defined. Conducting sensory deprivation experiments could elucidate sensory involvement. Overall, the study's strengths lie in its comprehensive approach, large sample size, and translational potential. To enhance future research, investigating the complexity of neural mechanisms and expanding the exploration of regulating pathways could be beneficial. Additionally, exploring the ecological relevance of the findings could deepen our understanding of social behavior in natural environments.

      Our current work provides a neuroanatomical basis for early-life social memory and experience-dependent plasticity of social-interaction behaviors. We believe future studies will build up the mechanical details for social experience-dependent DSK expression, DSK neuron activity, and behavioral outputs. Regarding the key sensory pathways, we examined injury-induced SNB plasticity of distinct sensory mutants (e.g., olfactory, visual, auditory, etc.) and our revised manuscript provided additional data that norpA-dependent visual sensing might play a crucial role in this process (Fig. S9), consistent with the previous finding that vision is required for larval clustering behaviors in Drosophila (https://pubmed.ncbi.nlm.nih.gov/28918946/).

      Advance:

      Compared to previous studies such as Heiko Dankert et al.'s publication in 2009 in Nature Methods and Assa Bentzur et al.'s publication in 2020 in Current Biology, which also investigated the impact of early life experiences on male social behavior and examined various aspects of social network construction, this study employs a systematic analysis of social network behavior (SNB) in Drosophila, integrating genetic, physiological, and behavioral assessments. The authors conducted detailed and systematic analyses through transcriptome and gene ontology (GO) analyses, including the visualization of gene expression heatmaps, volcano plots, and overlapping analysis of differentially expressed genes (DEGs) between grouped and isolated conditions. Additionally, this research delved into the regulatory pathway of DSK signaling in male-specific SNB plasticity, with a particular focus on the DSK to CCKLR-17D1 signaling, which encodes early social experiences. The research provides valuable insights into the genetic basis and adaptability of social behavior in Drosophila. Moreover, it illuminates the neural mechanisms that underlie social memory and the ability of groups to adapt across different species.

      Audience:

      Researchers conducting basic research in genetics, neuroscience, behavioral biology, and evolutionary biology, particularly those focused on understanding social behavior and its underlying genetic and neural mechanisms, will find this study highly relevant. Additionally, researchers studying social cognition, social memory, and group dynamics in various species may also be interested in these findings.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Jeong et al. investigate the influence of genetic factors and early-life social experience on social network behaviors in adult Drosophila. Utilizing isogenic DGRP lines, the study correlates social distances with key developmental and physiological traits-developmental time, digging activity, and food intake. The findings suggest that adult flies with shorter social distance -indicating closer proximity to each other-face developmental disadvantages that are offset by the benefits of social grouping. The authors argue for an evolutionary advantage in such social behaviors, suggesting they help compensate for individual developmental deficits. The study further identifies the Dsk signaling pathway as a key mediator of social network behavior plasticity in male flies, particularly under challenging conditions like mechanical injury.

      The study undertakes a broad range of behavioral and neurogenetic approaches, demonstrating an extensive scope of research efforts. Despite its ambitious scope, the manuscript lacks a clear rationale and cohesive flow among its sections. The numerous experiments do not merge into a unified narrative, leaving the reader questioning the reasoning and progression behind the experimental choices. The manuscript needs a clearer structure, well-defined hypotheses, and more detailed methodological descriptions. Greater emphasis on novelty and better integration with existing literature are also needed. The lack of control experiments and adequate statistical analysis weakens some conclusions.

      Major comments

      1. The authors show that flies in short SD lines reduce their activity over time, leading to the formation of social clusters (Fig. 1B). This clustering could potentially be attributed to reduced activity rather than active social preferences. It would be informative to test whether these SD flies exhibit similar social behaviors when placed in a larger arena, to test if the clustering persists under varied environmental conditions.

      Short-SD flies did not reduce their moving speed over time when we placed a single fly in the original arena and assessed its locomotor behavior individually (Fig. S2). Thus, it is likely that the reduced activity in a group of short-SD flies is an effect of clustering over time but not necessarily the cause. We also confirmed that short and long SD lines retain their clustering property even in a larger arena (8.5 cm in diameter) (Fig. S3). We included these new data in our revised manuscript to better demonstrate active social preferences in the DGRP lines.

      In lines 78-79, the authors claim that "short-SD flies gradually reduced SD over time and stayed in the cluster." However, the study established SD clustering by only analyzing behavior during the last quarter of a 10-minute window, assigning a single data point to each fly and taking the average for group SD. Yet, a single value cannot demonstrate whether initially formed clusters remained stable-unchanged-over time. To strengthen this point, the authors could investigate dynamic changes in SD over a longer period to demonstrate stability, or alternatively, adjust the language to better convey the findings. Additionally, including a time scale in Fig. 1B would enhance the clarity of these findings.

      We traced dynamic changes in SD and walking speed of representative DGRP lines over the 10-minute window (Fig. 2A and 2B) and modified our text accordingly in the revised manuscript. We also included a time scale in Fig. 1B and relevant figures.

      The statistical analysis presented in Figure 2C-D raises concerns. It appears that feeding and digging efficiency in both SD type lines benefit from socialization, suggesting that the effects attributed to SD might stem from the overall digging and feeding activity of each line. Therefore, it is crucial to integrate both social distance (short vs. long) and socialization (grouped vs. isolated) into the analysis using methods that allow for the assessment of confounding effects (interaction), such as two-way ANOVA or regression, depending on the data. This would help authors to clarify whether isolation reduces feeding overall (both line types) and determine if this reduction is more pronounced in short-SD lines. Additionally, it is counterintuitive that lines with more larvae per cluster show worse digging efficiency when previous studies, such as Dombrovski et al. (2017), have shown that larger groups of larvae typically exhibit better digging efficiency. This discrepancy highlights the need for a thorough re-evaluation of the data and assumptions regarding group dynamics and their impact on resource access.

      As the reviewer suggested, we employed ordinary or aligned ranks transformation 2-way ANOVA (depending on the normality and equal variance of a given dataset) to determine if social distance and socialization cooperatively contribute to developmental phenotypes. Our new analyses confirmed significant interaction effects of social distance and socialization on most developmental phenotypes tested (i.e., larval digging activity, developmental time, %male progeny, and %eclosion success). These results convincingly support that short-SD larvae benefit more from socialization than long-SD larvae to compensate for the inferior phenotypes in isolated individuals. We speculate that the feeding amount of isolated long-SD individuals may be saturating for normal development (i.e., developmental time, %male progeny), possibly explaining the lack of interaction effects on food intake while displaying developmental inferiorities only in isolated short-SD individuals. We reason that grouped long-SD flies should not necessarily display poorer digging activity than grouped short-SD flies since isolated long-SD individuals displayed much higher digging activity than isolated short-SD individuals. Consistent with the previous finding, both SD lines showed better digging efficiency when grouped than isolated. We included these new analyses in the revised manuscript and modified our text accordingly. To clarify any statistical issues, we included a summary of all our statistical analyses performed in the revised manuscript (Dataset S5).

      The choice to use the percentage of male progeny as a measure of developmental success is confusing, especially without an explanation for why it is favored over measures like overall progeny survival rates. As with digging and feeding, the statistical analysis should include an examination of potential interaction effects to fully assess how social conditions impact developmental outcomes.

      The percentage of male progeny was one of the most evident developmental phenotypes on which social distance and socialization showed significant interaction effects. In the revised manuscript, we further included the percentage of eclosed flies as a measure for the overall progeny survival rate (Fig. 3F) and performed 2-way analyses to validate the significant interaction effects of social distance and socialization on various larval/developmental phenotypes. Please see our response to reviewer #3, major comment #3 above.

      The rationale for using physical injury to induce SNB in the study is not clearly explained, raising concerns about the potential impact of injury on overall locomotion. Before employing such a method in sociality experiments, it is crucial to demonstrate that the injury does not affect locomotion. Additionally, the study's methodologies for transitioning between grouped and isolated cultures (present only in Fig. 2I and not in the methods section), as well as the specific methods used to measure social distance (SD) in isolated flies, are not sufficiently detailed. This lack of clarity complicates the evaluation of the study's conclusions.

      To determine if the long-SD lines express their social behaviors selectively (e.g., upon physiological challenges), we introduced physical injury to the SNB analysis. There was a positive correlation between locomotion activity and SD trait among DGRP lines (i.e., DGRP lines with low walking speed and centroid velocity exhibited short-SD phenotypes in general) (Fig. 1D). This observation thus prompted us to hypothesize that modest injury may reduce locomotor activity in individuals, facilitate their interactions in a group, and shorten the overall SD. The mechanical injury actually shortened SD in both the short- and long-SD lines (Fig. 4D and S7A). Under the same experimental condition, mechanical injury reduced walking speed and centroid velocity only in the long-SD lines (Fig. S8), whereas social isolation blunted the injury effects (Fig. 4D, S7A, and S8). We reason that our injury condition does not severely impair general locomotion per se to abolish or overestimate SNB, but low activity in grouped long-SD flies is likely a consequence of their injury-induced clustering. We clarified our original text for the rationale and included the new data in the revised manuscript (Fig. S8). We also revised the method text for the transitions between grouped and isolated cultures, as well as for measuring SD in isolated flies.

      Lines 104-106 "The clustering property of short-SD lines may have evolved as a compensation mechanism for their developmental inferiority in individuals". To support this claim, the authors should assess the significance of interactions terms as stated earlier.

      Please see our responses to the reviewer's relevant comments above (reviewer #3, major comments #3 and #4).

      In Fig. 4, the authors conclude that Drosulfakinin (DSK) signaling encodes early-life experiences for SNB plasticity. It is crucial for the authors to differentiate whether changes in feeding behavior are directly due to DSK or if they are secondary effects resulting from altered social interactions mediated by DSK.

      Previous studies demonstrated that DSK is a satiety-signaling molecule whose expression is elevated upon feeding to suppress food intake (https://pubmed.ncbi.nlm.nih.gov/34398892/; https://pubmed.ncbi.nlm.nih.gov/32314736/; https://pubmed.ncbi.nlm.nih.gov/25187989/; https://pubmed.ncbi.nlm.nih.gov/22969751/). Under our experimental context, social isolation downregulated DSK expression and DSK neuron activity, whereas isolated larvae rather reduced their food intake. It is thus unlikely that changes in the feeding behavior of isolated larvae directly implicate DSK-dependent satiety signaling. We discussed this issue in our revised manuscript.

      In Fig 4, the data show that DSK peptide is significantly increased in cell bodies in grouped long DS lines when compared with grouped short DS lines (Fig. 4B). However, no changes are reported at the level of DSK projection levels when comparing these groups. Can the authors clarify this?

      The SD-trait effects on DSK levels were evident in cell bodies but not in DSK neuron projections. We reason that axonal transport or processing of the neuropeptide was limiting under the group-culture condition. These observations might also be relevant to our conclusion that Dsk is not crucial for shaping the SD traits per se. We revised our text accordingly.

      Additionally, the data show that DSK activity is reduced by isolation in both types of SD. To clarify if this effect is driven by isolation only, and not type of line (short vs long SD line), the interaction term should be tested. Furthermore, it is not clear what lines are used in live imaging (e.g., Fig 4C-F).

      We detected no significant interaction effects of SD type and social isolation on DSK expression (Fig. 6B). Live-brain imaging of the GCaMP-expressing DSK neurons was performed using a transgenic line (i.e., Dsk-Gal4>UAS-GCaMP) in a wild-type background. Since genetic factors shaping the SD traits were not defined in each DGRP line, we could not combine the transgenes with DGRP backgrounds while retaining their respective SD phenotypes (please also see our response to reviewer #2 major comment #4 above). Nonetheless, the GCaMP imaging demonstrates that 1) either injury or social isolation alone significantly affects DSK neuron activity, but 2) the two conditions act independently on the GCaMP signals (i.e., no significant interaction effects). We clarified it in our revised manuscript and displayed each genotype used in our imaging experiments.

      In the 'Male-specific DSK-CCKLR-17D1 signalling mediates SNB plasticity' section (line 217), the analysis should include an interaction term to account for the possible confounding effects of isolation and injury on SD. This would aid in determining whether the impacts of social isolation and injury on DSK signalling and SNB plasticity are independent of each other or if they interact in significant ways, as stated by the authors.

      Throughout our revised manuscript, we performed 2-way analyses to validate the interaction effects of isolation and injury on SD and support our conclusion. We also included a summary of all our statistical analyses performed in the revised manuscript (Dataset S5).

      Minor comments

      1. The introduction section would also benefit from major rewriting to clearly indicate the research gap and hypothesis tested. The introduction section would also benefit from major rewriting to clearly indicate the research gap and hypothesis tested. The manuscript would benefit from a more thorough integration of previous studies related to Drosophila social behavior (e.g., Blumstein, D. T. et al., 2010; Schneider, J., Dickinson, M. H., & Levine, J. D., 2012; Simon, A. F. et al., 2012; Ramdya, P. et al., 2015). While the current references are adequate, a more detailed discussion of how this study builds upon and diverges from existing literature would be beneficial.

      The introduction of our original manuscript starts from previous findings on Drosophila social behaviors and clearly indicates what remains elusive, thereby defining our biological questions. We further explain why we focus on SD among other social network measures published previously and outline our approaches for new findings in this study (i.e., the principles of social network behavior and its plasticity). Since our original text was written in a concise manner, we revised the introduction to give a more detailed description of what earlier studies have discovered according to the reviewer suggestion.

      A better description of methods, especially behavioral approaches, could vastly help in understanding the results. Clarifying the methodologiy, particularly the behavioral approaches, would greatly enhance the understanding of the results. Also, the method for quantifying the total number of larvae per vial is unclear, particularly whether variations in larval density were considered. This is crucial, as different densities could affect the available sensory cues necessary for larval aggregation, such as vision (e.g. Dombrovski et al. Curr Biol. 2019). Better descriptions of the results and inclusion of exact statistical analyses used in support of the claims are also needed.

      We revised our method text to better describe our experimental conditions. We further described how we controlled larval density to prepare group-cultured larvae and adults for analyzing larval behaviors and developmental phenotypes. Finally, we included a summary of all our statistical analyses performed in the revised manuscript (Dataset S5).

      Some terms and descriptions in the manuscript are somewhat ambiguous, such as "social memory" and "adaptive social plasticity" and should be better defined.

      We better defined the two key short terms in the introduction of our revised manuscript.

      Line 86-89: "Social interactions compensate for developmental inferiority in short-SD larvae Why do flies display SNB? One clue comes from the previous observation that Drosophila larvae collectively dig culture media and improve food accessibility, possibly facilitating their constitutive feeding during early development..." - This paragraph could be moved to the introduction section.

      As we reorganized the introduction in our revised manuscript, we feel it should be fine to leave the paragraph above in the original context.

      In lines 77-78, the manuscript mentions that the locomotion trajectories of individual flies confirm certain characteristics but fails to provide an analysis of individual locomotion metrics, e.g., tortuosity, distance walked, etc. The authors should add quantitative analysis to support claims about trajectories or alternatively rephrase the sentence to remove any claims about the trajectories of flies.

      As the reviewer suggested, we added quantitative analyses of cumulative walking distances over time and total walking distances in individual flies to our revised manuscript (Fig. 2D and S1C).

      After screening 175 strains, three short and long SD lines were selected. It would be good if justification for the authors' choice were included, as the selected lines were not the ones with the longest or shortest SD as seen in Fig. 1C.

      We ranked individual DGRP lines for each of the two group properties (i.e., SD and centroid velocity) and then selected the top and bottom three lines based on their average ranking. We included this rationale in our revised manuscript.

      Other comments:

      • Line 72-73: What correlation was performed? This should be included in the results/methods section.

      As described in the figure legend of our original manuscript, the significance of the correlation was determined by Spearman correlation analysis. We further included the method description in the results and methods section of our revised manuscript.

      • Line 113: Change "pre-trained colleagues" to "pre-trained flies".

      Changed.

      • Lines 321 and 325: Use "3D" instead of "2D" as three dimensions are given?

      Corrected.

      • Ensure all figures are correctly scaled and aligned.

      We revised our figures to avoid any of these issues.

      • Video: Including short videos for each behavioral test (e.g., feeding) would help in understanding it.

      We included representative video files for SNB and aggression assays in the revised manuscript (Video S1-S12).

      • Fig. 4 should include control neurons that do not change with social grouping; authors should also show ROI.

      We included new data for control neurons (Fig. S13, Pdf-Gal4>UAS-GCaMP7f) and also showed ROI for quantification in our revised manuscript (Fig. 6C and S13).

      • Line 27, 86: Change "inferiority" to "disadvantage".

      We feel inferiority fits better in the context of our overall study.

      Reviewer #3 (Significance (Required)):

      This study extends existing knowledge by linking specific genetic pathways to behavioral outcomes in a well-established model system, providing new insights into the genetic and neural basis of social behavior. The use of DGRP lines to dissect the impact of genetic variation on behavior is particularly valuable. The identification of the Dsk signaling pathway as a mediator of these behaviors under stress is interesting. However, the study would benefit from more in-depth statistical analysis and expanded experimental designs to solidify the conclusions. It should also more clearly highlight the novelty of its findings and better integrate them with the current literature on Dsk signaling and social behaviors.

      My expertise is in behavioral neuroscience. The insights from this study promise to deepen our understanding of the genetic and neural mechanisms behind social behaviors. The potential implications of this research are likely to extend well beyond Drosophila, influencing studies across various species.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Summary:

      The manuscript investigates the role of Drosulfakinin (Dsk) signaling in Drosophila social network behavior (SNB) and its plasticity based on early-life experiences. The study employs a systematic analysis using 175 inbred strains to link short social distance (SD) with developmental time, food intake, and activity levels. Key findings suggest that social interactions during development compensate for individual developmental inferiority and that early-life social experience is necessary for adaptive social behaviors in adults. The genetic basis of SNB is further explored through transcriptome analyses, implicating Dsk and one of its receptors in mediating these behaviors.

      Major comments:

      Are the key conclusions convincing?

      The key conclusions are well-supported by the data presented. The association between early-life social interactions and adult social behaviors is convincingly demonstrated through multiple experimental setups.

      We appreciate the reviewer's positive feedback on our rigorous approaches and key conclusions.

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Some claims, particularly those regarding the evolutionary implications of Dsk signaling and its conservation across species, might benefit from being presented as hypotheses or speculations rather than definitive conclusions. This would align with the current evidence while acknowledging the need for further investigation.

      As the reviewer suggested, we toned down our claims on the evolutionary implications of Dsk signaling.

      Would additional experiments be essential to support the claims of the paper?

      Measure aggression and male-male courtship behavior in the 6 DGRP lines to examine whether SD correlates with these behaviors.

      Aggression but not male-male courtship behaviors correlated with SD phenotypes in the 6 DGRP lines. We included the new data in our revised manuscript (Fig. S4 and S11). Please also see our response to a relevant reviewer comment above (reviewer #1, major comment #7).

      Include behavior results of flies tested in Figures 4C and D.

      We included the behavior data in our revised manuscript (Fig. S12A).

      Repeat the CCKLR-17D1 experiments shown in Figures 5 F and G for CCKLR-17D3 to provide extra evidence that CCKLR-17D1 mediates DSK's effects on SNB.

      We employed a transgenic Gal4 knock-in for the CCKLR-17D1 locus to specifically manipulate the activity of CCKLR-17D1-expressing neurons. However, a Gal4 knock-in line for the CCKLR-17D3 locus was not available for pairwise comparison. We instead provide extra evidence for CCKLR-17D1 function in SNB plasticity by showing that 1) independent genomic deletions of CCKLR-17D1 but not CCKLR-17D3 suppressed injury-induced clustering in group-cultured males (Fig. S15) and 2) blocking of synaptic transmission in CCKLR-17D1 neurons phenocopied CCKLR-17D1 deletion (Fig. S16). Please also see our response to a relevant reviewer comment above (reviewer #2, minor comment #5)

      Are the suggested experiments realistic in terms of time and resources?

      These experiments are realistic and feasible within typical research timelines. These might require a few months and moderate funding.

      According to the reviewer suggestions, we included new pieces of data in our revised manuscript to address the reviewer concerns and further support our conclusions.

      Are the data and the methods presented in such a way that they can be reproduced?

      The methods section is detailed, providing sufficient information for replication.

      We appreciate the reviewer's positive feedback on our method description.

      Are the experiments adequately replicated and statistical analysis adequate?

      The experiments appear to be adequately replicated, and the statistical analyses are generally appropriate. However, ensuring consistent selection of statistical methods can further support the evidence presented (Figures 2C and D).

      We performed more appropriate statistical analyses in the revised manuscript (e.g., 2-way ANOVA of the data presented in our original Fig. 2C and 2D) and included a summary of all the statistical analyses in the revised manuscript (Dataset S5). Please also see our responses to the reviewer comments above (reviewer #3, major comments #3 and #10; reviewer #3, minor comment #2).

      Minor comments:

      Specific experimental issues that are easily addressable:

      Ensure clarity in the presentation of figures and legends. Some figures could benefit from more detailed legends explaining all aspects of the data shown.

      We revised our figures and figure legends to address this issue and improve clarity.

      Are prior studies referenced appropriately?

      The manuscript references prior studies appropriately, providing a solid context for the current research.

      We appreciate the reviewer comment.

      Are the text and figures clear and accurate?

      The text is clear, but some figures, particularly those with complex data, could be more informative with additional annotations.

      We revised our figures and figure legends to address this issue.

      The larvae pictures in Figure 2A should be replaced with ones with higher resolution with drawn larval contours.

      We replaced the larval pictures with higher resolution and indicated individual larvae with arrows in the revised manuscript (Fig. 3A).

      Scale bars are missing in most of the images shown.

      We added scale bars to our revised figures.

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Consider providing a graphical abstract summarizing the key findings. This would aid readers in quickly grasping the main conclusions. Additionally, breaking down complex figures into simpler, more focused panels might improve readability.

      As the reviewer suggested, we split complex figures into simpler ones to improve the readability of our revised manuscript and data. We also provided a graphical abstract summarizing our findings (Fig. 8).

      Reviewer #4 (Significance (Required)):

      Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      This study provides significant conceptual advances in understanding the genetic and neurobiological basis of social behavior in Drosophila. By linking early-life social experiences to adult social behaviors, it highlights the importance of developmental context in shaping adult phenotypes.

      Place the work in the context of the existing literature (provide references, where appropriate).

      The work builds on previous studies on Drosophila social behavior and neurogenetics. It extends the current understanding by integrating developmental and adult behaviors with genetic and molecular analyses. References to foundational works in Drosophila social behavior and recent studies on neuropeptide signaling are well-placed.

      State what audience might be interested in and influenced by the reported findings.

      Researchers in the fields of neurogenetics, behavioral ecology, developmental biology, and evolutionary biology will find this work particularly relevant. It also has implications for those studying social behavior across species, including mammals.

      Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Expertise: Neurogenetics, Behavioral Neuroscience, Drosophila Genetics, Social Behavior, Bioinformatics. I have sufficient expertise to evaluate the genetic, behavioral, and transcriptomics aspects of the study. Specific details on the imaging studies might require additional expert evaluation.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Summary:

      The manuscript investigates the role of Drosulfakinin (Dsk) signaling in Drosophila social network behavior (SNB) and its plasticity based on early-life experiences. The study employs a systematic analysis using 175 inbred strains to link short social distance (SD) with developmental time, food intake, and activity levels. Key findings suggest that social interactions during development compensate for individual developmental inferiority and that early-life social experience is necessary for adaptive social behaviors in adults. The genetic basis of SNB is further explored through transcriptome analyses, implicating Dsk and one of its receptorS in mediating these behaviors.

      Major comments:

      Are the key conclusions convincing?

      The key conclusions are well-supported by the data presented. The association between early-life social interactions and adult social behaviors is convincingly demonstrated through multiple experimental setups. Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Some claims, particularly those regarding the evolutionary implications of Dsk signaling and its conservation across species, might benefit from being presented as hypotheses or speculations rather than definitive conclusions. This would align with the current evidence while acknowledging the need for further investigation.

      Would additional experiments be essential to support the claims of the paper?

      Measure aggression and male-male courtship behavior in the 6 DGRP lines to examine whether SD correlates with these behaviors. Include behavior results of flies tested in Figures 4C and D. Repeat the CCKLR-17D1 experiments shown in Figures 5 F and G for CCKLR-17D3 to provide extra evidence that CCKLR-17D1 mediates DSK's effects on SNB.

      Are the suggested experiments realistic in terms of time and resources?

      These experiments are realistic and feasible within typical research timelines. These might require a few months and moderate funding.

      Are the data and the methods presented in such a way that they can be reproduced?

      The methods section is detailed, providing sufficient information for replication.

      Are the experiments adequately replicated and statistical analysis adequate?

      The experiments appear to be adequately replicated, and the statistical analyses are generally appropriate. However, ensuring consistent selection of statistical methods can further support the evidence presented (Figures 2C and D).

      Minor comments:

      Specific experimental issues that are easily addressable:

      Ensure clarity in the presentation of figures and legends. Some figures could benefit from more detailed legends explaining all aspects of the data shown.

      Are prior studies referenced appropriately?

      The manuscript references prior studies appropriately, providing a solid context for the current research.

      Are the text and figures clear and accurate?

      The text is clear, but some figures, particularly those with complex data, could be more informative with additional annotations. The larvae pictures in Figure 2A should be replaced with ones with higher resolution with drawn larval contours. Scale bars are missing in most of the images shown.

      Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Consider providing a graphical abstract summarizing the key findings. This would aid readers in quickly grasping the main conclusions. Additionally, breaking down complex figures into simpler, more focused panels might improve readability.

      Significance

      Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      This study provides significant conceptual advances in understanding the genetic and neurobiological basis of social behavior in Drosophila. By linking early-life social experiences to adult social behaviors, it highlights the importance of developmental context in shaping adult phenotypes.

      Place the work in the context of the existing literature (provide references, where appropriate).

      The work builds on previous studies on Drosophila social behavior and neurogenetics. It extends the current understanding by integrating developmental and adult behaviors with genetic and molecular analyses. References to foundational works in Drosophila social behavior and recent studies on neuropeptide signaling are well-placed.

      State what audience might be interested in and influenced by the reported findings.

      Researchers in the fields of neurogenetics, behavioral ecology, developmental biology, and evolutionary biology will find this work particularly relevant. It also has implications for those studying social behavior across species, including mammals.

      Define your field of expertise with a few keywords to help the authors contextualize your point of view.

      Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Expertise: Neurogenetics, Behavioral Neuroscience, Drosophila Genetics, Social Behavior, Bioinformatics. I have sufficient expertise to evaluate the genetic, behavioral, and transcriptomics aspects of the study. Specific details on the imaging studies might require additional expert evaluation.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Jeong et al. investigate the influence of genetic factors and early-life social experience on social network behaviors in adult Drosophila. Utilizing isogenic DGRP lines, the study correlates social distances with key developmental and physiological traits-developmental time, digging activity, and food intake. The findings suggest that adult flies with shorter social distance -indicating closer proximity to each other-face developmental disadvantages that are offset by the benefits of social grouping. The authors argue for an evolutionary advantage in such social behaviors, suggesting they help compensate for individual developmental deficits. The study further identifies the Dsk signaling pathway as a key mediator of social network behavior plasticity in male flies, particularly under challenging conditions like mechanical injury.

      The study undertakes a broad range of behavioral and neurogenetic approaches, demonstrating an extensive scope of research efforts. Despite its ambitious scope, the manuscript lacks a clear rationale and cohesive flow among its sections. The numerous experiments do not merge into a unified narrative, leaving the reader questioning the reasoning and progression behind the experimental choices. The manuscript needs a clearer structure, well-defined hypotheses, and more detailed methodological descriptions. Greater emphasis on novelty and better integration with existing literature are also needed. The lack of control experiments and adequate statistical analysis weakens some conclusions.

      Major comments

      1. The authors show that flies in short SD lines reduce their activity over time, leading to the formation of social clusters (Fig. 1B). This clustering could potentially be attributed to reduced activity rather than active social preferences. It would be informative to test whether these SD flies exhibit similar social behaviors when placed in a larger arena, to test if the clustering persists under varied environmental conditions.
      2. In lines 78-79, the authors claim that "short-SD flies gradually reduced SD over time and stayed in the cluster." However, the study established SD clustering by only analyzing behavior during the last quarter of a 10-minute window, assigning a single data point to each fly and taking the average for group SD. Yet, a single value cannot demonstrate whether initially formed clusters remained stable-unchanged-over time. To strengthen this point, the authors could investigate dynamic changes in SD over a longer period to demonstrate stability, or alternatively, adjust the language to better convey the findings. Additionally, including a time scale in Fig. 1B would enhance the clarity of these findings.
      3. The statistical analysis presented in Figure 2C-D raises concerns. It appears that feeding and digging efficiency in both SD type lines benefit from socialization, suggesting that the effects attributed to SD might stem from the overall digging and feeding activity of each line. Therefore, it is crucial to integrate both social distance (short vs. long) and socialization (grouped vs. isolated) into the analysis using methods that allow for the assessment of confounding effects (interaction), such as two-way ANOVA or regression, depending on the data. This would help authors to clarify whether isolation reduces feeding overall (both line types) and determine if this reduction is more pronounced in short-SD lines. Additionally, it is counterintuitive that lines with more larvae per cluster show worse digging efficiency when previous studies, such as Dombrovski et al. (2017), have shown that larger groups of larvae typically exhibit better digging efficiency. This discrepancy highlights the need for a thorough re-evaluation of the data and assumptions regarding group dynamics and their impact on resource access.
      4. The choice to use the percentage of male progeny as a measure of developmental success is confusing, especially without an explanation for why it is favored over measures like overall progeny survival rates. As with digging and feeding, the statistical analysis should include an examination of potential interaction effects to fully assess how social conditions impact developmental outcomes.
      5. The rationale for using physical injury to induce SNB in the study is not clearly explained, raising concerns about the potential impact of injury on overall locomotion. Before employing such a method in sociality experiments, it is crucial to demonstrate that the injury does not affect locomotion. Additionally, the study's methodologies for transitioning between grouped and isolated cultures (present only in Fig. 2I and not in the methods section), as well as the specific methods used to measure social distance (SD) in isolated flies, are not sufficiently detailed. This lack of clarity complicates the evaluation of the study's conclusions.
      6. Lines 104-106 "The clustering property of short-SD lines may have evolved as a compensation mechanism for their developmental inferiority in individuals". To support this claim, the authors should assess the significance of interactions terms as stated earlier.
      7. In Fig. 4, the authors conclude that Drosulfakinin (DSK) signaling encodes early-life experiences for SNB plasticity. It is crucial for the authors to differentiate whether changes in feeding behavior are directly due to DSK or if they are secondary effects resulting from altered social interactions mediated by DSK.
      8. In Fig 4, the data show that DSK peptide is significantly increased in cell bodies in grouped long DS lines when compared with grouped short DS lines (Fig. 4B). However, no changes are reported at the level of DSK projection levels when comparing these groups. Can the authors clarify this?
      9. Additionally, the data show that DSK activity is reduced by isolation in both types of SD. To clarify if this effect is driven by isolation only, and not type of line (short vs long SD line), the interaction term should be tested. Furthermore, it is not clear what lines are used in live imaging (e.g., Fig 4C-F).
      10. In the 'Male-specific DSK-CCKLR-17D1 signalling mediates SNB plasticity' section (line 217), the analysis should include an interaction term to account for the possible confounding effects of isolation and injury on SD. This would aid in determining whether the impacts of social isolation and injury on DSK signalling and SNB plasticity are independent of each other or if they interact in significant ways, as stated by the authors.

      Minor comments

      1. The introduction section would also benefit from major rewriting to clearly indicate the research gap and hypothesis tested. The introduction section would also benefit from major rewriting to clearly indicate the research gap and hypothesis tested. The manuscript would benefit from a more thorough integration of previous studies related to Drosophila social behavior (e.g., Blumstein, D. T. et al., 2010; Schneider, J., Dickinson, M. H., & Levine, J. D., 2012; Simon, A. F. et al., 2012; Ramdya, P. et al., 2015). While the current references are adequate, a more detailed discussion of how this study builds upon and diverges from existing literature would be beneficial.
      2. A better description of methods, especially behavioral approaches, could vastly help in understanding the results. Clarifying the methodologiy, particularly the behavioral approaches, would greatly enhance the understanding of the results. Also, the method for quantifying the total number of larvae per vial is unclear, particularly whether variations in larval density were considered. This is crucial, as different densities could affect the available sensory cues necessary for larval aggregation, such as vision (e.g. Dombrovski et al. Curr Biol. 2019). Better descriptions of the results and inclusion of exact statistical analyses used in support of the claims are also needed.
      3. Some terms and descriptions in the manuscript are somewhat ambiguous, such as "social memory" and "adaptive social plasticity" and should be better defined.
      4. Line 86-89: "Social interactions compensate for developmental inferiority in short-SD larvae Why do flies display SNB? One clue comes from the previous observation that Drosophila larvae collectively dig culture media and improve food accessibility, possibly facilitating their constitutive feeding during early development..." - This paragraph could be moved to the introduction section.
      5. In lines 77-78, the manuscript mentions that the locomotion trajectories of individual flies confirm certain characteristics but fails to provide an analysis of individual locomotion metrics, e.g., tortuosity, distance walked, etc. The authors should add quantitative analysis to support claims about trajectories or alternatively rephrase the sentence to remove any claims about the trajectories of flies.
      6. After screening 175 strains, three short and long SD lines were selected. It would be good if justification for the authors' choice were included, as the selected lines were not the ones with the longest or shortest SD as seen in Fig. 1C.

      Other comments:

      • Line 72-73: What correlation was performed? This should be included in the results/methods section.
      • Line 113: Change "pre-trained colleagues" to "pre-trained flies".
      • Lines 321 and 325: Use "3D" instead of "2D" as three dimensions are given?
      • Ensure all figures are correctly scaled and aligned.
      • Video: Including short videos for each behavioral test (e.g., feeding) would help in understanding it.
      • Fig. 4 should include control neurons that do not change with social grouping; authors should also show ROI.
      • Line 27, 86: Change "inferiority" to "disadvantage".

      Significance

      This study extends existing knowledge by linking specific genetic pathways to behavioral outcomes in a well-established model system, providing new insights into the genetic and neural basis of social behavior. The use of DGRP lines to dissect the impact of genetic variation on behavior is particularly valuable. The identification of the Dsk signaling pathway as a mediator of these behaviors under stress is interesting. However, the study would benefit from more in-depth statistical analysis and expanded experimental designs to solidify the conclusions. It should also more clearly highlight the novelty of its findings and better integrate them with the current literature on Dsk signaling and social behaviors.

      My expertise is in behavioral neuroscience. The insights from this study promise to deepen our understanding of the genetic and neural mechanisms behind social behaviors. The potential implications of this research are likely to extend well beyond Drosophila, influencing studies across various species.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The article explores the social network behavior (SNB) of Drosophila, focusing on individual social distance (SD) within groups over time. A systemic analysis revealed that short SD is associated with long developmental time, low food intake, and hypoactivity. Group culturing compensates for developmental inferiority in short social distance individuals. Social interactions during early development positively impact adult physiology and adaptive social plasticity. Transcriptome analyses show genetic diversity for SD traits. The neuropeptide Drosulfakinin (DSK) signaling mediates social network behavior plasticity via receptor CCKLR-17D1, particularly in males, suggesting a dedicated neural mechanism encoding early-life experiences to adaptively transform group properties. The research suggests that animals have developed neural mechanisms to encode early-life experiences. It offers insights into the genetic foundation and adaptability of social behavior in Drosophila, shedding light on the neural processes involved in social memory and the adaptive behaviors of groups. These findings have broader implications for understanding similar neural mechanisms governing social memory and group behaviors in other species.

      Major concerns:

      Major 1. In Figure 2H, the latency to 75% arrival of short-SD isolated fruit flies (no matter with or without pioneers) is close to that of group fruit flies with pioneers while the latency to 75% arrival of long-SD isolated fruit flies (no matter with or without pioneers) is close to that of group fruit flies without pioneers; How to explain the difference in latency to 75% array between long-SD and short-SD isolated fruit flies? It seems that only in the long-SD fruit flies from the grouped experience, the absence of pioneers will increase the time it takes to reach the target food in the maze.

      Major 2. In Figure 3C, there are two up-regulated genes in the Drosophila group that overlap in short-SD and long-SD strains. Apart from Dsk, what is the other gene? In addition, for isolated fruit flies, both short-SD and long-SD lines have more gene expression upregulated. How to explain this phenomenon? Can you briefly explore the reasons for their upregulation and instead of involvement in SNB plasticity, what kind of physiological functions may they have?

      Major 3. In lines 219-223, the genomic deletion by mutant or depletion by RNA interference emphasizes the role of neuropeptides DSK and its receptor CCKLR-17D1 in injury-induced clustering behaviors. How about the effect of neuropeptides overexpression? Do they confer injury-induced social interactions to isolated male flies. Meanwhile, in line 238, the transgenic excitation of CCKLR-17D1 neurons emphasizes the function of neuronal synaptic transmission in the pathway. Indeed, both neuropeptide expression and neuronal synaptic connections may be involved in the regulation of injury-induced clustering behaviors. It is recommended to separate the discussion of protein expression and the respective regulatory modes at the neuronal circuit level.

      Major 4. Since a significant portion of the work in the first half of this paper is focused on elucidating two types of social distance in SNB, is there any difference in the regulation of social network plasticity by Dsk signaling pathway in the short-SD and long-SD lines?

      Minor ones:

      Minor 1. There is a color difference between the data spots and the figure legends in Figure 2H.

      Minor 2. The anatomical sample images in Figure 4 and Figure 5 require scale bars.

      Minor 3. The "grouped" and "grp" in Figures 3B-3F can be unified as "grp", while the "isolated" and "iso" can be unified as "iso". So that the male and female symbols in Figure 3F will not have any deviation in the mark.

      Minor 4. The difference in Denmark signals of each group of neurons under the condition of injury should also be compared in Figure 4C.

      Minor 5. What is the effect of inactivating CCKLR-17D1 or CCKLR-17D3 by shibire on injury-induced clustering in group-cultured adults in Figure 5E? (This relates to major comment 3)

      Significance

      General assessment:

      The strengths of this work is that the authors have identified specific lines with short social distance or long social distance by conducting extensive screening experiments. By transcriptome analyses and gene ontology (GO) analyses they revealed genes up or down regulation in the social experience. They have also narrowed down to the DSK signaling involved in the social experience encoding process. However, the study's limitation lies in the lack of clarity regarding the DSK signaling pathway. The mechanisms through which social experiences affect neuronal activity and synaptic connections remain unclear. Further research on upstream and downstream pathways could enhance understanding. Although the article proposes injury-induced clustering behaviors, the key sensory pathways involved in social network behavior plasticity during early social experiences are not well-defined. Conducting sensory deprivation experiments could elucidate sensory involvement. Overall, the study's strengths lie in its comprehensive approach, large sample size, and translational potential. To enhance future research, investigating the complexity of neural mechanisms and expanding the exploration of regulating pathways could be beneficial. Additionally, exploring the ecological relevance of the findings could deepen our understanding of social behavior in natural environments.

      Advance:

      Compared to previous studies such as Heiko Dankert et al.'s publication in 2009 in Nature Methods and Assa Bentzur et al.'s publication in 2020 in Current Biology, which also investigated the impact of early life experiences on male social behavior and examined various aspects of social network construction, this study employs a systematic analysis of social network behavior (SNB) in Drosophila, integrating genetic, physiological, and behavioral assessments. The authors conducted detailed and systematic analyses through transcriptome and gene ontology (GO) analyses, including the visualization of gene expression heatmaps, volcano plots, and overlapping analysis of differentially expressed genes (DEGs) between grouped and isolated conditions. Additionally, this research delved into the regulatory pathway of DSK signaling in male-specific SNB plasticity, with a particular focus on the DSK to CCKLR-17D1 signaling, which encodes early social experiences. The research provides valuable insights into the genetic basis and adaptability of social behavior in Drosophila. Moreover, it illuminates the neural mechanisms that underlie social memory and the ability of groups to adapt across different species.

      Audience:

      Researchers conducting basic research in genetics, neuroscience, behavioral biology, and evolutionary biology, particularly those focused on understanding social behavior and its underlying genetic and neural mechanisms, will find this study highly relevant. Additionally, researchers studying social cognition, social memory, and group dynamics in various species may also be interested in these findings.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Jeong et al. describes effects of neuronal signalling on collective behavior by measuring social distance (SD) which is used as a measure for social network behavior. Authors screened for a panel of inbred DGRP lines and compared the SD due to prior experience of group or single culturing when flies are recorded in a 55 mm diameter petri-dish. The screen uncovered 3 short Sd and three long-SD lines, and subsequent experiments showed differences in various behaviors such as recovery from injury, search for food and SD. Using RNA-seq from heads of flies they implicate Dsk signalling and show neuronal architecture and activity differences between grouped and isolated male flies. They implicate Dsk signalling in recovery from injury affecting SD but it was dispensable for grouped vs. isolated flies. I have suggestions to support the claims made, analysis and interpretation of the data and improve the clarity of writing. See my specific comments below.

      Major comments:

      1. For recording social behaviour in flies, arenas with sloped walls have been extensively used called 'fly bowl' (Simon et al. 2010 doi:10.1371/journal.pone.0008793; Robie et al., 2017, doi: 10.1016/j.cell.2017.06.032), or 'flyworld' (Liu et al., 2018 doi:10.1371/journal.pcbi.1006410). Such geometry ensures that flies don't walk on the side of the arena, and don't occlude each other. However, in the screen carried out in this manuscript, a petri-dish of 5.5 cm diameter filled with agar was used to record social network formation. Given the propensity of flies to walk on the walls of such circular arena, it will be difficult to know if the long and short SD behavior resulting from propensity to form clusters is an artefact of the assay condition used. It would be important to test the SNB and SD of at least the 6 selected short and long SD lines in arena with sloped walls to rule out this possibility.
      2. Methods section would require additional details about the SNB assay for instance, the height of the agar bed and the effective height in which interactions was recorded is not mentioned.
      3. Figure 2C and 2D results in larvae seems to contradict previous studies that have shown that isolated flies eat more as adults (Li et al. Nature, 2021) and Dsk-RNAi increases feeding in larvae and adults (Soderberg et al., 2012). It might be due to unique characteristic of DGRP lines used and would be helpful to discuss this.
      4. Rutabaga mutants for Fig S3 are directly compared with CS flies in the maze assay and it appears from methods that these lines were not isogenized, this can significantly impact the results. Similarly for some of the subsequent Dsk experiments it appears that lines were not isogenized (see below). These experiments would either need to be repeated of this caveat needs to be explicitly mentioned to avoid misinterpretation of the data.
      5. For Figure 3 describing RNA-seq data additional analysis would be helpful. Gene expression from isolated and grouped flies have been studied earlier by microarray and RNA-seq methods (Wang et al., PNAS 2008; Agrawal et al., JEB 2020; Li et al. Nature, 2021). Data from these studies should be compared with to see if there are common patterns of gene expression between long and short SD flies vs. group and isolated flies.
      6. GEO accession number and the analyzed list of DEGs should be provided as supplementary information.
      7. Figure 3E & F are not referred to in the main text, also there is no description of how the data was generated. Is this based on published data from Mackay lab about DGRP lines, if so, aggression experiments were not convincing in those studies and have been shown to not recapitulate 'real' aggression by other labs for several of the DGRP lines tested (Chowdhury et al., 2021, doi: 10.1038/s42003-020-01617-6).
      8. Dsk was shown to be reduced in isolated flies by RNA-seq and play a role in aggression by an earlier study (Agrawal et al., 2020) and should be cited appropriately (line 180-181) and elsewhere.
      9. For Fig. 4A-B, source images for other two DGRP lines should be included at least in supplementary information, if not as main figure.
      10. For Fig. 5, what is the reason that uninjured flies don't show any SD phenotype? Are there any changes in their velocity? This is mentioned in passing on line 228-29 but should be properly discussed.
      11. Trans-Tango and UAS-Denmark, SytGFP experiments were performed previously by Wu et al., 2020 and Wang et al., 2021 for Dsk, these two studies observed that P1 neurons are presynaptic and Dsk neurons are post synaptic but in Figure 4 it's not clear what are the presynaptic and post synaptic neurons. Also these studies are not cited appropriately in this section.

      Minor comments:

      1. Line no. 286: please mention about the relative humidity and light & dark cycle conditions and when experiments were conducted (ZT).
      2. Line no. 311: How many days old flies were used (isolated and group housed) for the behavior and transcriptomic studies?
      3. Line no. 349: for RNA extraction please mention how many fly heads were used and ZT for collection.
      4. Line no. 358: Italicize "Drosophila melanogaster".

      Significance

      This manuscript will be of interest to neuroscientists studying Drosophila social behaviors. The manuscript asks interesting questions and authors have done extensive set of experiments but the progress appears incremental given the current state of the field, especially for the later part of the manuscript. Some of the interpretation would also require additional data to bolster the claims made. Finally, the findings from this study could be better discussed in the context of what it is already known.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all reviewers for their constructive criticism and suggestions. We have addressed all the points as detailed below. We also added an experiment that strengthens the connection between replication stress and GSF2 and suggests a role of GSF2 in recovery from the DNA replication checkpoint arrest (Fig. 4g).

      Reviewer #1 (Evidence, reproducibility and clarity)

      Summary

      The manuscript by the Khmelinskii group reports that they have successfully constructed two conditional degron libraries of budding yeast for almost all proteins. For this purpose, the authors employed an improved auxin-inducible degron (AID2). Initially, they constructed yeast libraries by fusing HaloTag to the N- or C-terminus of proteins and found that C-terminal tagging is less likely to affect the location and function of proteins (Fig. 1). Based on this finding, the authors fused mNG-AID*-3Myc or AID*-3Myc (AID-v1 or AID-v2 library, respectively) to more than 5600 proteins and found that 4079 proteins were significantly depleted when cells were treated with 5-Ph-IAA (Fig. 2). A fitness defect was observed for over 60% of essential proteins, indicating the target depletion showed the expected phenotype in many cases (Fig. 3). Finally, the authors screened proteins required for maintaining viability in the presence of MMS, CPT and HU, and identified common proteins involved in DNA repair (such as RAD52 epistasis proteins) and other proteins specific for MMS, CPT or HU resistance (Fig. 4). Furthermore, the authors revealed that an ER membrane protein, Gsf2, is required for HU resistance, which was not found in previous studies with the YKO library because gsf2∆ cells in the YKP library had acquired a suppressor mutation (Fig. 4e).

      Major comments

      1 - In Figure S2a, the authors initially checked the growth of yeast cells expressing OsTIR1(F74G) under the GAL1 promoter, saying that "expression of OsTir1(F74G) from the strong galactose-inducible GAL1 promoter had a negligible impact on yeast fitness (page 3)". To me, the OsTIR1(G74G) expressing cells showed slightly slower growth compared to the control cells. Moreover, the cells expressing it under the very strong GPD promoter showed apparent slow growth, suggesting that OsTIR1(F74G) overexpression caused a side effect. The authors should carefully evaluate the cells with GAL1-OsTIR1(F74G).

      Indeed high levels of OsTir1(F74G) impaired growth, at least in the strain background used in our experiments. Expression from the strongest promoter we tested (GPD) resulted in an obvious fitness defect, whereas conditional expression from the strong GAL1 promoter had a small impact on fitness and expression from the weaker CYC1 and ADH1 promoters did not affect fitness (Fig. S2a). Despite the small fitness impairment, we decided to use the GAL1-OsTIR1(F74G) construct for the AID libraries for two reasons: the conditional nature of this promoter is likely to limit adaptation to expression of OsTir1(F74G) and the high expression levels of OsTir1(F74G) are less likely to limit degradation of AID-tagged proteins. We added this explanation to the Results section.

      As suggested by the reviewer, we quantitatively evaluated the fitness impact of the GAL1-OsTIR1(F74G) construct. Using the colony size data of the AID-v1 library (grown on galactose medium with 1 µM 5-Ph-IAA, Fig. 2c), we compared colony sizes of OsTIR1– and OsTIR1+ strains for non-essential ORFs. As degradation of non-essential proteins is not expected to affect fitness, the difference in colony size between OsTIR1– and OsTIR1+ strains can be attributed to OsTir1 expression. On average, the presence of the OsTIR1 construct reduced colony size by 7% (median fitness of OsTIR1+ strains relative to OsTIR1– strains of 0.93 ± 0.06, n = 4698 non-essential ORFs). We performed the same comparison for strains that did not exhibited OsTIR1-dependent protein degradation. In this set of strains, the presence of the OsTIR1 construct also reduced colony size by 7% (median fitness of OsTIR1+ strains relative to OsTIR1– strains of 0.93 ± 0.05, n = 624 ORFs in the “not affected” group in Fig. 2d). We added this information to Fig. S3a.

      2 - Given the possibility that OsTIR1(F74G) overexpression might cause a growth problem, it is not appropriate to compare OsTIR1+ and OsTIR1- conditions for evaluating growth fitness (Fig. 2). As shown in Fig. S4b, it is more appropriate to compare the +/- 5-Ph-IAA conditions. Additionally, the 5-Ph-IAA concentration used in this study was not clearly mentioned in the method section and figure legends.

      The two approaches, comparison of OsTIR1– and OsTIR1+ strains grown on galactose with 5-Ph-IAA (as was done for the AID-v1 library) and comparison of galactose ± 5-Ph-IAA conditions (as was done for the AID-v2 library), have advantages and disadvantages but should yield similar results. The technical noise (due to spatial effects on the screen plates) is lower for the comparison of OsTIR1– and OsTIR1+ strains, as the two strains for each ORF can be grown next to each other on the same plate (Fig. 2c). Furthermore, corrections of spatial effects are more precise with this layout as the frequency of fitness defects per plate is lower. On the other hand, comparison of galactose ± 5-Ph-IAA conditions implicitly corrects for the fitness impact of the GAL1-OsTIR1(F74G) construct, as the fitness distribution of each condition is normalized to the median of that condition, but this fitness impact of OsTir1 cannot be determine from the screen results.

      We now explicitly corrected the colony size data of the AID-v1 library for the fitness impact of OsTir1 expression (quantified in the previous point) and updated all the analyses and results shown in Fig. 3, Fig. S3b-e and Fig. S4a. The correction was performed using the multiplicative model, whereby the fitness impacts of OsTir1 expression and degradation of the AID-tagged protein are independent. Overall, our observations and conclusions stand unchanged with the corrected data.

      Finally, the 5-Ph-IAA concentration (1 µM) used in all experiments is now indicated in the figure legends and the Methods section.

      3 - The authors found that fitness defects were observed for over 60% of essential proteins (Fig. 3). In other words, depletion of the remaining 40% was not enough to induce growth defects. The authors should discuss how the current AID library can be improved to achieve better target depletion. Previous literature reported various possibilities, such as using a tandem degron tag and combining AID with the Tet promoter system (PMID 25181302, 26081484). Although optional, it would be wonderful if the authors would generate an improved library.

      Following the reviewer’s suggestion, we added the following statement to the discussion:

      “In the future, the libraries could be potentially improved with N-terminal tagging of ORFs that currently exhibit incomplete or no degradation of AID-tagged proteins or using multiple copies of the AID* tag to enhance protein degradation (Kubota et al, 2013; Nishimura & Kanemaki, 2014).”

      Minor comments

      4 - 5-Ph-IAA is not auxin because it does not induce the auxin responses in plants (PMID 29355850). Therefore, the authors should be careful when they refer to 5-Ph-IAA and should not call it auxin.

      We corrected this and now refer to 5-Ph-IAA explicitly throughout the manuscript.

      5 - The availability of the HaloTag and AID libraries should be indicated.

      We added the following statement to the Methods section: “All strains, plasmids and libraries are available upon request.”

      6 - Page 3: "Finally, the extent of AID-dependent degradation varied with protein abundance, in that highly expressed proteins were more likely to be only partially degraded compared to lowly expressed ones (Fig. 2e, Fig. S2e)". Fig. S2e should be Fig. S2d, shouldn't it?

      We corrected this mistake.

      Reviewer #1 (Significance):

      This paper is technically robust and well-conducted. It presents a comprehensive study showcasing the effectiveness of the conditional degron library. The HaloTag libraries will also be useful. The yeast libraries presented in this study will be invaluable for future screenings and studies across all aspects of yeast biology.

      Reviewer #2 (Evidence, reproducibility and clarity):

      In this study, the authors reported the development and characterization of two AID-tagged strain libraries for the model organism S. cerevisiae. The libraries are based on the latest AID technology, AID2. One library contains a fluorescent protein fused to the AID, whereas the other library does not have the fluorescent protein, thus offering better compatibility with imaging-based screens. The authors show that AID-dependent protein degradation can be achieved for most of the library strains, and a growth phenotype was induced for a high fraction of essential genes. Genetic screens for DNA damage-sensitive mutants showcased the applicability of the libraries.

      I only have the following minor comments and suggestions for the authors to consider.

      Point 1, Page 3

      "Optimized tagging of proteins with these N-terminal localization signals likely also contributes to the lack of correlation between differential fitness defects and occurrence of terminal localization signals (Fig. S1f, Table S2)."

      Is this because the genes that cannot tolerate C-terminal addition are already depleted in the C-SWAT library? In the C-SWAT library, a 15-amino-acid linker L3 is added to the C-terminus.

      That is certainly a possibility. During construction of SWAT library, tagging with N-SWAT and C-SWAT acceptor modules failed for 251 and 353 ORFs, respectively (Weill et al. 2018, Meurer et al. 2018). However, these ORFs are not enriched in N- or C-terminal localization signals, respectively (4.6% ORFs with C-terminal signals in C-SWAT library vs 3.3% among failed C-SWAT strains; 12.3% ORFs with N-terminal signals in N-SWAT library vs 2.0% among failed N-SWAT strains).

      The most significant trend in the data is enrichment of ribosomal subunits in both sets of failed strains: 3.9% and 16.3% of the genes mapped to the GO term “ribosome” in the N-SWAT library and the set of failed N-SWAT strains, respectively; 3.6% and 15.9% of the genes in the C-SWAT library and the set of failed C-SWAT strains, respectively. This is consistent with what was reported by Weill et al. for failed N-SWAT strains.

      Point 2, Page 3

      "Expression of OsTir1(F74G) from the strong galactose-inducible GAL1 promoter had a negligible impact on yeast fitness (Fig. S2a)."

      I wonder why the authors chose to use an inducible promoter to express OsTir1(F74G). In other studies, for example Snyder et al. 2019, OsTir1 has been expressed from a constitutive promoter.

      Despite the small fitness impairment, we decided to use the GAL1-OsTIR1(F74G) construct for the AID libraries for two reasons: the conditional nature of this promoter is likely to limit adaptation to expression of OsTir1(F74G) and the high expression levels of OsTir1(F74G) are less likely to limit degradation of AID-tagged proteins. We added this explanation to the Results section.

      Please see our response to reviewer 1, points 1 and 2.

      Point 3, Page 3

      "A similar frequency was previously observed with a set of AID alleles constructed for 758 essential ORFs using the original AID system (Snyder et al, 2019). However, over a third of these alleles exhibited fitness defects even in the absence of auxin, which were further compounded by off-target effects of auxin, highlighting the advantages of the AID2 system."

      Snyder et al. 2019 used a TAP-AID-6FLAG tag. The fitness defect in the absence of auxin may not necessarily be due to the AID part of the tag, as TAP tagging is known to compromise the functions of some genes.

      We corrected our statement as follows:

      “A similar frequency was previously observed with a set of AID alleles constructed for 758 essential ORFs using the original AID system (Snyder et al, 2019). However, over a third of these alleles exhibited fitness defects even in the absence of auxin, which were further compounded by off-target effects of auxin.”

      Point 4, Page 3

      "Interestingly, complete degradation of 33% of essential proteins did not result in a fitness defect. It is possible that in some cases partial degradation results in low protein levels that are below the detection limit of our assay but are sufficient for viability."

      Are these "33% of essential proteins" enriched with genes with low expression levels? I guess genes with low expression levels are more likely to fall below the detection limit even when partially depleted. Are there extreme examples where a highly expressed essential gene does not exhibit a fitness defect when the protein product is no longer detectable?

      We performed the analysis suggest by the reviewer, and observed no difference in pre-degradation protein levels between essential & degraded proteins with and without a fitness defect (now shown in Fig. S3b). This also showed that there are indeed several essential proteins with high pre-degradation proteins levels and without a fitness defects upon degradation to below our detection limit: Pgi1, Nhp2, Smt3, Gus1, Dys1, Sis1, Fas2 and Rpo26 (in the abundance bin 4 in Fig. S2f).

      In addition, we considered the nature of the essential genes in these two groups. Namely, we compared the frequency of core essential genes, which are always required for viability, and conditional essential genes, which vary in essentiality depending on the genetic background or environment (Bosch-Guiteras & van Leeuwen, 2022). Interestingly, the set of essential and degraded proteins without an accompanying fitness defect was enriched in conditional essential genes defined by two independent measures: essentiality across S. cerevisiae natural isolates (Peter et al, 2018) or with bypass suppression interactions in a laboratory strain (van Leeuwen et al, 2020) (Fig. S3c, odds ratio = 1.6, p-value = 0.04 in a Fisher’s exact test and odds ratio = 1.7, p-value = 0.02, respectively). This suggests that conditional essentiality could explain the observed lack of fitness defects upon degradation of some essential proteins.

      We added this analysis to the Results section.

      Reviewer #2 (Significance):

      This study generated highly valuable resources for functional genomic studies.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary: In this manuscript, the authors construct and analyze a genome-wide collection of AID-tagged S. cerevisiae strains. The manuscript is clearly written and the analysis appears to be thorough. This collection will be quite useful to the yeast community. There are some issues to address, listed below.

      1. page 1, abstract - "...with protein abundance and tag accessibility as limiting factors." It's not clear what the authors mean by protein abundance as a limiting factor. Are they referring to the protein level pre-depletion? Please clarify.

      That is correct. We clarified this statement as follows:

      “Almost 90% of AID-tagged proteins were degraded in the presence of the auxin analog 5-Ph-IAA, with initial protein abundance and tag accessibility as limiting factors.”

      1. page 1, second paragraph of the Intro, end of the paragraph - There are publications prior to Van Leeuwen et al. 2016 that describe suppressors lurking in the deletion set. Here are two that should be cited: Hughes et al. 2000, DOI: 10.1038/77116 and Teng et al. 2013, DOI: 10.1016/j.molcel.2013.09.026.

      We added the references pointed out by the review.

      1. The goal of this work, stated in the first sentence of results, is to construct genome-wide AID libraries. Yet, to test whether N-terminal or C-terminal tagging is better, the authors used a Halo tag. Those results showed that, for the Halo tag, C-terminal tagging was less likely to impair function. Why weren't these tests done with the same AID tag used to build the libraries in the next section? What is the evidence that the results for a Halo tag will be the same as for an AID tag? While hard to find it documented in publications, there is a lot of anecdotal evidence that the type of tag can make a big difference, as well as its location. While this section will be of interest to those using Halo tags, it's not clear how it relates to the rest of the paper, especially given the careful characterization of the AID library in the next section.

      We chose the Halo tag due its size (33 kDa), similar to many commonly used fluorescent protein tags and to the mNG-AID*-3myc tag in the AID-v1 library, and lack of evidence for a dominant negative effect on the tagged proteins. This is now stated in the Results section.

      We agree that further work is needed to understand how the type of tag, its size and biophysical properties, and the linker between the tag and the protein of interest affect protein localization and function across the proteome. This is now stated in the Results section.

      1. Throughout this manuscript, including the Tables, in cases where the protein is no longer detected, please do not describe this as "complete degradation." Instead, please use "not detectable." This is clearly the case for essential proteins that are no longer detected but that still grow, so it is very likely the case for many or all of the others. If the authors have any understanding of the sensitivity of their fluorescence assay, then that would be helpful to know. For example, they could add a control, taking a known amount of a fluorescent protein and analyzing known dilutions to assay the level of detection.

      We appreciate the reviewer’s suggestion. We decided against “not detectable” instead of “complete degradation” to avoid confusion with proteins that are not detectable pre-degradation. Nevertheless, we replaced “complete degradation” with “degradation” and added the following explanation to the Results section:

      “Out of 5079 proteins detected in OsTIR1– strains, 4455 (~88%) were significantly depleted in OsTIR1+ strains (Fig. 2d, Table S3). 3981 proteins could not be detected specifically in the OsTIR1+ background. Hereafter, we will refer to these proteins as degraded, although it is likely that at least in some cases degradation is not complete but the remainder is below the detection limit of our plate reader assay. Nevertheless, 474 proteins were unequivocally degraded only partially, as they were detectable in the OsTIR1+ background but at reduced levels compared to the OsTIR1– background (Fig. 2d).”

      To estimate the detection limit of the colony fluorescence assay, we correlated the background-corrected mNG intensities in OsTIR1– strains with absolute levels (in molecules per cell) of 1167 proteins determined by Lawless et al. (PMID 26750110). Based on a linear fit, the threshold above which proteins are considered “detected” in our analysis, mNG/bkg(OsTIR1–) > 1.2, corresponds to 200 molecules per cell (95% confidence interval 18 to 2187 molecules per cell). We added this information to the Results section and Fig. S2c.

      This detection limit is in line with our results, where low abundance proteins such as the centromeric histone Cse4/CENP-A (with two Cse4 molecules per centromere adding to 64 molecules per cell, Aravamudhan et al. PMID: 23623551 and several times that amount elsewhere in the cell, Collins et al. PMID: 15530401) can be detected in the colony assay (Table S3).

      1. Fig. S2a and page 3, first full paragraph - The authors wrote that expression of OsTir1(F74G) from the strong GAL1 promoter had a negligible impact on growth. However, the figure shows that there is an obvious effect on growth after 48 hours of incubation, with much smaller colonies. This defect is much less obvious after 72 hours. This difference suggests that the growth effect would have been even more obvious at 24 hours. I think that the text should be modified to indicate this effect.

      We now quantified the fitness impact of the GAL1-OsTIR1(F74G) construct and rephrased this part of the manuscript. In addition, we corrected the AID-v1 library screen results for the fitness impact of the GAL1-OsTIR1(F74G) construct and updated all figures and tables. Please see our response to reviewer 1, points 1 and 2.

      1. One of the main justifications for the construction of the AID library is to allow assays for essential genes. Yet that was not a feature of the screen for DNA damage response factors. Were any essential genes identified in those screens? It would be of great interest to identify lower levels of 5-Ph-IAA that only mildly affect growth of essential genes and then to repeat the screens.

      58 out of the combined 165 potential resistance factors identified in the three screens are essential genes. We added this information to the Results section and essential genes are now indicated in Fig. S5c.

      We now show that chemical-genetic interactions for both essential and non-essential genes can be reproduced in spot tests using the MMS screen as an example (Fig. S5d). We also show that additional essential hits can be identified at lower concentrations of 5-Ph-IAA, which allow determining chemical-genetic interactions for strains that otherwise exhibit no growth in 1 μM 5-Ph-IAA (Fig. S5e). As the screens serve as a demonstration of possible uses of the AID libraries, we consider additional exhaustive screening for DNA damage response factors beyond the scope of this manuscript.

      1. A big advantage of AID depletion over deletions is the ability to look at strains very shortly after loss of the protein of interest. In many cases in the literature, experiments are done after one or two hours after depletion. Yet in this work, there are no data presented on how effective depletion is in the short term versus after a long period of growth (24 hours). It would be a strong addition to the manuscript to include a time course for at least a subset of the proteins to look at the loss of signal over time, either by fluorescence or by Western blots.

      We performed time courses of protein depletion with immunoblotting for 12 strains (4 proteins from the “degraded”, “partially degraded” and “not affected” groups each). The results in Fig. S2e show that “degraded” proteins are depleted to below the detection limit within 60min of 5-Ph-IAA addition, “partially degraded” proteins are depleted less or exhibit a degradation-resistant pool, and the levels of “not affected” proteins remain stable over time, consistent with their classification based on mNG fluorescence in the colony assay. We added this information to the Results section.

      Reviewer #3 (Significance):

      The library will be of use to the yeast community.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: In this manuscript, the authors construct and analyze a genome-wide collection of AID-tagged S. cerevisiae strains. The manuscript is clearly written and the analysis appears to be thorough. This collection will be quite useful to the yeast community. There are some issues to address, listed below.

      1. page 1, abstract - "...with protein abundance and tag accessibility as limiting factors." It's not clear what the authors mean by protein abundance as a limiting factor. Are they referring to the protein level pre-depletion? Please clarify.
      2. page 1, second paragraph of the Intro, end of the paragraph - There are publications prior to Van Leeuwen et al. 2016 that describe suppressors lurking in the deletion set. Here are two that should be cited: Hughes et al. 2000, DOI: 10.1038/77116 and Teng et al. 2013, DOI: 10.1038/77116 .
      3. The goal of this work, stated in the first sentence of results, is to construct genome-wide AID libraries. Yet, to test whether N-terminal or C-terminal tagging is better, the authors used a Halo tag. Those results showed that, for the Halo tag, C-terminal tagging was less likely to impair function. Why weren't these tests done with the same AID tag used to build the libraries in the next section? What is the evidence that the results for a Halo tag will be the same as for an AID tag? While hard to find it documented in publications, there is a lot of anecdotal evidence that the type of tag can make a big difference, as well as its location. While this section will be of interest to those using Halo tags, it's not clear how it relates to the rest of the paper, especially given the careful characterization of the AID library in the next section.
      4. Throughout this manuscript, including the Tables, in cases where the protein is no longer detected, please do not describe this as "complete degradation." Instead, please use "not detectable." This is clearly the case for essential proteins that are no longer detected but that still grow, so it is very likely the case for many or all of the others. If the authors have any understanding of the sensitivity of their fluorescence assay, then that would be helpful to know. For example, they could add a control, taking a known amount of a fluorescent protein and analyzing known dilutions to assay the level of detection.
      5. Fig. S2a and page 3, first full paragraph - The authors wrote that expression of OsTir1(F74G) from the strong GAL1 promoter had a negligible impact on growth. However, the figure shows that there is an obvious effect on growth after 48 hours of incubation, with much smaller colonies. This defect is much less obvious after 72 hours. This difference suggests that the growth effect would have been even more obvious at 24 hours. I think that the text should be modified to indicate this effect.
      6. One of the main justifications for the construction of the AID library is to allow assays for essential genes. Yet that was not a feature of the screen for DNA damage response factors. Were any essential genes identified in those screens? It would be of great interest to identify lower levels of 5-Ph-IAA that only mildly affect growth of essential genes and then to repeat the screens.
      7. A big advantage of AID depletion over deletions is the ability to look at strains very shortly after loss of the protein of interest. In many cases in the literature, experiments are done after one or two hours after depletion. Yet in this work, there are no data presented on how effective depletion is in the short term versus after a long period of growth (24 hours). It would be a strong addition to the manuscript to include a time course for at least a subset of the proteins to look at the loss of signal over time, either by fluorescence or by Western blots.

      Significance

      The library will be of use to the yeast community.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, the authors reported the development and characterization of two AID-tagged strain libraries for the model organism S. cerevisiae. The libraries are based on the latest AID technology, AID2. One library contains a fluorescent protein fused to the AID, whereas the other library does not have the fluorescent protein, thus offering better compatibility with imaging-based screens. The authors show that AID-dependent protein degradation can be achieved for most of the library strains, and a growth phenotype was induced for a high fraction of essential genes. Genetic screens for DNA damage-sensitive mutants showcased the applicability of the libraries.

      I only have the following minor comments and suggestions for the authors to consider.

      Point 1

      Page 3

      "Optimized tagging of proteins with these N-terminal localization signals likely also contributes to the lack of correlation between differential fitness defects and occurrence of terminal localization signals (Fig. S1f, Table S2). " Is this because the genes that cannot tolerate C-terminal addition are already depleted in the C-SWAT library? In the C-SWAT library, a 15-amino-acid linker L3 is added to the C-terminus.

      Point 2

      Page 3

      "Expression of OsTir1(F74G) from the strong galactose-inducible GAL1 promoter had a negligible impact on yeast fitness (Fig. S2a)." I wonder why the authors chose to use an inducible promoter to express OsTir1(F74G). In other studies, for example Snyder et al. 2019, OsTir1 has been expressed from a constitutive promoter.

      Point 3

      Page 3

      "A similar frequency was previously observed with a set of AID alleles constructed for 758 essential ORFs using the original AID system (Snyder et al, 2019). However, over a third of these alleles exhibited fitness defects even in the absence of auxin, which were further compounded by off-target effects of auxin, highlighting the advantages of the AID2 system." Snyder et al. 2019 used a TAP-AID-6FLAG tag. The fitness defect in the absence of auxin may not necessarily be due to the AID part of the tag, as TAP tagging is known to compromise the functions of some genes.

      Point 4

      Page 3

      "Interestingly, complete degradation of 33% of essential proteins did not result in a fitness defect. It is possible that in some cases partial degradation results in low protein levels that are below the detection limit of our assay but are sufficient for viability." Are these "33% of essential proteins" enriched with genes with low expression levels? I guess genes with low expression levels are more likely to fall below the detection limit even when partially depleted. Are there extreme examples where a highly expressed essential gene does not exhibit a fitness defect when the protein product is no longer detectable?

      Significance

      This study generated highly valuable resources for functional genomic studies.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The manuscript by the Khmelinskii group reports that they have successfully constructed two conditional degron libraries of budding yeast for almost all proteins. For this purpose, the authors employed an improved auxin-inducible degron (AID2). Initially, they constructed yeast libraries by fusing HaloTag to the N- or C-terminus of proteins and found that C-terminal tagging is less likely to affect the location and function of proteins (Fig. 1). Based on this finding, the authors fused mNG-AID-3Myc or AID-3Myc (AID-v1 or AID-v2 library, respectively) to more than 5600 proteins and found that 4079 proteins were significantly depleted when cells were treated with 5-Ph-IAA (Fig. 2). A fitness defect was observed for over 60% of essential proteins, indicating the target depletion showed the expected phenotype in many cases (Fig. 3). Finally, the authors screened proteins required for maintaining viability in the presence of MMS, CPT and HU, and identified common proteins involved in DNA repair (such as RAD52 epistasis proteins) and other proteins specific for MMS, CPT or HU resistance (Fig. 4). Furthermore, the authors revealed that an ER membrane protein, Gsf2, is required for HU resistance, which was not found in previous studies with the YKO library because gsf2∆ cells in the YKP library had aquired a suppressor mutation (Fig. 4e).

      Major comments

      • In Figure S2a, the authors initially checked the growth of yeast cells expressing OsTIR1(F74G) under the GAL1 promoter, saying that "expression of OsTir1(F74G) from the strong galactose-inducible GAL1 promoter had a negligible impact on yeast fitness (page 3)". To me, the OsTIR1(G74G) expressing cells showed slightly slower growth compared to the control cells. Moreover, the cells expressing it under the very strong GPD promoter showed apparent slow growth, suggesting that OstIR1(F74G) overexpression caused a side effect. The authors should carefully evaluate the cells with GAL1-OsTIR1(F74G).
      • Given the possibility that OsTIR1(F74G) overexpression might cause a growth problem, it is not appropriate to compare OsTIR1+ and OsTIR1- conditions for evaluating growth fitness (Fig. 2). As shown in Fig. S4b, it is more appropriate to compare the +/- 5-Ph-IAA conditions. Additionally, the 5-Ph-IAA concentration used in this study was not clearly mentioned in the method section and figure legends.
      • The authors found that fitness defects were observed for over 60% of essential proteins (Fig. 3). In other words, depletion of the remaining 40% was not enough to induce growth defects. The authors should discuss how the current AID library can be improved to achieve better target depletion. Previous literature reported various possibilities, such as using a tandem degron tag and combining AID with the Tet promoter system (PMID 25181302, 26081484). Although optional, it would be wonderful if the authors would generate an improved library.

      Minor comments

      • 5-Ph-IAA is not auxin because it does not induce the auxin responses in plants (PMID 29355850). Therefore, the authors should be careful when they refer to 5-Ph-IAA and should not call it auxin.
      • The availability of the HaloTag and AID libraries should be indicated.
      • Page 3: "Finally, the extent of AID-dependent degradation varied with protein abundance, in that highly expressed proteins were more likely to be only partially degraded compared to lowly expressed ones (Fig. 2e, Fig. S2e)". Fig. S2e should be Fig. S2d, shouldn't it?

      Significance

      This paper is technically robust and well-conducted. It presents a comprehensive study showcasing the effectiveness of the conditional degron library. The HaloTag libraries will also be useful. The yeast libraries presented in this study will be invaluable for future screenings and studies across all aspects of yeast biology.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Reviewer #1

      Major comments:

      1. * The connection between vB12 and MMA is weak, and the attempt to connect these pieces to PPI seems somehow forced. For instance, the authors do not convincingly demonstrate that MMA causes the PPI deficit. Furthermore, vB12 may rescue PPI independently of MMA. The authors should be more transparent about the lack of connection or causality between changes in metabolism and behavior.* We appreciate the reviewer's comment and acknowledge that we have not demonstrated causal relationships between increased MMA, PPI deficits in Tbx1+/- mice and their rescue by vB12. They are associations.

      In the revised manuscript, we have clarified this in the Discussion, para.4, by adding the following phrase. “The results of our study do not prove a causal relationship between elevated brain MMA and PPI impairment, nor do they tell us whether rescue of the PPI impairment by vB12 occurs by reducing MMA”.

      Regarding the comment of a weak connection between vitamin B12 and MMA, we respectfully disagree.

      The biochemistry underlying the connection is outlined clearly in the Introduction, page 4, para. 2.

      Patients with vitamin B12 deficiency typically exhibit elevated levels of MMA and administration of vitamin B12 (cobalamin) helps to normalize MMA values (Robert & Brown, 2003). Furthermore, several animal models with genetic alterations in the vitamin B12 pathway exhibit high levels of MMA. For instance, mice lacking the cobalamin transporter have increased MMA (Bernard et al., 2018). Additionally, mice lacking the mutase (Mut), which requires vitamin B12 as a cofactor for the conversion of methylmalonyl CoA to succinyl-CoA for entry into the Krebs cycle, demonstrate elevated levels of MMA and are unresponsive to vitamin B12 treatment (Peters et al. 2006). In the revised manuscript, we have cited these references (Introduction, page 4, para. 2).

      Throughout the manuscript, an important control is missing: WT+ vB12 group. Data from this group should be added to Figures 1, 3, 4, 6, and Supplemental Figures 3 and 4 to show the effect of vB12 on WT mice.

      All of the experiments reported in the original manuscript included this control group although it was not always included in the data analysis and therefore in the figures, as observed by the reviewer, In the revised manuscript all of the relevant figures and tables now include these data.

      In Figures 1C and 3, data from the respective WT controls in the Df1 and Tbx1 cohorts should be shown.

      • *

      The wild type (WT) animals serve as the control group for both Tbx1+/- and Df/+ mice because they were littermates, obtained from matings between Df1/+ and Tbx1+/- mice. This has been clarified in the Materials and Methods, and in a new cartoon which has been added to Supplementary Figure 1 (1C) showing all of the animal groups used for the various studies (NMR, transcriptomics, behavior).

      The Supplemental Table S1 includes 17 WT controls for Tbx1+/- and 6 WT controls for Df1/+, but Figure 1C includes only one group of 11 WT controls. For which group were those 11 WT controls?

      Here are several examples of inconsistency in the data: "For this, we first performed a preliminary metabolome analysis using isolated whole brains of male and female Tbx1+/- (n= 18) and WT (n= 10) mice between one and two months of age. A set of metabolites was quantified in brain extracts by liquid chromatography tandem mass spectrometry (LC-MS/MS) (Supplementary Table 1)." Again, the number of mice the authors note in the manuscript does not match that shown in Supplementary Table 1 (Tbx1+/- (n=14) and WT (N=17). "We analyzed whole brain tissue isolated from Df1/+ and WT (control) mice (n = 5 per genotype)." Again, the numbers of mice do not match those in Supplementary Table 1, which notes 6 Df1/+ mice and 6 WT mice.

      We apologize for these errors and inconsistencies in the text and tables, all of which have been corrected in the revised manuscript. In addition, we have added the aforementioned cartoon (Supplementary Figure 1C) and we have improved the presentation of the data (genotypes and treatments) in Supplementary Tables 1 and 2. We hope that these changes provide the expected clarity to the data.

      MMA is the only metabolite that is similarly changed between Tbx1+/- and Df1/+ brains. This is an interesting observation. However, there is no other overlap in metabolic changes between these two mutants. This is a concern that requires clarification

      We appreciate the reviewer's comment. The observation is not altogether surprising considering that the Df1 deletion includes at least 9 genes involved in metabolic pathways (cited refs (Maynard, 2008;Meechan, 2011; Devaraju and Zakharenko, 2017)) any of which might counteract or compensate for changes caused by Tbx1 mutation alone. In addition, heterozygosity for other genes in the deleted region (Df1 encompasses over 20 genes) might affect metabolic processes indirectly. In the revised manuscript we have added the following phrase to the Discussion, para.1 “Thus, even though the two mutants are genetically and metabolically very different, in Df1/+ mice, the MMA phenotype is not affected by heterozygosity of other genes in the deletion.

      The authors mention that MMA is not changed in pre-term Tbx1+/- embryos, but no data are provided. What about MMA levels in Df1/+ embryos?

      In pre-term Tbx1+/- and WT embryos MMA was undetectable. This is now stated in the final paragraph of the first section of the Results.

      We did not measure MMA in Df1/+ embryos. It was not a goal of the study to compare the metabolome of these two genetically very different mutants. The MMA data on Df1/+ mice are presented because they show the potential relevance of this phenotype for the human disease, and they justify the use of the single gene (Tbx1+/-) mutants for studies into metabolism-related disease mechanisms. See also response to point 12

      * * In some cases, the differences in metabolites (e.g., glutamine, glutamate, phosphoethanolamine, taurine, leucine, myo-inositol, and niacinamide) between the WT and Tbx1+/- mice is very minimal (Supplemental Figure 2). The y-axis scale should start at 0.

      We have changed the y-axis settings where necessary

      The vB12 is administered via two different regimens: 1) every 3 days for 28 days, at 4-8 weeks of age for metabolic measurements, and 2) twice a week for 2 months for PPI behavioral testing. Is there any reason the authors chose different protocols?

      We apologize for the confusion, which was due to an oversight in the Materials and Methods section that has been corrected. The weekly injection regimen was the same for mice used in the behavioral and metabolic studies, but the treatment time was shorter for the metabolic studies, for practical reasons beyond our control; mice received vB12 injections twice a week, beginning at 4 wks of age and continuing until 8 wks or 12 wks of age for metabolic and behavioral studies respectively.

      * * The authors should add the following references to the study: Long et al., Neurogenetics (2006), which shows no change in PPI in Tbx1+/- mice. This discrepancy compared with the current study results and those of Paylor et al., Proc Natl Acad Sci U S A (2006) should be discussed.

      We have not cited the study by Long et al. because there are no obvious reasons for the discrepancy (age, mouse strain, sex) that could be discussed. Beyond this of course we cannot comment on data generated by another research group. Nevertheless, the presence of the PPI deficit in Tbx1+/- mice has been confirmed in two different Tbx1 alleles Tbx1 lacZ/+ and Tbx1ΔE5/+, by two different investigators, Dr. Richard Paylor using Tbx1 lacZ/+ mice (Paylor et al. 2001) and Dr. Elvira De Leonibus (co-author of this manuscript) using Tbx1ΔE5/+ mice, in two different countries (USA and Italy) in a rederived colony of mice.

      • Figure 6B is a concern. The PPI decrease in the Tbx1+/- group appears to be driven by results from 3-4 mice. First, are those data statistical outliers? *

      With all due respect, this is not the case. Eight Tbx1+/- mice, i.e., >50% of those tested had PPI values below the minimum observed in WT mice. The behavioural data were checked for the presence of outliers in each group using the Grubbs test, which yielded negative results. Our finding of PPI deficits in Tbx1+/- mice are in line with previously published data in Tbx1+/- and other animal models of 22q11.2 microdeletion (Paylor et al., 2006; Paylor and Lindsay, 2006; Stark et al., 2008), as well as in humans (Sobin et al., 2005).

      Second, experiments in the same mice would be more informative. Do PBS-treated mutants recover PPI if they are treated with vB12 and vice versa? If the authors are concerned about the age difference, they also should include age-dependent effects on PPI.

      We decline to perform the proposed experiment for the reasons described in section 4 of the Revision Plan

      • *

      Because vB12 treatment completely rescued the MMA level in Df1/+ mice (Figure 3), the authors should include a figure showing PPI test results in Df1/+ mice.

      Vitamin B12 treatment fully rescued the MMA phenotype in both mutants (Figure 3). Whether it rescues the PPI defect in Df1/+ mice is not important for this study. We used Df1/+ mice as an entry point, in order to give validity to the pursuit of the MMA phenotype in the single gene mutant (Tbx1+/-), in which we expect that it will be easier to find disease mechanisms. For this reason, we focused our attention on identifying metabolic alterations in adult Tbx1+/- mice.

      See also response to point 7.

      *Figure 1A and B table: Did the authors mean Log2FC instead of FC? The authors also should present the *

      *source data by adding supplemental tables that include raw data and normalized conversion, etc., as described in the multivariate statistical data analysis of the LC-MS/MS data. *

      • *

      The new Figures 1A, Figure 3 and the accompanying tables now state Log2FC. New Supplementary Table 1 presents the raw data that were normalized on the basis of the amount of protein in the samples, described and referenced in the Materials and Methods

      "...we identified a new metabolic phenotype that was associated with reduced sensorimotor gating deficits in Tbx1+/- mice". Although the authors showed the PPI rescue by treating Tbx1+/- mice with vB12, that result alone does not prove the association of metabolic phenotype with sensorimotor-gating deficit; other supporting data are needed.

      • *

      This is perhaps a question of semantics; by associated we mean that the two phenotypes, metabolic alterations and reduced PPI were observed together

      The authors stated, "Results showed that there were very few differentially expressed genes in Tbx1+/- vs WT brains, (n=22 out of 14535 expressed genes (Fig. 5 and Supplementary Tab. 2)". However, they described how 3 differentially expressed genes are involved in mitochondrial activity in the Discussion. The authors should describe those 3 genes and their relation to the metabolic change.

      The results that the reviewer refers to have changed in the revised manuscript due to the inclusion of the control group WT +vB12 in the data analysis. The transcriptome analysis revealed that vB12 had a stronger impact than genotype, and as a consequence, the statistical analysis of all groups did not highlight minor differences between the two genotypes.

      Figure 5B: The authors claimed that they detected similar transcription profiles between WT+vB12 vs. Tbx1+/-+vB12, comparing Tbx1+/-+PBS vs. Tbx1+/-+vB12. This is based on 947 genes being downregulated and 834 being upregulated, which is not appropriate. The authors should normalize those data to the numbers of genes upregulated and downregulated in WT+PBS vs. WT+vB12 respective groups.

      We said that we detected similar transcription profiles in PBS-treated WT and Tbx1+/- brains; a WT+vB12 group was not present. The latter group is included in revised manuscript and the data reanalyzed comparing all groups.

      See also response to points 2 and 15.

      Minor comments

      1. Supplementary Table S1 shows the identical MMA concentration "0.2" for 6 controls. Is this correct? This was an error that has been corrected; the value is 0.00 (not detectable).

      * Remove the callout for Figure 1C at the end of the second paragraph in Results*.

      This figure is no longer present

      *There are multiple typos throughout the manuscript. *

      Here are several examples:

      1. * Fig1B graph- Df/+ => Df1/+* Figure changed in revised manuscript

      2. "Together, the hydrophilic and lipophilic results revealed a group of 6 compounds that characterized the brain metabolic differences between Tbx1+/- and WT mice (Figure 2B, 2C)". Figure 2A should be included also. Corrected

      3. "In support of this notion, is the finding that...(remove) Removed

      4. Remove double periods: "The pathways found are depicted in Figure 2C' which reports the impact of each pathway versus p values.." Corrected

      5. Panel labels in all figures are misplaced. Panel labels are aligned correctly

      We have performed a spelling and grammar check on the text

      "In support of this notion... at least nine orthologs are involved in mitochondrial metabolism". What are those 9 mitochondrial genes? Kolar et al., Schizophr Bull (2023) indicates that there are 8 mitochondrial genes within the 22q11.2 locus. The authors need to list these genes.

      This reference, which is a review, has been cited in the Introduction, para.3 along with the genes.

      The review presents nine mitochondrial genes which the authors divide into two groups, 1) Genes expressed in mitochondria (SLC25A1, TXNRD2, MRPL40, PRODH, and COMT) and 2) Genes that have been shown to have an impact on mitochondrial function (TANGO2, ZDHHC8, UFD1L, and DGCR8). In the abstract they mention only eight genes, the ninth gene COMT is mentioned in the text.

      Reviewer #2 (Significance (Required)):

      *The manuscript titled "Tbx1 haploinsufficiency causes brain metabolic and behavioral anomalies in adult mice which are corrected by vitamin B12 treatment" by Caterini et al. presents a comparative metabolomics study in the brains of mouse models carrying a heterozygous mutation in the transcription factor Tbx1. This mutation is contrasted with a chromosomal deficiency encompassing Tbx1, among other gene loci, known as Df1/+, which serves as a mouse model for 22q11 microdeletion syndrome. The primary and most significant finding of the study is that Tbx1 heterozygosity alone induces broad metabolomic alterations in the entire brain parenchyma, despite Tbx1 expression being confined to vascular endothelial cells. The authors leverage this observation to investigate the effects of dietary supplementation with vitamin B12, which alters the metabolome in a manner interpreted by the authors as rescuing or reversing the Tbx1 heterozygosity phenotype. This study holds promise for understanding the individual gene contributions to the penetrant behavioral phenotypes observed in Df1/+ and 22q11 affected subjects. This potential arises from the clear and consequential metabolic phenotypes described, notably the accumulation of methylmalonic acid.

      However, despite the intriguing metabolic phenotypes, there are significant issues hindering incontrovertible conclusions. *

      Response to Reviewer #2

      Major comments

      1. Despite the intriguing metabolic phenotypes, there are significant issues hindering incontrovertible conclusions. Chief among these problems is the experimental design's nature, where the effects of genotype and a pharmacological intervention, vitamin B12, are assessed. The current design overlooks the effects of vitamin B12 on wild-type animals in metabolic and behavioral measures, thus precluding the attribution of the effects of vitamin B12 to a rescue. See response to Reviewer 1 (point 2) who made the same criticism. This group is now included in the data analysis of the relevant experiments.

      *An alternative explanation, consistent with the measurements, is that vitamin B12 modifies metabolites and transcripts irrespective of genotype. A suggestion of this possibility is the observed effect of B12 lowering glutamate levels in Tbx1 mutant tissue below those in wild-type brain tissue (Fig. 4C). *

      This might be true for some metabolites. Indeed, we found 5 metabolites that responded similarly to vB12 in both WT and Tbx1 +/- mice. In contrast, three metabolites responded to vB12 in both WT and Tbx1+/- mice, but the response was more pronounced in Tbx1+/- mice. Finally, a group of eight metabolites was altered exclusively in Tbx1+/- mice after vB12 treatment, including inosine, glutamate and short-chain fatty acids (SCFAs), Figure 4 and Supplementary Figure 6. Thus, overall, our data suggest that with only a few exceptions, the metabolic response to vB12 treatment is genotype-dependent.

      • *

      This experimental design issue is exacerbated by the multitude of analytes measured by metabolomics, all collectively assumed to change as part of a common genotype-B12 interaction mechanism. This interpretation is feasible only if none of the analytes were to respond to B12 in wild-type animals.

      • *

      As specified above, the response to vB12 was genotype-dependent. The inclusion of the vB12-treated WT dataset should address this point.

      A second major issue arises from the assertion that Tbx1 is exclusively expressed in mouse brain endothelial cells and not in brain parenchyma. A significant unresolved question is how a gene expressed solely in endothelial cells can alter the brain parenchyma metabolome and transcriptome. This issue remains unaddressed and is not sufficiently discussed. If this assertion holds true, then the observations bear great importance in understanding how Df1/+ causes brain phenotypes and, by extension, in human 22q11.

      There are quite a lot of published data from the mouse demonstrating the brain endothelial-specific expression of Tbx1 and the lack of expression in other brain cell types. These include studies using reporter genes (Paylor, 2006; Cioffi, 2014), Tbx1Cre based cell fate mapping (Cioffi, 2014, Cioffi, 2022) and single cell whole genome transcriptions (Ximerakis et al., 2019); (https://portals.broadinstitute.org/single_cell/study/aging-mouse-brain). All are cited in the manuscript.

      HOW the mutation of Tbx1 alters the brain metabolome and transcriptome will be the object of future studies, Currently, we do not have any data. At the reviewer’s request, we have extended the discussion of this point in the revised manuscript (Discussion, para. 4).

      In this vein, the authors should consider that Tbx1 is not expressed in brain endothelial cells in humans and is minimally expressed in fetal astrocytes (see https://brainrnaseq.org/).

      https://brainrnaseq.org provides a tool to evaluate the gene expression in the fetal brain. The sequencing was performed on fetal human brain tissue after elective pregnancy termination (4wks-9wks, it is not clear). Our analysis focuses on adult mice, which may contribute to observed differences.

      Moreover, Yi et al. (2010) generated a gene expression atlas of human embryogenesis spanning from 4 to 9 weeks of gestational age, revealing downregulation of TBX1 during this timeframe. Conversely, in the normal adult human brain, TBX1 expression is identified in endothelial cells, as indicated in "The Human Brain Cell Atlas v1.0" presented for visualization and data mining through the Chan Zuckerberg Initiative’s CellxGene application, referring to the atlas ontology in Ding et al. (2016). * 3. A third major concern pertains to the general poor quality of the figures. Many figures appear to be directly exported from the software used for data analysis without proper curation. They are inadequately labeled, lack color codes to clarify differences (e.g., volcano plots), feature lettering fonts that are difficult to discern, and have lettering panels placed in awkward positions. Fig. 1 would benefit by the addition of a pathway diagram showing which metabolites are changing. Figure tables/spreadsheets either have sheets labeled in Italian or are empty. Collectively, the manuscript needs more careful data curation and presentation*.

      Many of the figures and tables have been modified with respect to the original manuscript and issues of clarity and quality have been improved where necessary.

      Other points for consideration are listed below. • The abstract results section does not mention the Df1 mutants at all, and overall the description of the results should be improved

      Corrected • The abstract would benefit from defining vB12 before using the abbreviation

      Corrected • The section of the Results describing MMA accumulation in the brain would benefit from

      • *explaining the choice of 1 month of age for terminal experiments and the choice to use whole brains (are there particularly brain regions suspected to be affected?), * The majority of animals were 2 months of age at sacrifice (age and sex of individual animals are indicated in Supplementary Table 1). Young adult mice were the object of the study for the reason described in the first paragraph of the Results section, namely “Human studies of brain metabolism have mainly been conducted on children and adolescent patients. Therefore, in order to determine whether similar anomalies were present in the mouse models, we performed our studies on young mice between 1 and 2 months of age (Dutta et al., 2016)”.

      This is also the age at which the behavioural phenotype has been demonstrated (Paylor et al., 2006), and therefore could, potentially be rescued by vB12 treatment. We do not have regional information pertaining to the adult brain.

      2) describing any sex effects for Tbx1 mutants (and clarifying what data points for Tbx1 animals correspond to which sex), and 3) including what sex was used for Df1 experiments.

      In preliminary experiment we analyzed males and females’ mice, before electing to use only males. To obtained reliable information about the impact of gender on metabolism and transcription we would have to use much larger numbers of animals. In Supplementary Table 1 pertaining to males and females are now indicated.

      • The authors demonstrate that vB12 rescues PPI but use no other behavioral paradigms. It is possible that these mutations and/or vB12 could be impacting anxiety-like behaviors or other behavioral phenotypes. By only including PPI, the authors limit the interpretation of the "rescue" of this phenotype by vB12. * Reduced PPI was the only behavioral anomaly identified in Tbx1+/- mice that were subjected to a standard battery of behavioral tests (Paylor et al., 2006). *

      * *Reduced PPI was the only behavioral anomaly identified in Tbx1+/- mice that were subjected to a standard battery of behavioral tests (Paylor et al., 2006).

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This is a terrific paper looking at influences of Tbx1 heterozygosity on metabolic phenotypes in mice. A weakness is that the locus on effects of B12 is totally unclear--could be neurovascular or even peripheral, but correcting this weakness might include study of Tbx1 conditional mutants, beyond the scope of this study.

      • *

      Reviewer #3 (Significance (Required)):* good significance

      Only two minor suggestions. 'We selected to study primarily Tbx1 single gene mutants because it is the primary candidate disease gene". What is the basis for this statement? Mouse +/- mutants in Mrpl40, Txnrd2, ProhD, and probably others have shown brain phenotypes.*

      • *

      The basis for TBX1 being considered as the primary candidate disease gene is the finding of TBX1 point mutations in patients who have the full spectrum of clinical phenotypes associated with 22q11.2 deletion syndrome without the chromosomal deletion, namely, congenital heart defects, immune defects, facial dysmorphism, learning defects and developmental delay. Similarly, in the mouse, Tbx1 mutation recapitulates the phenotype observed in multigene deletion mutants, such as Df1/+ mice.

      We do not say (or think) that heterozygosity of other genes from del22q11.2 does not contribute to the disease, but mutations of other genes have not been found in individuals with a 22q11.2DS phenotype but without the chromosomal deletion.

      In the discussion, the authors could close the loop on low glutamine could result in lower gaba in inhibitory interneurons, and its correction with B12 could restore gaba levels.

      Discussion, para. 3. We thank the reviewer for comment. However, the GABA concentration is not altered in Tbx1 haploinsufficient brains; it is only upregulated by Vitamin B12. Therefore, this assumption may be very speculative. Due to differences in the release and reabsorption rates of the three compounds (glutamine, glutamate, and GABA), correctly evaluating the glutamine-glutamate cycle requires separating astrocytes from neurons. We have only discussed the upregulation of glutamate and the GABA response to Vitamin B12, which may counteract the excess of glutamate.

      1. __4__Description of analyses that authors prefer not to carry out Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      Reviewer 1, point 11. The PPI decrease in the Tbx1+/- group appears to be driven by results from 3-4 mice. First, are those data statistical outliers?

      Second, experiments in the same mice would be more informative. Do PBS-treated mutants recover PPI if they are treated with vB12 and vice versa? If the authors are concerned about the age difference, they also should include age-dependent effects on PPI.

      We are unable to perform this experiment because, as stated in the manuscript, the mice were sacrificed at the end of the experiment and the brains preserved for histological analysis (not part of this study). The generation of mice for new experiments would take about one year. With all due respect, we do not believe that the data that would be obtained are sufficiently important to justify, ethically and economically, this work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This is a terrific paper looking at influences of Tbx1 heterozygosity on metabolic phenotypes in mice. A weakness is that the locus on effects of B12 is totally unclear--could be neurovascular or even peripheral, but correcting this weakness might include study of Tbx1 conditional mutants, beyond the scope of this study.

      Only two minor suggestions.

      • 'We selected to study primarily Tbx1 single gene mutants because it is the primary candidate disease gene". What is the basis for this statement? Mouse +/- mutants in Mrpl40, Txnrd2, ProhD, and probably others have shown brain phenotypes.

      • In the discussion, the authors could close the loop on low glutamine could result in lower gaba in inhibitory interneurons, and its correction with B12 could restore gaba levels.

      Significance

      good significance

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript titled "Tbx1 haploinsufficiency causes brain metabolic and behavioral anomalies in adult mice which are corrected by vitamin B12 treatment" by Caterini et al. presents a comparative metabolomics study in the brains of mouse models carrying a heterozygous mutation in the transcription factor Tbx1. This mutation is contrasted with a chromosomal deficiency encompassing Tbx1, among other gene loci, known as Df1/+, which serves as a mouse model for 22q11 microdeletion syndrome. The primary and most significant finding of the study is that Tbx1 heterozygosity alone induces broad metabolomic alterations in the entire brain parenchyma, despite Tbx1 expression being confined to vascular endothelial cells. The authors leverage this observation to investigate the effects of dietary supplementation with vitamin B12, which alters the metabolome in a manner interpreted by the authors as rescuing or reversing the Tbx1 heterozygosity phenotype. This study holds promise for understanding the individual gene contributions to the penetrant behavioral phenotypes observed in Df1/+ and 22q11 affected subjects. This potential arises from the clear and consequential metabolic phenotypes described, notably the accumulation of methylmalonic acid.

      • However, despite the intriguing metabolic phenotypes, there are significant issues hindering incontrovertible conclusions. Chief among these problems is the experimental design's nature, where the effects of genotype and a pharmacological intervention, vitamin B12, are assessed. The current design overlooks the effects of vitamin B12 on wild-type animals in metabolic and behavioral measures, thus precluding the attribution of the effects of vitamin B12 to a rescue. An alternative explanation, consistent with the measurements, is that vitamin B12 modifies metabolites and transcripts irrespective of genotype. A suggestion of this possibility is the observed effect of B12 lowering glutamate levels in Tbx1 mutant tissue below those in wild-type brain tissue (Fig. 4C). This experimental design issue is exacerbated by the multitude of analytes measured by metabolomics, all collectively assumed to change as part of a common genotype-B12 interaction mechanism. This interpretation is feasible only if none of the analytes were to respond to B12 in wild-type animals. Given that this experimental design recurs across a substantial portion of the paper, it may constitute a factor that undermines the strength of the conclusions.

      • A second major issue arises from the assertion that Tbx1 is exclusively expressed in mouse brain endothelial cells and not in brain parenchyma. A significant unresolved question is how a gene expressed solely in endothelial cells can alter the brain parenchyma metabolome and transcriptome. This issue remains unaddressed and is not sufficiently discussed. If this assertion holds true, then the observations bear great importance in understanding how Df1/+ causes brain phenotypes and, by extension, in human 22q11. In this vein, the authors should consider that Tbx1 is not expressed in brain endothelial cells in humans and is minimally expressed in fetal astrocytes (see https://brainrnaseq.org/).

      • A third major concern pertains to the general poor quality of the figures. Many figures appear to be directly exported from the software used for data analysis without proper curation. They are inadequately labeled, lack color codes to clarify differences (e.g., volcano plots), feature lettering fonts that are difficult to discern, and have lettering panels placed in awkward positions. Fig. 1 would benefit by the addition of a pathway diagram showing which metabolites are changing. Figure tables/spreadsheets either have sheets labeled in Italian or are empty. Collectively, the manuscript needs more careful data curation and presentation.

      Other points for consideration are listed below:

      1) The abstract results section does not mention the Df1 mutants at all, and overall the description of the results should be improved

      2) The abstract would benefit from defining vB12 before using the abbreviation

      3) The section of the Results describing MMA accumulation in the brain would benefit from 1) explaining the choice of 1 month of age for terminal experiments and the choice to use whole brains (are there particularly brain regions suspected to be affected?), 2) describing any sex effects for Tbx1 mutants (and clarifying what data points for Tbx1 animals correspond to which sex), and 3) including what sex was used for Df1 experiments.

      4) The authors demonstrate that vB12 rescues PPI but use no other behavioral paradigms. It is possible that these mutations and/or vB12 could be impacting anxiety-like behaviors or other behavioral phenotypes. By only including PPI, the authors limit the interpretation of the "rescue" of this phenotype by vB12.

      Significance

      The manuscript titled "Tbx1 haploinsufficiency causes brain metabolic and behavioral anomalies in adult mice which are corrected by vitamin B12 treatment" by Caterini et al. presents a comparative metabolomics study in the brains of mouse models carrying a heterozygous mutation in the transcription factor Tbx1. This mutation is contrasted with a chromosomal deficiency encompassing Tbx1, among other gene loci, known as Df1/+, which serves as a mouse model for 22q11 microdeletion syndrome. The primary and most significant finding of the study is that Tbx1 heterozygosity alone induces broad metabolomic alterations in the entire brain parenchyma, despite Tbx1 expression being confined to vascular endothelial cells. The authors leverage this observation to investigate the effects of dietary supplementation with vitamin B12, which alters the metabolome in a manner interpreted by the authors as rescuing or reversing the Tbx1 heterozygosity phenotype. This study holds promise for understanding the individual gene contributions to the penetrant behavioral phenotypes observed in Df1/+ and 22q11 affected subjects. This potential arises from the clear and consequential metabolic phenotypes described, notably the accumulation of methylmalonic acid.

      However, despite the intriguing metabolic phenotypes, there are significant issues hindering incontrovertible conclusions.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Using mass spectrometry, nuclear magnetic resonance spectroscopy, and transcriptomics analyses, Caterino et al. identified metabolic changes in the brains of young adult Df1/+ mouse models of 22q11.2 deletion syndrome (22q11DS). They observed similar changes in Tbx1+/- mice, as Tbx1 is one of the genes encoded inside the 22q11.2 microdeletion. Among the metabolites identified, methylmalonic acid (MMA) showed the most prominent increase. Systemic vitamin B12 (vB12) administration rescued the MMA increase and had strong effects on the brain metabolomes and transcriptomes in Tbx1+/- and Df1/+ mice. Finally, the pre-pulse inhibition (PPI) deficit observed in Tbx1+/- mice was partially rescued by vB12 treatment. As such, this study provides a description of metabolic changes caused by Tbx1 haploinsufficiency and suggests vB12 as a therapeutic avenue in 22q11DS. Although intriguing, this study suffers from several shortcomings that should be corrected or clarified before publication.

      Major comments:

      1. The connection between vB12 and MMA is weak, and the attempt to connect these pieces to PPI seems somehow forced. For instance, the authors do not convincingly demonstrate that MMA causes the PPI deficit. Furthermore, vB12 may rescue PPI independently of MMA. The authors should be more transparent about the lack of connection or causality between changes in metabolism and behavior.

      2. Throughout the manuscript, an important control is missing: WT+ vB12 group. Data from this group should be added to Figures 1, 3, 4, 6, and Supplemental Figures 3 and 4 to show the effect of vB12 on WT mice.

      3. In Figures 1C and 3, data from the respective WT controls in the Df1 and Tbx1 cohorts should be shown.

      4. The Supplemental Table S1 includes 17 WT controls for Tbx1+/- and 6 WT controls for Df1/+, but Figure 1C includes only one group of 11 WT controls. For which group were those 11 WT controls?

      5. Here are several examples of inconsistency in the data: "For this, we first performed a preliminary metabolome analysis using isolated whole brains of male and female Tbx1+/- (n= 18) and WT (n= 10) mice between one and two months of age. A set of metabolites was quantified in brain extracts by liquid chromatography tandem mass spectrometry (LC-MS/MS) (Supplementary Table 1)." Again, the number of mice the authors note in the manuscript does not match that shown in Supplementary Table 1 (Tbx1+/- (n=14) and WT (N=17). "We analyzed whole brain tissue isolated from Df1/+ and WT (control) mice (n = 5 per genotype)." Again, the numbers of mice do not match those in Supplementary Table 1, which notes 6 Df1/+ mice and 6 WT mice.

      6. MMA is the only metabolite that is similarly changed between Tbx1+/- and Df1/+ brains. This is an interesting observation. However, there is no other overlap in metabolic changes between these two mutants. This is a concern that requires clarification.

      7. The authors mention that MMA is not changed in pre-term Tbx1+/- embryos, but no data are provided. What about MMA levels in Df1/+ embryos?

      8. In some cases, the differences in metabolites (e.g., glutamine, glutamate, phosphoethanolamine, taurine, leucine, myo-inositol, and niacinamide) between the WT and Tbx1+/- mice is very minimal (Supplemental Figure 2). The y-axis scale should start at 0.

      9. The vB12 is administered via two different regimens: 1) every 3 days for 28 days, at 4-8 weeks of age for metabolic measurements, and 2) twice a week for 2 months for PPI behavioral testing. Is there any reason the authors chose different protocols?

      10. The authors should add the following references to the study: Long et al., Neurogenetics (2006), which shows no change in PPI in Tbx1+/- mice. This discrepancy compared with the current study results and those of Paylor et al., Proc Natl Acad Sci U S A (2006) should be discussed.

      11. Figure 6B is a concern. The PPI decrease in the Tbx1+/- group appears to be driven by results from 3-4 mice. First, are those data statistical outliers? Second, experiments in the same mice would be more informative. Do PBS-treated mutants recover PPI if they are treated with vB12 and vice versa? If the authors are concerned about the age difference, they also should include age-dependent effects on PPI.

      12. Because vB12 treatment completely rescued the MMA level in Df1/+ mice (Figure 3), the authors should include a figure showing PPI test results in Df1/+ mice.

      13. Figure 1A and B table: Did the authors mean Log2FC instead of FC? The authors also should present the source data by adding supplemental tables that include raw data and normalized conversion, etc., as described in the multivariate statistical data analysis of the LC-MS/MS data.

      14. "...we identified a new metabolic phenotype that was associated with reduced sensorimotor gating deficits in Tbx1+/- mice". Although the authors showed the PPI rescue by treating Tbx1+/- mice with vB12, that result alone does not prove the association of metabolic phenotype with sensorimotor-gating deficit; other supporting data are needed.

      15. The authors stated, "Results showed that there were very few differentially expressed genes in Tbx1+/- vs WT brains, (n=22 out of 14535 expressed genes (Fig. 5 and Supplementary Tab. 2)". However, they described how 3 differentially expressed genes are involved in mitochondrial activity in the Discussion. The authors should describe those 3 genes and their relation to the metabolic change.

      16. Figure 5B: The authors claimed that they detected similar transcription profiles between WT+vB12 vs. Tbx1+/-+vB12, comparing Tbx1+/-+PBS vs. Tbx1+/-+vB12. This is based on 947 genes being downregulated and 834 being upregulated, which is not appropriate. The authors should normalize those data to the numbers of genes upregulated and downregulated in WT+PBS vs. WT+vB12 respective groups.

      Minor comments:

      1. Supplementary Table S1 shows the identical MMA concentration "0.2" for 6 controls. Is this correct?

      2. Remove the callout for Figure 1C at the end of the second paragraph in Results.

      3. There are multiple typos throughout the manuscript. Here are several examples:

      a) Fig1B graph- Df/+ => Df1/+

      b) "Together, the hydrophilic and lipophilic results revealed a group of 6 compounds that characterized the brain metabolic differences between Tbx1+/- and WT mice (Figure 2B, 2C)". Figure 2A should be included also.

      c) "In support of this notion, is the finding that...(remove)

      d) Remove double periods: "The pathways found are depicted in Figure 2C' which reports the impact of each pathway versus p values.."

      e) Panel labels in all figures are misplaced.

      f) "In support of this notion... at least nine orthologs are involved in mitochondrial metabolism". What are those 9 mitochondrial genes? Kolar et al., Schizophr Bull (2023) indicates that there are 8 mitochondrial genes within the 22q11.2 locus. The authors need to list these genes.

      Significance

      Caterino et al. provide the evidence of a change in brain metabolism associated with 22q11.2 deletion syndrome (22q11DS), which is a major risk factor for neuropsychiatric disease, especially schizophrenia. The authors attributed these changes to haploinsufficiency of Tbx1, which is located inside the 22q11.2 genomic locus, as Tbx1+/- mice and 22q11DS models (Df1/+ mice) demonstrated similar metabolic changes. Among these changes, the authors detected an abnormally high accumulation of methylmalonic acid (MMA), a toxic metabolite that negatively affects mitochondrial activity and glutamate uptake. The authors also showed that systemic vitamin B12 (vB12) administration in Tbx1+/- mice and 22q11DS mice returned MMA to its normal levels in the brains of both strains. Finally, administration of vB12 partially rescued a behavioral phenotype in Tbx1+/- mice. Specifically, vB12 improved pre-pulse inhibition (PPI) of the acoustic startle in these mutants. This study may potentially motivate a new direction of research toward elucidating the Tbx1-dependent metabolic mechanism and its connection to abnormal behavioral phenotypes in 22q11DS.

      • This is the first thorough characterization of metabolic changes in the brain of mouse models of 22q11DS. This work may advance our understanding of the mechanism of this disease and schizophrenia in general. Previous studies mostly focused on cell-autonomous mechanisms in 22q11DS, but because Tbx1 is not expressed in the adult brain, this study strongly argues that non-cell-autonomous mechanisms are involved in the pathogenicity of 22q11DS. This description is the strongest part of the manuscript. However, as it stands, this study does not causally connect the metabolite abnormalities with aberrant behavior in 22q11DS models. The rescue of metabolic changes by vB12 is interesting, but the interpretation of the results is a bit speculative. There are also several missing controls and other limitations that preclude this reviewer from a more enthusiastic assessment of this manuscript. The manuscript has the potential to be much stronger if the authors add more experiments and analyze their data in a more appropriate manner.

      • This manuscript describes metabolic abnormalities in 22q11DS models that are driven by the haploinsufficiency of the Tbx1 gene, which is located within the 22q11DS-related genomic region. This is an incremental advance in the 22q11DS field. The authors identified multiple metabolic abnormalities that would be of interest to researchers in the fields of psychiatry and translational and basic neuroscience. They also showed that some metabolic abnormalities are improved by vB12 treatment. They argue that vB12 improves PPI, the abnormality of which is associated with many psychiatric diseases, including schizophrenia. If confirmed, this will be a conceptual advance in the translational neuroscience field. However, making such a statement, exciting as it may be, would be speculative if based solely on the data presented in this manuscript..

      • This reviewer has expertise in basic and translational neuroscience with focus on neurodevelopmental psychiatric diseases.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer1

      I recommend combining Figures 1 and 3 either before or after the results shown in Figure 2, so the reader's expectation for quantification is immediately satisfied.

      Response:

      Thank you for your suggestion. In the revised manuscript, images of GFP::POP-1 in compound mutants are moved to Figure 3. The schematic diagram of the gonad (previously Fig. 1A) and GFP::POP-1 images in wild type are kept in Figure 1, as they are described in Introduction.

      Major comments:

      Delving into the figure legend of Fig. 3 and the normalization procedure described in the Methods "Quantification of POP-1 asymmetry in the Z1 and Z4 division" raised concerns. The method therein described is overly complicated but also neglects background subtraction. My first question about this method: what range of distances between daughters is measured in Z? These distances are not discussed in absolute terms, and this is important for our understanding of how much correction for tissue depth might be necessary, as L1s are very thin.

      To check my understanding, the authors use as a control a nuclear-localized GFP driven in the somatic gonad precursors in otherwise wild-type worms by the sys-1 promoter. They observe that the regression on a log scale of anterior:posterior (and vice versa) Z1 and Z4 daughter fluorescence over the distance between the daughters in the Z plane is fit by y = −0.034x + 0.0148, which is practically a slope of 0 and an intercept of 0. This means that they observed an ~1:1 ratio (as log(1)=0) of fluorescence in the anterior and posterior daughters of otherwise wild-type worms, at least across the range of very small X values of relevant distances between daughters (again, the relevant range of distances really matters and should be presented), making the normalization seem unnecessary.

      Response:

      Normalization is essential to compare POP-1 signals between daughter cells since the signal intensities depend on the depth of cells. Depth differences between SGP daughter cells range from 0 to 7.5 micrometers. For example, when we input the maximum difference (7.5) into our correction equation y = −0.034x + 0.0148 (the logarithmically transformed linear regression equation), we get:

      y = −0.034 * 7.5 + 0.0148 = -0.2402

      To interpret this on the original scale, we apply the inverse logarithmic transformation:

      10^(-0.2402) ≈0.575

      This result indicates that even if GFP::POP-1 expression is the same in both cells, the depth difference alone can cause approximately a 1.74-fold (1/0. 575) difference in fluorescence intensity.

      Similarly, if we use a median value of 3.5 micrometers as the depth difference, we get: y = -0.1042. After the inverse logarithmic transformation, this corresponds to a 0.787 or 1.27 (1/0.787) fold difference in fluorescence intensity.

      Without normalization, we risk misinterpreting such differences in expression levels when in reality the expression is the same. Conversely, actual differences in GFP::POP-1 signal could be masked or overestimated due to the depth effect.

      In the revised manuscript, examples of depth differences between SGP daughters are shown in Fig. 2S which is added in response to the comment of reviewer 2, asking images of lin-17 mom-5 animals.

      In the revised manuscript, we explained the depth effects in the legend of Fig. 3 as follows.

      "Since SGP daughter cells are often present at distinct focal planes, we normalized the depth effects on fluorescence intensities (see Materials and Methods for details) for the quantification shown in (B). The images in (A) and (C) are from animals with SGP daughters at similar depths."

      Then based on this regression and 95% CI, the authors predict values that reflect true equivalence of fluorescence of POP-1::GFP in the two SGP daughters, compare the observed values to these predictions, and ultimately display in violin plots these differences of observed and expected. Correct?

      Response:

      Yes, your understanding is correct.

      Is this complicated treatment the only way to detect differences in polarity of anterior and posterior daughters of Z1 and Z4? What happens if the authors measure GFP::POP-1 and calculate the following?

      Z1p(MGV - background control from same focal plane)


      Z1a(MGV - background control from same focal plane)

      If this straightforward analysis shows asymmetric signal in the control that is made symmetrical or reversed in the mutants, the hypothesis would seem to be supported with a much more straightforward method. Samples could be analyzed separately in two bins by worm body position, which affects which cell is superficial in the sample. As it is, the Figure 3 Y axis label is hard to interpret without reading the methods at length, diminishing its impact.

      Response:

      Thank you for the suggestion. Your suggested calculation would be simple if we could assume that control signals (sys-1p::GFP::NLS or sys-1p::GFP::POP-1 in the same wild-type cell) on the same focal plane are the same among animals. However, since there are apparent variations in expression levels among individuals, your suggested method is not appropriate for evaluating differences in sys-1p::GFP::POP-1 intensities between the SGP daughter cells of the same animal.

      Missing control: The sys-1 promoter-driven NLS-tagged fluorescent protein as a control to compare to the GFP::POP-1 is analyzed only in the wild-type, and apparently not in the mutants under consideration. Phillips et al. (2007) show that sys-1p transcriptional activity is equivalent between the SGP daughters in wild-type worms, but neither those results nor the method of normalizing to a sys-1p::GFP::NLS signal in this paper address the question of whether sys-1 promoter activity is equivalent in these cells in mutants upstream in the Wnt pathway. If the current method of normalization is to be used, it seems important to normalize to the sys-1p::GFP::NLS regression in each mutant background.

      Response:

      Thank you for your suggestion. We used sys-1p::GFP::NLS as a control to normalize depth effects, which should be the same across all genotypes because the GFP molecules in SGPs should be equally distributed between SGP daughter cells, not because sys-1 promoter activities are similar among them. Since SGP daughters divide within a short time (about 2 hours), it is likely that the fluorescence of newly synthesized GFP (maturation time of about 1 hour) in SGP daughters is neglectable compared to GFP inherited from the SGP cells. Similarly, sys-1p::GFP::POP-1 signals in SGP daughters reflect the distribution of GFP::POP-1 from SGPs rather than the transcriptional activities of the sys-1 promoter in the daughter cells. sys-1p::GFP::POP-1 or sys-1p::GFP::SYS-1 has been widely used to evaluate polarity of asymmetric divisions in a number of studies, none of which consider transcriptional differences of the sys-1 promoter in the daughter cells.

      1. How was lin-17(mn589) generated? if this is the first report of this allele, full information on what the lesion is and how it was derived should to be reported.

      Response:

      Thank you for your question regarding the lin-17(mn589) allele. We would like to point out that the information about this allele is provided in the Methods section of the original manuscript as follows.

      "lin-17(mn589) (gifted by Mike Herman) carries a mutation in the seventh cysteine residue of the CRD domain (C104Y). mn589 exhibits 47% Psa phenotype (indicating T cell polarity defects)."

      The methods section lacks a description of how the mes-1 experiments were done, in terms of timing, duration, and temperature; mes-1(bn7) is a temperature sensitive allele.

      Response:

      Thank you for pointing out the lack of detailed methodology for the mes-1 experiments. The germless phenotype of mes-1 mutants is partial even at high temperatures. We have not performed temperature shifts to observe the phenotype. As per your suggestion, we added the following text to the Strains section:

      "mes-1(bn7) is a temperature-sensitive allele with higher penetrance of the germless phenotype at 25{degree sign}C than at 15{degree sign}C, and was grown at 22.5{degree sign}C. The germless phenotype of mes-1(bn7) was observed by the absence of the mex-5::GFP::PH signal through direct observation of epifluorescence."

      Minor comments

      1. The paper lacks a discussion of precedent in the literature for Wnt-independent Frizzled activity; this is a major finding that is being undersold in the current version of the manuscript.

      Response:

      Thank you very much for appreciating out finding. We have added the following paragraph to the Discussion section:

      "Wnt-independent functions of Frizzled receptors

      We have shown that lin-17/Fzd functions in a Wnt-independent manner to control SGP polarity, since the missing DTC phenotype of lin-17; cwn-2 and lin-17 mom-5 was completely rescued by ΔCRD-LIN-17. In addition, SGP polarity is normal in the quintuple Wnt mutant that has mutations in all the Wnt genes (Yamamoto et al., 2011). In seam cells, Wnt receptors including LIN-17/Fzd and MOM-5/Fzd appear to have Wnt-independent functions for cell polarization, as seam cells are still mostly polarized in the quintuple Wnt mutants, while they are strongly unpolarized in the triple receptor mutants (lin-17 mom-5 cam-1/Ror) (Yamamoto et al., 2011). In Drosophila, Fz/Fzd has been primarily considered to function Wnt-independently to coordinate planar cell polarity (PCP) between neighboring cells (Lawrence et al., 2007), though Fz function can still be regulated by Wnt, as PCP orientation can be directed by ectopically expressed Wnt proteins (Wu et al., 2013).

                In Drosophila, Fz regulates PCP by interacting with other PCP components including Van Gogh (Vang). In C. elegans, we found that vang-1/Vang does not appear to function with LIN-17/Fz, since most vang-1 single mutants and cwn-1 cwn-2 vang-1 triple mutants have two gonadal arms (215/216 and 58/58, respectively). As Fz interacts with Disheveled (DSH) in Drosophila PCP regulation, in C. elegans, the Disheveled homologs DSH-2 and MIG-5 regulate SGP polarity (Phillips et al., 2007). Therefore, LIN-17 might regulate the DSH homologs in a Wnt-independent manner. "
      

      Added Reference:

      1. Lawrence PA, Struhl G, Casal J. (2007). Planar cell polarity: one or two pathways? Nat Rev Genet. 8, 555-563.
      2. Wu, J., Roman, A.C., Carvajal-Gonzalez, J.M., & Mlodzik, M. (2013). Wg and Wnt4 provide long-range directional input to planar cell polarity orientation in Drosophila. Nature Cell Biology, 15(9), 1045-1055.

      Important: I think "Fig. 6 Germ cell independent migration of germ cells" title is a typo; should be "Germ cell independent migration of DTCs"

      Response:

      Thank you for pointing out the typo. We corrected it in the revised manuscript.

      This is a very important experiment! I think a greater description of the mes-1 phenotype would be helpful, since loss of germline was not 100% penetrant in mes-1(bn7) hermaphrodites in Strome et al., 1995. The legend says "Germless mes-1 phenotype was confirmed by the absence of the mex-5::GFP::PH signal in the gonad." Consider adding a few sentences to the results (or methods, from which the mes-1 experiments are currently missing) describing that only mes-1 animals that lacked germline fluorescence were analyzed for DTC migration.

      Response:

      Thank you for providing the context. To address the concerns, we made the following changes to our manuscript:

      1. In the Results section, we revised the sentence "We found that 84% of DTCs (n = 90) in germless mes-1 animals..." to "Among mes-1 animals that lack germ cells, we found 84% of DTCs (n = 90)...".
      2. We also modified the sentence "We noticed that some germless mes-1 animals..." to "We noticed that some mes-1 animals that lack germ cells...".

      Please correct "secreting the Notch ligand LAG-2" this is a membrane-bound, not secreted ligand

      Response:

      Thank you for your comment. In the revised manuscript, we modified the relevant sentence in the Introduction section as follows:

      "Firstly, DTCs function as niche cells for germline stem cells, inhibiting their entry into meiosis by expressing the Notch ligand LAG-2 (Henderson et al., 1994)."

      Fig 1. The qualitative loss of polarity would be better depicted with

      a grayscale image instead of green-on-black.

      Response:

      Thank you for your suggestion. The GFP::POP-1 images are raw images of the green channel of the confocal microscopy. We believe that SGP polarity is clearly depicted by them.

      Fig. 3 the presentation of these violin plots is confusing. The central text that reads "normal polarity, loss of polarity, reversed polarity" with arrows looks like a second Y axis label attached to the Z4 plot. I recommend rearranging. Consider shading the top, bottom, and central regions and explaining the meaning of the shading in the legend.

      Response:

      Thank you for your suggestions regarding the presentation of Figure 3. In response to your feedback, we have made the following modifications:

      First, we moved the text and arrows from the center to the right side of the figure, creating a clearer layout. As you recommended, we applied shading to the top, bottom, and central regions of the violin plots. Additionally, to explain the meaning of the shading, we added a new explanation to the figure legend. Specifically, we included the following text:

      "Values within the 95% CI (between the red lines; light green regions) indicate symmetric localization. Values below the lower red line (light blue regions) indicate reversed localization, while values above the upper red line (light red regions) indicate normal localization."

      We applied the same modification to Supplemental Fig. 1.

      Reviewer 2

      Major comments

      1- Are the effects of combining the different Wnts with the lin-17 allele specific to the n3091 allele? It would be important to test another allele, for example the sy277 allele has a similar phenotype and is available at CGC. A null would be even better if it is viable. Alternatively, lin-17(RNAi) could instead be used if efficient enough. This is important since the n3091 allele could differentially alter the binding to the various Wnts, resulting in their distinct phenotypes in that background. However, these distinct phenotypes may not be relevant in a wild-type context.

      Response:

      Thank you for your insightful comment. The lin-17(n3091) allele contains a nonsense mutation at the 35th codon, located between the second and third cysteine residues in the CRD domain (Wnt binding domain) (Sawa et al 1996). Therefore, it is highly unlikely that the N-terminal protein of 34 amino acids produced in lin-17(n3091) can bind to Wnts. In the revised manuscript, we added the missing-DTC phenotype of lin-17(n671) cwn-2 animals, which show a similar phenotype to lin-17(n3091) cwn-2. n671 is a reference allele in WormBase and has a nonsense mutation. Although sy277 has a deletion in the N-terminal region, its phenotype is weaker than that of n3091 and n671 (Sawa et al 1996).

      In the revised manuscript, we described lin-17(n671) cwn-2, in the Table 1, Table S1 and added the following sentence.

      "We observed a similar phenotype in lin-17(n671); cwn-2 double mutants, confirming that this genetic interaction is not allele-specific."

      2- In the lin-17; mom-5 double mutant which lacks DTCs, are Z1 and Z4 there but they do not express DTC markers, or are they never born? A lineage analysis should be presented. Also, are Z2 and Z3 still there on their own? Please show images.

      Response:

      Thank you for your comments. We quantified sys-1p::GFP::POP-1 signals in Z1 and Z4 daughter cells of lin-17 mom-5 and have not observed any animals lacking Z1, Z4 or germ cells. In the revised manuscript, as Fig. S2, we added images of sys-1p::GFP::POP-1 localizations in SGP daughters, along with germ cells in lin-17 mom-5 as well as in lin-17 cwn-1 egl-20 cwn-2, both of which were not shown in the original manuscript. In response to Reviewer 1's comment, we also included examples of depth effects on fluorescence intensities in Fig. S2 with images of different focal planes.

      Fig. S2 is quoted it at the end of the following sentence.

      "Then, we quantified the ratios (on a logarithmic scale) of sys-1p::GFP::POP-1 signal intensities proximal to distal daughter cells in various genotypes (Fig. 3A and Fig. S2)."

      The loss of polarity phenotype of lin-17 mom-5 has been described in Phillips et al. We missed to cite this in the original manuscript. We added the citation in the revised manuscript.

      "These asymmetries were strongly disrupted and weakly affected in lin-17 mom-5 double and lin-17 single mutants, respectively, as described previously (Phillips et al., 2007; Siegfried et al., 2004)."

      Minor comments

      1- The term "mirror-symmetry" is redundant. Consider using "symmetry"

      or "symmetrical polarity".

      Response:

      As noted in the cross-comment by Reviewer 1, we believe that "mirror-symmetry" is the appropriate term.

      We think that "symmetry" implies the same lineage, whereas the relationship between the Z1 and Z4 lineages is not. "Mirror symmetry" was also used in Herman & Horvitz (1994) to describe the defect in the F lineage in lin-44/Wnt mutants as follows.

      "we observed division patterns that were mirror symmetric to those of the wild type (Fig. 2). One plausible explanation is that the polarity of the first asymmetric cell division was reversed, causing the polarities of all subsequent asymmetric cell divisions also to be reversed."

      2- "... they are permissively pushed distally by germ cells while proliferating" is confusing as it is unclear what proliferating cell you are referring to - germ cells or the DTC? proliferating? sense. Replace by: "they are pushed distally by proliferating germ cells"

      Response:

      Thank you for your helpful comment. We agree with your suggestion and modify the sentence as follows:

      Original: "... they are permissively pushed distally by germ cells while proliferating" Revised: "... they are pushed distally by proliferating germ cells"

      3- Fig. 2 is cited in the text before Fig. 1.

      Response:

      Thank you for pointing this out. Figure1 is mentioned in the Introduction before Figure 2 is referenced in the Result section in the original manuscript. We think the reviewer might be confused, as the POP-1 localization defect was shown in Figure 1. In response to the reviewer 1's comment, we moved the POP-1 localization images of the compound mutants to Figure 3. In addition, we noticed that in the original manuscript, Figure 1B was mentioned before Figure 1A in the Introduction. Therefore, we have modified the sentences in the Introduction.

      The original sentence was:

      "In the gonad, at the L1 stage, somatic gonadal precursor cells (SGPs), Z1 and Z4 have LH and HL polarity, respectively (Siegfried et al., 2004) (Fig. 1B). This mirror-symmetric polarity creates their mirror-symmetric lineages producing distal tip cells (DTCs) from the distal daughters (Z1.a and Z4.p) (Fig. 1B)."

      The revised sentence now reads:

      "In the gonad, at the L1 stage, somatic gonadal precursor cells (SGPs), Z1 and Z4 have LH and HL polarity, respectively, creating their mirror-symmetric lineages producing distal tip cells (DTCs) from the distal daughters (Z1.a and Z4.p) (Siegfried et al., 2004) (Fig. 1A and 1B)."

      4- The results also suggest that MOM-5/Frizzled might be the receptor for Wnts regulating DTC production, as lin-17 mom-5 double mutants completely lack DTCs." Table 1 results rather suggest that lin-17 and mom-5 are the two frizzled receptor involved in DTC specification and that they are largely redundant.

      Response:

      As the reviewer noted, lin-17 and mom-5 function redundantly in DTC specification (SGP polarization). However, their functions are clearly different in terms of genetic interactions with Wnt genes (e.g. lin-17 cwn-2 but not mom-5 cwn-2 show the DTC-missing phenotype). We propose that MOM-5 but not LIN-17 functions as a receptor for Wnts.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This is an interesting paper where authors study how DTC specification and migration are genetically controlled. The results show a complex interaction between the various Wnt components with partially redundant roles for the ligands and receptors. This helps understanding how the two DTCs are specified and symmetrically polarized in vivo. In addition, authors convincingly show that DTCs can significantly migrate in the absence of any push by GSC proliferation.

      Major

      1. Are the effects of combining the different Wnts with the lin-17 allele specific to the n3091 allele? It would be important to test another allele, for example the sy277 allele has a similar phenotype and is available at CGC. A null would be even better if it is viable. Alternatively, lin-17(RNAi) could instead be used if efficient enough. This is important since the n3091 allele could differentially alter the binding to the various Wnts, resulting in their distinct phenotypes in that background. However, these distinct phenotypes may not be relevant in a wild-type context.
      2. In the lin-17; mom-5 double mutant which lacks DTCs, are Z1 and Z4 there but they do not express DTC markers, or are they never born? A lineage analysis should be presented. Also, are Z2 and Z3 still there on their own? Please show images.

      Minor

      1. The term "mirror-symmetry" is redundant. Consider using "symmetry" or "symmetrical polarity".
      2. "... they are permissively pushed distally by germ cells while proliferating" is confusing as it is unclear what proliferating cell you are referring to - germ cells or the DTC? proliferating? sense. Replace by: "they are pushed distally by proliferating germ cells"
      3. Fig. 2 is cited in the text before Fig. 1.
      4. "The results also suggest that MOM-5/Frizzled might be the receptor for Wnts regulating DTC production, as lin-17 mom-5 double mutants completely lack DTCs." Table 1 results rather suggest that lin-17 and mom-5 are the two frizzled receptor involved in DTC specification and that they are largely redundant.

      Referees cross-commenting

      Dear reviewer #1, with all due respect, I do do understand your point as the Fig. 1 in Bowerman shows lineages that are left-right "asymmetrical", not "non-mirror symmetrical"? In the paper we are reviewing, the Z1-Z4 lineages are anterior-posterior symmetrical. But this is a minor issue and we can wait to see what the authors are going to reply...

      Significance

      The paper is interesting but in its current form, the genetics results around DTC specification are somewhat complex and difficult to interpret. The results linked to DTC migration are easier to take home.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      This manuscript illuminates the molecular regulation of mirror-image-symmetry in the C. elegans hermaphrodite somatic gonad precursor cells, reporting that a diversity of Wnt ligands and a Wnt receptor with Wnt-independent activity together polarize the two progenitors of the somatic gonad in opposite ways. The Wnt ligand genes act nonredundantly with respect to the polarity of the SGPs.

      The conclusions arise from the careful interpretation of compound genetic mutants in the Wnt pathway, including the surprising finding of a Wnt-independent effect of lin-17/Frizzled on cell polarity that is bolstered by a new allele and construct of lin-17 that affect specifically its Wnt-interacting domain.

      Major comments

      While I find the genetic analyses compelling, I think the presentation of the data undermines the strength of the quantitative analysis of expression. Recommendations follow.

      Genetic analyses: Major strength. Complex compound mutants are presented with ample sample sizes and careful interpretation of genetic interactions, presented clearly in tables that demonstrate large sample sizes. The new allele of lin-17 and delta-CRD LIN-17 construct reveal the surprising Wnt-independence of this Frizzled-type receptor. While of course I am interested in this Wnt-independent role, it is not necessary for the current manuscript to determine its mechanism.

      Statistical analysis: When I first saw Fig. 1 B and C and read the associated results sections, I thought the qualitative polarity results reported in Fig. 1 would benefit from quantification, including at a minimum the sample size and penetrance of the phenotypes reported in Fig1B and 1C; it would be even better to quantify expression levels in the daughter cells. Then I discovered this appears to be what is reported in Figure 3. I think the presentation of these data are confusing. I recommend combining Figures 1 and 3 either before or after the results shown in Figure 2, so the reader's expectation for quantification is immediately satisfied.

      Delving into the figure legend of Fig. 3 and the normalization procedure described in the Methods "Quantification of POP-1 asymmetry in the Z1 and Z4 division" raised concerns. The method therein described is overly complicated but also neglects background subtraction. My first question about this method: what range of distances between daughters is measured in Z? These distances are not discussed in absolute terms, and this is important for our understanding of how much correction for tissue depth might be necessary, as L1s are very thin.

      To check my understanding, the authors use as a control a nuclear-localized GFP driven in the somatic gonad precursors in otherwise wild-type worms by the sys-1 promoter. They observe that the regression on a log scale of anterior:posterior (and vice versa) Z1 and Z4 daughter fluorescence over the distance between the daughters in the Z plane is fit by y = −0.034x + 0.0148, which is practically a slope of 0 and an intercept of 0. This means that they observed an ~1:1 ratio (as log(1)=0) of fluorescence in the anterior and posterior daughters of otherwise wild-type worms, at least across the range of very small X values of relevant distances between daughters (again, the relevant range of distances really matters and should be presented), making the normalization seem unnecessary.

      Then based on this regression and 95% CI, the authors predict values that reflect true equivalence of fluorescence of POP-1::GFP in the two SGP daughters, compare the observed values to these predictions, and ultimately display in violin plots these differences of observed and expected. Correct?

      Question: Is this complicated treatment the only way to detect differences in polarity of anterior and posterior daughters of Z1 and Z4? What happens if the authors measure GFP::POP-1 and calculate the following?

      Z1p(MGV - background control from same focal plane)

      Z1a(MGV - background control from same focal plane)

      If this straightforward analysis shows asymmetric signal in the control that is made symmetrical or reversed in the mutants, the hypothesis would seem to be supported with a much more straightforward method. Samples could be analyzed separately in two bins by worm body position, which affects which cell is superficial in the sample. As it is, the Figure 3 Y axis label is hard to interpret without reading the methods at length, diminishing its impact.

      Missing control: The sys-1 promoter-driven NLS-tagged fluorescent protein as a control to compare to the GFP::POP-1 is analyzed only in the wild-type, and apparently not in the mutants under consideration. Phillips et al. (2007) show that sys-1p transcriptional activity is equivalent between the SGP daughters in wild-type worms, but neither those results nor the method of normalizing to a sys-1p::GFP::NLS signal in this paper address the question of whether sys-1 promoter activity is equivalent in these cells in mutants upstream in the Wnt pathway. If the current method of normalization is to be used, it seems important to normalize to the sys-1p::GFP::NLS regression in each mutant background.

      Missing methods: How was lin-17(mn589) generated? if this is the first report of this allele, full information on what the lesion is and how it was derived should to be reported.

      The methods section lacks a description of how the mes-1 experiments were done, in terms of timing, duration, and temperature; mes-1(bn7) is a temperature sensitive allele.

      Minor comments

      The paper lacks a discussion of precedent in the literature for Wnt-independent Frizzled activity; this is a major finding that is being undersold in the current version of the manuscript.

      Important: I think "Fig. 6 Germ cell independent migration of germ cells" title is a typo; should be "Germ cell independent migration of DTCs"

      This is a very important experiment! I think a greater description of the mes-1 phenotype would be helpful, since loss of germline was not 100% penetrant in mes-1(bn7) hermaphrodites in Strome et al., 1995. The legend says "Germless mes-1 phenotype was confirmed by the absence of the mex-5::GFP::PH signal in the gonad." Consider adding a few sentences to the results (or methods, from which the mes-1 experiments are currently missing) describing that only mes-1 animals that lacked germline fluorescence were analyzed for DTC migration.

      Please correct "secreting the Notch ligand LAG-2" this is a membrane-bound, not secreted ligand

      Fig 1. The qualitative loss of polarity would be better depicted with a grayscale image instead of green-on-black.

      Fig. 3 the presentation of these violin plots is confusing. The central text that reads "normal polarity, loss of polarity, reversed polarity" with arrows looks like a second Y axis label attached to the Z4 plot. I recommend rearranging. Consider shading the top, bottom, and central regions and explaining the meaning of the shading in the legend.

      Referees cross-commenting

      I have no comments to add to the points raised by Reviewer 2, other than to share my opinion that "mirror symmetry" is helpful terminology, as there are other possible developmental symmetries other than the "HL LH" mirror symmetry seen in the SGPs. For an example of a non-mirror developmental symmetry, see Fig. 1 of this dispatch by Bowerman 2006 https://www.sciencedirect.com/science/article/pii/S0960982206024481

      Significance

      Strengths: Rigorous genetics

      Limitations: Data presentation, expression level analysis, communication of results.

      Advance: Wnt signaling has highly conserved, ancient roles in the earliest polarity events in animal embryos, so learning how the Wnt signaling pathway dictates cell polarity, including a new Wnt-independent role for a major receptor, has broad implications for understanding cell polarity via this pathway in animal development. Another major advance is the finding that a conserved developmental signaling pathway with members that are redundant in certain contexts nonetheless elicits specific responses from certain cells during development, thus generating a diversity cell types.

      Audience: These findings will be of interest to a broad audience interested in animal development, symmetry-breaking, complexity, and the Wnt-signaling pathway.

      Additionally, a worm-specific audience will be interested in the finding that the DTCs are capable of germline-independent migration, as the new paradigm of Agarwal et al. ascribes DTC propulsion to germline pushing forces. How the DTC migrates is a longstanding question in our field.

      Reviewer: I study developmental genetics and cell biology in the C. elegans hermaphrodite gonad. I have sufficient expertise to evaluate all of the experiments presented here. In my opinion, this is a highly valuable developmental genetics study with significant but easily addressable flaws in presentation.

  5. Aug 2024
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      'The authors do not wish to provide a response at this time.'

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      DNA double-strand breaks are harmful to cells. The authors used CRISPR-Cas9 to create DSBs in repetitive elements (Ty transposons) in the Saccharomyces cerevisiae genome. This method builds on previous ones that used the HO endonuclease or prokaryotic restriction enzymes. They used Cas9-based DSBs to assess the role of Tel1 (ATR) in DSB sensing, comparing it with the DSB-generating xenobiotic zeocin. The first part of the paper is fine, but it lacks information about the yield. The system is not as efficient as it is claimed to be, and it is slow. The second part of the paper is not well connected to the first half, and several controls are needed to make it sound.

      Major comments:

      1. A significant portion of the manuscript posits that the CRISPR-Cas9 system is capable of creating as many DSBs as the theoretical number of Ty targets. Nevertheless, it is clear that this is not the case. The authors can determine the number of breaks per construction. A plot of the frequency of average DSBs in chromosomes IV and III appears to be feasible based on the Southern blots presented, although this could give an underestimate, given that some broken chromosomes may become trapped in the well during DSB processing. However, the percentage of zero DSBs for a given chromosome is readily quantifiable, allowing the frequency of DSBs per cell to be determined through straightforward calculations. Once this has been achieved, the authors should revisit their assessments of Rfa1/Rad52/Tel1 foci formation and number.

      2. OPTIONAL: It seems plausible that a saturation of DSBs per cell may occur when the actual number of DSBs is plotted against the theoretical maximum. In such a scenario, the introduction of additional Cas9 molecules could enhance the efficiency of DSB generation. This could be achieved by increasing the number of copies of the CAS9 gene.

      3. The majority of the discussion chapter is dedicated to the second part of the paper (Tel1), yet no mention is made of the subject I have just commented on. Similarly, a comparison should be made between the Cas9-based system and previous methodologies for creating single and multiple DSBs (HO, I-SceI, restriction enzymes, radiation, radiomimetic drugs, etc.) in terms of efficiency, time to DSB, cell response, etc. It is surprising that the Cas9 system takes so long to generate DSBs and that the cell cycle profile indicates very little arrest in G2/M. This should also be discussed.

      4. OPTIONAL: Since the lack of a strong G2/M arrest is intriguing, it would be good to do a Western blot of Rad53 to learn more about Cas9-based DSB sensing and the DNA damage checkpoint. Comparing it to both the well-established HO system (even a single HO cut) and radiation/radiomimetics would be ideal.

      5. OPTIONAL: Given the lack of the G2/M arrest when Cas9 is expressed in asynchronous cultures, the authors may try to synchronize cells in G1 and G2/M before Cas9 is induced. This could help them find out if the G1 and G2/M peaks in Figure 2A are caused by DSBs leading to both a G1 and a G2/M arrest. Alternatively, they could film the cells after Cas9 is added and check microcolony formation.

      6. The timeframes of Rad52 foci in Figure 5 indicate an anomalous pattern, which raises concerns about the validity of the experiment setup. The selected cells did not change their morphology throughout the 140-minute observation period. The unbudded cell remained unbudded, and the bud did not grow in six out of seven cells. For cells with small buds (cells 2 to 5), the expectation was that the bud would grow until the cell either became a dumbbell (indicative of a G2/M arrest) or divided its nucleus. Furthermore, no re-budding events were observed. Is this pattern real? Why is it so? Once this issue is addressed, could the duration of the Rad52 foci be quantified? Was a single z-plane taken? If so, some Rad52 foci that appear and/or disappear could reflect their migration to an on-focus or out-of-focus plane.

      7. OPTIONAL: It would be beneficial to conduct a double labeling with known DSB factors that coexist with Tel1, or are downstream of it, in order to enhance the data shown in Figures 6 and 7. This would also address some of the queries raised about Tel1 clustering and location on the nuclear periphery.

      8. In the experiment with the condensin mutant smc2-8, a temperature shift from 24 to 37 ºC is carried out, which represents a significant physiological change. Therefore, it is essential to include a parallel control with a WT strain to rule out the possibility that the unclustering of Tel1 foci is due to the temperature shift.

      9. The sentence in the Discussion about condensin's role in maintaining Tel1 clustering is misleading. It could be interpreted as suggesting that condensin actively gathers Tel1 foci after DSBs (lines 650-651), when in fact condensin's function is simply to maintain Ty element clustering prior to DSBs, as the authors themselves cite in the text.

      10. Taking into account the importance of Cas9/sgRNA plasmid constructions, PFGE and Southern blot in this work, the author should make the effort to describe them all in much more detail in M&M.

      Minor comments:

      1. In the Figure 7 legend, panel A is missing, and the text for the other panels is consequently misplaced.

      2. The functional verification of the yEGFP-Tel1 construction (Fig. 6A & B) would be better presented as a supplementary figure. The same rationale applies to Figure 8.

      3. Please include G2/M in the abbreviations.

      4. Please clarify what is meant by "5h40". Is this 5 hours and 40 minutes? If so, please use alternative nomenclature.

      5. Some sections of the text appear superfluous. For example, the definitions of mean, SEM and SD, and the rationale for choosing SEM over SD (lines 211 to 215), as well as the information about the purpose of PFGE in a figure legend (line 322).

      Referees cross-commenting

      After reading the comments of the other reviewers, I agree with them, and some of them raise the same concerns as I do.

      Significance

      General assessment: The study addresses the generation of multiple DSBs by Cas9 when an increasing number of targets are incorporated for cutting. The strategy to create an increasing dose of DSBs based on the different Ty elements is an innovative approach, although it is constrained by the nature of the target. The assessment of DSBs by PFGE plus Southern blot is well planned, although there is potential to exploit the obtained results further to assess the actual vs theoretical DSBs, saturation effect, etc. They then sought to use their dose-dependent system to examine the role of Tel1 in the DSB response, comparing Cas9 with zeocin. However, a comparison between the two strategies is challenging without prior knowledge of the number of DSBs present in each treatment. Overall, the study represents a promising effort to use Cas9 for the generation of multiple DSBs in yeast. Nevertheless, the system is constrained for further mechanistic studies on the DSB response due to its slow kinetics, low yields, and lack of expected DNA damage responses, particularly the G2/M checkpoint.

      Advance: Despite such a dose-response assessment for the Cas9-based DSBs have not been performed in yeast, similar studies with sequence-specific DSBs have been done before in Lorraine Symington's lab (and others). Perhaps the Cas9-based system is simpler than the one that relies on multiple insertions of the HO cutting sequence, but it appears neither simpler than the one based on the expression of restriction enzymes nor more efficient. The Cas9 system has several limitations that make it less suitable for studying how cells react against multiple DSBs. It is slow, probably saturates after a few DSBs, and does not render a full DNA damage response. However, there is still value in understanding and making predictions about this important gene-editing method.

      Audience: This paper is intended for a specialized audience, including those involved in basic research on DSB sensing and repair, particularly in yeast.

      Field of expertise of the reviewer: Cell and molecular biology of S. cerevisiae, with a particular interest in the DNA damage response.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary.

      The authors have developed a system to generate a variable number of double-strand breaks (DSBs) - specifically 1, 15, or 59 - in the genome of Saccharomyces cerevisiae in a galactose-inducible manner. This system relies on the galactose-inducible expression of Cas9 nuclease and the use of guide RNAs (gRNAs) targeting different classes of Ty elements. The efficiency of DSB induction was assessed through several methods, including monitoring cell viability in media containing galactose versus glucose, pulsed-field gel electrophoresis (PFGE), PFGE combined with Southern blotting using probes specific to different chromosomes, and the formation of foci by repair and checkpoint proteins. Specifically, the system was utilized to examine foci of the apical checkpoint kinase Tel1 and the repair factors RPA and Rad52.

      From their experiments, the authors conclude that the system is capable of inducing DSBs in a controlled and dose-dependent manner, and that the DSBs are indeed induced at the chromosomal loci where the Ty elements reside.From microscopy experiments, the authors conclude that some DSBs are more persistent, while many are repaired via homologous recombination (HR). New DSBs can form as a result of the continuous expression of Cas9. Finally, Tel1 appears to form multiple foci after DSB induction and these foci localize to the nuclear periphery.

      Major comments.

      In my opinion, some experiments lack appropriate quantification, particularly those evaluating the dose-response effect and the efficiency of DSB induction. Additionally, some conclusions appear to be overestimated in relation to the experimental data. Further experiments and more detailed analyses would be beneficial to fully support the claims and conclusions presented. The data and methods are presented in a manner that should allow for reproducibility. The experiments are adequately replicated, but the statistical analyses are lacking in most of the figures. For example, in Figures 4B, 4C, and 6D, it would be helpful to indicate when the increase in cells containing repair foci is significant.

      Additional experiments and modifications suggested:

      • Figure 1B: Quantify the effect of DSB induction on cell viability with a dose-response curve. No differences are observed between the 0.5% gal and 1% gal conditions. The greatest effect on viability occurs in 2% gal and in the absence of sucrose, which might be due to the absence of sucrose. This should be addressed or discussed.
      • Figure 1B and Figure 2: A quantification of galactose-induced Cas9 expression in the presence of different doses of galactose or over time, respectively, is a necessary control (mRNA or protein expression).
      • Figure 2A: From the cell cycle analysis using FACS, the authors suggest that Cas9 induction causes checkpoint activation. This can be easily confirmed by using more direct methods for checkpoint activation, such as Rad53 phosphorylation. In addition, checkpoint activation should be monitored in synchronized cells released in galactose in order to evaluate whether Cas9-induced DSBs in Ty elements can trigger checkpoint in the first cell cycle or require more time. This reflects the timing of DSB induction.
      • Figure 3: In addition to the restriction analysis, kinetics of DSB formation at specific Ty loci by classical Southern blot or qPCR is, in my opinion, necessary to demonstrate the effectiveness and efficiency of DSB induction over time, especially in relation to Cas9 expression.
      • Figure 5: In this case as well, I believe that a quantitative analysis of the number of cells in the different conditions illustrated in the figure is necessary to understand the dynamics of repair and the formation of new DSBs. Some conclusions are somewhat strong, particularly the correlation between cell cycle phase and repair kinetics (long-lasting Rad52 foci versus short-lived Rad52 foci) and would require quantitative and statistical analysis. In addition, it is not clear to me why cells in which the Rad52 focus disappears do not proceed through the cell cycle (e.g., cells represented in rows 8 and 9 of the figure).
      • Figure 7E and F: It would be interesting to see if the number of Rad52 foci per cell changes in the smc2-8 mutant to understand if RAD52 is required to form the repair center or if this depends on the clustering of Ty elements at the nuclear periphery.

      Minor comments.

      The authors have cited relevant literature to support their methodology, findings, and conclusions. The text and figures are clear. The descriptions in the text are precise and well-articulated, making the data easy to understand. The figures are well-designed and clearly labeled.

      Specific suggestions:

      • Figure 2C: Please change the order of the panel: place chromosome III at the top and chromosome IV at the bottom, as described in the text.
      • Figure 4: The dose-response effect is not evident when monitoring repair foci. It appears that the proportion of cells showing RPA and Rad52 foci is generally low, especially after the formation of 59 DSBs. This is particularly concerning given that the strain in which 59 DSBs are induced already has 20% of cells with RPA foci at time = 0. The authors attribute the lower presence of foci to improved repair caused by the clustering of multiple lesions, but could it simply be due to lower Cas9 efficiency when there are many target sites?
      • Related to Figure 4:The statement on line 400: "the proportion of nuclei displaying Rfa1 foci was consistently double than that of nuclei bearing Rad52 foci, probably reflecting the increased residence time of resected filaments in comparison with the process of homology search" is somewhat strong, considering that the proteins are labeled with different fluorophores, which might experience different rates of photobleaching.

      I believe the proposed experiments can be completed in about 6 months. The request to monitor the kinetics of DSB formation at specific Ty elements in more detail might take more time if the PCR or Southern blot techniques need to be optimized. However, the authors may have other methods in mind that are more familiar to them for evaluating these kinetics, which could be equally valid.

      Referees cross-commenting

      I have carefully read the comments on the manuscript from the other reviewers. I noticed that many of our opinions coincide, and I am convinced that all the requests are appropriate.

      Significance

      My field of expertise centers on checkpoint regulation and homologous recombination in Saccharomyces cerevisiae. I have extensively utilized the galactose-inducible HO endonuclease system for inducing DSBs. I believe that developing additional systems to induce one or more localized DSBs in specific genomic regions is crucial for addressing unresolved questions regarding DSB response. An ideal system would also operate independently of galactose. Based on my experience, an effective system for DSB induction should induce breaks rapidly and simultaneously to produce innovative and reproducible results. From the data presented, I am uncertain whether the system developed in this work meets these criteria.

      The development of a Cas9-based system capable of forming multiple DSBs could be an important tool for studying the DSB response, although I have doubts about how much it will truly enhance our understanding of damage repair. Other systems, cited by the authors, have been previously developed to produce multiple DSBs with different nucleases and using TY elements. Although the Ty-HO system to induce 1, 7, or 10 DSBs was developed in L. Symington's laboratory in 2004, the use of this system has been very limited, likely because it is not easy to monitor what happens to individual DSBs and because the efficiency of simultaneously inducing multiple DSBs may decrease with the increase in the number of target sites. I am concerned that the tool developed in this research article may have limited applicability, being relevant primarily to a small niche within the scientific community focused on DSB response, and thus might generate only a narrow interest. However, the latter part of the paper, which addresses the localization of Tel1, seems promising, despite being preliminary. Additionally, the development of a fluorescent variant of Tel1 is intriguing. This new variant appears to retain the protein's functions and forms well-visible foci within the cell, which seem to be brighter and more intense compared to those obtained with the variant previously developed by M. Lisby and R. Rothstein.

      Strengths:

      • The study presents an innovative system to induce a variable number of double-strand breaks (DSBs) in the genome of Saccharomyces cerevisiae using a galactose-inducible Cas9 and specific gRNAs for Ty elements.
      • The authors use a variety of methods to evaluate DSB induction, including cell viability assays, PFGE, Southern blotting, and foci formation of repair and checkpoint proteins. This comprehensive approach provides sufficient evidence that DSBs are induced.
      • The latter part of the article, focusing on the formation and localization of Tel1 foci, is particularly important. It sheds new light on the functions of Tel1 and its role in the DSB response. This finding is a significant preliminary indication that warrants further development, as the authors suggest in the discussion. I find the development of a tool to effectively visualize Tel1, which is a low-abundance protein in the cell, to be innovative and important for the community working in DSB repair and checkpoint.

      Limitations:

      • Some experiments lack appropriate quantification. For instance, a dose-response curve quantifying the effect of DSB induction on cell viability is missing. Additionally, quantification of galactose-induced Cas9 expression over time (mRNA or protein) is necessary.
      • Statistical analyses are lacking in most figures. It is important to indicate when increases in cells containing repair foci are significant, particularly in Figures 4B, 4C, and 6D.
      • The suggestion of checkpoint activation from cell cycle analysis using FACS should be validated with more direct methods, such as Rad53 phosphorylation, and monitored in synchronized cells to evaluate the timing of checkpoint activation.
      • Further analysis is needed to demonstrate the effectiveness and efficiency of DSB induction over time, especially in relation to Cas9 expression. Monitoring the kinetics of DSB formation at specific Ty loci by classical Southern blot or qPCR would be beneficial.
    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript entitled "A CRISPR-Cas9-based system for the dose-dependent study of DNA double-strand breaks sensing and repair" by Coiffard and colleagues reports on a novel system to monitor the response to multiple DSBs in yeast by targetting Cas9 nuclease to Ty elements. The data is clearly presented, but there are several technical concerns and questions the conclusions of the manuscript. First of all, it is not clear how many DSB cells contain at any given time. The kinetics of DSB formation appears to be slow (hours) relative to the formation and progression of DNA repair foci (minutes). Tel1 foci are found primarily at the nuclear pheriphery, but given the slow kinetics and low number of foci, it is unclear whether the peripheral localization of Tel1 represents physiologically relevant repair and aberrant events where repair failed. Unfortunately, there is not assessment of the outcome of repair. Is the repair error-free or do the cells accummulate indels and/or structural changes at the Cas9 cleavage sites? Without this information, the assay will be of limited use for the scientific community.

      Other major issues:

      1. The introduction should reflect on the chemical structure of DSB ends and the fact that Cas9 remains bound to DNA after cleavage, which may delay repair.
      2. Different Ty elements may be cut with different kinetics. The authors should compare their system to (Gnügge and Symington 2020).
      3. The introduction should reflect on the physiological relevance of 15/59 DSBs.
      4. What is the copy number of the Cas9 and gRNA plasmids? Could cell-to-cell variation in copy number explain the variation in the number of Tel1.
      5. Do the authors observe mutations caused by leaky expression of gRNAs in the uninduced state?
      6. Line 413: I don't think the data in figure 4 completely warrants the conclusion the "DSBs induced by Cas9 engage into HR in a dose-dependent manner". More foci with 15 DSBs than with 59 DSBs? Interference or other explanations? I think this discrepancy warrants additional discussion.
      7. What is observed with a 1h pulse of Cas9 induction followed by glucose? Can the assay monitor the kinetics of repair?
      8. The interpretation of foci in figure 5 is difficult to follow given that only 1 focus is observed while many DSBs are induced. Without further experimentation, these speculations should be moved to the Discussion.
      9. Tel1 foci were always observed at the nuclear periphery (figure 6C). This information should be quantified and compared to Rad52, given that Rad52 foci have previously also been observed at the nuclear periphery for persistent DNA lesions (Whalen and Freudenreich, 2020; Nagai et al., 2008; Lisby et al., 2010). Do these foci perhaps reflect a small subset of Cas9-induced DSBs that are not repair?
      10. Line 699: the notion that Tel1 senses DSBs is novel and does not fit well with the literature given that Tel1 is recruited to foci downstream of Mre11 (Lisby et al., 2004). Unless the authors can provide additional evidence, I suggest to rather write that Tel1 is an early transducer of the DNA damage response.

      Minor suggestions:

      1. The manuscript could benefit from correction of English grammar.
      2. Figure 7F: please also include cells with 1 focus in the graph.
      3. Figure 7: the labels A-F do not follow the legend.
      4. Figure 3, legend: it should be stated in the legend, how long Cas9 was induced in this experiment.
      5. Line 673: it could be noted that in mammalian cells, many more foci are observed, which is probably due to the larger size of the nucleus.
      6. The conclusion that Tel1 behaves exceptionally in terms of the number of foci that are formed is perhaps an overstatement, since only three proteins were analyzed and the occurrence of 8 foci was rare (1:1000 cells).
      7. Line 685: the word "form" indicates that the DSB are already at the nuclear periphery, when bound by Tel1. How do the authors exclude that Tel1 foci form all over the nucleus and then subsequently relocalize to the nuclear periphery? Time-lapse microscopy would be able to reveal where the Tel1 foci form.

      Referees cross-commenting

      I have read and agree with the comments of the other reviewers.

      Significance

      The manuscript entitled "A CRISPR-Cas9-based system for the dose-dependent study of DNA double-strand breaks sensing and repair" by Coiffard and colleagues reports on a novel system to monitor the response to multiple DSBs in yeast by targetting Cas9 nuclease to Ty elements. The data is clearly presented, but there are several technical concerns and questions the conclusions of the manuscript. First of all, it is not clear how many DSB cells contain at any given time. The kinetics of DSB formation appears to be slow (hours) relative to the formation and progression of DNA repair foci (minutes). Tel1 foci are found primarily at the nuclear pheriphery, but given the slow kinetics and low number of foci, it is unclear whether the peripheral localization of Tel1 represents physiologically relevant repair and aberrant events where repair failed. Unfortunately, there is not assessment of the outcome of repair. Is the repair error-free or do the cells accummulate indels and/or structural changes at the Cas9 cleavage sites? Without this information, the assay will be of limited use for the scientific community.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Protein kinases play an important role in regulating cell division and the dynamics of the actomyosin ring both in yeast and mammalian cells. While this raises the possibility that regulated protein dephosphorylation may also affects the division dynamics, few experimental studies have been reported. In the current study, Chrupcala and Moseley screen 9 non-essential protein phosphatases and show that the conserved protein phosphatase PP2A-B56 regulates division plane positioning in fission yeast. The authors show that Par1 localizes to the division site and that cells lacking Par1 have impaired levels of regulators of actomyosin ring positioning, including the anillin-like protein Mid1. Interestingly, restoring wildtype Mid1 levels partially suppresses the division defects observed in the par1∆ mutant.

      The study is well-executed with attention to detail and careful phenotype quantification and analyses. The conclusions of the study are largely supported by experiments to yield the novel insights. This reviewer commends the authors on reporting the negative results regarding Mid1-3xSLiM mutant (Results section) and failure to detect physical interaction between Par1 and Mid1 (Discussion section). The authors provide an interesting discussion. Furthermore, they describe very relevant additional experiments but they restrict their execution to future studies that complement the current manuscript

      Major concern

      The authors show that par1∆ cells have impaired levels of Mid1, Cdc15 and Rga7, yet propose that Mid1 reduction is the principal cause of cytokinetic defects in par1∆ cells. Even though increase in Mid1 levels rescues par1∆ defects, it is possible that increased Mid1 levels suppress another primary defect (e.g. Rga7 increased levels). Thus, it would be interesting to perform the following two experiments: 1. The authors in the discussion say "we found that cytokinesis defects arise from decreased Mid1 levels" but this is not formally shown other than in par1∆ cells. Thus I would suggest monitor cytokinesis in cells with Mid1 levels directly reduced to levels comparable to those observed in par1∆ cells (as quantified in Fig 3B). Monitoring the effects of reduced Mid1 levels on Rga7 and Cdc15 would also be interesting. 2. Monitoring the levels of Rga7 and Cdc15 in par1∆ cells rescued by the second copy Mid1.

      Minor concern

      Even though definitive evidence for Par1 mechanism of Mid1 regulation might be difficult to obtain, the authors may choose to strengthen their work on the role of dephosphorylation at the actomyosin ring. For example, using the GFP-GFPnanobody pairing to force interaction between Mid1 and Par2 in par1∆ cells may provide support for dephosphorylation playing a more direct role in actomyosin ring positioning.

      Referees cross-commenting

      Consultation regarding Review 1:

      Reviewer 3 shares the interest in points 1,3,5.

      Relating to point 2: Not sure what the reviewer specifically wishes here.

      Relating to point 4: Extensive analyses of many cytokinetic proteins in par1∆ cells, and importance of their levels for cytokinesis, would be interesting but perhaps beyond the scope of this study. I believe it would suffice to monitor Cdc15, Bgs1 and Rga7 in the 2xMid1 rescue that authors performed.

      Consultation regarding Review 2: Reviewer 3 shares the interest in major points 1 (though authors do leave open the possibility that PP2A acts indirectly),3,4,5. Regarding point 2: This might be difficult to ascertain directly and instead it might suffice to show that Mid1 levels reduced to those observed in par1∆ phenocopy the par1 mutant's division defects.

      Significance

      The study is well-executed with novel findings on regulation of the cell division and the physiological roles of phosphatases. The study will benefit cell polarity and cell division research fields as well as researchers interested in roles of protein phosphatases.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this study, the authors investigates the functional relationship of Mid1, which plays a central role in the positioning of the contractile ring (CAR) in fission yeast, with protein phosphatases. Previous studies have shown that Mid1 is phosphorylated and modified by several kinases, but its relationship with phosphatases is not well understood except of Clp1. Therefore, the authors examined whether phosphorylase gene disruption strains were able to divide symmetrically. The authors found that cells lacking par1, which encodes the regulatory subunit of PP2A, frequently divided asymmetrically. Furthermore, they found that the par1 deletion mutant showed reduced protein levels of Mid1 in the cells. Furthermore, they showed that increasing the expression level of Mid1 can partially compensate for the mitotic abnormalities of the par1 strain. Since Par1 localizes to the division site, the authors also investigated the possibility of binding with Mid1, focusing on its SLiM motif. However, at this time, it is unclear whether direct binding of Par1 to Mid1 is necessary for Mid1 to exert its cellular functions, since no conspicuous cytokinesis abnormalities were observed in Mid1 with the mutation in the SLiM motif.

      The focus of the study to be interesting and the clarity of the overall argument of the manuscript to be almost adequate. However, the authors should investigate or mention the following points. Also, a more appropriate and convincing way of presenting experimental results is required.

      Major comments:

      1. Does Par1 (PP2A) work in the dephosphorylation of Mid1? Looking at the band pattern of the western blot of Mid1 in Fig. 3E, there are several bands. This band shift probably indicates phosphorylation of Mid1, but is there any difference in the band shift between the wild-type strain and the par1 gene disrupted strain? Mid1 is phosphorylated in a cell cycle-dependent manner. So, it would be interesting to examine the cell cycle-dependent phosphorylation pattern using synchronous culture.
      2. In Fig. 5A, par1- and ppa2-deficient strains show little nuclear localization of Mid1 in interphase cells. If little or no Mid1 enters the nucleus, could it be possible that the mitotic abnormality is not due to a reduction in Mid1 protein levels, but is directly due to an effect on the shuttling of Mid1 between the cell nucleus and cell surface?
      3. The photographs showing cellular localization of proteins, such as Figs. 3, 4, 5, 6B, and 6C, are not convincing. The authors should show a typical picture of the cell population as supplemental figures, rather than trimming a single cell.
      4. Is the Mid1-deficient strains impaired in the localization of Par1 to the division plane? As shown in Fig. 3A, the fluorescence of Cdc15 and Rga7 is predominantly enhanced in the par1 deletion strain (Fig. S2 also shows a protein with a significant difference in localization). The authors should consider these points and make more comprehensive discussion.
      5. Fig. 3D and part of Fig. 5B also use the same photograph. This raises the suspicion that these data were taken only once. If so, the authors should do replicated experiments to assure the authenticity of the data. In addition, perhaps it is a careless mistake, but this way of presenting data should not be allowed unless specifically mentioned in the manuscript.

      Minor point.

      Typos. p. 11 Mid1-13my levels,.

      Lack of uniformity in description p. 13 Methyl-2-benzimidazole and p. 15 methyl-2-benzimidazole In the figure of the paper Mid1-mNeonGreen and mid1-mNeonGreen

      Significance

      The importance of this study is that it deepens our knowledge of Mid1 protein of fission yeast, an important model organism for the study of cytokinesis. Although the possibility that Par1 is involved in the maintenance of Mid1 protein levels has been demonstrated, the molecular mechanism has not been clarified. It might be expected that anilline, which is similar to Mid1 in animal cells, might have a similar relationship to protein phosphatases, but there is no evidence for this. Therefore, at this moment, the effect of these findings may be limited to the fission yeast research community.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The study by Chrupcala and Moseley explores the role of the protein phosphatase PP2A-B56 in cytokinesis, particularly focusing on its regulatory subunit Par1 and its interaction with Mid1 in fission yeast (S. pombe). The authors report that in cells lacking Par1 (par1d), the contractile rings are mispositioned, and the levels of multiple cytokinetic proteins, including Mid1, are altered, leading to defects in cytokinesis. They suggest that the reduction in Mid1 levels is responsible for the observed cytokinetic defects and show that restoring Mid1 levels can partially rescue these defects. The study further demonstrates that Mid1 shuttling between the nucleus and cell membrane is impaired in par1d mutants. Thus, par1d seems to cause a range of defects, from altered protein levels to specifically impairing Mid1 functions. However, the study does not demonstrate a direct interaction between Par1 and Mid1, leaving the exact mechanism by which Par1 regulates Mid1 levels unclear.

      The manuscript is well-written and investigates the lesser-studied role of protein phosphatases in cytokinesis, which is of general interest. However, a major criticism is the lack of evidence supporting a direct role of PP2A-B56 and its regulatory subunit Par1 in cytokinesis. It is possible that the phenotype observed in par1d cells results from a global impairment of protein regulation rather than a specific effect on Mid1. Indirect factors, such as changes in cell size/shape, could also produce similar phenotypes.

      To strengthen the manuscript, the authors should consider the following points:

      1. Changes in cell size/shape can influence Mid1 levels and nucleus positioning. Many par1d cells shown in the manuscript have bulgy morphologies. The manuscript should address whether cell size/morphology differences contribute to the observed phenotypes.
      2. The quantification of nucleus positioning needs further consideration, as the positioning of the nucleus has been shown to be a function of cell size, etc.
      3. The manuscript proposes that Par1 acts by reducing Mid1 levels and that restoring Mid1 levels can rescue the defects. However, given the known mechanisms of Mid1 localization, it is unclear why a reduction in Mid1 levels alone would lead to cytokinetic defects. To establish that the defects are due to Mid1 reduction, the authors should demonstrate that lowering Mid1 levels (e.g., using a low-expression promoter) phenocopies the cytokinesis defects observed in par1d cells.
      4. As shown in par1d cells, other cytokinesis proteins have impaired levels. It should be investigated whether changes in the levels of other proteins (e.g., Cdc15, Rga7, Myp2, etc.) in par1d cells contribute to the cytokinetic defects. In general, it is not discussed how the par1d mutant can lead to differential levels of many cytokinesis proteins. Does the reduction of Mid1 explain this shift? It is conceivable that changes in the levels of other proteins could also produce similar cytokinesis defects.
      5. The manuscript would benefit from additional functional rescue experiments. For example, expressing a phosphatase-dead version of Par1 in par1d cells could help determine if its phosphatase activity is necessary for cytokinesis.

      Significance

      The study by Chrupcala and Moseley explores the role of the protein phosphatase PP2A-B56 in cytokinesis, particularly focusing on its regulatory subunit Par1 and its interaction with Mid1 in fission yeast (S. pombe). The authors report that in cells lacking Par1 (par1d), the contractile rings are mispositioned, and the levels of multiple cytokinetic proteins, including Mid1, are altered, leading to defects in cytokinesis. They suggest that the reduction in Mid1 levels is responsible for the observed cytokinetic defects and show that restoring Mid1 levels can partially rescue these defects. The study further demonstrates that Mid1 shuttling between the nucleus and cell membrane is impaired in par1d mutants. Thus, par1d seems to cause a range of defects, from altered protein levels to specifically impairing Mid1 functions. However, the study does not demonstrate a direct interaction between Par1 and Mid1, leaving the exact mechanism by which Par1 regulates Mid1 levels unclear.

      The manuscript is well-written and investigates the lesser-studied role of protein phosphatases in cytokinesis, which is of general interest. However, a major criticism is the lack of evidence supporting a direct role of PP2A-B56 and its regulatory subunit Par1 in cytokinesis. It is possible that the phenotype observed in par1d cells results from a global impairment of protein regulation rather than a specific effect on Mid1. Indirect factors, such as changes in cell size/shape, could also produce similar phenotypes.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to Reviewers

      We would like to thank all the reviewers for their thorough reading and helpful comments. Below, please find our point-by-point response. The reviewer comments received through ReviewCommons have not been altered except for formatting.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors extended the existing recombination-induced tag exchange (RITE) technology to show that they can image a subset of NPCs, improving signal-to-noise ratios for live cell imaging in yeast, and to track the stability or dynamics of specific nuclear pore proteins across multiple cell divisions. Further, the authors use this technology to show that the nuclear basket proteins Mlp1, Mlp2 and Pml39 are stably associated with "old NPCs" through multiple cell cycles. The authors show that the presence of Mlp1 in these "old NPCs" correlates with exclusion of Mlp1-positive NPCs from the nucleolar territory. A surprising result is that basket-less NPCs can be excluded from the non-nucleolar region, an observation that correlates with the presence of Nup2 on the NPC regardless of maturation state of the NPC. In support of the proposal that retention of NPCs via Mlp1 and Nup2 in non-nucleolar regions, simulation data is presented to suggest that basket-less NPCs diffuse faster in the plane of the nuclear envelope.

      However, there are some points that do need addressing:

      Major Points 1. Taking into account that the Nup2 result in Figure 4B forms the basis for one half of the proposed model in Figure 6 regarding the exclusion of NPCs from the nucleolar region of the NE, there is a relatively small amount of data in support of this finding and this proposed model. For example, the only data for Nup2 in the manuscript is a column chart in Figure 4B with no supporting fluorescence microscopy examples for any Nup2 deletion. Further, the Nup60 deletion mutant will have zero basket-containing NPCs, whereas the Nup2 deletion will be a mixture of basket-containing and basket-less NPCs. The only support for the localization of basket-containing NPCs in the Nup2 deletion mutant is through a reference "Since Mlp1-positive NPCs remain excluded from the nucleolar territory in nup2Δ cells (Galy et al., 2004), the homogenous distribution observed in this mutant must be caused predominantly by the redistribution of Mlp-negative NPCs into the nucleolar territory."

      As suggested by the reviewer, we have added fluorescence microscopy examples for the Nup2 deletion to new Figure 4D. In addition, we have added data on Nup1 as suggested by reviewer 3. Since we observed a significant effect on nucleolar NPC density also upon depletion of Nup1 (new Figure 4A), we have overall revised the text and model to now reflect the shared role of Nup1 and Nup2.

      We have also localized Mlp1-GFP in a nup2Δ background as well as in the Nup60ΔC background where Nup2 can no longer bind to the NPC. In both strains, Mlp1-containing NPCs remain excluded from the nucleolus as now shown in the new Figure 4E. Although we also observed partial Mlp1 mislocalization to a nuclear focus in the nup2Δ strain, such mislocalization was only minimal in the strain with the Nup2-binding domain in Nup60 deleted (nup60ΔC), supporting our conclusion that Nup2 contributes to nucleolar exclusion of NPCs independent of Mlp1. Similarly, Mlp1-positive NPCs remained excluded from the nucleolar territory in cells depleted of Nup1 (new Figure 4B).

      1. The authors could consider utilizing this opportunity to discuss their technological innovations in the context of the prior work of Onischenko et al., 2020. This work is referenced for the statement "RITE can be used to distinguish between old and new NPCs" Page 2, Line 43. However, it is not referenced for the statement "We constructed a RITE-cassette that allows the switch from a GFP-labelled protein to a new protein that is not fluorescently labelled (RITE(GFP-to-dark))" despite Onischenko et al., 2020 having already constructed a RITE-cassette for the GFP-to-dark transition. The authors could consider taking this opportunity to instead focus on their innovative approach to apply this technology to decrease the number of fluorescently-tagged NPCs by dilution across multiple cell divisions and to interpret this finding as a measure of the stability of nuclear pore proteins within the broader NPC.

      We apologize for this imprecise citation. We have modified the text to indicate that our RITE cassette was previously used in two publications. It now reads: "We used a RITE-cassette that allows the switch from a GFP-labelled protein to a new protein that is not fluorescently labelled (RITE(GFP-to-dark)) (Onischenko et al., 2020, Kralt et al., 2022)." Together with additional changes to the text throughout, we hope that our new manuscript version more clearly highlights the innovation of our approach relative to previous use cases.

      1. The authors could also consider taking this opportunity to discuss their results in the context of the Saccharomyces cerevisiae nuclear pore complex structures published e.g. in Kim et al., 2018, Akey et al., 2022, Akey et al., 2023 in which the arrangement of proteins in the nuclear basket is presented, and also work from the Kohler lab (Mészáros et al., 2015) on how the basket proteins are anchored to the NPC. There is additional literature that also might help provide some perspective to the findings in the current manuscript, such as the observation that a lesser amount of Mlp2 to Mlp1 observed is consistent with prior work (e.g. Kim et al., 2018) and that intranuclear Mlp1 foci are also formed after Mlp1 overexpression (Strambio-de-Castillia et al., 1999).

      Following the reviewer's suggestion, we extended our discussion of basket Nup stoichiometry and organization in the discussion section including most of the citations mentioned as well as the recent articles on the nuclear basket structure and organization (Stankunas & Köhler 2024 1038/s41556-024-01484-x, Singh et al. 2024 10.1016/j.cell.2024.07.020)

      Minor Points 1. What is the "lag time" of the doRITE switching? Do the authors believe that it is comparable to the approximate 1-hour timeframe following beta-estradiol induction as shown previously in Chen et al. Nucleic Acids Research, Volume 28, Issue 24, 15 December 2000, Page e108, https://doi.org/10.1093/nar/28.24.e108

      We thank the reviewer for suggesting we analyze the kinetics of RITE switching. We carried out quantitative real-time PCR on genomic DNA and found that the half-time of switching is below 20 min. The majority of the population is switched after 1 hour, similar to the results in Chen et al. This data is now included in Supplemental Figure 1A.

      1. The authors could consider a brief explanation of radial position (um) for the benefit of the reader, in Figures 1E (right panel) and 2B (right panel), perhaps using a diagram to make it easier to understand the X-axis (um).

      To address this, we have now included a diagram and refer to it in the figure legend and the text.

      1. In Figure 1G, would the authors consider changing the vertical axis title and the figure legend wording from "mean number of NPCs per cell" to "mean labeled NPC # per cell" to reflect that what is being characterized are the remaining GFP-bearing NPCs over time?

      Thank you for spotting this inaccuracy. We have changed the label to "mean # of labeled NPCs per cell".

      1. In Figure 2C, the magenta-labeled protein in the micrographs is not described in the figure or the legend.

      A description has been added in figure and legend.

      1. In Figure S2A, there is an arrow indicating a Nup159 focus, but this is not described in the figure legend, as is done in Figure 2C.

      A description has been added to the legend.

      1. In Figure S3C, the figure legend does not match the figure. Was this supposed to be designed like Figure 3C and is missing part of the figure? Or is the legend a typographical error?

      We apologize for this error and thank the reviewer for spotting it. The legend has been corrected (now Figure S4B).

      1. In Figure S4B, the spontaneously recombined RITE (GFP-to-dark) Nup133-V5 appears in the western blot as equally abundant to pre-recombined Nup133-V5-GFP. In the figure legend, this is explained as cells grown in synthetic media without selection to eliminate cells that have lost their resistance marker from the population. In Cheng et al. Nucleic Acids Res. 2000 Dec 15; 28(24): e108, Cre-EBD was not active in the absence of B-estradiol, despite galactose-induced Cre-EBD overexpression. Would the authors be able to comment further on the Cre-Lox RITE system in the manuscript?

      We note that also in the cited publication, cells are grown in the presence of selection to select (as stated in this publication) "against pre-excision events that occur because of low but measurable basal expression of the recombinase". Although the authors report that spontaneous recombination is reduced with the b-estradiol inducible system (compared to pGAL expression control of the recombinase only), they show negligible spontaneous recombination only within a two-hour time window. Indeed, we also observe low levels of uninduced recombination on a short timeframe, but occasional events can become significant in longer incubation times (e.g. overnight growth) in the absence of selection. It should be noted that in our system, Cre expression is continuously high (TDH3-promoter) and not controlled by an inducible GAL promoter. We have added the information about the promoter controlling Cre-expression in the methods section.

      1. In Figure 6, the authors may want to consider inverting the flow of the cartoon model to start from the wild type condition and apply the deletion mutations at each step to "arrive" at the mutant conditions, rather than starting with mutant conditions and "adding back" proteins.

      Following the suggestions of this reviewer as well as reviewer 3, we have modified our model to smore clearly represent the contributions of the different basket components.

      Reviewer #1 (Significance (Required)):

      Recent work has drawn attention to the fact that not all NPCs are structurally or functionally the same, even within a single cell. In this light, the work here from Zsok et al. is an important demonstration of the kind of methodologies that can shed light on the stability and functions of different subpopulations of NPCs. Altogether, these data are used to support an interesting and topical model for Nup2 and nuclear-basket driven retention of NPCs in non-nucleolar regions of the nuclear envelope.

      We thank the reviewer for this positive assessment of our work.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this study, Zsok et al. develop innovative methods to examine the dynamics of individual nuclear pore complexes (NPCs) at the nuclear envelope of budding yeast. The underlying premise is that with the emergence of biochemically distinct NPCs that co-exist in the same cell, there is a need to develop tools to functionally isolate and study them. For example, there is a pool of NPCs that lack the nuclear basket over the nucleolus. Although the nature of this exclusion has been investigated in the past, the authors take advantage of a modification of recombination induced tag exchange (RITE), the slow turnover of scaffold nups, the closed mitosis of budding yeast, and extensive high quality time lapse microscopy to ultimately monitor the dynamics of individual NPCs over the nucleolus. By leveraging genetic knockout approaches and auxin-induced degradation with sophisticated quantitative and rigorous analyses, the authors conclude that there may be two mechanisms dependent on nuclear basket proteins that impact nucleolar exclusion. They also incorporate some computational simulations to help support their conclusions. Overall, the data are of the highest quality and are rigorously quantified, the manuscript is well written, accessible, and scholarly - the conclusions are thus on solid footing.

      We thank the reviewer for this assessment.

      Reviewer #2 (Significance (Required)):

      I have no concerns about the data or the conclusions in this manuscript. However, the significance is not overly clear as there is no major conceptual advance put forward, nor is there any new function suggested for the NPCs over nucleoli. As NPCs are immobile in metazoans, the significance may also be limited to a specialized audience.

      We respectfully disagree with this assessment. First, our work demonstrates the use of a novel approach in the application of RITE that can be useful for other researchers in the field of NPC biology and beyond. For example, doRITE could be applied to study the properties of aged NPCs, an area of considerable interest due to links between the NPC and age-related neurodegenerative diseases.

      Second, we characterize the interaction between conserved nuclear components, the NPC, the nucleolus and chromatin. While the specific architecture of the nucleus varies between species, many of these interactions are conserved. For example, Nup2's homologue Nup50 also interacts with chromatin in other systems, including mammalian cells, and thus may contribute to regulating the interplay between the nuclear basket and adjoining chromatin. This adds to our understanding of the multiple pathways and interactions that contribute to nuclear organization. Therefore, although the depletion of NPCs from the nucleolar territory in budding yeast may not be of direct importance, understanding the relationships between NPCs and their environment provide insight about nuclear organization throughout different eukaryotic lineages.

      In the revised manuscript, we attempt to better highlight and discuss these aspects.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript of Zsok et al. describes the role of nuclear basket proteins in the distribution and mobility of nuclear pore complexes in budding yeast. In particular, the authors showed that the doRITE approach can be used for the analysis of stable and dynamically associated NUPs. Moreover, it can distinguish individual NUPs and follow the inheritance of individual NPCs from mother to daughter cells. The author's findings highlight that Mlp1, Mlp2, and Pml39 are stably associated with the nuclear pore; deletion of Mlp1-Mlp2 and Nup60 leads to the higher NPC density in the nucleolar territory; and NPCs exhibit increased mobility in the absence of the nuclear basket components.

      The manuscript contains most figures supporting the data, and data supports the conclusions. However, authors need to include better explanations for figures in the text and figure legends. Lack of detailed explanation can pose challenges for non-experts. In addition, the authors jump over figures and shuffle them through the manuscript, which disrupts the flow and coherence of the manuscript.

      We thank the reviewer for pointing this out. In response to the detailed comments given below, we have moved some figures and added more explicit explanations to the text to improve the flow and make it easier to follow. In addition, we have modified the figure legends throughout the manuscript to make them more accessible to the reader.

      Major comments: - The nuclear basket contains Nup1, Nup2, Nup60, Mlp1, and Mlp2 in yeast. Nup60 works as a seed for Mlp1/Mlp2 and Nup2 recruitment and plays a key role in the assembly of nuclear pore basket scaffold (PMID: 35148185). Logically, the authors focused primarily on Nup60 in the current manuscript. However, NUP153 has another ortholog of yeast - Nup1, which has not been studied in this work. I recommend adjusting the title of the manuscript to: Nup60 and Mlp1/Mlp2 regulate the distribution and mobility of nuclear pore complexes in budding yeast. I also suggest discussing why work on Nup1 was not included/performed in the manuscript.

      We thank the reviewer for suggesting we should test the role of Nup1. Although we had originally not considered it, since we were focusing on the interactors of Mlp1/2, we found that indeed Nup1 also contributes to nucleolar exclusion. We have therefore changed the title to "Nuclear basket proteins regulate the distribution and mobility of nuclear pore complexes in budding yeast".

      • Figure 2B: I suggest choosing a more representative image for Pml39. It looks not like a stable component but rather dynamic as NUP60 or Gle1 based on figure showed in Figure 2B.

      We thank the reviewer for pointing out this poor choice of panel. We selected a panel for the 14h timepoint that more clearly shows that individual foci can still be seen for Pml39 after this time. Due to its lower copy number, the foci are dimmer for Pml39 than the other stable Nups. Nevertheless, at both the 11 and 14 h timepoint, clear dots can be detected for Pml39, while e.g. Nup116 in the same figure exhibits a more distributed signal and the signal for Nup60 and Gle1 is no longer visible.

      • Depletion of AID-tagged proteins needs to be supported by Western blot analysis with protein-specific antibodies, and PCR results should be included in supplementary data to demonstrate the homozygosity of the strains.

      The correct genomic tagging of the depleted proteins by AID was confirmed by PCR. We include this PCR analysis for the reviewer below. Since we are working with haploid yeast cells, all strains only carry a single copy of the genes. Unfortunately, we do not have protein-specific antibodies against the depleted proteins. However, other phenotypes support the successful depletion of the protein: Mlp1-mislocalization upon Nup60 depletion, reduced transcript production in Pol II depletion (characterized previously: PMID: 31753862, PMID: 36220102), growth defect upon Nup1 depletion.

      • Figure 5B: Snapshots of images from the movie are required. There are no images, only quantifications.

      We have replaced the supplemental movie with a movie showing the detection by Trackmate as well as overlaid tracks. As requested, a snapshot of this movie was inserted in figure 5B. We have also moved the example tracks from the supplement to the main figure. Furthermore, we will deposit the tracking dataset in the ETH Research Collection to make it available to the community.

      Description of figure legends is more technical than supporting/explaining the figure. For example, below my suggestions for Figure 1D. Please, consider more detailed explanation for other figures. (D) Left: Schematic of the RITE cassette. NUP of interest is tagged with V5 tag and eGFP fluorescent protein where LoxP sites flank eGFP. Before the beta-estradiol-induced recombination, the old NPCs are marked with eGFP signal, whereas new NPCs lack an eGFP signal after the recombination. ORF: open reading frame; V5: V5-tag; loxP: loxP recombination site; eGFP: enhanced green fluorescent protein. Right: doRITE assay schematic of stable or dynamic Nup behavior over cell divisions in yeast after the recombination.

      We have modified the figure legends throughout the manuscript to make them more explanatory and helpful for the reader.

      In addition, I recommend highlighting the result in the title of the figures. Please, re-consider titles for Figure S3.

      We have split this figure to better group related results. The new figures S4 and S5 are entitled: " A RITE(dark-to-GFP) cassette to visualize newly assembled NPC. " and "Mlp1 truncations localize predominantly to non-nucleolar NPCs."

      Minor: P.1 Line 31. Extra period symbol before the "(Figure 1A)".

      Fixed

      P.2 Line 10. Inconsistent writing of PML39 and MLP1. Both genes are capitalized. The same for P.4 Line 16. In some cases all letters are capitalized in other only the first one.

      We are following the official yeast gene nomenclature by spelling gene names in italicized capitals and protein names with only the first letter capitalized. We are sorry that this can be confusing for readers more familiar with other model systems.

      P.2 Line 18-22. The sentence is too long and hard to read. I recommend splitting it into two sentences.

      We agree and have fixed this.

      P.2-3 Line 46-47. The sentence is unclear. Suggestion: We expected that successive cell divisions would dilute the signal of labelled and stably associated with the NPC nucleoporins. By contrast, ...

      We have modified the sentence to read: "When tagging a Nup that stably associates with the NPC, we expected that successive cell divisions would dilute labelled NPCs by inheritance to both mother and daughter cells leading to a low density of labelled NPCs. By contrast,..."

      P.4 Line 17-21. Please, consider adding extra information and clarifying lines 19-21. For example, in Line 19 Figure 2B you can add that the reader needs to compare row 1 and row 4.

      Thank you, we have fixed this as suggested.

      P. 5 Line 15. When a number begins a sentence, that number should always be spelled out. You can pe-phrase the sentence to avoid it. Also, I recommend adding an explanation/hypothesis of why new NPCs are less frequently detected in nucleolar territory.

      We have formatted the text. Interestingly, new NPCs are more frequently detected in the nucleolar territory than old NPCs. We have reformulated this section to make it clearer, also in response to the next comment.

      P.5 Line 17-22. I recommend re-phrasing these two sentences. Logically, it is clear that Mlp1/Mlp2 loss mimics "old NPCs" to look more like "new NPCs", and for that reason, they are more frequently included in the nucleolar territory, but it is not clear when you read these two sentences from the first time.

      We have reformulated this section to make it clearer.

      P6. Line 16. No figure supporting data on graph (Figure 3B).

      We have added fluorescent images of the nup2Δ strain to the figure (new Figure 4D).

      P.7 Line 10-13. The sentence is unclear.

      We have shortened the sentence and moved part of the content to the discussion in the next paragraph.

      P.13,14 etc. If 0h timepoint has been used for normalization, why is it present on the graph?

      The 0h timepoint is shown for comparison and to illustrate the standard deviation in the data.

      P.15. Line 32-33. There is no image here. Potentially wrong description of the figure.

      Thank you for spotting this. This was fixed (new Figure S4B).

      Figures: - Inconsistent labeling of figures. For example, Fig.1, Fig.1S, Figure 2 etc.

      Thank you, this has been corrected.

      • Inconsistent labeling of figures. For example, Fig.1 G "mean number of NPCs per cell" - no capitalization of the first letter. Fig.1S "Fraction in population" is capitalized. In general, titles of axis should be capitalized.

      Thank you for spotting this. This was fixed.

      Suggestions for Figure 1D and Figure 6 are attached as a separate file.

      We thank the reviewer for their suggestions to improve these figures. We have taken their recommendation and revised the figures accordingly (see also response to reviewer 1, minor point 8).

      Reviewer #3 (Significance (Required)):

      Zsok et al. used the recombination-induced tag exchange (RITE) approach, which is an interesting and powerful method to follow individual NUPs over time with respect to their localization and abundance. This approach has been used before in PMID: 36515990 to distinguish pre-existing and newly synthesized Nup2 populations and has been extended to other basket NUPs in this work. Using this method, the authors support the earlier data on basket nucleoporins and highlight new insights on how basket nucleoporins regulate NPCs distribution and mobility. Overall, the manuscript provides new details on the stability of nucleoporins in yeast and how these data align with the mass spectrometry and FRAP data performed earlier in other studies. The limitation of this study is the absence of data on Nup1. It was unclear why these data were not present. Additional data can be included on the dynamics of Pml39, for example, using the FRAP method. The dynamic of Pml39 at the pore was shown only using the doRITE method.

      As suggested, we have tested the role of Nup1 (see above).

      Unfortunately, we are not able to provide orthologous data for the dynamics of Pml39. As we discuss in the manuscript, FRAP is not suitable for the analysis of the dynamics of most nucleoporins in yeast due to the high lateral mobility of NPCs in the nuclear envelope and has previously generated misleading results for Mlp1. Furthermore, the low expression levels of Pml39 will make it difficult to obtain reliable FRAP curves for this protein. We therefore do not think that adding FRAP experiments with Pml39 will provide valuable insight.

      However, in addition to the Pml39 doRITE result itself, our observation that the Pml39-dependent pool of Mlp1 exhibits stable association with the NPC supports the interpretation of Pml39 as a stable protein as well.

      In general, this study represents a unique research study of basic research on nuclear pore proteins that will be of general interest to the nuclear transport field.

      Field of expertise: nuclear-cytoplasmic transport, nuclear pore, inducible protein degradation. I do not have sufficient expertise in ExTrack.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript of Zsok et al. describes the role of nuclear basket proteins in the distribution and mobility of nuclear pore complexes in budding yeast. In particular, the authors showed that the doRITE approach can be used for the analysis of stable and dynamically associated NUPs. Moreover, it can distinguish individual NUPs and follow the inheritance of individual NPCs from mother to daughter cells. The author's findings highlight that Mlp1, Mlp2, and Pml39 are stably associated with the nuclear pore; deletion of Mlp1-Mlp2 and Nup60 leads to the higher NPC density in the nucleolar territory; and NPCs exhibit increased mobility in the absence of the nuclear basket components.

      The manuscript contains most figures supporting the data, and data supports the conclusions. However, authors need to include better explanations for figures in the text and figure legends. Lack of detailed explanation can pose challenges for non-experts. In addition, the authors jump over figures and shuffle them through the manuscript, which disrupts the flow and coherence of the manuscript.

      Major comments:

      • The nuclear basket contains Nup1, Nup2, Nup60, Mlp1, and Mlp2 in yeast. Nup60 works as a seed for Mlp1/Mlp2 and Nup2 recruitment and plays a key role in the assembly of nuclear pore basket scaffold (PMID: 35148185). Logically, the authors focused primarily on Nup60 in the current manuscript. However, NUP153 has another ortholog of yeast - Nup1, which has not been studied in this work. I recommend adjusting the title of the manuscript to: Nup60 and Mlp1/Mlp2 regulate the distribution and mobility of nuclear pore complexes in budding yeast. I also suggest discussing why work on Nup1 was not included/performed in the manuscript.
      • Figure 2B: I suggest choosing a more representative image for Pml39. It looks not like a stable component but rather dynamic as NUP60 or Gle1 based on figure showed in Figure 2B.
      • Depletion of AID-tagged proteins needs to be supported by Western blot analysis with protein-specific antibodies, and PCR results should be included in supplementary data to demonstrate the homozygosity of the strains.
      • Figure 5B: Snapshots of images from the movie are required. There are no images, only quantifications.
      • Description of figure legends is more technical than supporting/explaining the figure. For example, below my suggestions for Figure 1D. Please, consider more detailed explanation for other figures. (D) Left: Schematic of the RITE cassette. NUP of interest is tagged with V5 tag and eGFP fluorescent protein where LoxP sites flank eGFP. Before the beta-estradiol-induced recombination, the old NPCs are marked with eGFP signal, whereas new NPCs lack an eGFP signal after the recombination. ORF: open reading frame; V5: V5-tag; loxP: loxP recombination site; eGFP: enhanced green fluorescent protein. Right: doRITE assay schematic of stable or dynamic Nup behavior over cell divisions in yeast after the recombination.

      In addition, I recommend highlighting the result in the title of the figures. Please, re-consider titles for Figure S3.

      Minor:

      P.1 Line 31. Extra period symbol before the "(Figure 1A)".

      P.2 Line 10. Inconsistent writing of PML39 and MLP1. Both genes are capitalized. The same for P.4 Line 16. In some cases all letters are capitalized in other only the first one.

      P.2 Line 18-22. The sentence is too long and hard to read. I recommend splitting it into two sentences.

      P.2-3 Line 46-47. The sentence is unclear. Suggestion: We expected that successive cell divisions would dilute the signal of labelled and stably associated with the NPC nucleoporins. By contrast, ...

      P.4 Line 17-21. Please, consider adding extra information and clarifying lines 19-21. For example, in Line 19 Figure 2B you can add that the reader needs to compare row 1 and row 4.

      P. 5 Line 15. When a number begins a sentence, that number should always be spelled out. You can pe-phrase the sentence to avoid it. Also, I recommend adding an explanation/hypothesis of why new NPCs are less frequently detected in nucleolar territory.

      P.5 Line 17-22. I recommend re-phrasing these two sentences. Logically, it is clear that Mlp1/Mlp2 loss mimics "old NPCs" to look more like "new NPCs", and for that reason, they are more frequently included in the nucleolar territory, but it is not clear when you read these two sentences from the first time.

      P6. Line 16. No figure supporting data on graph (Figure 3B).

      P.7 Line 10-13. The sentence is unclear.

      P.13,14 etc. If 0h timepoint has been used for normalization, why is it present on the graph?

      P.15. Line 32-33. There is no image here. Potentially wrong description of the figure.

      Figures:

      • Inconsistent labeling of figures. For example, Fig.1, Fig.1S, Figure 2 etc.
      • Inconsistent labeling of figures. For example, Fig.1 G "mean number of NPCs per cell" - no capitalization of the first letter. Fig.1S "Fraction in population" is capitalized. In general, titles of axis should be capitalized.

      Suggestions for Figure 1D and Figure 6 are attached as a separate file.

      Significance

      Zsok et al. used the recombination-induced tag exchange (RITE) approach, which is an interesting and powerful method to follow individual NUPs over time with respect to their localization and abundance. This approach has been used before in PMID: 36515990 to distinguish pre-existing and newly synthesized Nup2 populations and has been extended to other basket NUPs in this work. Using this method, the authors support the earlier data on basket nucleoporins and highlight new insights on how basket nucleoporins regulate NPCs distribution and mobility. Overall, the manuscript provides new details on the stability of nucleoporins in yeast and how these data align with the mass spectrometry and FRAP data performed earlier in other studies. The limitation of this study is the absence of data on Nup1. It was unclear why these data were not present. Additional data can be included on the dynamics of Pml39, for example, using the FRAP method. The dynamic of Pml39 at the pore was shown only using the doRITE method.

      In general, this study represents a unique research study of basic research on nuclear pore proteins that will be of general interest to the nuclear transport field.

      Field of expertise: nuclear-cytoplasmic transport, nuclear pore, inducible protein degradation. I do not have sufficient expertise in ExTrack.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, Zsok et al. develop innovative methods to examine the dynamics of individual nuclear pore complexes (NPCs) at the nuclear envelope of budding yeast. The underlying premise is that with the emergence of biochemically distinct NPCs that co-exist in the same cell, there is a need to develop tools to functionally isolate and study them. For example, there is a pool of NPCs that lack the nuclear basket over the nucleolus. Although the nature of this exclusion has been investigated in the past, the authors take advantage of a modification of recombination induced tag exchange (RITE), the slow turnover of scaffold nups, the closed mitosis of budding yeast, and extensive high quality time lapse microscopy to ultimately monitor the dynamics of individual NPCs over the nucleolus. By leveraging genetic knockout approaches and auxin-induced degradation with sophisticated quantitative and rigorous analyses, the authors conclude that there may be two mechanisms dependent on nuclear basket proteins that impact nucleolar exclusion. They also incorporate some computational simulations to help support their conclusions. Overall, the data are of the highest quality and are rigorously quantified, the manuscript is well written, accessible, and scholarly - the conclusions are thus on solid footing.

      Significance

      I have no concerns about the data or the conclusions in this manuscript. However, the significance is not overly clear as there is no major conceptual advance put forward, nor is there any new function suggested for the NPCs over nucleoli. As NPCs are immobile in metazoans, the significance may also be limited to a specialized audience.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors extended the existing recombination-induced tag exchange (RITE) technology to show that they can image a subset of NPCs, improving signal-to-noise ratios for live cell imaging in yeast, and to track the stability or dynamics of specific nuclear pore proteins across multiple cell divisions. Further, the authors use this technology to show that the nuclear basket proteins Mlp1, Mlp2 and Pml39 are stably associated with "old NPCs" through multiple cell cycles. The authors show that the presence of Mlp1 in these "old NPCs" correlates with exclusion of Mlp1-positive NPCs from the nucleolar territory. A surprising result is that basket-less NPCs can be excluded from the non-nucleolar region, an observation that correlates with the presence of Nup2 on the NPC regardless of maturation state of the NPC. In support of the proposal that retention of NPCs via Mlp1 and Nup2 in non-nucleolar regions, simulation data is presented to suggest that basket-less NPCs diffuse faster in the plane of the nuclear envelope.

      However, there are some points that do need addressing:

      Major Points

      1. Taking into account that the Nup2 result in Figure 4B forms the basis for one half of the proposed model in Figure 6 regarding the exclusion of NPCs from the nucleolar region of the NE, there is a relatively small amount of data in support of this finding and this proposed model. For example, the only data for Nup2 in the manuscript is a column chart in Figure 4B with no supporting fluorescence microscopy examples for any Nup2 deletion. Further, the Nup60 deletion mutant will have zero basket-containing NPCs, whereas the Nup2 deletion will be a mixture of basket-containing and basket-less NPCs. The only support for the localization of basket-containing NPCs in the Nup2 deletion mutant is through a reference "Since Mlp1-positive NPCs remain excluded from the nucleolar territory in nup2Δ cells (Galy et al., 2004), the homogenous distribution observed in this mutant must be caused predominantly by the redistribution of Mlp-negative NPCs into the nucleolar territory."
      2. The authors could consider utilizing this opportunity to discuss their technological innovations in the context of the prior work of Onischenko et al., 2020. This work is referenced for the statement "RITE can be used to distinguish between old and new NPCs" Page 2, Line 43. However, it is not referenced for the statement "We constructed a RITE-cassette that allows the switch from a GFP-labelled protein to a new protein that is not fluorescently labelled (RITE(GFP-to-dark))" despite Onischenko et al., 2020 having already constructed a RITE-cassette for the GFP-to-dark transition. The authors could consider taking this opportunity to instead focus on their innovative approach to apply this technology to decrease the number of fluorescently-tagged NPCs by dilution across multiple cell divisions and to interpret this finding as a measure of the stability of nuclear pore proteins within the broader NPC.
      3. The authors could also consider taking this opportunity to discuss their results in the context of the Saccharomyces cerevisiae nuclear pore complex structures published e.g. in Kim et al., 2018, Akey et al., 2022, Akey et al., 2023 in which the arrangement of proteins in the nuclear basket is presented, and also work from the Kohler lab (Mészáros et al., 2015) on how the basket proteins are anchored to the NPC. There is additional literature that also might help provide some perspective to the findings in the current manuscript, such as the observation that a lesser amount of Mlp2 to Mlp1 observed is consistent with prior work (e.g. Kim et al., 2018) and that intranuclear Mlp1 foci are also formed after Mlp1 overexpression (Strambio-de-Castillia et al., 1999).

      Minor Points

      1. What is the "lag time" of the doRITE switching? Do the authors believe that it is comparable to the approximate 1-hour timeframe following beta-estradiol induction as shown previously in Chen et al. Nucleic Acids Research, Volume 28, Issue 24, 15 December 2000, Page e108, https://doi.org/10.1093/nar/28.24.e108
      2. The authors could consider a brief explanation of radial position (um) for the benefit of the reader, in Figures 1E (right panel) and 2B (right panel), perhaps using a diagram to make it easier to understand the X-axis (um).
      3. In Figure 1G, would the authors consider changing the vertical axis title and the figure legend wording from "mean number of NPCs per cell" to "mean labeled NPC # per cell" to reflect that what is being characterized are the remaining GFP-bearing NPCs over time?
      4. In Figure 2C, the magenta-labeled protein in the micrographs is not described in the figure or the legend.
      5. In Figure S2A, there is an arrow indicating a Nup159 focus, but this is not described in the figure legend, as is done in Figure 2C.
      6. In Figure S3C, the figure legend does not match the figure. Was this supposed to be designed like Figure 3C and is missing part of the figure? Or is the legend a typographical error?
      7. In Figure S4B, the spontaneously recombined RITE (GFP-to-dark) Nup133-V5 appears in the western blot as equally abundant to pre-recombined Nup133-V5-GFP. In the figure legend, this is explained as cells grown in synthetic media without selection to eliminate cells that have lost their resistance marker from the population. In Cheng et al. Nucleic Acids Res. 2000 Dec 15; 28(24): e108, Cre-EBD was not active in the absence of B-estradiol, despite galactose-induced Cre-EBD overexpression. Would the authors be able to comment further on the Cre-Lox RITE system in the manuscript?
      8. In Figure 6, the authors may want to consider inverting the flow of the cartoon model to start from the wild type condition and apply the deletion mutations at each step to "arrive" at the mutant conditions, rather than starting with mutant conditions and "adding back" proteins.

      Significance

      Recent work has drawn attention to the fact that not all NPCs are structurally or functionally the same, even within a single cell. In this light, the work here from Zsok et al. is an important demonstration of the kind of methodologies that can shed light on the stability and functions of different subpopulations of NPCs. Altogether, these data are used to support an interesting and topical model for Nup2 and nuclear-basket driven retention of NPCs in non-nucleolar regions of the nuclear envelope.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility, and clarity

      Singh et al. analyze the expression and putative contribution of TEs in CD4+ T cells in HIV elite controllers. Through re-analysis of existing datasets, the authors describe broad differences in expression of TEs in ECs through analysis of RNA-seq and ATAC-seq data and come up with convincing examples where differentially-expressed innate immune genes correlate with increased accessibility of proximal TEs. Overall, the authors' conclusions are appropriately measured, though the manuscript text should be re-organized for clarity and a few further analyses are needed to support the main message of the paper.

      Major comments

      The manuscript would benefit from a re-organization of the figures to focus on TEs - in particular, Fig 1B, Fig 2, and Fig 3 reproduce known transcriptional differences between ECs and HCs and serve as quality controls for the authors' computational analysis. Conversely, Supplementary Fig 6 contains very interesting data on KZNF expression and should be included in the main figures.

      Authors: Thank you for the suggestion. We agree that Figure S6 should be featured more prominently in the manuscript. Accordingly, we have now incorporated it into the main text as Figure 6. The TE-KZNF correlation plots, previously Figure 5C, have been relocated to this new figure to provide a cohesive presentation of all KZNF-related data within the same figure.

      We’ve chosen to keep Figures 1B, 2, and 3 in their original places. We contend that they provide a foundational view of transcriptional variances in gene expression between patient groups, encompassing both previously identified and novel DEGs, which we believe warrants their placement in the main text. Furthermore, they serve as robust quality control measures for subsequent TE-centric transcriptional analyses. Given that there is no limitation in the number of figures in Genome Biology articles, we think it’s adequate to retain them as main figures.

      It remains unclear whether differences in TE expression described are specific to ECs or to EC-like CD4+ T cell states. As there are plenty of datasets available that compare the transcriptome of naïve, activated, exhausted, and regulatory CD4+ T cells, the authors should compare the TE expression patterns observed in ECs to activated CD4+ T cells, particularly those with a Th1 and cytotoxic phenotype analogous to those observed in ECs, from healthy donors.

      Authors: We thank the reviewer for this constructive suggestion to further study the foundations of HIV-1 elite control. In our initial study, we demonstrate that PBMCs from elite controllers (ECs) exhibit a heightened proportion of activated CD4+ T cells compared to PBMCs of healthy controls (HCs) and a heightened proportion of macrophages, naïve CD4+ T cells, and NK cells compared to PBMCs of treatment-naïve viremic progressors (VPs) (Figure 2D). Additionally, through clustering analysis of deconvoluted CD4+ T cell samples from elite controllers, we ascertain that the clustering pattern is not predicated on the CD4+ T cell subtype (Figure 3B). To further explore the reviewer’s inquiry, we compared the TE expression profile of ECs with that of unstimulated and stimulated CD4+ T cell subsets from HCs (data source: PMID 31570894), integrated into the revised manuscript as Figure S3B.

      “Unsupervised clustering of these samples shows that the TE expression pattern of ECs is most similar to that of Th2 progenitor cells, which are associated with HIV-1-specific adaptive immune responses (61). Still, we observed that, for the majority of families, TE expression was higher on average in all EC CD4+ T cell subsets than in CD4+ T cell subsets from HCs, regardless of stimulation (Figure S3B). While a subset of TE families exhibited an expression pattern in ECs similar to that of activated CD4+ T cells of HCs (e.g., high expression of L1s and THE1B), multiple TE families appear to be upregulated in an EC-specific way (e.g., LTR12C and LTR7). Together, these findings underscore the unique immune cell composition, transcriptome, and retrotranscriptome of ECs.” [pg.13-14, L226-235]

      While these observations are interesting, pursuing this question further falls beyond the scope of our study, as we note in the Discussion of the revised manuscript. We believe the reviewer’s inquiry pertains to a distinct research question, namely whether the potential for elite control of HIV-1 infection manifests as a detectable phenotype pre-infection within healthy CD4 T cell subsets (i.e., EC-like CD4+ T cell states) or is a unique phenotype that emerges solely after HIV-1 infection.

      “Another outstanding question is whether the gene and TE signatures revealed by our analysis of ECs exist in the general population independent of HIV-1 infection or if they are driven by the initial infection. While this inquiry is beyond the scope of this study, we have presented here evidence of common TE signatures between EC CD4+ T cells and Th2 progenitors from HCs (Figure S3B) and established that ECs possess a unique CD4+ T cell retrotranscriptome with potential implications for natural HIV-1 control. Future studies designed to assess elite control prediction should explore whether these TE profiles can serve as predictive variables for whether an individual displays enhanced viral control.” [pg. 38, L663-671]

      Therefore, while we appreciate the reviewer's suggestion and offer the addition of these preliminary findings, we believe that further investigation would be better suited for future studies specifically designed to address that question. Our manuscript aims to provide insight into the retrotranscriptome dynamics in ECs and their potential implications for natural HIV-1 control.

      In Fig 1, the authors demonstrate differential expression of both innate immune genes and TEs, but the link between the two is unclear. Is there any enrichment in differential expression for TEs located proximal to innate immune genes? This type of analysis should be possible using the authors' own software to map TE expression to specific genomic loci.

      __Authors: __Thank you for this excellent question. To answer this inquiry, we used the paired ATAC-seq and RNA-seq datasets for from ECs and HCs (used in Figures 1 and 4) to produce a new list of TE-gene pairs on which we could perform gene set enrichment analysis, the results of which have been integrated into the revised manuscript as Figure 4A.

      “We used paired ATAC-seq – which measures chromatin accessibility – and RNA-seq datasets for ECs (n=4) and HCs (n=4) to create a list of TE-gene pairs where the TE locus and gene show increased accessibility and expression, respectively, in ECs compared to HCs (Table S7, see Methods for details). These loci and genes were paired based on proximity, with a maximum distance of 10kb between the TE locus and the gene’s transcription start site, to increase the likelihood of a direct cis-regulatory influence of the TE over the nearby gene. Subsequent gene set enrichment analysis revealed that these genes were predominantly involved in cellular activation, cytokine production, and immune response regulation (Figure 4A). The enrichment for differential accessibility of TE loci near genes involved in these pathways suggests that the distinct TE landscape observed in ECs may contribute significantly to a unique immune regulome in these individuals.” [pg. 21, L357-368]

      Thus, we conclude that yes, there is an enrichment for immune-related genes with higher expression in ECs, proximal to differentially accessible TEs. We highlight six of these TE-gene pairs in Figure 4B-C. While we have high confidence in our analyses, future experimental validation is needed to confirm these regulatory relationships.

      Optional: In Fig 3, the authors cluster CD4+ T cells based on transcriptomic profiles. It would be interesting to re-cluster these samples based on TE expression alone, given the differences in TE expression described in Fig 5.

      __Authors: __Thank you for the suggestion. We agree that it would be valuable to assess how the EC clustering is altered when considering TE expression alone, as opposed to combining gene and TE family expression. To address this, we used the same graph-based k-nearest neighbors method to re-cluster the EC CD4+ T cell RNA-seq samples based only on locus-level TE expression, integrated into the revised manuscript as Figure S7.

      “To further explore locus-level expression patterns, we re-clustered the same EC samples (n=128) using only locus-level TE expression. This again resolved four EC clusters (Figure S7A), which interestingly appeared even more distinct than those identified by gene and TE family expression (Figure 3A). The TE locus-based clusters (TL-Cs) aligned well with the gene and TE family clusters (GT-Cs), with an average 70% overlap in samples between each GT-C and its corresponding TL-C (Figure S7B), indicating high consistency (Table S8). The remaining 30% of samples that shifted between clusters did so consistently within individuals, not cohorts, maintaining heterogeneous TL-C compositions similar to the GT-Cs (Figures S7C & S5A). An exception to this heterogeneity was TL-C4, comprising 22 samples from GT-C1 that were almost entirely from the CD4+ T cell subsets of only four participants in the Jiang cohort (Figure S7C, Table S8). No other samples from the Jiang cohort shifted to this cluster from other GT-Cs, suggesting that these patterns reflect individual variation rather than cohort bias. Like the GT-Cs, each TL-C included samples from all five CD4+ T cell subsets and was largely heterogeneous (Figure S7C). Notably, TL-C2 mirrored corresponding GT-C3 in its overrepresentation of EM and TM cells, while TL-C1 uniquely showed an overrepresentation of naïve CD4+ T cells. Beyond sample composition, each TL-C was characterized by a unique pattern of expressed TE loci (Figure S7D). These signatures were heterogeneous across families, with subsets of variable loci from one TE family marking separate clusters (Figure S7E), some of which did not reach the threshold of significance in earlier analyses when analyzed at the family-level, like SVA-D. Many families maintained their cluster-specific signatures, like THE1B (a marker of GT-C2), for which the majority of variable loci were found in corresponding TL-C1. However, some TE families, like the L1s that marked GT-C1, showed more heterogeneous signatures with variable loci marking multiple TL-Cs. These findings underscore the need for future locus-level investigations with high-depth sequencing to fully capture the complexity of TE expression.” [pg. 27-28, L462-488]

      We believe these findings not only validate the distinct clustering patterns observed but also highlight the potential of locus-level TE analysis to reveal additional layers of retrotranscriptomic diversity in EC CD4+ T cells.

      Significance

      The manuscript by Singh et al. describes for the first time the role of TEs in HIV elite controllers, suggesting that TEs may be co-opted for cis-regulatory function. This study builds off prior work demonstrating that HIV-infected CD4+ T cells activate LTR elements that may regulate the expression of interferon-inducible genes, demonstrating that ECs show further upregulation of innate immune genes. While these findings will need to be experimentally validated, this study constitutes a useful resource and adds to the growing body of evidence implicating TEs in cis-regulatory control of immune genes. This study will be of interest to basic scientists interested in genetic mechanisms of HIV control, and if further developed may comprise a useful source of biomarkers to predict viral kinetics in HIV-infected individuals. My expertise is in immunology, TE biology, and viral infection.

      Authors: We greatly appreciate this positive evaluation of our manuscript and recognition of its significance in uncovering novel evidence of TE co-option for immune regulatory function in HIV-1 elite control, as well as the suggestion of promising avenues for future research in this field.

      Reviewer #2

      Evidence, reproducibility and clarity

      The authors have re-analyzed published RNA-Seq data from CD4 T cells isolated from HIV elite controllers and reference cohorts, including HIV negative persons, viremic progressors and ART-treated persons. Their main finding is that in some of their comparisons, EC have higher levels of interferon-stimulated genes (ISG), paired with distinct expression patterns of transposable elements. The authors suggest that expression of transposable elements may induce altered expression of ISG, presumably due to immune recognition of TE. They also suggest that reduced expression of KZNF genes, which encode for transcription factors that can suppress TE, may be responsible for enhanced expression of TE. I have the following comments:

      1. All data included in this manuscript derive from previously published data. A new dataset, specifically designed to focus on a high-resolution analysis of TE expression, would be better suited to address the proposed questions.

      Authors: We agree that a new dataset tailored specifically for high-resolution analysis of TE expression would be optimal for addressing the proposed inquiries, and we emphasize this point in the Discussion of the revised manuscript.

      “We found that distinct sets of innate immunity genes and restriction factors are upregulated in different EC clusters even in the absence of active viremia, suggesting that elevated basal expression of these factors plays a previously underappreciated role in the EC phenotype. Further studies will be necessary to cement this idea and would especially benefit from the integration of single-cell omics to dissect TE regulation and clustering in deconvoluted CD4+ T cells of ECs. We also acknowledge that our study is limited by the small number of EC individuals with available omics data, which likely limited our ability to identify significant relationships between transcriptome clustering and available participant metadata (Figure S5). While the rarity of ECs in the seropositive population makes it challenging to study this phenotype, the transcriptomic heterogeneity revealed by our analyses underscores the need for surveying larger and more diverse EC cohorts.” [pg. 37-38, L651-662]

      Regrettably, we do not have access to elite controller samples (which are exceedingly rare), and as such the addition of a novel dataset was not feasible within the scope of this revision. Nevertheless, we assert that the publicly available sequencing data analyzed here is robust and suitable for locus- and family-level TE analysis. All sequencing runs were paired-end and of high depth, ensuring proper alignment to and high coverage of TEs at a locus-specific resolution. Additionally, we use in-house pipelines curated for TE analysis, to optimize the accuracy and quantity of TE-assigned reads (see Methods and our GitHub Repository for more details).

      Authors: We agree that a new dataset tailored specifically for high-resolution analysis of TE expression would be optimal for addressing the proposed inquiries, and we emphasize this point in the Discussion of the revised manuscript.

      “We found that distinct sets of innate immunity genes and restriction factors are upregulated in different EC clusters even in the absence of active viremia, suggesting that elevated basal expression of these factors plays a previously underappreciated role in the EC phenotype. Further studies will be necessary to cement this idea and would especially benefit from the integration of single-cell omics to dissect TE regulation and clustering in deconvoluted CD4+ T cells of ECs. We also acknowledge that our study is limited by the small number of EC individuals with available omics data, which likely limited our ability to identify significant relationships between transcriptome clustering and available participant metadata (Figure S5). While the rarity of ECs in the seropositive population makes it challenging to study this phenotype, the transcriptomic heterogeneity revealed by our analyses underscores the need for surveying larger and more diverse EC cohorts.” [pg. 37-38, L651-662]

      Regrettably, we do not have access to elite controller samples (which are exceedingly rare), and as such the addition of a novel dataset was not feasible within the scope of this revision. Nevertheless, we assert that the publicly available sequencing data analyzed here is robust and suitable for locus- and family-level TE analysis. All sequencing runs were paired-end and of high depth, ensuring proper alignment to and high coverage of TEs at a locus-specific resolution. Additionally, we use in-house pipelines curated for TE analysis, to optimize the accuracy and quantity of TE-assigned reads (see Methods and our GitHub Repository for more details).

      1. As the authors acknowledge, the described investigations are exploratory, and do not allow to draw firm conclusions. Mechanistic experiments are recommended to address the authors' hypotheses.

      Authors: We agree and have duly acknowledged throughout the Discussion the exploratory nature of our investigations and the need for future mechanistic experiments to validate our model. Below are passages from the revised manuscript which we’ve added to emphasize these points.

      “These findings underscore the need for future locus-level investigations with high-depth sequencing to fully capture the complexity of TE expression.” [pg. 28, L486-488]

      “Each step in the model will require experimental work to be validated. First and foremost, it will be important to confirm that the TEs exhibiting increased transcript levels and accessibility in ECs are indeed boosting the innate immune response and control of HIV-1 in these individuals.” [pg. 34, L583-586]

      “CRISPR-Cas9 editing was used in cell lines to demonstrate that a subset of MER41 elements function as enhancers driving the interferon-inducibility of several innate immune genes. However, the specific MER41 loci we identified here as differentially active in ECs have not been tested experimentally for enhancer activity. Thus, further work is warranted to confirm the regulatory function of these loci under the control of STAT1 or other immune TFs, as well as other TE families identified as targets of immune-related TFs (Figure S8).” [pg. 35, L594-600]

      “Overall, our results reinforce the concept that TEs are important players in the human antiviral response (25,93) and uncover specific candidate elements for boosting cellular defenses against HIV-1 in ECs. We acknowledge that these associations are drawn from correlative patterns and manipulative experiments are needed to infer causality between chromatin changes at these TEs and increased expression of nearby immunity genes.” [pg. 36, L618-623]

      “Further work is needed to validate TE-KZNF regulatory interactions in T cells, probe their connection to epigenetic variation at individual TE loci, and explore their repercussions on gene expression variation in CD4+ T cells, with and without HIV-1 infection.” [pg. 40, L715-718]

      Thus, while we appreciate and agree with the suggestion of experimental validation, we contend that these experiments fall beyond the scope of the present study, which is a computational investigation providing insight into the EC retrotranscriptome and its potential implications for natural HIV-1 control.

      1. An important limitation is that virological data of EC are not considered. For example, I believe it is a lot more likely that the upregulation of ISG in EC relates to ongoing low-level viral replication. The authors could analyze cell-associated HIV RNA and DNA levels and determine how they associate with ISG expression.

      Authors: Thank you for bringing up this important consideration. It's worth noting that the public datasets used in our study reported undetectable viremia in the EC volunteers (PMIDs 30964004, 29269040, 32848246, 27453467). Nonetheless, we sought to address this limitation and explore the potential association between ISG expression and viremia as recommended by the reviewer. These analyses were integrated into the revised manuscript as Figure S6.

      “To exclude the possibility that these gene expression signatures in ECs are associated with viremia, we quantified HIV-1 transcript levels in deconvoluted CD4+ T cell RNA-seq samples from ECs and ART-treated PLWH for comparison. In the original studies, all samples were reported to have undetected viremia by blood tests (9,37-39). Consistent with this, we found that the vast majority of the EC and ART samples taken from PBMCs exhibited very low HIV-1 transcript levels, with TPM values generally below 1. However, in samples originating from the lymph nodes of EC individuals (n = 22) (37), we detected HIV-1 expression in some subsets (Figure S6A&B). In agreement with the corresponding study (37), we found elevated HIV-1 transcript levels in germinal center and non-germinal center T follicular helper cells (GC Tfh & nGC Tfh, not included in our clustering analyses) -- and to a lesser extent in T effector memory (EM) cells (Figure S6A, average TPM This added analysis confirms that the increased expression of ISGs in ECs is not correlated with virological transcription and is therefore likely not to be driven by viremia.

      1. KZNF genes seem downregulated in EC. Can the authors propose a reason/mechanism for that?

      Authors: There is the possibility that KZNF regulatory loops are the cause of their transcriptional downregulation, which has been documented in embryogenesis (PMID 31006620) and cancer (PMID 33087347). We’ve incorporated this hypothesis into the Discussion as an additional consideration for the reader.

      “These observations suggest that interindividual variation in KZNF expression in CD4+ T cells could explain why certain TEs are variably expressed and accessible across ECs. But what are the mechanisms underlying variation in ZNF expression? It is possible that TE-KZNF regulatory loops are involved, in which a copy of the TE family targeted by a KZNF is inserted near and regulates the KZNF gene, thereby introducing a negative feedback loop. This phenomenon has been documented in prior studies of KZNF activity in embryogenesis (51) and cancer (115).” [pg. 39-40, L705-711]

      While we believe this is a viable hypothesis, it requires further experimentation to confirm the existence of this phenomenon and its impacts in the context of immune cells.

      Significance

      Overall, I think this is an interesting manuscript that proposes distinct and potentially important mechanisms that may contribute to immune control of HIV. My suggestions to improve the manuscript are complex and cannot be easily addressed through experimental work. I believe a possible option would be to publish the present manuscript without my proposed modifications but highlight the weaknesses of the current paper more clearly; mechanistic studies could then be deferred to a future study.

      Authors: We appreciate the reviewer's positive assessment of our manuscript and their recognition of its significance in elucidating novel TE-derived mechanisms that may contribute to natural HIV-1 control. We agree that mechanistic studies are required to test our predictions. As the reviewer suggests, these would be complex experiments that we feel fall beyond the scope of this study. With the additions detailed above in response to the reviewer’s point #2, we believe that we have clearly highlighted the limitations of our work and emphasized the need for future experimentation to validate our findings.

      Reviewer #3

      Evidence, reproducibility, and clarity

      Summary: This manuscript presents an analysis of published gene expression (RNA-seq and ATAC-seq) data from a couple of cohorts of HIV-infected elite controllers (EC), as compared to uninfected controls, (HC), virological progressors (VP). The authors report that HIV elite controllers may exhibit 4 distinct patterns of TE (and gene) expression and suggest that TE expression may drive some form of antiviral gene expression. Further, they show that heterogeneous TE expression may be determined by differential KZNF gene activity among the different clusters of elite controllers. These results are very interesting, even though the conclusions are very preliminary. It presents intriguing correlations between expression of certain TE groups of LINES and HERVs, and the clustering into 4 gene expression groups in EC and is a novel finding. That said, correlation is not causation, and the authors need to be more cautious in presenting their highly preliminary model in Figure 6.

      Authors: We are grateful for the reviewer's insightful assessment of our manuscript, acknowledging the novelty and interest of our findings regarding TE expression patterns in HIV-1 elite controllers. We also appreciate their constructive feedback regarding the cautious interpretation of preliminary conclusions. In the revised manuscript, we have underscored the exploratory nature of our investigations and the need for future mechanistic experiments to validate our model.

      “These findings underscore the need for future locus-level investigations with high-depth sequencing to fully capture the complexity of TE expression.” [pg. 28, L486-488]

      “Each step in the model will require experimental work to be validated. First and foremost, it will be important to confirm that the TEs exhibiting increased transcript levels and accessibility in ECs are indeed boosting the innate immune response and control of HIV-1 in these individuals.” [pg. 34, L583-586]

      “CRISPR-Cas9 editing was used in cell lines to demonstrate that a subset of MER41 elements function as enhancers driving the interferon-inducibility of several innate immune genes. However, the specific MER41 loci we identified here as differentially active in ECs have not been tested experimentally for enhancer activity. Thus, further work is warranted to confirm the regulatory function of these loci under the control of STAT1 or other immune TFs, as well as other TE families identified as targets of immune-related TFs (Figure S8).” [pg. 35, L594-600]

      “Overall, our results reinforce the concept that TEs are important players in the human antiviral response (25,93) and uncover specific candidate elements for boosting cellular defenses against HIV-1 in ECs. We acknowledge that these associations are drawn from correlative patterns and manipulative experiments are needed to infer causality between chromatin changes at these TEs and increased expression of nearby immunity genes.” [pg. 36, L618-623]

      “Further work is needed to validate TE-KZNF regulatory interactions in T cells, probe their connection to epigenetic variation at individual TE loci, and explore their repercussions on gene expression variation in CD4+ T cells, with and without HIV-1 infection.” [pg. 40, L715-718]

      We hope these passages provide sufficient caution and clarity in the presentation of our scientific inquiry.

      Major comments:

      Overall, although preliminary, as the authors note, the results are interesting and worthy of follow-up. At this point, however, a number of issues arise that need further clarification and analysis before I would consider this study complete.

      First, the analyses shown in Figures 3-5 based on data from studies on EC of CD4 cells are apparently motivated by the differential TE expression in total PBMCs shown in Fig 1 and 2. Yet, the TE groups (please don't use taxonomic terms like "subfamily") identified in Fig 2 and Fig 4 are completely different, with no overlap. This discrepancy underscores the possibility that the differential expression observed is, at least in part, due to the differences among the groups or clusters in cell type composition, as seen in Fig 2D and 3B which, themselves, could be a consequence of HIV infection and elite control (which has been shown to involve ongoing, albeit low-level, virus replication). This issue must be addressed.

      Authors: Thank you for the suggestion. First, we’d like to clarify that the data used in Figures 1 and 2 were not both derived from PBMCs. Figures 1 and S1 examine the differential expression of TEs in EC CD4+ T cells compared to HCs and ART-treated PLWH, respectively. Figure 2 examines differential expression of TEs in EC PBMCs compared to treatment-naïve VPs. Second, regarding Figure 4B-C, the TE loci that we chose to highlight were not based on our results from the PBMC analysis in Figure 2, which is why there is no overlap in the TE families presented. Instead, we selected those TE-gene pairs based on 1) known function of the genes in immunity and/or HIV-1 restriction, 2) known contribution of the TE families to immunity, and 3) differential accessibility and expression of the TEs and genes respectively in ECs compared to HCs. Thus, Figure 4B-C represents select examples that we deemed particularly relevant to the EC phenotype. We have revised the manuscript to better explain the process of TE-gene pair identification and the rationale behind our selection for Figure 4B-C.

      “We used paired ATAC-seq – which measures chromatin accessibility – and RNA-seq datasets from the CD4+ T cells of ECs (n=4) and HCs (n=4) (39) to create a list of TE-gene pairs where the TE locus and gene show increased accessibility and expression, respectively, in ECs compared to HCs (Table S7, see Methods for details). These loci and genes were paired based on proximity, with a maximum distance of 10kb between the TE locus and the gene’s transcription start site, to increase the likelihood of a direct cis-regulatory influence of the TE over the nearby gene.” [pg. 21, L357-363)

      “In Figure 4B & 4C, we have highlighted six of the TE-gene pairs from Table S7 based on the gene’s function in HIV-1 restriction and the TE family’s known contribution to immune gene regulation.” [pg. 21, L369-371]

      Regarding cell type composition, we acknowledge that the differences observed in the proportion of immune cell subtypes may contribute to the differential expression between ECs, VPs, and HCs (Figures 2D and S3A). However, we provide evidence that cell type composition cannot be the sole driver for the clustering of deconvoluted CD4+ T cell RNA-seq samples (Figure 3B and S5D). Cell subtype alone could not explain the observed clustering of EC samples by gene and TE family expression. Clusters 1 and 2, for example, had nearly identical subtype compositions, but were clearly separated on the UMAP (Figures 3A, 3B, and S5D). We remark on this in the Results of the revised manuscript.

      “[W]e visualized the samples by cellular subtype, as identified in the original studies, to assess whether the clustering could be explained by CD4+ T cell subtype composition (Figure S5D). Clusters 1 and 2 were essentially indistinguishable in cell type composition, whereas Clusters 3 and 4 showed an overrepresentation of TM/EM and naïve/CM cell types, respectively (Figure 3B). Thus, cell subtype composition could only partially explain the clustering.” [pg. 16, L271-276]

      The EC CD4+ T cell clusters also had unique gene ontology, gene & TE expression, and TE accessibility profiles (Figures 3C, 3D, 5). Moreover, while we do not have parallel RNA- and ATAC-seq data from similarly deconvoluted CD4+ T cells of ECs like those used in the clustering analysis (PMIDs 32848246 & 27453467), the original article from which we sourced the parallel RNA- and ATAC-seq data used in Figures 1 and 4 reported that these samples are predominantly effector memory CD4+ T cells (PMID 30964004). If new deconvoluted, multi-omic datasets from ECs become available, we would be interested in further exploring the contribution of cell type composition. However, the current data indicate that it is not a major contributor to the differential TE expression identified in our analyses.

      Regarding the impact of ongoing HIV-1 replication upon the unique expression patterns in the EC participants, it's worth noting that the public datasets used in our study reported undetectable viremia in the EC volunteers (PMIDs 30964004, 29269040, 32848246, 27453467). Nonetheless, we sought to address this by quantifying HIV-1 transcription and exploring its potential association with interferon-stimulated gene (ISG) expression, a group of genes that we know would be reactive to active viremia. These analyses were integrated into the revised manuscript as Figure S6.

      “To exclude the possibility that these gene expression signatures in ECs are associated with viremia, we quantified HIV-1 transcript levels in deconvoluted CD4+ T cell RNA-seq samples from ECs and ART-treated PLWH for comparison. In the original studies, all samples were reported to have undetected viremia by blood tests (9,37-39). Consistent with this, we found that the vast majority of the EC and ART samples taken from PBMCs exhibited very low HIV-1 transcript levels, with TPM values generally below 1. However, in samples originating from the lymph nodes of EC individuals (n = 22) (37), we detected HIV-1 expression in some subsets (Figure S6A&B). In agreement with the corresponding study (37), we found elevated HIV-1 transcript levels in germinal center and non-germinal center T follicular helper cells (GC Tfh & nGC Tfh, not included in our clustering analyses) -- and to a lesser extent in T effector memory (EM) cells (Figure S6A, average TPM Based on these results, we have concluded that the differential expression of genes and TEs in the EC clusters are not a consequence of low-level viral transcription in ECs.

      Finally, a remark on TE nomenclature: The reviewer suggests that we use the term “TE groups” as opposed to taxonomic terms such as TE subfamily or TE family. We respectfully disagree. This nomenclature of TEs has been well defined (PMIDs 26612867, 26612867, 17984973) and is widely used in TE literature. Throughout the manuscript, we have conformed to the nomenclature used to annotate the human genome. One can debate the way TE families and subfamilies have been classified in Dfam (the database through which repetitive elements in the human genome have been annotated), but it is outside the scope of this study to revisit that nomenclature.

      Similarly, of the 12 DE TE groups in EC in Fig 5A, only 3 overlap with the 16 in EC Fig S1.

      Authors: This is correct, but we don’t believe it’s concerning. In Figure 5A, we are comparing the expression of TE families between separate EC clusters. In Figure S1, we are comparing the expression of TE families in ECs compared to ART-treated PLWH. These are fundamentally different comparisons and thus the differences in the identified DE-TEs between the two figures reflect the distinct biological contexts being investigated in each analysis.

      Second, the introduction points out the strongly supported association between elite control and immunogenetic determinants, most notably specific HLA-B types, but also innate immunity factors. This cries out for inclusion of these factors in the analyses of this manuscript, in the format of Figure S4, for example, but none is to be found. The relevant genotypes are likely available in the metadata in the references cited, but, if not, could be inferred from the RNA-seq data.

      Authors: Thank you for the recommendation. While our project’s primary focus is on the transcriptomic and epigenomic signatures, we agree that studying the HLA-B genotypes of all EC participants could provide valuable context for understanding the clustering of elite controllers. To explore this, we inferred the HLA-B alleles in each EC participant whose RNA-seq data was included in the clustering analysis, utilizing the arcasHLA tool (PMID: 31173059) on the total CD4+ T cell samples. We then validated these inferred HLA-B alleles against the available metadata from one of the source studies (PMID 27453467) and found that they matched for all participants. This strengthened our confidence in the accuracy of the HLA-B genotype inferences for the other samples where comprehensive HLA-B data was not provided.

      In order to assess how these protective HLA-B alleles segregated between the four EC clusters derived from gene and TE family expression, we chose to visualize three of the most common alleles associated with HIV-1 elite control: HLA-B*27:03, *57:01, and *57:03 (PMIDs 30964004, 25119688, 21051598) (Figure R1, available in the Response to Reviewers PDF).

      Our analysis revealed that these major protective alleles were not significantly overrepresented in any particular cluster. Consequently, we believe that HLA-B genotype does not have a major impact on the clustering observed in Figure 3.

      It would also be very useful to present the KZNF data in Figure 5 the same way, since, looking at Fig 5C, the correlation of high and low KZNF expression, while clearly correlated with a that of few groups of elements, with clustering into specific groups does not appear to be well supported.

      Authors: Thank you for the insightful suggestion. While the KZNF genes are included in the gene set used for the clustering analysis in Figure 3, we agree that clustering based solely on KZNF expression and displaying it as we have in Figures 3A and S5 could provide valuable insights. However, when we attempted to cluster the EC RNA-seq samples using only KZNF expression data, we were limited by the relatively low number of KZNF genes that showed sufficient variability across samples (n = 120). For robust statistical power, we require at least 200 features to reliably cluster the 128 EC CD4+ T cell samples. We believe this limitation does not diminish the relevance of KZNFs in the observed clustering patterns but rather highlights the nuanced role each KZNF plays in the regulation of the transcriptome. Each individual KZNF is responsible for the regulation of hundreds to thousands of TE loci (PMID 37730438). Thus, while a clustering approach based solely on KZNF expression may not be feasible, the integral role of KZNFs in modulating the transcriptome through TE regulation remains evident and supports their inclusion in Figure 6 of the revised manuscript.

      In general, other than the cell type composition differences, there is no presentation of evidence for any biologically important feature associated with the clusters found.

      Authors: We agree that the root cause of the transcriptomic differences between the EC clusters is hard to pin down but we do identify several distinctive features of the clusters that we believe are biologically significant. First, having extracted the lists of genes whose differential expression defined the four EC clusters, gene set enrichment analysis revealed that the clusters were functionally distinct, each characterized by a unique list of top GO terms (Figure 3C). Second, we provide evidence that KZNFs expressed in CD4+ T cells significantly bind to the candidate TE families whose expression defines each of these clusters (Figure 6D) and have significantly decreased expression in ECs compared to VPs (Figure 6C). This is corroborated by pairwise correlation analysis that revealed cluster-specific anticorrelation patterns between these KZNFs and their target TEs (Figure 6A). We present this data in support of our hypothesized KZNF-based mechanism for TE co-option in viral immunity. We do not yet have data indicative of the mechanism by which KZNF expression is in turn regulated. However, we speculate that negative feedback loops may be contributing to changes in KZNF expression.

      “These observations suggest that interindividual variation in KZNF expression in CD4+ T cells could explain why certain TEs are variably expressed and accessible across ECs. But what are the mechanisms underlying variation in ZNF expression? It is possible that TE-KZNF regulatory loops are involved, in which a copy of the TE family targeted by a KZNF is inserted near and regulates the KZNF gene, thereby introducing a negative feedback loop. This phenomenon has been documented in prior studies of KZNF activity in embryogenesis (51) and cancer (115).” [pg. 39-40, L705-711]

      Overall, our study presents preliminary evidence that the four EC clusters derived from gene & TE family expression may be distinguished by complex interplay of activators (Figure S8) and repressors (Figure 6) altering the activity of infection-responsive TE families to co-opt specific elements for immune regulatory function. While not yet validated in an experimental setting, we believe these results are of biological significance.

      Third, the figures present values that have been very heavily analyzed, and it is difficult to impossible to infer what the underlying data look like. For example, with the exception of a few selected examples in Figs 4 and 5, individual provirus data are lacking. Nor can we tell how consistent the distribution of expression values within a TE group is, whether the TEs included solo LTRs (which constitute the majority of all ERVs), the possible contribution of other TFs to expression (with the exception of a brief mention of STAT1).

      Authors: We respectfully disagree that the values presented in our figures are heavily analyzed. As this manuscript represents the first investigation of TEs’ role in HIV-1 elite control, we believe the most reasonable initial approach was to compile and visualize the data at the family level, rather than at the level of individual loci, which is harder to interpret due to mapping issues, commonly low transcription, and often idiosyncratic behavior of individual loci. Nonetheless, we did not limit our analysis to full-length HERVs (proviruses) and thus retain all solo LTR data in our analyses. This was added to the Methods of the revised manuscript.

      “To facilitate comprehensive expression quantification, we curated a reference transcriptome by combining gene, TE, and HIV-1 genomic sequences. This was achieved by integrating the locus-level TE classification from RepeatMasker, the hg19 GenCode gene annotation,

      and the HXB2 reference HIV-1 annotation. For the TEs, we removed simple repeats, SINE elements, and DNA transposons, retaining LINE and HERV loci, including all solo LTRs. We also removed any loci within gene exons/UTRs. The remaining sequences were appended in fasta format, and all sequences were annotated with their respective gene, TE locus, or HIV subunit and modeled in GTF format.” [pg. 55, L869-878]

      For the sake of transparency, all relevant details on sequencing data analysis and the corresponding scripts are available in the Methods and our GitHub Repository.

      Additionally, while most of our figures make comparisons at the family level, we do visualize multiple TE loci (Figure 4C) and provide a list of putative locus-level TE-gene pairs from which those shown in Figure 4C were selected (Table S7). In our revisions, we also re-clustered the 128 EC CD4+ T cell RNA-seq samples based only on locus-level TE expression, using the same graph-based k-nearest neighbors method as in Figure 3. The results of this new analysis have been integrated into the revised manuscript as Figure S7.

      “To further explore locus-level expression patterns, we re-clustered the same EC samples (n=128) using only locus-level TE expression. This again resolved four EC clusters (Figure S7A), which interestingly appeared even more distinct than those identified by gene and TE family expression (Figure 3A). The TE locus-based clusters (TL-Cs) aligned well with the gene and TE family clusters (GT-Cs), with an average 70% overlap in samples between each GT-C and its corresponding TL-C (Figure S7B), indicating high consistency (Table S8). The remaining 30% of samples that shifted between clusters did so consistently within individuals, not cohorts, maintaining heterogeneous TL-C compositions similar to the GT-Cs (Figures S7C & S5A). An exception to this heterogeneity was TL-C4, comprising 22 samples from GT-C1 that were almost entirely from the CD4+ T cell subsets of only four participants in the Jiang cohort (Figure S7C, Table S8). No other samples from the Jiang cohort shifted to this cluster from other GT-Cs, suggesting that these patterns reflect individual variation rather than cohort bias. Like the GT-Cs, each TL-C included samples from all five CD4+ T cell subsets and was largely heterogeneous (Figure S7C). Notably, TL-C2 mirrored corresponding GT-C3 in its overrepresentation of EM and TM cells, while TL-C1 uniquely showed an overrepresentation of naïve CD4+ T cells. Beyond sample composition, each TL-C was characterized by a unique pattern of expressed TE loci (Figure S7D). These signatures were heterogeneous across families, with subsets of variable loci from one TE family marking separate clusters (Figure S7E), some of which did not reach the threshold of significance in earlier analyses when analyzed at the family-level, like SVA-D. Many families maintained their cluster-specific signatures, like THE1B (a marker of GT-C2), for which the majority of variable loci were found in corresponding TL-C1. However, some TE families, like the L1s that marked GT-C1, showed more heterogeneous signatures with variable loci marking multiple TL-Cs. These findings underscore the need for future locus-level investigations with high-depth sequencing to fully capture the complexity of TE expression.” [pg. 27-28, L462-488]

      With this addition, we include significantly more data analyzed at the locus level, which we believe not only validate the distinct clustering observed in Figure 3, but also underscore the potential for locus resolution analysis to reveal additional layers of retrotranscriptomic diversity in EC CD4+ T cells.

      Finally, we agree with the reviewer that TFs other than STAT1 may contribute to the observed changes in TE expression. To investigate this, we analyzed several TFs expressed in CD4+ T cells and, for TFs enriched over TEs of interest, subsequently examined the correlation between TF and target TE expression in the deconvoluted EC CD4+ T cell samples used for the clustering. The results of this analysis have been integrated into the revised manuscript at Figure S8.

      “In addition to KZNF repressors, transcriptional activators may also drive the differential expression of specific TE families across ECs (83). To investigate this, we focused on transcription factors (TFs) expressed in CD4+ T cells and mined ChIP-seq data from the ENCODE Consortium (84) to identify TFs with binding enrichment to TE families of interest, selected for their elevated, cluster-specific expression in ECs (highlighted in Figures 4, 5, and S4). We then examined the correlation between TF and target TE expression in the deconvoluted CD4+ T cell samples from ECs used for our clustering analysis (Figure 3) (9,37). We observed several significant positive correlations between TF and TE expression across ECs (Figure S8). Thus, differential expression of immune-related TFs may also contribute to the variation in TE expression and cis-regulatory activity across ECs, in tandem with the repressive activities of KZNFs.” [pg. 30, L517-527]

      This evidence supports the reviewer’s suggestion that other TFs may be contributing to the unique EC retrotranscriptome we profile in this study. These added analyses, mimicking those conducted for KZNFs in Figure 6B & 6D, demonstrate that transcriptional activators may indeed play a crucial role in shaping the TE landscape in ECs.

      Other issues

      Figure 1:

      A) Log2 fold change of what? TPM values? Needs to be specified.

      Authors: Thank you for pointing out this ambiguity. The log2-transformed fold change values plotted in Figure 1A refer to DESeq2-normalized expression. They were extracted from the results of the DESeq2 pipeline, which we applied to the raw count expression matrix (see our Methods for more details). Following your suggestion, we have clarified this point in the figure legend in the revised manuscript.

      “Total detected genes and TE loci are plotted by log2-transformed fold change of DESeq2-normalized counts (EC vs. HC).” [pg. 10, L163-164]

      We have similarly made these changes to any figure legend which was ambiguous in its description of the expression data.

      Why Bonferroni correction? Usually BH q values or other less stringent adjustments are used nowadays.

      Authors: In our analysis, we opted for the Bonferroni correction due to its well-established reliability and stringent control of the family-wise error rate when conducting multiple tests. Given the exploratory nature of our investigation and the desire to minimize the risk of false positive findings, we chose to employ this traditional correction method within our analytical pipelines.

      B,C): Z-score of what? Scaled, normalized counts? Scaled TPM values?

      Authors: Thank you again for highlighting this point of uncertainty. We now clarify this in the figure legend in the revised manuscript.

      “Heatmap displaying the expression of the top differentially expressed genes in CD4+ T cells of ECs (n=4; red bar) vs. HCs (n=5; blue bar). Relative expression levels are representative of row-wise scaled, log2-transformed expression in transcripts per million (TPM). Heatmap coloration is based on the z-score distribution from low (gold) to high (purple) expression.” [pg. 11, L167-171]

      Figure 2:

      B) The blue font color is very difficult to see.

      Authors: We have changed the blue font color to make it more easily distinguishable from the black.

      C) This heatmap should demarcate or separate genes versus TE clades. If that's not possible, then the two should be shown separately.

      Authors: We appreciate your suggestion regarding the heatmap presentation. While we understand the rationale for demarcating genes versus TE clades, we have chosen to retain the original figure layout. In this analysis, TEs were analyzed simultaneously with genes. The order in which they are shown was obtained by default clustering of the expression matrix using the hclust function. We chose to present them together and in this order to provide a comprehensive visualization of the differential expression patterns between the two groups and highlight the homogenous nature of gene and TE expression across VPs.

      L191: How many groups (NOT families) and how many total elements were examined?

      Authors: We begin with the RepeatMasker annotation of the hg19 assembly and filter out the SINE elements, DNA transposons, simple repeats, and all loci within gene exons/UTRs. These details are provided in the Methods of the revised manuscript, as was quoted above. In total, our analyses examine 1,104,828 loci from 603 TE groups (which we refer to as families). We apologize if this figure is not accurate to a separate classification of TEs into groups, rather than families. Any such method of grouping TEs is unfamiliar to us and outside of the Dfam annotation.

      L198: 2B, not C

      Authors: Thank you for catching this. The figures labelled were swapped in error and have been changed to reflect in Figure 2 to match the in-text references.

      L205: Did the expressed proviruses have STAT1 sites?

      Authors: Thank you for your question. The identification of LTR13’s increased expression in ECs compared to VPs was the result of a family level analysis which considered expression additively across the LTR13 loci in our annotation. To answer your question, we analyzed STAT1 ChIP-seq data from the ENCODE Consortium to characterize which LTR13 loci were bound by STAT1 (corroborated by motif prediction calls). We then integrated the EC RNA-seq data and found that the expressed LTR13 proviruses significantly overrepresented those with bound STAT1 sites (Figure R2, available in the Response to Reviewers PDF).

      These data suggest that STAT1 binding may play a critical role in the transcriptional regulation of LTR13 in ECs, contributing to their differential expression profile. Further exploration into the contribution of activating, immune-related TFs is explored in Figure S8 in the revised manuscript.

      L333: 10 kb is very close. Why was it chosen?

      Authors: We chose 10 kb as our cutoff for selection because it allowed for very high confidence in the TE loci’s cis-regulatory capacity over the nearby genes. For transparency, we have made this clearer in the Results text of the revised manuscript.

      “These loci and genes were paired based on proximity, with a maximum distance of 10kb between the TE locus and the gene’s transcription start site, to increase the likelihood of a direct cis-regulatory influence of the TE over the nearby gene.” [pg. 21, L360-363]

      However, if desired, a less stringent cutoff could also be used with relative confidence (e.g., 50 kb).

      L351-352: Again, correlation is not causation. How do the authors know it's not the other way around?

      Authors: The candidates that we chose to display in Figure 4 (the figure to which these lines refers) are from MER41, ERV3-16, and LTR12C. Our lab and others have shown that these specific loci or other loci in these TE families are capable of regulating neighboring genes’ expression, with specific evidence in the context of immunity (PMID Smitha, Ed, APOBEC, etc.). Based on this knowledge, we believe that it’s most likely that TE-derived regulatory sequences are the cause of the increased restriction factor expression, rather than TE accessibility being a consequence of the transcriptional activation of the neighboring genes. However, we recognize that these results are correlative, as the reviewer notes, and we emphasize this in the revised manuscript. Most notably:

      “We acknowledge that these associations are drawn from correlative patterns and manipulative experiments are needed to infer causality between chromatin changes at these TEs and increased expression of nearby immunity genes.” [pg. 36, L620-623]

      Figure 4

      B) Need to show a scale of the genome region, the orientation of both the gene and the TE, whether it is a solo LTR

      Authors: Thank you for the suggestion. Genomic scale and orientation have been added to Figure 4C. All loci visualized were solo LTRs, save for HCP5, which is a lncRNA derived from a full-length ERV3 element.

      Figure 5

      A) Would benefit from also showing HCs

      Authors: Thank you for the recommendation. The RNA-seq datasets used in this analysis do not include HC samples. Additionally, this analysis is meant to highlight differences in TE expression between the four EC clusters. Thus, we have chosen to keep Figure 5A as it appears in the original manuscript.

      C) Would be helped by showing adjusted p-values, and also should show examples of non-correlating relationships between these KZNF genes and other TEs.

      Authors: Thank you for the suggestion. All correlation analyses had adjusted p-values below 0.01, derived from corr.test in R. We’ve added this to the figure legends of Figure 6B [pg. 32, L539] and S8B [pg. 53, L835]. However, we have chosen not to integrate non-correlating examples into the revised manuscript for the sake of space.

      Figure 6

      Title: should start with "proposed model for.." or some such.

      Authors: Thank you for the suggestion. The title has been changed to “Proposed model for the interplay of KZNFs and TEs regulating proximal antiviral gene expression in elite controllers of HIV-1” in the revised manuscript [pg. 34, L580-581].

      L 537: Again, how do the alleles segregate in the clusters?

      Authors: This question has been addressed in response to an earlier comment from Reviewer #3.

      Generally, in the correlation analyses, I'd like to see adjusted p-values and examples of non-correlated results.

      Authors: Thank you for the suggestion. As mentioned above, all correlation analyses have been annotated with the adjusted p-value threshold. Additionally, below we’ve included examples of non-correlated results from two analyses. First, we show a TE-gene pair whose increased TE accessibility in HCs compared to ECs does not correlate with increased expression of the proximal gene (Figure R3, available in the Response to Reviewers PDF). Notably, this gene does not play a role in HIV-1 infection response. Here, we show that genes with proximal (Second, we show the pairwise correlation and linear regression results of L1PA6 and ZNF2 (Figure R4, available in the Response to Reviewers PDF). ZNF2 is one of the KZNFs highlighted in Figure 6 for its low expression in ECs, anticorrelated to its repressive target LTR12C. On the other hand, L1PA6 is active in ECs, with variably high expression across samples. ZNF2 ChIP-exo revealed that ZNF2 has no capacity to bind to L1PA6 loci (adj. p-value = 1; PMID 37730438). Thus, even though both genes are variable across samples, we observe no significant (anti)correlation between the two variables (rho = 0.051 & p-value = 0.866).

      While we have not integrated these results into the revised manuscript for the sake of space, we hope that the provided examples satisfactorily demonstrate the presence of non-correlated results in our analyses, further reinforcing the specificity and robustness of our significant findings.

      Significance:

      This study presents an in-depth analysis of the reverse transcriptome in Elite controllers. It will be of interest to both HIV researchers and those interested in the regulation of the human retrotranscriptome and its consequences.

      Provides an avenue for future explanation into elite controllers and TE involvement in the phenotype.

      Does a good job of placing the work in the context of existing lit, synthesizing other papers regarding TEs and immune control.

      Potential immune regulatory involvement of specific HERV clades.

      Authors: We’d like to thank the reviewer for their encouraging feedback. We’re pleased that they found our analysis of the EC retrotranscriptome to be of broad interest and appreciate their recognition of our efforts to synthesize existing literature, contextualizing our findings within the broader field. We agree that our study opens new avenues for exploring the role of TEs, particularly specific HERV clades, in not only the EC phenotype but immune regulation as a whole.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This manuscript presents an analysis of published gene expression (RNA-seq and ATAC-seq) data from a couple of cohorts of HIV-infected elite controllers (EC), as compared to uninfected controls, (HC), virological progressors (VP). The authors report that HIV elite controllers may exhibit 4 distinct patterns of TE (and gene) expression and suggest that TE expression may drive some form of antiviral gene expression. Further, they show that heterogeneous TE expression may be determined by differential KZHF gene activity among the different clusters of elite controllers. These results are very interesting, even though the conclusions are very preliminary. It presents intriguing correlations between expression of certain TE groups of LINES and HERVs, and the clustering into 4 gene expression groups in EC and is a novel finding. That said, correlation is not causation, and the authors need to be more cautious in presenting their highly preliminary model in Figure 6.

      Major comments:

      Overall, although preliminary, as the authors note, the results are interesting and worthy of follow-up. At this point, however, a number of issues arise that need further clarification and analysis before I would consider this study complete. First, the analyses shown in Figures 3-5 based on data from studies on EC of CD4 cells are apparently motivated by the differential TE expression in total PBMCs shown in Fig 1 and 2. Yet, the TE groups (please don't use taxonomic terms like "subfamily") identified in Fig 2 and Fig 4 are completely different, with no overlap. This discrepancy underscores the possibility that the differential expression observed is, at lest in part, due to the differences among the groups or clusters in cell type composition, as seen in Fig 2D and 3B which, themselves, could be a consequence of HIV infection and elite control (which has been shown to involve ongoing, albeit low-level, virus replication). This issue must be addressed. Similarly, of the 12 DE TE groups in EC in Fig 5A, only 3 overlap with the 16 in EC Fig S1.<br /> Second, The introduction points out the strongly supported, association between elite control and immunogenetic determinants, most notably specific HLA-B types, but also innate immunity factors. This cries out for inclusion of these factors in the analyses of this manuscript, in the format of Figure S4, for example, but none is to be found. The relevant genotypes are likely available in the metadata in the references cited, but, if not, could be inferred from the RNA-seq data. It would also be very useful to present the KZNF data in Figure 5 the same way, since, looking at Fig 5C, the correlation of high and low KZNF expression, while clearly correlated with a that of few groups of elements, with clustering into specific groups does not appear to be well supported. I n general, other than the cell type composition differences, there is no presentation of evidence for any biologically important feature associated with the clusters found.<br /> Third, the figures present values that have been very heavily analyzed, and it is difficult to impossible to infer what the underlying data look like. For example, with the exception of a few selected examples in Figs 4 and 5, individual provirus data are lacking. Nor can we tell how consistent the distribution of expression values within a TE group is, whether the TEs included solo LTRs (which constitute the majority of all ERVs), the possible contribution of other TFs to expression (with the exception of a brief mention of STAT1).

      Other issues

      Figure 1: A) Log2 fold change of what? TPM values? Needs to be specified.

      Why Bonferroni correction? Usually BH q values or other less stringent adjustments are used nowadays. B,C): Z-score of what? Scaled, normalized counts? Scaled TPM values?

      Figure 2: B) The blue font color is very difficult to see C) This heatmap should demarcate or separate genes versus TE clades. If that's not possible, then the two should be shown separately.

      L191: How many groups (NOT Fam1lies) and how many total elements were examined?

      L198: 2B, not C

      L205: Did the expressed proviruses have STAT1 sites?

      L333: 10 kb is very close. Why was it chosen?

      L351-352: Again., correlation is not causation. How do the authors know it's not the other way around?

      Figure 4 Title: For "induction" Substitute "correlation"

      Panel B: Need to show a sclae of the genome region, the orientation of both the gene and the TE, whether it is a solo LTR 5 Panel A: Would benefit from also showing HCs C: Would be helped by showing adjusted p-values, and also should show examples of non-correlating relationships between these KZNF genes and other TEs. 6 Title: should start with "proposed model for.." or some such. L 537: Again, how do the alleles segregate in the clusters?

      General

      In the correlation analyses, I'd like to see adjusted p-values and examples of non-correlated results.

      Significance

      This tudy presents an in depth analysis of the reverse transcriptome in Elite controllers. It will be of interest to both HIV researchers and thos interested in the regulation of the human retrotranscriptome and its consequences

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Provides an avenue for future explanation into elite controllers and TE involvement in the phenotype. - Place the work in the context of the existing literature (provide references, where appropriate).

      Does a good job of this, synthesizing other papers regarding TEs and immune control. - State what audience might be interested in and influenced by the reported findings.

      Potential immune regulatory involvement of specific HERV clades. - Define your field of expertise with a few keywords to help the authors contextualize your point of view.

      Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors have re-analyzed published RNA-Seq data from CD4 T cells isolated from HIV elite controllers and reference cohorts, including HIV negative persons, viremic progressors and ART-treated persons. Their main finding is that in some of their comparisons, EC have higher levels of interferon-stimulated genes (ISG), paired with distinct expression patterns of transposable elements. The authors suggest that expression of transposable elements may induce altered expression of ISG, presumably due to immune recognition of TE. They also suggest that reduced expression of KZNF genes, which encode for transcription factors that can suppress TE, may be responsible for enhanced expression of TE. I have the following comments:

      1. All data included in this manuscript derive from previously published data. A new dataset, specifically designed to focus on a high-resolution analysis of TE expression, would be better suited to address the proposed questions.
      2. As the authors acknowledge, the described investigations are exploratory, and do not allow to draw firm conclusions. Mechanistic experiments are recommended to address the authors' hypotheses.
      3. An important limitation is that virological data of EC are not considered. For example, I believe it is a lot more likely that the upregulation of ISG in EC relates to ongoing low-level viral replication. The authors could analyze cell-associated HIV RNA and DNA levels and determine how they associate with ISG expression.
      4. KZNF genes seem downregulated in EC. Can the authors propose a reason/mechanism for that?

      Significance

      Overall, I think this is an interesting manuscript that proposes a distinct and potentially important mechanisms that may contribute to immune control of HIV. My suggestions to improve the manuscript are complex and cannot be easily addressed through experimental work. I believe a possible option would be to publish the present manuscript without my proposed modifications, but highlight the weaknesses of the current paper more clearly; mechanistic studies could then be deferred to a future study.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Singh et al. analyze the expression and putative contribution of TEs in CD4+ T cells in HIV elite controllers. Through re-analysis of existing datasets, the authors describe broad differences in expression of TEs in ECs through analysis of RNAseq annd ATACseq data, and come up with convincing examples where differentially-expressed innate immune genes correlate with increased accessibility of proximal TEs. Overall, the authors' conclusions are appropriately measured, though the manuscript text should be re-organized for clarity and a few further analyses are needed to support the main message of the paper.

      Major comments: The manuscript would benefit from a re-organization of the figures to focus on TEs - in particular, Fig 1B, Fig 2, and Fig 3 reproduce known transcriptional differences between ECs and HCs and serve as quality controls for the authors' computational analysis. Conversely, Supplementary Fig 6 contains very interesting data on KZNF expression and should be included in the main figures.

      It remains unclear whether differences in TE expression described are specific to ECs or to EC-like CD4+ T cell states. As there are plenty of datasets available that compare the transcriptome of naïve, activated, exhausted, and regulatory CD4+ T cells, the authors should compare the TE expression patterns observed in ECs to activated CD4+ T cells, particularly those with a Th1 and cytotoxic phenotype analogous to those observed in ECs, from healthy donors.

      In Fig 1, the authors demonstrate differential expression of both innate immune genes and TEs, but the link between the two is unclear. Is there any enrichment in differential expression for TEs located proximal to innate immune genes? This type of analysis should be possible using the authors' own software to map TE expression to specific genomic loci.

      Optional: In Fig 3, the authors cluster CD4+ T cells based on transcriptomic profiles. It would be interesting to re-cluster these samples based on TE expression alone, given the differences in TE expression described in Fig 5.

      Significance

      The manuscript by Singh et al. describes for the first time the role of TEs in HIV elite controllers, suggesting that TEs may be co-opted for cis-regulatory function. This study builds off prior work demonstrating that HIV-infected CD4+ T cells activate LTR elements that may regulate expression of interferon-inducible genes, demonstrating that ECs show further upregulation of innate immune genes. While these findings will need to be experimentally validated, this study constitutes a useful resource and adds to the growing body of evidence implicating TEs in cis-regulatory control of immune genes. This study will be of interest to basic scientists interested in genetic mechanisms of HIV control, and if further developed may comprise a useful source of biomarkers to predict viral kinetics in HIV-infected individuals.

      My expertise is in immunology, TE biology, and viral infection

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility and clarity

      Sanial et al. carefully analyze the use of in-gel fluorescence as an alternative to immunoblotting. The authors show that simple modifications of common protein extraction protocols can preserve (to varying extents) fluorescent proteins in their native, fluorescent states. This can be exploited in different applications for in-gel fluorescence quantification, bypassing immunoblotting. The experimental results are clear, showcasing the ease and linearity of in-gel fluorescence quantification.

      In my opinion, the trick of this approach is also potentially its main drawback, the partial denaturation conditions. I think the manuscript could be strengthened with more extensive benchmarking of the approach and further discussion of potential caveats as detailed below.

      Major points:

      1. Protein abundance in the original GFP library (and in other FP-tagged libraries constructed in the meanwhile) have been quantified using fluorescence (flow cytometry, microscopy, colony fluorescence) (Ho et al. 2018 10.1016/j.cels.2017.12.004, Weill et al. 2018 10.1038/s41592-018-0044-9, Meurer et al. 2018 10.1038/s41592-018-0045-8). This provides an opportunity to significantly strengthen the manuscript (where most of the test have been done using two abundant cytosolic proteins Bmh1 and Hxk1) if the authors could apply their approach to a representative fraction of the yeast proteome (sampling from such libraries FP-tagged proteins that differ in abundance, localization, membrane vs cytosolic/nuclear, subunits of large stable complexes vs proteins not part of complexes, etc.) and compare their quantification with previous relative abundance estimates. This information would also help future users in case protein-specific issues are identified.

      Indeed, Hxk1 and Bmh1 are quite strongly expressed (41,000 and 65,000 copies/cell, according to SGD, ____www.yeastgenome.org____). In the course of our experiments we were able to detect proteins with a much lower expression level (eg. Reg1, 4000 copies/cell). We have selected a number of proteins based on their expression level as detailed in SGD, ranging from 700 to 75000, and plan to detect the signal by IGF and compare it with published data on absolute protein quantifications for ecah protein. However, this will take a bit of time as each gene must be tagged with EGFP – we cannot use the GFP-S65T from the GFP collection which is poorly amenable to IGF because of its sensitivity to denaturation, as we show in our manuscript.

      The authors discuss several drawbacks, including the change in apparent molecular weight compared to denatured proteins; differential recognition of folded vs denatured proteins by antibodies.

      Other potentials drawbacks should be discussed. For instance, the need of additional steps post-fluorescence imaging for signal normalization against a loading control; the need of antibodies and immunoblotting to decide on the best denaturing temperature for a specific protein or FP tag; complexity of the native protein extraction protocol compared for example to alkaline lysis followed by TCA precipitation (Knop et al. 1999 PMID: 10407276).

      • Regarding the the need of additional steps post-fluorescence imaging for signal normalization against a loading control – this doesn’t take extra time for people using gels with protein stain included in the gel (eg. Stain-Free from BioRad). There are other possibilities of total protein fluorescent labeling that will be discussed. We will provide an example of this application.
      • On the need of antibodies and immunoblotting to decide on the best denaturing temperature for a specific protein or FP tag – we believe that the system could be set up for a protein of interest for which antibodies are available (as we did for Bmh1 or Hxk1), and once this is done, there is no need to do these controls anymore. We will mention this in the manuscript.
      • On the complexity of the native protein extraction protocol compared for example to alkaline lysis followed by TCA precipitation – indeed the protocol is a bit more time-consuming compared to the mentioned method, and we will mention this in the text. However, please note that people studying mammalian cells, for instance, often use this native protocol for total extracts so this is mostly a yeast-model issue. Yet, we will add this comment. Moreover, although the denatured fraction is FP- and temperature-dependent, even under the milder 30{degree sign}C conditions there is a detectable denatured fraction (Fig.s3b). This would seem to preclude the use of this approach for absolute protein quantification.

      True, but it depends a lot on the FP used. For instance, sfGFP is not denatured and could potentially be used for absolute quantification. We will comment in the text.

      Finally, any evidence that the denatured fraction would depend on the protein tagged with the FP?

      We will use several proteins used for point 1 but fused to the most sensitive FP, GFP-S65T, and do a western blot using anti-GFP antibodies to estimate the variation in native vs. denatured forms of the protein.

      Minor points: 1. In the experiments designed to test the linearity and sensitivity of the approach, an alternative approach that would not result in dilution of cell extract is to mix wild type cell extract (no GFP fusion) with extract of the GPF-tagged strain in different ratios.

      Yes, this was an alternative but it seemed that dilution was easier to control than mixing two extracts.

      Define all acronyms at first appearance. For example, DTT and LDS on page 4.

      Thank you, we will address all acronyms in the text.

      Fig.4D: the colors chosen to represent EGFP and sfGFP data make them hard to tell apart. The same comment to Fig.S6.

      Agreed, we will change the figures accordingly.

      As the temperature steps are not uniform in Figures 4 and 5, it would be more informative to indicate the exact temperate above each lane (in addition/instead of the ramp cartoon).

      Agreed, we will change the figures accordingly.

      Regarding linearity, that HRP-based quantification is not linear is expected. A fairer comparison would be to use fluorescently labeled secondary antibodies. It is also puzzling that detection with signal amplification (HRP) is less sensitive than direct quantification of the fluorescence signal from the FP tag.

      We will do a sensitivity tets (dilutions) to compare IGF with HRP-based and fluorescent-based antibody-mediated detection.

      I appreciate the workflow Figure 10. But in my opinion it is trying to show too much (protocol, troubleshooting, calls to figure panels). Perhaps it could be made clearer by separating the protocol steps/settings from the optimization/troubleshooting tips.

      Thank you, we will work on this to make the workflow clearer.

      Some of the discussion of different fluorescent proteins, and expression levels of tagged proteins, could be confounded by the different linkers used in the tagging constructs.

      Thank you for this remark. Indeed, there are various linkers on these constructs and we don’t know to which extent they contribute to the effect on protein expression level. We will comment his in the text.

      Significance

      Could be a generally useful and simple approach for in-gel quantification using fluorescent protein tags.

      __ ____Thank you for your comments and overall assesment.__


      Reviewer #2

      Evidence, reproducibility and clarity

      The present manuscript "Direct observation of fluorescent proteins in gels: a rapid cost-efficient, and quantitative alternative to immunoblotting" describes a method how to visualize bands of fluorescent protein fusions onto a common SDS-PAGE without antibody staining. It is based on ability of GFP-like fluorescent proteins (FPs) to retain their fluorescence under conditions of SDS-PAGE if step of extensive heating (boiling) of protein sample is omitted. This property of FPs is not novel; it was known for more than 20 years (for example, see Fig. 2 in Yanushevich et al. FEBS Lett. 2002 Jan 30, 511:11-4; Supporting Fig. 7 in Campbell et al. Proc Natl Acad Sci USA. 2002 Jun 11, 99:7877-82). However, the authors did perform a very accurate and robust study to quantitatively assess the behavior of several FP fusion protein in SDS-PAGE. A thorough analysis of different conditions for a variety of FPs and target proteins was done; detailed protocols were developed. A surprisingly high sensitivity of FP detection (even superior to that of standard Western blotting) was demonstrated. Considering the simplicity of the proposed approach, it appears to be the method of choice for those working with FP fusion proteins.

      Thank you for this comment. Indeed we do not claim to discover that FP remain fluorescent in mild denaturing conditions, as presented in the text. We did our best to include original publications showing precedent for this and we missed Yanushevich et al. FEBS Lett. 2002 that we will add. However the Campbell paper is cited, precisely for the Supplementary figure 7 that the reviewer mentions.

      I have only minor, discretionary comments:

      1. It is known that under conditions of SDS-PAGE without heating, FPs retain not only fluorescence but also their oligomeric state. The same can be true for proteins of interest (POIs). If so, even for monomeric FPs, the POI-FP band can potentially migrate much slower than expected because of oligomerization of the POI.

      __Thank you for this suggestion. Our data in the manuscript already show that Bmh1 and Bmh2, which are tighlty associated 14-3-3 proteins, no longer intereact in these mild denaturation conditions. In the set of proteins that we will use to answer to Reviewer #1 (point 1), we will include proteins in large complexes to assess whether this can happen. __

      It might be useful to briefly discuss a possibility to use other types of fluorescent proteins (namely, Flavin-binding FPs, bacteriophytochrome-based FPs, bilirubin-binding FP UnaG) in the same way as proposed here. In particular, biliverdin-binding near-infrared FPs (IFP, iRFP, etc.) can be detected even after fully denaturing SDS-PAGE by zinc-induced orange fluorescence of proteins carrying covalently attached bilin chromophore (Berkelman TR, Lagarias JC. Visualization of bilin-linked peptides and proteins in polyacrylamide gels. Anal Biochem. 1986, 156, 194-201; Stepanenko OV, Kuznetsova IM, Turoverov KK, Stepanenko OV. Impact of Double Covalent Binding of BV in NIR FPs on Their Spectral and Physicochemical Properties. Int J Mol Sci. 2022, 23, 7347).

      __Agreed. ____We will extend the discussion to other fluorescent approaches to visualize proteins in gels and compare them. __


      Significance

      A simple method of specific visualization of fluorescent protein fusion bands on SDS-PAGE is proposed.

      Thank you for your comments and overall assesment.

      Reviewer #3

      Evidence, reproducibility and clarity

      In this paper, Sanial et al present in-gel fluorescence detection (IGF), a method that allows the direct detection of fluorescent proteins from SDS-PAGE gels with minimal adaptation of existing protocols. The authors test a range of fluorescent proteins routinely used, especially when working with yeast, and describe their behavior in IGF. They identify heat-induced denaturation of fluorescent proteins as the main component influencing their assay and systematically test this on a selection of fluorescent proteins. Next, they compare the detection limit and the linearity of the signal between IGF and chemiluminescence, showing that IGF is not only comparable but also superior to chemiluminescence. This is particularly significant given that chemiluminescence can suffer from issues such as a limited dynamic range and limitations in accurately quantifying very low or high-abundance proteins. The authors further demonstrate the utility of IGF in co-immunoprecipitation experiments and test whether the mild denaturing conditions are compatible with proteins from other organisms. Overall, the study is well-presented and is an asset to the scientific community. I have one major and some minor comments that, in my opinion, would improve this already informative paper: Major comment 1. In all cases where there is signal quantification the authors should perform replicates to account for variability of the signal (in Fig 6, S6 and S7).

      __Agreed, we will perform triplicates for the indicated experiments. __

      Minor comments 1. The study mainly focuses on soluble protein. While the authors have tested one plasma membrane protein, the study would benefit from including more membrane proteins from different environments (e.g., cell wall, nuclear envelope, mitochondrial). This would help determine if incubation at higher temperatures is necessary to properly solubilize these proteins, in which case the experiment would need adaptation.

      Thank you for this suggestion. __In the set of proteins that we will use to answer to Reviewer #1 (point 1), we will include proteins from various subcellular locations. __

      The authors show that when fluorescent proteins are partially denatured, their migration behavior changes. One cannot exclude that in some cases, the tagged proteins themselves might also be partially resistant to denaturing at the low temperatures used for IGF. This would lead to more than one fluorescent bands. In such cases one should be careful with interpretation, especially in the context of PTMs or isoforms. Could the authors briefly discuss this?

      Thank you for this comment. __We will discuss this in the text. __

      Based on Fig 4D and 5D, some fluorescent proteins seem to have a higher signal variability between replicates than others. It would be helpful to add this information next to the behavior of the proteins in different temperatures so it would be easier to choose the fluorescent protein for specific experiments.

      __Indeed, there are variations between experiments, but it is not clear whether this inherent to the FP considered or the experiment. We will look back at the data and modify the text accordingly if pertinent. __

      The sensitivity experiment (Figure 6) is convincing and important for IP conditions, where the total protein concentration of the sample is radically decreased. Could the authors additionally test if very low abundant proteins can be detected (without any dilution of the total protein content), and compare this to chemiluminescence? This could be done either by tagging some very low abundant proteins (for example a few hundred copies per cell) or diluting the lysate in wild-type lysate to artificially reduce their concentration while maintaining the overall protein load the same.

      __We have planned an experiment in which low abundant proteins will be tagged in response to reviewer 1 (point 1) which should address this point. __

      It would be useful to address the detection of very high molecular weight proteins - or proteins that are problematic in terms of transfer during western blotting.

      Again, in the experiment planned in ____response to reviewer 1 (point 1), proteins or various MW as well as membrane proteins will be studied, which should address this point. __ __

      Significance

      The authors already discuss the strengths and limitations of their approach. The main strength of IGF is that it does not require transfer of the proteins to a membrane and also does not rely on antibody binding and (potential) chemical reactions. In addition to the fact that this is time, cost, equipment, waste and expertise effective, the sensitivity and signal linearity of IGF seems to not only compare but outperforme western blotting. There are two main limitations. First, IGF relies on the resilience to denaturing of the chosen fluorescent protein that depends, according to the authors, at least on the temperature and overall protein concentration and pH. Second, IGF relied on tagging proteins with fluorescent proteins which might affect the stability or even function of the tagged protein. As the authors mention, these factors do not diminish the value of IGF, they highlight the need for appropriate controls.

      A potential development of the technique (not at the present study) could be the compatibility of IGF with different self-labelling proteins (Halo, Snap) and fluorescent dyes.

      We have conducted experiments in which we show the applicability of IGF in combination to SNAP-tagging, that we could show if needed.

      I think IGF will benefit a rather broad range of scientists. As already mentioned by the authors, there are different applications of IGF. From checking of clones when creating strains, to comparison of protein levels in different conditions and coIP experiments.

      Thank you for your comments and overall assesment.

      Cross reviews. Reviewer 1: I agree with the assessment by Reviewer #2. Considering the comment about potential oligomerization of a protein of interest, I stand by my point about testing the method with more proteins of interest. How extensive this testing should be or whether additional discussion of possible issues would suffice is a matter of opinion. It is clear from the manuscript in it's current form that the method works and that it has caveats.

      We believed that the experiments we have planned will clarify these points.

      Reviewer 2: In general, I agree with the points raised by Reviewer #1. However, in my opinion, there is already a large body of reliable experimental results in the manuscript that are worth publishing without a new round of extensive experiments.

      Reviewer 1: Fair enough, I don't insist on the experiments in my point 1.

      We think that this is an important point that will likely be a common question for readers so we will still do our best to provide data for this point.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this paper, Sanial et al present in-gel fluorescence detection (IGF), a method that allows the direct detection of fluorescent proteins from SDS-PAGE gels with minimal adaptation of existing protocols. The authors test a range of fluorescent proteins routinely used, especially when working with yeast, and describe their behavior in IGF. They identify heat-induced denaturation of fluorescent proteins as the main component influencing their assay and systematically test this on a selection of fluorescent proteins. Next, they compare the detection limit and the linearity of the signal between IGF and chemiluminescence, showing that IGF is not only comparable but also superior to chemiluminescence. This is particularly significant given that chemiluminescence can suffer from issues such as a limited dynamic range and limitations in accurately quantifying very low or high-abundance proteins. The authors further demonstrate the utility of IGF in co-immunoprecipitation experiments and test whether the mild denaturing conditions are compatible with proteins from other organisms.

      Overall, the study is well-presented and is an asset to the scientific community. I have one major and some minor comments that, in my opinion, would improve this already informative paper:

      Major comment

      1. In all cases where there is signal quantification the authors should perform replicates to account for variability of the signal (in Fig 6, S6 and S7). Minor comments
      2. The study mainly focuses on soluble protein. While the authors have tested one plasma membrane protein, the study would benefit from including more membrane proteins from different environments (e.g., cell wall, nuclear envelope, mitochondrial). This would help determine if incubation at higher temperatures is necessary to properly solubilize these proteins, in which case the experiment would need adaptation.
      3. The authors show that when fluorescent proteins are partially denatured, their migration behavior changes. One cannot exclude that in some cases, the tagged proteins themselves might also be partially resistant to denaturing at the low temperatures used for IGF. This would lead to more than one fluorescent bands. In such cases one should be careful with interpretation, especially in the context of PTMs or isoforms. Could the authors briefly discuss this?
      4. Based on Fig 4D and 5D, some fluorescent proteins seem to have a higher signal variability between replicates than others. It would be helpful to add this information next to the behavior of the proteins in different temperatures so it would be easier to choose the fluorescent protein for specific experiments
      5. The sensitivity experiment (Figure 6) is convincing and important for IP conditions, where the total protein concentration of the sample is radically decreased. Could the authors additionally test if very low abundant proteins can be detected (without any dilution of the total protein content), and compare this to chemiluminescence? This could be done either by tagging some very low abundant proteins (for example a few hundred copies per cell) or diluting the lysate in wild-type lysate to artificially reduce their concentration while maintaining the overall protein load the same.
      6. It would be useful to address the detection of very high molecular weight proteins - or proteins that are problematic in terms of transfer during western blotting.

      Significance

      The authors already discuss the strengths and limitations of their approach. The main strength of IGF is that it does not require transfer of the proteins to a membrane and also does not rely on antibody binding and (potential) chemical reactions. In addition to the fact that this is time, cost, equipment, waste and expertise effective, the sensitivity and signal linearity of IGF seems to not only compare but outperforme western blotting. There are two main limitations. First, IGF relies on the resilience to denaturing of the chosen fluorescent protein that depends, according to the authors, at least on the temperature and overall protein concentration and pH. Second, IGF relied on tagging proteins with fluorescent proteins which might affect the stability or even function of the tagged protein. As the authors mention, these factors do not diminish the value of IGF, they highlight the need for appropriate controls.

      A potential development of the technique (not at the present study) could be the compatibility of IGF with different self-labelling proteins (Halo, Snap) and fluorescent dyes.

      I think IGF will benefit a rather broad range of scientists. As already mentioned by the authors, there are different applications of IGF. From checking of clones when creating strains, to comparison of protein levels in different conditions and coIP experiments.

      Keywords/field of expertise: yeast genetics, organelle homeostasis, biochemistry, molecular biology, cell biology, fluorescence microscopy, functional proteomics, gut microbiology

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The present manuscript "Direct observation of fluorescent proteins in gels: a rapid cost-efficient, and quantitative alternative to immunoblotting" describes a method how to visualize bands of fluorescent protein fusions onto a common SDS-PAGE without antibody staining. It is based on ability of GFP-like fluorescent proteins (FPs) to retain their fluorescence under conditions of SDS-PAGE if step of extensive heating (boiling) of protein sample is omitted. This property of FPs is not novel; it was known for more than 20 years (for example, see Fig. 2 in Yanushevich et al. FEBS Lett. 2002 Jan 30, 511:11-4; Supporting Fig. 7 in Campbell et al. Proc Natl Acad Sci USA. 2002 Jun 11, 99:7877-82). However, the authors did perform a very accurate and robust study to quantitatively assess the behavior of several FP fusion protein in SDS-PAGE. A thorough analysis of different conditions for a variety of FPs and target proteins was done; detailed protocols were developed. A surprisingly high sensitivity of FP detection (even superior to that of standard Western blotting) was demonstrated. Considering the simplicity of the proposed approach, it appears to be the method of choice for those working with FP fusion proteins.

      I have only minor, discretionary comments:

      1. It is known that under conditions of SDS-PAGE without heating, FPs retain not only fluorescence but also their oligomeric state. The same can be true for proteins of interest (POIs). If so, even for monomeric FPs, the POI-FP band can potentially migrate much slower than expected because of oligomerization of the POI.
      2. It might be useful to briefly discuss a possibility to use other types of fluorescent proteins (namely, Flavin-binding FPs, bacteriophytochrome-based FPs, bilirubin-binding FP UnaG) in the same way as proposed here. In particular, biliverdin-binding near-infrared FPs (IFP, iRFP, etc.) can be detected even after fully denaturing SDS-PAGE by zinc-induced orange fluorescence of proteins carrying covalently attached bilin chromophore (Berkelman TR, Lagarias JC. Visualization of bilin-linked peptides and proteins in polyacrylamide gels. Anal Biochem. 1986, 156, 194-201; Stepanenko OV, Kuznetsova IM, Turoverov KK, Stepanenko OV. Impact of Double Covalent Binding of BV in NIR FPs on Their Spectral and Physicochemical Properties. Int J Mol Sci. 2022, 23, 7347).

      Referee cross-commenting

      In general, I agree with the points raised by Reviewer #1. However, in my opinion, there is already a large body of reliable experimental results in the manuscript that are worth publishing without a new round of extensive experiments.

      Significance

      A simple method of specific visualization of fluorescent protein fusion bands on SDS-PAGE is proposed.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Sanial et al. carefully analyze the use of in-gel fluorescence as an alternative to immunoblotting. The authors show that simple modifications of common protein extraction protocols can preserve (to varying extents) fluorescent proteins in their native, fluorescent states. This can be exploited in different applications for in-gel fluorescence quantification, bypassing immunoblotting. The experimental results are clear, showcasing the ease and linearity of in-gel fluorescence quantification.

      In my opinion, the trick of this approach is also potentially its main drawback, the partial denaturation conditions. I think the manuscript could be strengthened with more extensive benchmarking of the approach and further discussion of potential caveats as detailed below.

      Major points:

      1. Protein abundance in the original GFP library (and in other FP-tagged libraries constructed in the meanwhile) have been quantified using fluorescence (flow cytometry, microscopy, colony fluorescence) (Ho et al. 2018 10.1016/j.cels.2017.12.004, Weill et al. 2018 10.1038/s41592-018-0044-9, Meurer et al. 2018 10.1038/s41592-018-0045-8). This provides an opportunity to significantly strengthen the manuscript (where most of the test have been done using two abundant cytosolic proteins Bmh1 and Hxk1) if the authors could apply their approach to a representative fraction of the yeast proteome (sampling from such libraries FP-tagged proteins that differ in abundance, localization, membrane vs cytosolic/nuclear, subunits of large stable complexes vs proteins not part of complexes, etc.) and compare their quantification with previous relative abundance estimates. This information would also help future users in case protein-specific issues are identified.
      2. The authors discuss several drawbacks, including the change in apparent molecular weight compared to denatured proteins; differential recognition of folded vs denatured proteins by antibodies.

      Other potentials drawbacks should be discussed. For instance, the need of additional steps post-fluorescence imaging for signal normalization against a loading control; the need of antibodies and immunoblotting to decide on the best denaturing temperature for a specific protein or FP tag; complexity of the native protein extraction protocol compared for example to alkaline lysis followed by TCA precipitation (Knop et al. 1999 PMID: 10407276).

      Moreover, although the denatured fraction is FP- and temperature-dependent, even under the milder 30{degree sign}C conditions there is a detectable denatured fraction (Fig.s3b). This would seem to preclude the use of this approach for absolute protein quantification.

      Finally, any evidence that the denatured fraction would depend on the protein tagged with the FP?

      Minor points:

      1. In the experiments designed to test the linearity and sensitivity of the approach, an alternative approach that would not result in dilution of cell extract is to mix wild type cell extract (no GFP fusion) with extract of the GPF-tagged strain in different ratios.
      2. Define all acronyms at first appearance. For example, DTT and LDS on page 4.
      3. Fig.4D: the colors chosen to represent EGFP and sfGFP data make them hard to tell apart. The same comment to Fig.S6.
      4. As the temperature steps are not uniform in Figures 4 and 5, it would be more informative to indicate the exact temperate above each lane (in addition/instead of the ramp cartoon).
      5. Regarding linearity, that HRP-based quantification is not linear is expected. A fairer comparison would be to use fluorescently labeled secondary antibodies. It is also puzzling that detection with signal amplification (HRP) is less sensitive than direct quantification of the fluorescence signal from the FP tag.
      6. I appreciate the workflow Figure 10. But in my opinion it is trying to show too much (protocol, troubleshooting, calls to figure panels). Perhaps it could be made clearer by separating the protocol steps/settings from the optimization/troubleshooting tips.
      7. Some of the discussion of different fluorescent proteins, and expression levels of tagged proteins, could be confounded by the different linkers used in the tagging constructs.

      Referee Cross-commenting

      This session contains comments from all reviewers.

      Reviewer 1: I agree with the assessment by Reviewer #2. Considering the comment about potential oligomerization of a protein of interest, I stand by my point about testing the method with more proteins of interest. How extensive this testing should be or whether additional discussion of possible issues would suffice is a matter of opinion. It is clear from the manuscript in it's current form that the method works and that it has caveats.

      Reviewer 2: In general, I agree with the points raised by Reviewer #1. However, in my opinion, there is already a large body of reliable experimental results in the manuscript that are worth publishing without a new round of extensive experiments.

      Reviewer 1: Fair enough, I don't insist on the experiments in my point 1.

      Significance

      Could be a generally useful and simple approach for in-gel quantification using fluorescent protein tags.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Thank you very much for your editorial handling of our manuscript entitled 'A conserved fungal Knr4/Smi1 protein is vital for maintaining cell wall integrity and host plant pathogenesis'. We have taken on board the reviewers' comments and thank them for their diligence and time in improving our manuscript.

      Please find our responses to each of the comments below.

      Reviewer(s)' comments

      Reviewer #1


      Major comments:


      __1.1. As a more critical comment, I find the presentation of the figures somewhat confusing, especially with the mixing of main figures, supplements to the main figures, and actual supplemental data. On top of that, the figures are not called up in the right order (e.g. Figure 4 follows 2D, while 3 comes after 4; Figure 6 comes before 5...), and some are never called up (I think) (e.g. Figure 1B, Figure 2B). __


      __Response: __The figure order has been revised according to the reviewer's suggestion, while still following eLife's formatting guidelines for naming supplementals. Thank you.

      1.2. I agree that there should be more CWI-related genes in the wheat module linked to the FgKnr4 fungal module, or, vice-versa, CW-manipulating genes in the fungal module. It would at least be good if the authors could comment further on if they find such genes, and if not, how this fits their model.


      Response: Thank you for your insightful suggestion regarding the inclusion of more CWI-related genes in the wheat module linked to the FgKnr4 fungal module F16, or vice versa. We did observe a co-regulated response between the wheat module W05 which is correlated to the FgKnr4 module F16. Namely, we observed an enrichment of oxidative stress genes including respiratory burst oxidases and two catalases (lines 304 - 313) in the correlated wheat module (W05). Early expression of these oxidative stress inducing genes likely induces the CWI pathway in the fungus, which is regulated by FgKnr4. Knr4 functions as both a regulatory protein in the CWI pathway and as a scaffolding protein across multiple pathways in S. cerevisiae (Martin-Yken et al., 2016, https://onlinelibrary.wiley.com/doi/10.1111/cmi.12618 ). Scaffolding protein-encoding genes are typically expressed earlier than the genes they regulate to enable pre-assembly with their interacting partners, ensuring that signaling pathways are ready to activate when needed. In this context, the CWI integrity MAPKs Bck1 and Mkk1 are part of module F05, which includes two chitin synthases and a glucan synthase. This module is highly expressed during the late symptomless phase. The MAPK Mgv1, found in module F13, is expressed consistently throughout the infection process, which aligns with the expectation that MAPKs are mainly post-transcriptionally regulated. Thank you for bringing our attention to this, this is now included in the discussion (lines 427 - 443) along with eigengene expression plots of all modules added to the supplementary (Figure 3 - figure supplement 1).

      To explore potential shared functions of FgKnr4 with other genes in its module, we re-analyzed the high module membership genes within module F16, which includes FgKnr4, using Knetminer (Hassani-Pak et al., 2021; https://onlinelibrary.wiley.com/doi/10.1111/pbi.13583 ). This analysis revealed that 8 out of 15 of these genes are associated with cell division and ATP binding. Four of the candidate genes are also part of a predicted protein-protein interaction subnetwork of genes within module F16, which relate to cell cycle and ATP binding. In S. cerevisiae, the absence of Knr4 results in cell division dysfunction (Martin-Yken et al., 2016, https://onlinelibrary.wiley.com/doi/10.1111/cmi.12618 ). Accordingly, we tested sensitivity of ΔFgknr4 to microtubule inhibitor benomyl (a compound commonly used to identify mutants with cell division defects; Hoyt et al., 1991 https://www.cell.com/cell/pdf/0092-8674(81)90014-3.pdf). We found that the ΔFgknr4 mutant was more susceptible to benomyl, both when grown on solid agar and in liquid culture. This data has now been added Figure 7, and referred to in lines 338-348.

      __Specific issues: __


      1.3. In the case of figure 5, I generally find it hard to follow. In the text (line 262/263), the authors state that 5C shows "eye-shaped lesions" caused by ΔFgknr4 and ΔFgtri5, but I can't see neither (5C appears to be a ΔFgknr4 complementation experiment). The figure legend also states nothing in this regard.

      __Response: __Thank you for your suggestion. We have amended the manuscript to include an additional panel that shows the dissected spikelet without its outer glumes, making the eye shaped diseased regions more visible in Figure 5.

      __1.4. Figure 5D supposedly shows 'visibly reduced fungal burden' in ΔFgknr4-infected plants, but I can't really see the fungal burden in this picture, but the infected section looks a lot thinner and more damaged than the control stem, so in a way more diseased. __


      Response: __Thank you for your insight. We have revised our conclusions based on this image to state that while ΔFgknr4 can colonise host tissue, it does so less effectively compared to the wild-type strain as we are unable to quantitatively evaluate fungal burden using image-colour thresholding due to the overlapping colours of the fungal cells and wheat tissues. Decreased host colonisation is evidenced by (i) reduced fungal hyphae proliferation, particularly in the thicker adaxial cell layer, (ii) collapsed air spaces in wheat cells, and (iii) increased polymer deposition at the wheat cell walls, indicating an enhanced defence response. __Figure 5 has been amended to include these observations in the corresponding figure legend and the resin images now include insets with detailed annotation.

      __1.5. The authors then go on to state (lines 272-273) that they analyzed the amounts of DON mycotoxin in infected tissues, but don't seem to show any data for this experiment. __

      Response: __We have amended this to now include the data in __Figure 5 - figure supplement 2B, thank you.

      Reviewer #2


      __Major issues: __


      2.1 If Knf4 is involved in the CWI pathway, what other genes involved in the CWI pathway are in this fungal module? one of the reasons for developing modules or sub-networks is to assign common function and identify new genes contributing to the function. since FgKnr4 is noted to play a role in the CWI pathways, then genes in that module should have similar functions. If WGCN does not do that, what is the purpose of this exercise?


      Response: __Thank you for raising this point regarding the role of FgKnr4 in the CWI pathway and the expectations for genes of shared function within the FgKnr4 module F16. We did observe that the module containing FgKnr4 (F16) was also correlated to a wheat module (W05) which was significantly enriched for oxidative stress genes. This pathogen-host correlated pattern led us to study module F16, which otherwise lacks significant gene ontology term enrichment, unique gene set enrichments, and contains few characterised genes. This is now highlighted in __lines 233-246. This underscores the strength of the WGCNA. By using high-resolution RNA-seq data to map modules to specific infection stages, we identified an important gene that would have otherwise been overlooked. This approach contrasts with other network analyses that often rely on the guilt-by-association principle to identify novel virulence-related genes within modules containing known virulence factors, potentially overlooking significant pathways outside the scope of prior studies. Therefore, our analysis has already benefited from several advantages of WGCNA, including the identification of key genes with high module membership that may be critical for biological processes, as well as generating a high-resolution, stage-specific co-expression map of the F. graminearum infection process in wheat. This point is now emphasised in lines 233-252. As discussed in response to reviewer 1, Knr4 functions as both a regulatory protein in the CWI pathway and as a scaffolding protein across multiple pathways in S. cerevisiae (Martin-Yken et al., 2016, https://onlinelibrary.wiley.com/doi/10.1111/cmi.12618 ) which would explain its clustering separate from the CWI pathway genes. The high module membership genes within module F16 containing FgKnr4 were re-analysed using Knetminer (Hassani-Pak et al., 2021; https://onlinelibrary.wiley.com/doi/10.1111/pbi.13583 ), which found that 8/15 of these genes were related to cell division and ATP binding. Four of the candidate genes are also part of a predicted protein-protein interaction subnetwork of genes within module F16, which relate to cell cycle and ATP binding. In S. cerevisiae, the absence Knr4 leads to dysfunction in cell division. Accordingly, we tested sensitivity of ΔFgknr4 to the microtubule inhibitor benomyl (a compound commonly used to identify mutants with cell division defects; Hoyt et al., 1991 https://www.cell.com/cell/pdf/0092-8674(81)90014-3.pdf). We found that the ΔFgknr4 mutant was more susceptible to benomyl, both when grown on solid agar and in liquid culture. This data has now been added as Figure 7 and referred to in lines 338-348.


      2.2. Due to development defects in the Fgknr1 mutant, I would not equate to as virulence factor or an effector gene.


      __Response: __We are in complete agreement with the reviewer and are not suggesting that FgKnr4 is an effector or virulence factor, we have been careful with our wording to indicate that FgKnr4 is simply necessary for full virulence and its disruption results in reduced virulence and have outlined how we believe FgKnr4 participates in a fungal signaling pathway required for infection of wheat.


      2.3. What new information is provided with WGCN modules compared with other GCN network in Fusarium (examples of GCN in Fusarium is below) ____https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5069591/ https://doi.org/10.1186/s12864-020-6596-y____ DOI: 10.1371/journal.pone.0013021. The GCN networks from Fusarium have already identified modules necessary/involved in pathogenesis.

      Response: __The 2016 New Phytologist gene regulatory network (GRN) by Guo et al. is large and comprehensive. However, only three of the eleven datasets are in planta, with just one dataset focusing on F. graminearum infection on wheat spikes. The other two in planta datasets involve barley infection and Fusarium crown rot. By combining numerous in planta and in vitro datasets, the previous GRNs lack the fine resolution needed to identify genetic relationships under specific conditions, such as the various stages of symptomatic and symptomless F. graminearum infection of mature flowering wheat plants. This limitation is highlighted in the 2016 paper itself. This network is expanded in the Guo et al., 2020 BMC genomics paper where it includes one additional in planta and nine in vitro datasets. However, the in planta dataset involves juvenile wheat coleoptile infection, which serves as an artificial model for wheat infection but is not on mature flowering wheat plants reminiscent of Fusarium Head Blight of cereals in the field. This model differs significantly in the mode of action of F. graminearum, notably DON mycotoxin is not essential for virulence in this context (Armer et al. 2024, https://pubmed.ncbi.nlm.nih.gov/38877764/ ). The Guo et al., 2020 paper still faces the same issues in terms of resolution and the inability to draw conclusions specific to the different stages of F. graminearum infection. Additionally, these GRNs use Affymetrix data, which miss over 400 genes (~ 3 % of the genome) from newer gene models. In contrast, our study addresses these limitations by analysing a meticulously sampled, stage- and tissue-specific in planta RNA-seq dataset using the latest reference annotation. Our approach provides higher resolution and insights into host transcriptomic responses during the infection process. The importance of our study in the context of these GRNs is now addressed in the introduction (__lines 85-92).


      2.4. Ideally, the WGCN should have been used identify plant targets of Fusarium pathogenicity genes. This would have provided credibility and usefulness of the WGCN. Many bioinformatic tools are available to identify virulence factors and the utility of WGCN in this regard is not viable. However, if the authors had overlapped the known virulence factors in a fungal module to a particular wheat module, the impact of the WGCN would be great. The module W12 has genes from numerous traits represented and WGCN could have been used to show novel links between Fg and wheat. For example, does tri5 mutant affect genes in other traits?

      __Response: __Thank you for your suggestions. In this study we have shown the association between the main fungal virulence factor of F. graminearum, DON mycotoxin, with wheat detoxification responses. Through this we have identified a set of tri5 responsive genes and validated this correlation in two genes belonging to the phenylalanine pathway and one transmembrane detoxification gene. Although we could validate more genes in this tri5 responsive wheat module, our paper aimed to investigate previously unstudied aspects of the F. graminearum infection process and how the fungus responded to changing conditions within the host environment. We accomplished this by characterising a gene within a fungal module that had limited annotation enrichment and few characterised genes. Tri5 on the other hand is the most extensively studied gene in F. graminearum and while the network we generated may offer new insights into tri5 responsive genes, this is beyond the scope of our current study. In addition to the tri5 co-regulated response, we have also demonstrated the coordinated response between the fungal module F16, which contains FgKnr4 that is necessary for tolerance to oxidative stress, and the wheat module W05, which is enriched for oxidative stress genes.


      While our co-expression network approach can be used to explore and validate other early downstream signaling and defense components in wheat cells, several challenges must be considered: (a) the poor quality of wheat gene calls, (b) genetic redundancy due to both homoeologous genes and large gene families, and (c) the presence of DON, which can inhibit translation and prevent many transcriptional changes from being realised within the host responses. Additionally, most plant host receptors are not transcriptionally upregulated in response to pathogen infection (most R gene studies for the NBS-LRR and exLRR-kinase classes), making their discovery through a transcriptomics approach unlikely. These points will be included in our discussion (lines 408-413), thank you.

      Specific issues

      • *

      2.5. Since tri5 mutant was used a proof of concept to link wheat/Fg modules, it would have been useful to show that TRI14, which is not involved DON biosynthesis, but involved in virulence ( https://doi.org/10.3390/applmicrobiol4020058____) impact the wheat module genes.


      Response: __Our goal was to show that wheat genes respond to the whole TRI cluster, not just individual TRI genes. Therefore, the tri5 mutant serves as a solid proof-of-concept, because TRI5 is essential for DON biosynthesis, the primary function of the TRI gene cluster, thereby representing the function of the cluster as a whole. This is now clarified in __lines 217-219. Additionally, the uncertainties surrounding other TRI mutants would complicate the question we were addressing-namely, whether a wheat module enriched in detoxification genes is responding to DON mycotoxin, as implied by shared co-expression patterns with the TRI cluster. For instance, the referenced TRI14 paper indicates that DON is produced in the same amount in vitro in a single media. Although the difference is not significant, the average DON produced is lower for the two Δtri14 transformants tested. Therefore, we cannot definitively rule out that TRI14 is involved in DON biosynthesis and extrapolate this to DON production in planta. Despite this, the suggestion is interesting, and would make a nice experiment but we believe it does not contribute to the overall aim of this study.

      2.6. Moreover, prior RNAseq studies with tri5 mutant strain on wheat would have revealed the expression of PAL and other phenylpropanoid pathway genes?

      __Response: __We agree that this would be an interesting comparison to make but unfortunately no dataset comparing in planta expression of the tri5 mutant within wheat spikes exists.

      2.7. Table S1 lists 15 candidate genes of the F16 module; however, supplementary File 1 indicates 74 genes in the same module. The basis of exclusion should be explained. The author has indicated genes with high MM was used as representative of the module. The 59 remaining genes of this module did not meet this criteria? Give examples.


      Response: __The 15 genes with the highest module membership were selected as initial candidates for further shortlisting from the 74 genes within module F16. In WGCNA, genes with high module membership (MM) (i.e. intramodular connectivity) are predicted to be central to the biological functions of the module (Langfelder and Horvath, 2008; https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-9-559 ) and continues to be a metric to identify biologically significant genes within WGCN analyses (https://bmcplantbiol.biomedcentral.com/articles/10.1186/s12870-024-05366-0 Tominello-Ramirez et al., 2024; https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9151341/ ;Zheng et al., 2022; https://www.nature.com/articles/s41598-020-80945-3 Panahi and Hejazi et al 2021). Following methods by Mateus et al. (2019) (https://academic.oup.com/ismej/article/13/5/1226/7475138 ) key genes were defined as those exhibiting elevated MM within the module, which were also strongly correlated (R > |0.70|) with modules of the partner organism (wheat). We have clarified this point in the manuscript. Thank you for the suggestion. (__Lines 253-263).

      2.____8. A list from every module that pass this criteria will be useful resource for functional characterization studies.


      __Response: __A supplementary spreadsheet has been generated which includes full lists of the top 15 genes with the highest module membership within the five fungal modules correlated to wheat modules and a summary of shared attributes among them. Thank you for this suggestion.

      2.9. Figure 3 indicates TRI genes in the module F12; your PHI base in Supp File S2 lists only TRI14. Why other TRI genes such as TRI5 not present in this File?


      Response: For clarity, the TRI genes in module F12 are TRI3, TRI4, TRI11, TRI12, and TRI14 which was stated in Table 1. TRI5 clusters with its neighboring regulatory gene TRI6 in module F11, which exhibits a similar but reduced expression pattern compared to module F12. To improve clarity on this the TRI genes in module F12 are also listed in-text in line 168 and added to Figure 4. The enrichment and correlated relationship of W12 to a cluster's expression still imply a correlated response of the wheat gene to the TRI cluster's biosynthetic product (DON), which is absent in the Δtri5 mutant.

      TRI14 and TRI12 are listed in PHI-base. TRI12 was mistakenly excluded due to an unmapped Uniprot ID, which were added separately in the spreadsheet. We will recheck all unmapped ID lists to ensure all PHI-base entries are included in the final output. Thank you for pointing out this error.


      2.10. What is purpose of listing the same gene multiple times? Example, osp24 (a single gene in Fg) is listed 13 times in F01 module.


      __Response: __This is a consequence of each entry having a separate PHI ID, which represents different interactions including inoculations on different cultivar. Cultivar and various experimental details were omitted from the spreadsheet to reduce information density, however the multiple PHI base ID's will be kept separate to make the data more user friendly when working with the PHI-base database. An explanation for this is now provided in the file's explanatory worksheet, thank you.

      Reviewer #3:


      3.1. Why only use of high confidence transcripts maize to map the reads and not the full genome like Fusarium graminearum? I have never analyzed plant transcriptome.


      __Response: __ In the wheat genome, only high-confidence gene calls are used by the global community (Choulet et al., 2023; https://link.springer.com/chapter/10.1007/978-3-031-38294-9_4 ) until a suitable and stable wheat pan-genome becomes available.

      3.2. The regular output of DESeq are TPMs, how did the authors obtain the FPKM used in the analysis?


      Response: FPKM was calculated using the GenomicFeatures package and included on GitHub to enhance accessibility for other users. However, the input for WGCNA and this study as a whole was normalised counts rather than FPKM. The FPKM analysis was done to improve interoperability of the data for future users and made available on Github. To complement this, the information regarding FPKM calculation is now included in the methods section of the revised manuscript (line 491).

      3.3. Do the authors have a Southern blot to prove the location of the insertion and number of insertions in Zymoseptoria tritici mutant and complemented strains?


      __Response: __No, but the phenotype is attributed to the presence or absence of ZtKnr4, as the mutant was successfully complemented in multiple phenotypic aspects. This satisfies Koch's postulates which is the gold standard for reverse genetics experimentation (Falkow 1988; https://www.jstor.org/stable/4454582 ).

      __3.4. Boxplots and bar graphs should have the same format. In Figures 5 B and F and supplementary figure 6.3 the authors showed the distribution of samples but it is lacking in figure 3 B and all bar graphs. __


      __Response: __Graphs have been modified to display the distribution of all samples, thank you.

      3.5. Line 247 FGRAMPH1_0T23707 should be FGRAMPH1_01T23707


      __Response: __Thank you this has now been amended.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors of the manuscript entitled "A conserved fungal Knr4/Smi1 protein is vital for maintaining cell wall integrity and host plant pathogenesis" used a weighted gene co-expression network to identify Fusarium graminearum genes highly expressed during early symptomless infection of wheat. Based on its sequence and previous studies, authors selected FgKnr4 from the early symptomless Fusarium modules. The characterization of knockout strains revealed a role in morphogenesis, growth, cell wall stress tolerance, and virulence in F. graminearum and the phylogenetically distant fungus Zymoseptoria tritici.

      The methods are properly described and statistical analysis are reasonable so reproducibility is possible. The RNA-seq dataset is already published and the authors provided a repository with the code used to create the co-expression network. However, I have the following questions:

      • Why only use of high confidence transcripts maize to map the reads and not the full genome like Fusarium graminearum? I have never analyzed plant transcriptome.
      • The regular output of DESeq are TPMs, how did the authors obtain the FPKM used in the analysis?
      • Do the authors have a southern blot to prove the location of the insertion and number of insertions in Zymoseptoria tritici mutant and complemented strains?
      • Boxplots and bar graphs should have the same format. In Figures 5 B and F and supplementary figure 6.3 the authors showed the distribution of samples but it is lacking in figure 3 B and all bar graphs.
      • Line 247 FGRAMPH1_0T23707 should be FGRAMPH1_01T23707

      Referees cross-commenting

      I agree with reviewer 1, the order in which the figures are called in the text is confusing. Regardless of figures 5C-D I am no expert in the field therefore I can only say they look like they have not been edited.

      I agree with reviewer 1, data of DON mycotoxin production in infected issues is need it to support statement in line 272-273.

      I agree with Reviewer 2, the criteria to exclude genes from the final selection list should be explained.

      Significance

      The study showed, once again, that a weighted gene co-expression network is a great method to identify new genes of interest regardless of the organism or condition even if not very popular in the fungal pathogen field yet. The study proved that functions identified in a WGCN module from a pathogen have their opposite in the host module. The authors go beyond the theory and demonstrate the effect of the highest expressed gene during the early symptomless stage of infection in maize and wheat fungal pathogens.

      Fungal pathogen, RNA-seq, metabolic models, metabolism, comparative genomics

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: The authors in this manuscript use "dual weighting" to identify clusters or modules of genes from the fungus F. graminearum (Fg) with coordinated expression patterns with genes in wheat modules - potentially uncover key regulators or pathways linking Fg genes with plant traits, including plant pathogenesis. As proof of concept, the authors use one of the fungal genes FgKnr4 identified in a fungal module that has strong link with the wheat module. They were able to show that this gene is likely involved in CWI pathway and affects virulence properties of the fungus

      Major comments:

      Does the WGCN provide useful framework to link fungal genes affecting plant traits? If Knf4 is involved in the CWI pathway, what other genes involved in the CWI pathway are in this fungal module? This is not forthcoming. Due to development defects in the Fgknr1 mutant, I would not equate to as virulence factor or an effector gene.

      Since tri5 mutant was used a proof of concept to link wheat/Fg modules, it would have been useful to show that TRI14, which is not involved DON biosynthesis, but involved in virulence ( https://doi.org/10.3390/applmicrobiol4020058) impact the wheat module genes. Moreover, prior RNAseq studies with tri5 mutant strain on wheat would have revealed the expression of PAL and other phenylpropanoid pathway genes?

      Table S1 lists 15 candidate genes of the F16 module; however, supplementary File 1 indicates 74 genes in the same module.

      The basis of exclusion should be explained. The author has indicated genes with high MM was used as representative of the module. The 59 remaining genes of this module did not meet this criteria? Give examples. Did similar exclusion criteria used for other modules and if so, how many genes in each module pass the criteria? For example, Did TRI5 in module F12 pass this criteria. A list from every module that pass this criteria will be useful resource for functional characterization studies.

      Minor comments:

      Figure 3 indicates TRI genes in the module F12; your PHI base in Supp File S2 lists only TRI14. Why other TRI genes such as TRI5 not present in this File? What is purpose of listing the same gene multiple times? Example, osp24 (a single gene in Fg) is listed 13 times in F01 module.

      Referees cross-commenting

      agree with both reviewers regarding clarification of Figures.

      one of the reasons for developing modules or sub-networks is to assign common function and identify new genes contributing to the function. since FgKnr4 is noted to play a role in the CWI pathways, then genes in that module should have similar functions. If WGCN does not do that, what is the purpose of this exercise?

      Significance

      What new information is provided with WGCN modules compared with other GCN network in Fusarium (examples of GCN in Fusarium is below)

      https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5069591/ https://doi.org/10.1186/s12864-020-6596-y DOI: 10.1371/journal.pone.0013021

      The GCN networks from Fusarium have already identified modules necessary/involved in pathogenesis. Ideally, the WGCN should have been used identify plant targets of Fusarium pathogenicity genes. This would have provided credibility and usefulness of the WGCN.

      Many bioinformatic tools are available to Identify virulence factors and the utility of WGCN in this regard is not viable. However, if the authors had overlapped the known virulence factors in a fungal module to a particular wheat module, the impact of the WGCN would be great. The module W12 has genes from numerous traits represented and WGCN could have been used to show novel links between Fg and wheat. For example, does tri5 mutant affect genes in other traits?

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      A public mRNA-seq dataset from Dilks et al. (2019) for wheat spikelets infected by Fusarium graminearum was used to generate a dual weighted gene co-expression network (WGCN). Since colonization of the spike by F. graminearum progresses from spikelet to spikelet, thereby forming an infection-gradient from early to late stages, quasi spatio-temporal resolution for the transcriptomic dataset can be achieved by cutting the spike into equal pieces along this gradient (in this case cuts were done at rachis internodes 1-2, 3-4, 5-6, and 7-8. The authors created co-expression networks for both, fungal and plant genes, and cross-correlated them. They identify several modules specific for each infection stage. For further analysis, the authors focus on two module pairs. (1) the wheat module 12 (W12), which correlates to Fusarium module 12 (F12), and (2) the Fusarium module 16 (F16) and the correlated wheat modules 1 and 5 (W01/W05). The W12/F12 modules were deemed of interest because they were specific to the transition from symptomless to symptomatic infection stage. Here, the authors find genes related to mycotoxin production to be upregulated in the F12 module, while the W12 is enriched in genes involved in detoxification. F16 and W01/W05 are specific to the earliest stages of infection, and thus most likely involved in fungal virulence. Here, one of the key genes identified is FgKnr4, which the authors show to be important for fungal virulence, as gene knockout leads to a premature stop of disease progression. As the authors show that FgKnr4 is involved in activating cell wall-integrity mechanisms, and may function in oxidative stress-resistance, this reduced virulence may be the result a reduced ability of the fungus to withstand plant defense mechanisms. Interestingly, knocking out an orthologue of FgKnr4 in Zymoseptoria tritici led to similarly reduced virulence of this pathogenic fungus on wheat plant.

      Comments:

      Overall, I find the WGCN analysis to be very interesting and informative, especially because of the different stages of infection. As the dataset is made public (I believe), I think that this will be a really important resource for the community. The exemplary functional analysis of the F16/W01/W05 modules via FgKnr4 is very interesting and demonstrates that novel genes involved in virulence can be identified via this approach. A similar more detailed analysis of the W12/F12 modules with a focus on detoxification mechanisms in the plant (i.e. the W12 module) would be a very interesting bonus, but as much as I would be interested in reading about it, functional gene analyses in wheat are obviously time-consuming, and it is not essential to this manuscript. As a more critical comment, I find the presentation of the figures somewhat confusing, especially with the mixing of main figures, supplements to the main figures, and actual supplemental data. On top of that, the figures are not called up in the right order (e.g. Figure 4 follows 2D, while 3 comes after 4; Figure 6 comes before 5...), and some are never called up (I think) (e.g. Figure 1B, Figure 2B). In the case of figure 5, I generally find it hard to follow. In the text (line 262/263), the authors state that 5C shows "eye-shaped lesions" caused by ΔFgknr4 and ΔFgtri5, but I can't see neither (5C appears to be a ΔFgknr4 complementation experiment). The figure legend also states nothing in this regard. Figure 5D supposedly shows 'visibly reduced fungal burden' in ΔFgknr4-infected plants, but I can't really see the fungal burden in this picture, but the infected section looks a lot thinner and more damaged than the control stem, so in a way more diseased. The authors then go on to state (lines 272-273) that they analyzed the amounts of DON mycotoxin in infected tissues, but don't seem to show any data for this experiment. In contrast to the sometimes confusing data presentation, I find the table of correlated modules (table 1) very helpful, and obviously am happy to see that all data is available in the first author's GitHub account.

      Referees cross-commenting

      just to clarify in regards to my comment on Figures 5C-D, and Reviewer #3's comment "Regardless of figures 5C-D I am no expert in the field therefore I can only say they look like they have not been edited." - I didn't want to insinuate that the images have been edited. Based on the images provided, I just can't see what the authors state is shown. So this is not about editing/manipulation - just about image quality/choice. The phenotypic descriptions by the authors are quite detailed ("eye-shaped lesions", 'visibly reduced fungal burden'...), but at least for me, the images aren't good enough to illustrate and underpin their statements. Maybe better images are needed, maybe magnifications of the exact regions showing the phenotypes? But this is simply a matter of presentation, not of editing/manipulation.

      Second, I agree that there should be more CWI-related genes in the wheat module linked to the FgKnr4 fungal module, or, vice-versa, CW-manipulating genes in the fungal module. It would at least be good if the authors could comment further on if they find such genes, and if not, how this fits their model.

      Significance

      In summary, I think that the presented WGCN analysis of mRNA-seq data with quasi-spatio-temporal resolution is a very helpful approach to identify novel fungal virulence and plant immunity genes, and with the created datasets made public, this will be an interesting and valuable resource for the community. The identification and functional analysis of FgKnr4 works as proof-of-principle. If the data presentation is improved, I believe that this will be an interesting publication.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number:

      RC-2024-02569

      Corresponding author(s): Mary O'Riordan, Teresa O'Meara

      1. General Statements

      We thank the reviewers for their positive feedback, highlighting the significance and novelty of our work, especially regarding the novel functions of IRE1a in regulating phagosome biology during infection. We also appreciate some overarching themes that were focused on by multiple reviewers, including the role of XBP1S protein and RIDD activity, which we have addressed here. We have also added additional data, made adjustments to data presentation, and added clarifying language to address concerns from Reviewer 3. We appreciate these constructive suggestions and include our planned experiments to address reviewer concerns here. Our specific responses to the reviewer comments are below.

      Specific figures used in the response to reviewers are in the attached file as they cannot be pasted here.

      2. Description of the planned revisions

      Reviewer 1:

      1) The demonstration of protein misfolding independent IRE1 activation should also be demonstrated using molecules such as TUDCA or 4PBA that should be innocuous regarding the splicing of XBP1s. It would also be interesting to evaluate the activation of the other arms of the UPR in particular through the phosphorylation of eIF2a, expression of ATF4 and cleavage of ATF6.

      We appreciate the suggestion to strengthen our data regarding protein misfolding-independent activation of IRE1 more robust. We note that canonical UPR transcriptional targets are not induced during C. albicans infection (Fig. 2G,H), suggesting that IRE1 is activated in the absence of a standard unfolded protein response. However, we agree that we can use additional chemical chaperones to assay this. To address this point, we will perform the suggested experiments in the presence or absence of TUDCA with C. albicans, LPS, thapsigargin, and tunicamycin. As 4PBA has been shown to inhibit protein synthesis, rather than promoting protein folding or preventing aggregation (PMC9741500), we will avoid using this compound for these assays.

      We will also perform western blots for ATF6 cleavage and eIF2a phosphorylation, although we note that eIF2a can be phosphorylated by multiple kinases and can be triggered by nutrient deprivation or changes in intracellular calcium, both of which occur during C. albicans infection (glucose: PMC6709535; calcium: data within this manuscript).

      3) The authors use thioflavin to evaluate the extent of protein misfolding. This type of stain can lead to artefactual results and in general it is rather safer to test several stainers (see for instance the work presented in PMC10720158)

      We thank the reviewer for this suggestion. We have previously tried Proteostat staining as an additional method to measure protein misfolding, but we found that it bound strongly to the C. albicans cell wall, which would result in a strong false positive signal that is not indicative of host protein misfolding (see below). Congo Red, an additional dye used in the listed reference, is also known to bind to C. albicans and perturbs cell wall synthesis (PMC266468), therefore we have avoided these dyes.

      However, to address this point, we will perform experiments utilizing poly-ubiquitin blotting, as in the suggested reference, as an orthogonal readout of protein misfolding during C. albicans infection or treatment with LPS, depleted zymosan, and thapsigargin.

      __Figure legend: Proteostat staining with _C. albicans_ infection. __Macrophages were infected with C. albicans, and subsequently stained with Proteostat to measure protein misfolding. Proteostat bound and displayed strong fluorescence on the C. albicans cell wall.

      6) The whole study relies on the use of IRE1deltaR to impair IRE1 signaling. The authors should validate their hypothesis with an orthogonal approach, for instance with IRE1 pharmacological inhibitors (eg MKC8866 or KIRA8).

      We consider the use of genetic perturbation of IRE1 to be a strength of this manuscript, as IRE1 inhibitors have been shown to cause off-target effects (KIRA8: PMC9600248). However, to address this point, we will attempt to replicate important phenotypes, including the effect of IRE1 on calcium flux and phagolysosome fusion, using MKC8866 and KIRA8 as representative inhibitors.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      __Reviewer 1: __

      5) The authors focus on the IRE1/XBP1s signaling arm of the UPR but do not explore RIDD activity which has been linked to several infection mechanisms and lysosomal integrity (in particular by regulating the expression of BLOS1 - see PMC9119680 and PMC6446841). The authors should definitely evaluate how RIDD is activated (or not) in their experimental systems.

      We thank the reviewer for this suggestion, as we have considered potential effects of RIDD when analyzing our RNA-seq data, and are aware of the potential links between IRE1, BLOS1 (encoded by Bloc1s1) expression, and lysosome perturbations. We now add additional figures to our supplemental data (Fig. S3C-D; also shown below) showing that established RIDD targets, including Bloc1s1 are not depleted during C. albicans infection, and also not increased in IRE1 null macrophages. We add the following text to describe these findings (lines 322-326): "Additionally, we did not observe depletion of published RIDD targets (14, 65, 66) during C. albicans infection in WT macrophages (Fig. S3C; Table S2), nor increased expression of RIDD targets in IRE1ΔR macrophages, compared to IRE1 WT macrophages (Fig. S3D; Table S1.1), suggesting minimal RIDD activity during C. albicans infection." We also note that experiments with LysoSensor (Fig. 3E) suggested lysosome biogenesis is not impaired in IRE1 null macrophages. Therefore, we expect RIDD activity has negligible effects on our reported phenotypes.

      Reviewer #1 (Significance (Required)):

      The manuscript is interesting and highlights novel aspects towards the interaction between macrophages and a pathogen, candida albicans, involving the likely selective activation of IRE1. The data are novel and experimentally sound. Several controls are however missing.

      The strengths of the study are associated with the novelty of the findings, with the links that could potentially derive from this study to connect ER biology, UPR signaling and phagosome maturation

      The main weaknesses are associated i) with the fact that the authors did not evaluate RIDD activity which has already been linked with pathogen infection and with lysosome integrity, ii) with methodological aspects, in particular regarding the demonstration of the IRE1 activation independent on protein misfolding and the sole use of a genetic variant of IRE1 to test their hypotheses

      We thank Reviewer 1 for their constructive feedback and for noting the novelty of our findings. We believe that the data we have added regarding RIDD activity and our planned experiments to address additional concerns will add additional evidence to support our findings.

      Reviewer 2:

      1. A point that should be addressed with more detail is the correlation of fungal killing with Ca2+ fluxes and Ire1α activity, given the well-known data regarding the strong ability of the axis dectin/SYK/phospholipase Cγ to induce Ca2+ transients, a response not shared by LPS signaling, and the sequential activation of mitochondrial Ca2+ uniporter (MCU), which is a critical element of fungal killing associated with the citrate-pyruvate shuttle as a NADPH source (Seegren et al., Cell Rep. 33: 108411, 2020). Incidentally, this paper is referred in ref. 46 as a preprint, although it was accessible in Cell Reports in 2020.

      This is an excellent suggestion; we have added this topic to our discussion (lines 605-608) and have corrected the citation.

      The assay of the expression of V-ATPase complex, mitochondrial calcium uniporter, and mitochondrial uptake 1 and 2 could shed light on the dependence of fungal killing on Ire1α function.

      Thank you for this suggestion - below, we plot the transcripts comprising the V-ATPase, as well as Mcu, Micu1, and Micu2. We note that these transcripts are not perturbed in IRE1 null macrophages, suggesting that the basic functions of the V-ATPase complex and mitochondrial calcium uptake are intact in IRE1 null macrophages.

      These data are in agreement with our LysoSensor assay (Fig. 3E), which suggested that lysosome biogenesis is not impaired in IRE1 null macrophages.

      While we cannot rule out a defect in mitochondrial calcium flux from our RNA-seq data, we have added discussion around this topic to our discussion, as mentioned above.

      Expression of V-ATPase subunits and mitochondrial calcium uptake genes in C. albicans-infected IRE1 null macrophages vs C. albicans-infected IRE1 WT macrophages.

      Fig. 1A should be explained with more detail to disclose the products of PstI digestion.

      Thank you for the suggestion. We have added this information to the Figure 1 legend, "RT-PCR-amplified Xbp1 cDNA was treated with PstI, which recognizes a cleavage site within the 26 base pair intron that is removed by IRE1α activity, resulting in cleavage of the unspliced isoform, specifically."

      The anti-XBP1 antibody used to construct the blots in Fig.S1A recognizes epitopes not disclosed by the manufacturers, but they have to pertain to the N-terminal peptide sequence shared by sXBP1 and uXBP1. Showing full lanes encompassing both protein isoforms would allow a better appraisal of protein expression. In connection to point 4, the use of an antibody reactive to the epitopes expressed in sXBP1 in cell lysates or, preferentially in nuclear fractions, could be most valuable to rule out the dependence of the effect of Ire1α on the trans-activating function of sXBP1.

      We have un-cropped these westerns and now show spliced and unspliced XBP1 products on a single image in Fig. S1A.

      On page 23, the mention to Fig. 5A should be changed to Fig. 5B.

      We have fixed this mis-labeling, thank you for calling this to our attention.

      Line 209. I understand gene synthesis refers to gene expression.

      We have clarified this in the text, thank you for the suggestion.

      Line 394. What is the reason to study the cytokine-signature of Candida in LPS-primed cells?

      Thank you for the question; we have added the following text (lines 413-414) to clarify that LPS is used for inflammasome priming:

      "Therefore, we tested secretion of IL-1β, TNF, and IL-6 from WT and IRE1ΔR macrophages after LPS treatment to transcriptionally prime the NLRP3 inflammasome components, followed by C. albicans infection (Fig. 5D-F)."

      Numerous studies have shown that C. albicans can trigger macrophage pyroptosis, resulting in production of pro-inflammatory cytokines like IL-1b, which can also be influenced by phagosome rupture (PMC3910967). However, this requires inflammasome transcriptional priming, and LPS is commonly used to prime macrophages for inflammasome activation in vitro. Therefore, we perform a short pre-treatment with LPS for NLRP3 inflammasome priming to subsequently measure its activation following C. albicans infection, using secreted cytokines as a readout. We also note that macrophages in vivo may not be naive and are often M1-polarized by the microbial or cytokine environment, thus inflammasome priming is likely common during in vivo infection.

      Reviewer #2 (Significance (Required)):

      This study focuses on an aspect not usually addressed in papers devoted to the UPR.

      If more data are shown as suggested, the paper could be of interest for a wider audience

      We thank the reviewer for their positive feedback about the novelty of our work and agree that the suggested experiments will bolster our data and story.


      Reviewer 3:

      Fig. 2:

      Panel A-B: same question as for Fig. 1. The variation in TG DMSO-induced splicing is huge. The effects of the treatments with CHX or Act D are smaller than the variation between experiments with TG DMSO alone. As long as that variation is not controlled for, it is impossible to draw any conclusion from the inhibitors. In this regard, it is very difficult to interpret data if they are not done in one and the same experiment.

      The variability in thapsigargin fold change over mock likely represents differences in basal Xbp1 expression. We consistently see complete Xbp1 splicing in response to thapsigargin treatment (see Fig. 1A). Additionally, we note that thapsigargin treatment is used only as a positive control, not as a physiologically relevant treatment, as it results in unmitigated ER stress that triggers cell death (PMC6986015).

      We have removed the following sentence, "Translation inhibition using cycloheximide was sufficient to alleviate Xbp1 splicing specifically in response to thapsigargin, likely by reducing the nascent protein folding burden (Fig. 2B)," since our data are plotted on separate graphs, matched to their respective controls, for appropriate comparisons.

      4. Description of analyses that authors prefer not to carry out


      Reviewer 1:

      2) Since the IRE1/XBP1 arm of the UPR is also involved in lipid biosynthesis which might be required for phagosome maturation, the authors should perform XBP1s rescues in IRE1 deficient cells to ensure that their observation is XBP1s dependent or IRE1 dependent.

      As we do not see XBP1S protein induced in wild-type macrophages at any timepoint during our C. albicans infection scheme (Fig. S1A-B), we interpret our results as being XBP1S-independent. If we were to add back XBP1S with constitutive expression, we would be overexpressing the protein relative to C. albicans infected wild-type macrophages (in which we do not see measurable XBP1S expression). Therefore, we believe these experiments would not address a physiologically-relevant scenario.

      4) The authors should evaluate in what compartment IRE1 is activated upon CA infection, does that happen in the ER or in the ER fraction fused to phagosomes?

      This is an interesting question for future exploration. In order to answer this question with existing tools, we would need to perform biochemical fractionation of infected cells to isolate an ER-phagosome contact site fraction, followed by phos-tag gel analysis of IRE1 activation in the ER fraction, compared to the ER-phagosome contact site fraction. However, a biochemical fractionation protocol to distinguish the ER fraction from ER-phagosome contact sites has not yet been developed, to our knowledge, and we believe it is outside the scope of this study to develop such a technique.

      We have added additional text regarding this intriguing question to our discussion (lines 549-553).

      Reviewer 2:


      Infection at a MOI 1 of C. albicans is a ratio of infecting agent/susceptible targets not very high for a non-soluble stimulus with limited diffusion in the culture medium. Although I recognize the difficulty of quantitating adhered cell, the mention to 80% confluence makes it more difficult the appraisal of the actual MOI. The delayed time-course of Xbp splicing under these conditions can be explained by the time required for in vitro proliferation, Candida damage, and diffusion of fungal patterns. A study with viable Candida at MOI 5 in human monocyte-derived dendritic cells, which show a robust capacity for non-opsonic phagocytosis associated with C-type lectin receptors only showed initial hypha formation after 2 hours (Rodriguez et al., J. Biol. Chem. 289, P22942-22957, 2014). Consistent with the requirement of a time lag for infecting agent to attain levels of expression consistent with a net response, 16 hours have been considered an appropriate time-course to assay sXBP1 expression following SARS-CoV2 infection (Fernandez et al., Biochim Biophys Acta Mol Basis Dis. 1870(5):167193, 2024). I wonder if a higher MOI could show a similar kinetics.

      We use lower MOI in part due to the size and ability of C. albicans to undergo extensive hyphal growth if its numbers greatly exceed the number of host cells. From our microscopy data, we can see that C. albicans spreads well throughout the culture plate (see Fig. 3A, Fig. 4A). We and others have observed considerable death of macrophage cultures after 12 hours with Candida infection, even at low MOI (PMC6709535), therefore we avoid later timepoints in these assays and all other in vitro assays in our manuscript.

      As all of our in vitro experiments are performed within an 8 hour window of infection, whether XBP1S is induced at later timepoints by C. albicans or depleted zymosan would not alter the conclusions of the rest of our results.

      sXBP1 can be present in nuclear fractions in resting cells, which suggests the involvement of post-translational modifications for the display of transcriptional activity.

      As we do not see induction of XBP1S in our lysates after C. albicans infection, it is unlikely that post-translational modification is influencing its function, although we agree post-translational modification is a likely regulatory control over XBP1S during the unfolded protein response.

      The independence of sXBP1 transcriptional activity from canonical UPR associated with misfolded protein stress is well known from the seminal paper by Martinon et al., (ref.6). Moreover, the expression of CHOP, the final effector of the PERK route, encoded by DDIT3 gene, has been found to be blunted by Candida (Rodriguez et al., J. Biol. Chem. 289, P22942-22957,2014). This is additional evidence for the recruitment of sXBP1 transcriptional activity in the absence of canonical UPR.

      As mentioned, we found that XBP1S protein is not induced during C. albicans infection at any timepoint in our experiments (Fig. S1A-B). Importantly, the work referenced by the reviewer uses RAW267.7 cells, which (as mentioned by the authors) constitutively express CHOP as a result of Abel leukemia virus infection. Based on this specific overexpression, we believe this phenotype is not comparable to our bone marrow-derived macrophages.

      Reviewer 3:

      Fig. 1:

      Panel 1C: please remove outlier in 4h timepoint. This implies that the experiment needs to be redone to reduce variation

      We have performed an outlier test on these data, which revealed that this data point is not a statistical outlier, therefore we do not feel that its removal is appropriate (see below).

      Panel 1E-H: how is the splicing efficiency determined and normalized? How to explain the big differences in splicing efficiency of Xbp1 upon LPS stimulation (appr. 4 to 6 times in E, G and H versus 30-fold in panel F). Where does this difference come from?

      Panel H, outlier needs to be removed.

      We do occasionally see differences in magnitude of Xbp1 splicing in different cell lines or experiments, especially with controls, which may be caused by differences in the basal level of Xbp1 expression, especially as Xbp1 levels have been shown to be affected by circadian rhythm in certain cell types (PMCID: PMC11214543; PMCID: PMC6959563).

      In panel H, an outlier test reveals that these are not statistical outliers, therefore we feel their removal is inappropriate as we do not wish to mask biological variation. Moreover, this graph includes two cell lines (open and closed circles), showing that our data are robust across multiple independent cell lines and are an appropriate measure of experimental replicates.

      Fig. 2:

      Panel A-B: same question as for Fig. 1. The variation in TG DMSO-induced splicing is huge. The effects of the treatments with CHX or Act D are smaller than the variation between experiments with TG DMSO alone. As long as that variation is not controlled for, it is impossible to draw any conclusion from the inhibitors. In this regard, it is very difficult to interpret data if they are not done in one and the same experiment.

      The variability in thapsigargin fold change over mock likely represents differences in basal Xbp1 expression. We consistently see complete Xbp1 splicing in response to thapsigargin treatment (see Fig. 1A). Additionally, we note that thapsigargin treatment is used only as a positive control, not as a physiologically relevant treatment, as it results in unmitigated ER stress that triggers cell death (PMC6986015).

      We have removed the following sentence, "Translation inhibition using cycloheximide was sufficient to alleviate Xbp1 splicing specifically in response to thapsigargin, likely by reducing the nascent protein folding burden (Fig. 2B), since our data are plotted on separate graphs, matched to their respective controls, for appropriate comparisons.

      Below, we plot all data together with replicate matching, although our major interpretation of these data is that C. albicans infection can trigger Xbp1 splicing with or without new gene expression, and not about the impact of the inhibitors on the control treatment thapsigargin.

      Please provide a scheme of how the experiment was performed, at what time were the inhibitors provided, at what time point the inducers? What are matched mock samples. Which mock samples were chosen since they differ from one experiment to the next? Please plot all the data for one and the same experiment in one graph so that the reader can easily compare the results of DMSO, DMSO + inducer, DMSO + inducer + inhibitor. Indicate whether the points in the graph are technical or experimental repetitions.

      -How to explain the increase in XBP1 splicing in combination with ActD? Was this due to differences in Gapdh expression? Where did the authors control for cell death? Please provide the data.

      Below is a scheme of the experimental treatments. We have now clarified in the figure legend that inhibitors (ActD and CHX) are added at the same time as experimental treatments (Mock, Ca, TG). All data included in the original submission are biological replicates, as stated in the figure legend. We have now re-written the figure legend to clearly indicate that these are biological replicates.

      All data are normalized such that the effects of the drugs are directly compared (for example, the fold change over Mock for Candida is matched to its drug treatment; Mock DMSO vs Ca DMSO and Mock ActD vs Ca ActD, or Mock CHX vs Ca CHX). Actinomycin D does inhibit new transcription, although IRE1 can cleave existing Xbp1 transcript. We now show conditions normalized to DMSO Mock in Supplemental Figure 2, which allows visualization of the effects of ActD and CHX on Xbp1-S abundance in comparison to control DMSO treatment, while also seeing the relative changes in Xbp1 splicing caused by C. albicans or thapsigargin treatment (see below).

      -Is RT-qPCR a reliable readout when actinomycinD is used? How can new genes be transcribed.

      We interpret RT-qPCR data as a readout of transcript abundance, rather than transcription. Therefore, we are not measuring new gene expression here, but whether the existing Xbp1 transcript can be cleaved by IRE1. Based on the technique, we can still measure changes in Xbp1-S abundance.

      Panel D: where is TG at 4h and 6h?

      We do not include thapsigargin at later timepoints to avoid autofluorescence from excessive cell death. We include thapsigargin as a positive control at the early 2h timepoint, but note that LPS is sufficient to increase thioflavin T intensity at the 8h timepoint.

      Panel G, why was Ddit3 included here as this is not a typical IRE1 dependent gene (rather PERK dependent). What about IRE1 specific genes such as Sec61 or Sec24a?

      We have added additional text (lines 235-240; "Finally, we measured induction of UPR-responsive genes by RT-qPCR in response to C. albicans infection, LPS and depleted zymosan treatment, or thapsigargin treatment, to further test whether IRE1α activation occurs without canonical UPR induction (Fig. 2G-H). C. albicans infection and depleted zymosan treatment did not lead to induction of UPR-responsive genes (Ddit3, Grp78, Grp94, and total Xbp1) at 4 or 6 hours.") to clarify that the purpose of this figure is to add evidence that IRE1 activation is independent of the canonical UPR response (indicating that IRE1 is likely specifically activated independently of the other UPR branches) during C. albicans infection. Therefore, the transcripts measured are canonical UPR-responsive transcripts, rather than IRE1/XBP1S targets (although some are overlapping).

      Below are RNA-seq data comparing Sec61a1, Sec61a2, and Sec24a in IRE1 null macrophages, compared to IRE1 WT macrophages. While there is less expression of Sec61a1 in IRE1 null macrophages, Sec61a2 and Sec24a are largely unaffected. These data support our finding that XBP1S protein is not induced during C. albicans infection.

      Did the authors also check for RIDD activity?

      As mentioned above in response to Reviewer 1, we now add additional figures to our supplemental data (Fig. SX; also shown below) showing that established RIDD targets are not depleted during C. albicans infection in WT macrophages, and also not increased in IRE1 null macrophages. Therefore, we expect RIDD activity has negligible effects on our reported phenotypes.

      Fig. 3:

      Panel C and D look convincing. Lamp1 is a well-known RIDD target gene (see Osorio et al., Nat Imm, 2014). Did the authors check Lamp1 expression in presence and absence of IRE1 and could RIDD explain their phenotype?

      As shown above, Lamp1 transcript expression is not strongly perturbed in IRE1 null macrophages. If RIDD activity were depleting Lamp1 transcript abundance, we would expect to see increased Lamp1 expression in IRE1 null macrophages. We also note that our experiments using LysoSensor (Fig. 3E) suggested that lysosome biogenesis is not impaired, but more specifically, lysosome recruitment to the phagosome is impaired in IRE1 null macrophages.

      Fig. 4, but especially Fig 5 and Fig 6 suffer from very bad imaging quality. Both Fig 5A and Fig 6A are completely uninterpretable. The SRB staining is all over the cells and it is totally unclear how the authors interpret this as phagosomal leakage or not. Fig. 6A is even worse and appears nothing but vague background. It is difficult to understand how the authors make graphs based on these types of images and dare to draw any conclusions.

      In Figure 4, we observe some photobleaching from frequent image acquisition, which is necessary to capture calcium flux dynamics. Image brightness across the timecourse is adjusted in the same way such that we do not attempt to hide the effects of photobleaching. However, our analyses account for photobleaching over time, and the phagosomal calcium flux is clear and quantifiable. `

      In Figure 5, the sulforhodamine B pulse-chase assay involves loading of the endosomal system with SRB, thus the cells are expected to ingest a considerable amount of SRB and it will distribute throughout the endosomal network. However, as endosomes fuse, we also observe fusion with the C. albicans-containing phagosome and SRB will surround C. albicans hyphae. Our analysis pipeline first segments C. albicans hyphae (see below) and measures SRB signal in proximity to the phagosome. Thus, we measure loss of phagosome-associated SRB over time, as C. albicans ruptures the phagosome, in hundreds of macrophages. This is a standard assay that has been previously used for this purpose (PMID: 33022213; PMID: 30131363).

      For Figure 6, we have added additional wide-field images that we believe will clarify how these images can be readily quantified (Fig. 6A, shown below). The purpose of the previous Fig. 6A (now Fig. 6B) is to demonstrate single cell examples of live and dead C. albicans using the dual-fluorescence assay, although we quantify much wider fields for sufficient numbers. We hope the amended figures provide additional clarity.

      Fig. 7 is again an example where differences in expression are mainly due to one or a few complete outliers, and it is hard to understand why the authors did not repeat these experiments to reduce the problems in variation to get proper data sets before submission.

      After performing outlier tests, we have found a total of 4 data points that are statistical outliers from all of the panels in Figure 7. These included the highest data point in each genotype in the female IL-1Ra levels (Fig. 7A, second graph), the highest data point among the male IRE1 fl/fl mice IL-1Ra levels (Fig. 7B, second graph), and the highest data point among the male TNF levels in IRE1 fl/fl + LysM-Cre mice (Fig. 7B, third graph). We have removed these data points in our updated graphs and changed the text to only point out differences in serum TNF and IL-6 levels. Moreover, our interpretation includes that serum cytokine levels are not different in male mice. However, no other data points are statistical outliers, therefore we believe their removal is inappropriate.

      While the paper started nicely and showed an interesting hypothesis (Fig. 3), the remaining part of the paper was of very poor quality and was not ready for submission.

      We thank the reviewer for the constructive feedback and believe that the addition of data and clarifications we have added will demonstrate that our data are of sufficient quality to support our conclusions.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Reviewer comments for "Non-canonical activation of IRE1a by Candida albicans infection promotes macrophage phagosomal calcium flux to enhance fungal killing"

      This paper describes a role for IRE1 in controlling Candida albicans (Ca) infection in macrophages. The authors show that Ca infection slightly induces IRE1 activity as monitored by XBP1 splicing, however this does not result in XBP1 protein expression, nor in IRE1-dependent target gene expression. The authors propose that IRE1 controls phagosomal maturation, as measured by defects in LAMP1 recruitment to Ca containing phagosomes. This would be due to a defect in calcium flux at the phagosomal level leading to an increased propensity to rupture and cytosolic escape of the pathogen. While the data are interesting and the defect in LAMP1 recruitment to the phagosome convincing, the majority of the data are difficult to interpret due to the poor quality. This concerns specifically all imaging experiments but also the ELISAs and qPCRs where differences are due to the effect of outliers rather than to the behavior of a complete population. Therefore, most experiments need to be redone and complemented with additional approaches before any firm conclusions can be drawn. Specific details and examples are provided below.

      Fig. 1:

      Panel 1C: please remove outlier in 4h timepoint. This implies that the experiment needs to be redone to reduce variation

      Panel 1E-H: how is the splicing efficiency determined and normalized? How to explain the big differences in splicing efficiency of Xbp1 upon LPS stimulation (appr. 4 to 6 times in E, G and H versus 30-fold in panel F). Where does this difference come from?

      Panel H, outlier needs to be removed.

      Fig. 2:

      Panel A-B: same question as for Fig. 1. The variation in TG DMSO-induced splicing is huge. The effects of the treatments with CHX or Act D are smaller than the variation between experiments with TG DMSO alone. As long as that variation is not controlled for, it is impossible to draw any conclusion from the inhibitors. In this regard, it is very difficult to interpret data if they are not done in one and the same experiment. Please provide a scheme of how the experiment was performed, at what time were the inhibitors provided, at what time point the inducers? What are matched mock samples. Which mock samples were chosen since they differ from one experiment to the next? Please plot all the data for one and the same experiment in one graph so that the reader can easily compare the results of DMSO, DMSO + inducer, DMSO + inducer + inhibitor. Indicate whether the points in the graph are technical or experimental repetitions.

      • How to explain the increase in XBP1 splicing in combination with ActD? Was this due to differences in Gapdh expression? Where did the authors control for cell death? Please provide the data.
      • Is RT-qPCR a reliable readout when actinomycinD is used? How can new genes be transcribed.

      Panel D: where is TG at 4h and 6h?

      Panel G, why was Ddit3 included here as this is not a typical IRE1 dependent gene (rather PERK dependent). What about IRE1 specific genes such as Sec61 or Sec24a?

      Did the authors also check for RIDD activity?

      Fig. 3:

      Panel C and D look convincing. Lamp1 is a well-known RIDD target gene (see Osorio et al., Nat Imm, 2014). Did the authors check Lamp1 expression in presence and absence of IRE1 and could RIDD explain their phenotype?

      Fig. 4, but especially Fig 5 and Fig 6 suffer from very bad imaging quality. Both Fig 5A and Fig 6A are completely uninterpretable. The SRB staining is all over the cells and it is totally unclear how the authors interpret this as phagosomal leakage or not. Fig. 6A is even worse and appears nothing but vague background. It is difficult to understand how the authors make graphs based on these types of images and dare to draw any conclusions.

      Fig. 7 is again an example where differences in expression are mainly due to one or a few complete outliers, and it is hard to understand why the authors did not repeat these experiments to reduce the problems in variation to get proper data sets before submission.

      While the paper started nicely and showed an interesting hypothesis (Fig. 3), the remaining part of the paper was of very poor quality and was not ready for submission.

      Significance

      The study presents interesting hypothesis but unfortunately the data are not of sufficient quality

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This an interesting report in a widely explored area. This makes it necessary to pigeonhole the new data provided by the study. The paper addresses two issues encompassing a scope distinct from studies focusing on the cytokine-signature and the role of sXbp1. This research singles out fungal killing and Ire1α versus sXbp1 function. More precisely, the reduction of cytokine expression in Ire1fl/fl LysMCre as compared to WT discloses an opposing function of Ire1α and its target sXbp1 in cytokine expression that requires mechanistic explanation.

      1. A point that should be addressed with more detail is the correlation of fungal killing with Ca2+ fluxes and Ire1α activity, given the well-known data regarding the strong ability of the axis dectin/SYK/phospholipase Cγ to induce Ca2+ transients, a response not shared by LPS signaling, and the sequential activation of mitochondrial Ca2+ uniporter (MCU), which is a critical element of fungal killing associated with the citrate-pyruvate shuttle as a NADPH source (Seegren et al., Cell Rep. 33: 108411, 2020). Incidentally, this paper is referred in ref. 46 as a preprint, although it was accessible in Cell Reports in 2020.
      2. The assay of the expression of V-ATPase complex, mitochondrial calcium uniporter, and mitochondrial uptake 1 and 2 could shed light on the dependence of fungal killing on Ire1α function.
      3. Infection at a MOI 1 of C. albicans is a ratio of infecting agent/susceptible targets not very high for a non-soluble stimulus with limited diffusion in the culture medium. Although I recognize the difficulty of quantitating adhered cell, the mention to 80% confluence makes it more difficult the appraisal of the actual MOI. The delayed time-course of Xbp splicing under these conditions can be explained by the time required for in vitro proliferation, Candida damage, and diffusion of fungal patterns. A study with viable Candida at MOI 5 in human monocyte-derived dendritic cells, which show a robust capacity for non-opsonic phagocytosis associated with C-type lectin receptors only showed initial hypha formation after 2 hours (Rodriguez et al., J. Biol. Chem. 289, P22942-22957, 2014). Consistent with the requirement of a time lag for infecting agent to attain levels of expression consistent with a net response, 16 hours have been considered an appropriate time-course to assay sXBP1 expression following SARS-CoV2 infection (Fernandez et al., Biochim Biophys Acta Mol Basis Dis. 1870(5):167193, 2024). I wonder if a higher MOI could show a similar kinetics.
      4. Fig. 1A should be explained with more detail to disclose the products of PstI digestion.
      5. sXBP1 can be present in nuclear fractions in resting cells, which suggests the involvement of post-translational modifications for the display of transcriptional activity.
      6. The independence of sXBP1 transcriptional activity from canonical UPR associated with misfolded protein stress is well known from the seminal paper by Martinon et al., (ref.6). Moreover, the expression of CHOP, the final effector of the PERK route, encoded by DDIT3 gene, has been found to be blunted by Candida (Rodriguez et al., J. Biol. Chem. 289, P22942-22957,2014). This is additional evidence for the recruitment of sXBP1 transcriptional activity in the absence of canonical UPR.
      7. The anti-XBP1 antibody used to construct the blots in Fig.S1A recognizes epitopes not disclosed by the manufacturers, but they have to pertain to the N-terminal peptide sequence shared by sXBP1 and uXBP1. Showing full lanes encompassing both protein isoforms would allow a better appraisal of protein expression. In connection to point 4, the use of an antibody reactive to the epitopes expressed in sXBP1 in cell lysates or, preferentially in nuclear fractions, could be most valuable to rule out the dependence of the effect of Ire1α on the trans-activating function of sXBP1.
      8. On page 23, the mention to Fig. 5A should be changed to Fig. 5B.
      9. Line 209. I understand gene synthesis refers to gene expression.
      10. Line 394. What is the reason to study the cytokine-signature of Candida in LPS-primed cells?

      Significance

      This study focuses on an aspect not usually addressed in papers devoted to the UPR. If more data are shown as suggested, the paper could be of interest for a wider audience

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors show that in macrophages, IRE1 activation (independent of improperly folded proteins) is essential to promote fungicidal activity towards candida albicans (CA) in vitro and in vivo by ensuring phagosome maturation through the preservation of calcium fluxes

      Major comments

      1. The demonstration of protein misfolding independent IRE1 activation should also be demonstrated using molecules such as TUDCA or 4PBA that should be innocuous regarding the splicing of XBP1s. It would also be interesting to evaluate the activation of the other arms of the UPR in particular through the phosphorylation of eIF2a, expression of ATF4 and cleavage of ATF6.
      2. Since the IRE1/XBP1 arm of the UPR is also involved in lipid biosynthesis which might be required for phagosome maturation, the authors should perform XBP1s rescues in IRE1 deficient cells to ensure that their observation is XBP1s dependent or IRE1 dependent.
      3. The authors use thioflavin to evaluate the extend of protein misfolding. This type of stain can lead to artefactual results and in general it is rather safer to test several stainers (see for instance the work presented in PMC10720158)
      4. The authors should evaluate in what compartment IRE1 is activated upon CA infection, does that happen in the ER or in the ER fraction fused to phagosomes?
      5. The authors focus on the IRE1/XBP1s signaling arm of the UPR but do not explore RIDD activity which has been linked to several infection mechanisms and lysosomal integrity (in particular by regulating the expression of BLOS1 - see PMC9119680 and PMC6446841). The authors should definitely evaluate how RIDD is activated (or not) in their experimental systems.
      6. The whole study relies on the use of IRE1deltaR to impair IRE1 signaling. The authors should validate their hypothesis with an orthogonal approach, for instance with IRE1 pharmacological inhibitors (eg MKC8866 or KIRA8).

      Significance

      The manuscript is interesting and highligths novel aspects towards the interaction between marcrophages and a pathogen, candida albicans, involving the likely selective activation of IRE1. The data are novel and experimentally sound. Several controls are however missing.

      The strengths of the study are associated with the novelty of the findings, with the links that could potentially derive from this study to connect ER biology, UPR signaling and phagosome maturation

      The main weaknesses are associated i) with the fact that the authors did not evaluate RIDD activity which has already been linked with pathogen infection and with lysosome integrity, ii) with methodological aspects, in particular regarding the demonstration of the IRE1 activation independent on protein misfolding and the sole use of a genetic variant of IRE1 to test their hypotheses

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02555

      __Corresponding author(s): __Maurizio Molinari

      [Please use this template only if the submitted manuscript should be considered by the affiliate journal as a full revision in response to the points raised by the reviewers.

      • *

      If you wish to submit a preliminary revision with a revision plan, please use our "Revision Plan" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      We thank the 3 reviewers for the positive and constructive comments to our manuscript.

      Please see below the point-by-point responses to their suggestions.

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The paper proposes an interesting role for ERp44 in TMX5 retention. The authors identified a list of proposed TMX5 clients which include many Golgi localised proteins but do not discuss the role for TMX for instance in protein folding. In this context it is not absolutely clear whether TMX5 acts as a trafficking chaperone? are clients functionally engaged in the Golgi or ER or both?

      This work focuses on the crosstalk between a member of the PDI superfamily lacking a conventional cytosolic ER retention motif (TMX5), and ERp44, a PDI family member previously reported to retrieve in the ER proteins that lack the ER retention motif (ERp44). To support our conclusions on the involvement of ERp44 in control of TMX5’s intracellular distribution, we have added new data obtained by characterization of a cell line lacking ERp44, where more than 50% of TMX5 escapes ER retention (new Fig. 6).

      __We agree with the referee that the assessment of the biological role of TMX5 is of interest. We mention in this manuscript that there is a follow-up study (ongoing in the lab) on TMX5 clients and TMX5 function. More specifically, we are monitoring the action of TMX5 on the biogenesis and intracellular trafficking of class I HLA molecules, which are, besides PDI, ERp57 and ERp44, major interactors (clients) of TMX5 (please also refer to the initial and final parts of the new discussion). __

      The defining criteria for the client proteins were not included. At last, it might be of interest to evaluate for how long TMX5 clients are retained on the protein, whether it is temporary (as for instance a folding sensor) or more permanent.

      The list of interacting proteins is now available (__Data are available via ProteomeXchange with identifier PXD054716."), their selection for presentation in Figure 3B is now explained more clearly (Results, page 6). Also better explained is that we define as “clients of TMX5” those endogenous proteins that associate with TMX5, covalently, via the catalytic Cys220. The mutation of the TMX5 active site cysteine residue does not impact the covalent association of PDI, ERp57 and ERp44 with TMX5. For this reason, we do not consider these PDI family members clients of TMX5. In this submission, we explore the covalent association of ERp44 and its consequences on TMX5 subcellular distribution. Interacting via non-catalytic Cysteine residues 114 and 124 with the catalytic cysteine 29 of ERp44, we identify TMX5 as a client of the latter.__

      The preparation of figures could be greatly improved and there is some inconsistency among similar gels.


      Please refer to the point-by-point answers below.


      The proposed model of ERp44, ER retention vs ER retrieval, is unclear. Overall, there is more room for improved discussion beyond the conclusions from experiments.

      __We thank the referee for these comments. We have improved the description of the results, and we separated the Discussion (written de novo) from the Results section. __

      The ERp44 interaction is interesting especially since the protein contains an incomplete thioredoxin domain (such as ERp29, PDIA17 and 18), would the interaction between Erp44 and TMX5 be involved in some holdase/competitor role thereby allowing for client selectivity (or kinetics)? In addition, all the experiments were carried out in Hek293T or MEF cells, would the authors anticipate some interactions of TMX5 with PDIA17/18 in cells where those proteins are highly expressed? Testing whether the observation is a general mechanism occurring between TMX5 and PDI family members with incomplete thioredoxin sites would be an asset.

      __We thank the referee for this comment that we implemented in the new discussion. __

      Major comments Fig 2 - avoid labels on the blots that might obscure information and impede clarity and interpretation. o The % of resistant protein can be otherwise placed.


      __This has been modified, thank you. __

      • What does the asterix in 2B signify? This should be included in the legend.

      We have now specified in the legend of figures that asterisks show cross-reacting polypeptide bands.

      • A label for 'deglycosylated' proteins could be included.

      __We added a label for de- and for glycosylated proteins in the EndoH essays in Figs. 2A, 2B and 5B. __

      • Consider treating with PNGase.

      This is now showy in panel 2A, lane 5.

      • There is a change in EndoH resistance of about 3-4% among wt, C220A & C114A, is this significant?

      We do not consider significant these variations. Our data show that the mutation of TMX5 Cys 114 or of Cys124 to alanine substantially reduce (without abolishing) the co-IP with ERp44. This means that the proteins interact less, or that the interaction is more short living. The EndoH experiment shown in Fig. 2B and the CLSM analyses in Figs. 2D-2O fail to reveal significant differences showing that these reduced or more short living associations with ERp44 are sufficient to control TMX5 distribution.

      In the previous submission, the function of ERp44 in retaining TMX5 in the ER was supported by data showing that the co-expression of ERp44 retains TMX5 in the ER, but co-expression of ERp44C29S that cannot bind TMX5 fails to retain TMX5 in the ER. These model is further supported in this new submission by the release of 50% of TMX5 from the ER in cells lacking ERp44, which is substantially inhibited to the levels measured in wild type cells upon back-transfection of ERp44, but not upon the co-transfection of the ERp44C29S mutant (new Figure 6).


      • Equivalent inputs (cell lysates) for the IPs should be included.

      __ These have now been added in Figs. 4, 5 6, and 7.__

      Fig 3A - Indicate the specific bands that were subject to MS. How did the authors correct for non-specific interactors and false positives? Perhaps a more specifically targeted approach could be utilised.

      • How do the authors explain the absence of bands representing the reduced form of interacting TMX5 interactors?

      • What was the inclusion and exclusion criteria used to determine which of the proteins listed were clients?

      The endogenous proteins present in the entire region of the gel labeled with the red and blue rectangles have been sequenced (see methods section and this is now also better explained in the results section, page 6). Only the proteins that disappear from the corresponding region of the gel when the samples have been reduced are listed in the table. This is also better explained in the text (page 6). These experiments have been repeated few times with a series of controls (e.g., mock-transfected cells and cells transfected with other members of the TMX family (shown to capture and to impact on the fate of other endogenous polypeptides in previous publications from our lab)). An in parallel analysis of mock, TMX3, TMX4 and TMX5 interactors has been published in (Kucinska et al Nature Comm 2023), where we focused on the biological function of TMX4. The references referring to the TMX1 study (Brambilla et al 2015) and the TMX4 study (Kucinska et al) are given in the text.

      The Table in Fig. 3B only lists the interacting polypeptides that have a MW __- It might be useful to perform MS on the C220A mutant and compare those results to the WT.

      __To validate few interactions with endogenous proteins detected in MS, and to compare the interactions of TMX5 and TMX5C220A, we have used the specifically targeted approach suggested above by the referee (i.e., co-IP validated by WB, Figs. 3C-3F).

      __

      Fig 4 - Equivalent inputs (cell lysates) for the IPs should be included.

      __This is now shown as panel A in Fig. 4.____

      __

      Fig 5 - Equivalent inputs (cell lysates) for the IPs should be included.

      This is now Figure 7, see new panel 7A

      Fig 6 - A loading control should be included.

      __This is now Fig. 5. Both panels A and B in Fig. 5 show total cell extracts____

      __

      • Blot using anti-HA to identify ERp44 should be included to substantiate claims.

      The ERp44 and TMX5 components of the ERp44-s-s-TMX5 mixed disulfides are detected upon IP:HA followed by WB:V5 (to show the TMX5 component) and upon IP:V5 followed by WB:HA to show the ERp44 component) in Figs. 4B-4E and 7B-7C.

      • How do the authors account for the huge difference in TMX5 associated complexes shown in Fig 6A compared to Fig 3A.

      Fig. 6 is now Fig. 5. As specified in the legends of the figures, Fig. 3A shows a gel, where the complexes are stained with silver, Fig. 5A is a WB, where the complexes are stained with an antibody. The intensities of the signals cannot be compared.

      • Inappropriate marking on the gel area.
      • Inconsistencies in protein standard labeling

      This has carefully been checked and corrected where needed. Please note that we used two different MW standards for our figures (200, 117, 97, 66, 45, 31 kDa and 270, 175, 130, 95, 66, 53, 37 kDa)

      • It might be useful to demonstrate the colocalisation of ERp44 and ERp44C29S with Giantin and with TMX5 considering that ERp44 is known to cycle between the Golgi and ER.

      These data are shown in Fig. 5D-5I.

      Reviewer #1 (Significance (Required)):

      This work provides an additional understanding on how the regulation of Erp44 trafficking might occur (and perhaps additional PDIs), and lead to the characterization of kinetic value that might explain better productive protein folding in the early secretory pathway. This represents a significant advance in the field and may in turn unveil uncharacterized pathophysiological functions in various diseases. This is a serious study well conducted and original by an expert in the field that desserves publication.

      Field of expertise: ER homeostasis control

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Solda et al, investigates TMX5, a poorly understood member of the PDI family that lacks an ER-retrieval motif. They find that it localizes to the ER and the Golgi and that it interacts with ERp44. This interaction requires formation of a mixed disulfide and they identify the cysteine residues in both proteins that mediate this interaction. Overall, this is a well written manuscript that is easy to follow and the story is compact and straight-forward. It provides some new and solid insight into the biology of TMX5 without going into depth of what the cellular role of TMX5 is or might be. I have only very few comments and suggestions:

      1- The authors conclude that ERp44 associates only with ER-localized TMX5. I am not sure that this is a valid conclusion based on the data. EndoH sensitivity just means that the protein has not gone to the medial Golgi. The pool of TMX5 could therefore be an ERGIC-based pool, or it could interact with a TMX5 that is recycled directly from the first Golgi cisterna, where complex glycosylation is unlikely to occur. Can this be validated using another type of experiment? Alternatively, the wording could be changed.

      We thank the reviewer. We agree with this insightful comment that led us to change the wording used in some part of the text.

      2- Is the trafficking of TMX5 dependent on its glycosylation?

      This is another insightful comment that we report in the ____new discussion, where we write, page 14 “____It should be noted that in the case of TMX5 the extensive N-glycosylation could engage____ leguminous L-type lectins located in the ER (VIPL), cycling between the ER and the intermediate compartment (ERGIC) (ERGIC-53) or between the ERGIC and the cis-Golgi (VIP36)_33-36_ and have an impact on the subcellular distribution and activity of TMX5.____”

      3- Figure 6: The data are not really convincing. Just because the color turns yellow, it does not mean that there is colocalization. The green channel is overexposed in this area of the cell, and anything will produce a yellow color, even if there is no genuine colocalization. Maybe the authors could provide a different example and even better would be a quantification of the colocalization.

      __We thank the referee for this comment. We show images of better quality, where the black/white channels clearly show the co-localization (or lack thereof) of TMX5 with the Golgi marker Giantin in cells mock-transfected (co-localization TMX5:Giantin, Fig. 5D), co-transfected with ERp44 (no co-localization TMX5:Giantin, Fig. 5E), or co-transfected with ERp44C29S, co-localization TMX5:Giantin, Fig. 5F). Figs. 5G-5I show the corresponding results for the co-localization or lack thereof between TMX5C220A and Giantin. Importantly, the IF data match the data shown in Fig. 5B, where release from the ER (or arrival in the medial Golgi, see text of the manuscript and comment 1 by the referee) is assessed by monitoring complex glycosylation. __

      Reviewer #2 (Significance (Required)):

      This is a solid story that will be of interest of scientists working on various aspects of the secretory pathway and protein quality control. The advance is rather incremental, because there are no experiments that provide insight into the cellular roles of TMX5.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Solda et al have assembled data on the transmembrane redox enzyme TMX5, on which currently very little information is available. TMX5 does not contain any obvious targeting signal, unlike the other TMX family proteins, which localize to the ER. TMX5 has 5 glycosylation sites, which can be used to determine its intracellular localization biochemically. Indeed, about 20% of TMX5 is found as endoH-resistant, indicating Golgi localization. This is confirmed with beautiful IF imagery and giantin co-localization. The Golgi localization requires two luminal cysteines (C212, 177), which likely form a disulfide bond. TMX5 acts as a natural cysteine-trapping protein, allowing for easy assessment of its interactors. Within its interactome, the authors found multiple members of the thioredoxin family. Many of these interactions occur within the CXXS motif, but notably ERp44 does not require this motif to interact, indicating this and other interactions are not of a catalytic nature. Instead, the authors found this interaction to be essential for ER retention or retrieval and depends on the cysteine within the ERp44 "active" site. The study provides critical first insight about the potential functions and sites of activity of TMX5.

      Specific Points: 1. The results are very convincing and of high quality. 2. The cytosolic tail of TMX5 contains an LI motif, which could act as a post-ER localization signal. Since the protein might play a role in ciliogenesis, this motif could be critical. In this context, I am wondering which mutations are known to lead to the disease spectrum.


      The position of disease-related TMX5 mutations identified so far are given in Xu H, et al (2024) Mol Genet Genomic Med 12: e2340 _https://www.ncbi.nlm.nih.gov/pubmed/38073519_ and in ____Deng T, Xie Y (2024) Mol Genet Genomic Med 12: e2343 https://www.ncbi.nlm.nih.gov/pubmed/38156946____.

      They are all distributed in the luminal part of the protein____.


      Mutation of C114 and C124 abrogates interaction with ERp44. Therefore, I would expect these mutations to increase endoH resistance and Golgi staining. This should be investigated by the authors.

      __The mutations C114 and C124 reduce (or make short-living), without abrogating the covalent association between TMX5 and ERp44. The EndoH experiment shown in Fig. 2B and the IF in Figs. 2D-2O fail to reveal significant differences showing that these reduced or more short living associations with ERp44 are sufficient to control TMX5 distribution. To strengthen our conclusion that ERp44 is involved in regulation of the intracellular TMX5 distribution, we have now added data in ERp44 cell (50% of TMX5 displays complex glycans as symptom of traffic to the medial Golgi compartment), back-transfection of ERp44 (but not of the ERp44C29S mutant that does not associate with TMX5) restores the complex glycan fraction to the level measured in wild type cells (Fig. 6). __

      Minor Points: 1. The position of the % endoH resistance in Figures 1B and 6B is not ideal, as it obstructs a visual inspection of TMX5 resistance to endoH.

      This has been modified, thank you.

      Reviewer #3 (Significance (Required)):

      Given that no information about TMX5 is currently available, the study provides critical first insight that should allow researchers to tackle the disease relevance of TMX5 in the future.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Solda et al have assembled data on the transmembrane redox enzyme TMX5, on which currently very little information is available. TMX5 does not contain any obvious targeting signal, unlike the other TMX family proteins, which localize to the ER. TMX5 has 5 glycosylation sites, which can be used to determine its intracellular localization biochemically. Indeed, about 20% of TMX5 is found as endoH-resistant, indicating Golgi localization. This is confirmed with beautiful IF imagery and giantin co-localization. The Golgi localization requires two luminal cysteines (C212, 177), which likely form a disulfide bond. TMX5 acts as a natural cysteine-trapping protein, allowing for easy assessment of its interactors. Within its interactome, the authors found multiple members of the thioredoxin family. Many of these interactions occur within the CXXS motif, but notably ERp44 does not require this motif to interact, indicating this and other interactions are not of a catalytic nature. Instead, the authors found this interaction to be essential for ER retention or retrieval and depends on the cysteine within the ERp44 "active" site. The study provides critical first insight about the potential functions and sites of activity of TMX5.

      Specific Points:

      1. The results are very convincing and of high quality.
      2. The cytosolic tail of TMX5 contains an LI motif, which could act as a post-ER localization signal. Since the protein might play a role in ciliogenesis, this motif could be critical. In this context, I am wondering which mutations are known to lead to the disease spectrum.
      3. Mutation of C114 and C124 abrogates interaction with ERp44. Therefore, I would expect these mutations to increase endoH resistance and Golgi staining. This should be investigated by the authors.

      Minor Points:

      1. The position of the % endoH resistance in Figures 1B and 6B is not ideal, as it obstructs a visual inspection of TMX5 resistance to endoH.

      Significance

      Given that no information about TMX5 is currently available, the study provides critical first insight that should allow researchers to tackle the disease relevance of TMX5 in the future.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript by Solda et al, investigates TMX5, a poorly understood member of the PDI family that lacks an ER-retrieval motif. They find that it localizes to the ER and the Golgi and that it interacts with ERp44. This interaction requires formation of a mixed disulfide and they identify the cysteine residues in both proteins that mediate this interaction.

      Overall, this is a well written manuscript that is easy to follow and the story is compact and straight-forward. It provides some new and solid insight into the biology of TMX5 without going into depth of what the cellular role of TMX5 is or might be. I have only very few comments and suggestions:

      1. The authors conclude that ERp44 associates only with ER-localized TMX5. I am not sure that this is a valid conclusion based on the data. EndoH sensitivity just means that the protein has not gone to the medial Golgi. The pool of TMX5 could therefore be an ERGIC-based pool, or it could interact with a TMX5 that is recycled directly from the first Golgi cisterna, where complex glycosylation is unlikely to occur. Can this be validated using another type of experiment? Alternatively, the wording could be changed.
      2. Is the trafficking of TMX5 dependent on its glycosylation?
      3. Figure 6: The data are not really convincing. Just because the color turns yellow, it does not mean that there is colocalization. The green channel is overexposed in this area of the cell, and anything will produce a yellow color, even if there is no genuine colocalization. Maybe the authors could provide a different example and even better would be a quantification of the colocalization.

      Significance

      This is a solid story that will be of interest of scientists working on various aspects of the secretory pathway and protein quality control. The advance is rather incremental, because thereare no experiments that provide insight into the cellular roles of TMX5.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The paper proposes an interesting role for ERp44 in TMX5 retention. The authors identified a list of proposed TMX5 clients which include many Golgi localised proteins but do not discuss the role for TMX for instance in protein folding. In this context it is not absolutely clear whether TMX5 acts as a trafficking chaperone? are clients functionally engaged in the Golgi or ER or both? The defining criteria for the client proteins were not included. At last, it might be of interest to evaluate for how long TMX5 clients are retained on the protein, whether it is temporary (as for instance a folding sensor) or more permanent.

      The preparation of figures could be greatly improved and there is some inconsistency among similar gels. The proposed model of ERp44, ER retention vs ER retrieval, is unclear. Overall, there is more room for improved discussion beyond the conclusions from experiments.

      The ERp44 interaction is interesting especially since the protein contains an incomplete thioredoxin domain (such as ERp29, PDIA17 and 18), would the interaction between Erp44 and TMX5 be involved in some holdase/competitor role thereby allowing for client selectivity (or kinetics)? In addition, all the experiments were carried out in Hek293T or MEF cells, would the authors anticipate some interactions of TMX5 with PDIA17/18 in cells where those proteins are highly expressed? Testing whether the observation is a general mechanism occurring between TMX5 and PDI family members with incomplete thioredoxin sites would be an asset.

      Major comments

      Fig 2

      • avoid labels on the blots that might obscure information and impede clarity and interpretation.
      • The % of resistant protein can be otherwise placed.
      • What does the asterix in 2B signify? This should be included in the legend.
      • A label for 'deglycosylated' proteins could be included.
      • Consider treating with PNGase to .....
      • There is a change in EndoH resistance of about 3-4% among wt, C220A & C114A, is this significant?
      • Equivalent inputs (cell lysates) for the IPs should be included.

      Fig 3A

      • Indicate the specific bands that were subject to MS. How did the authors correct for non-specific interactors and false positives? Perhaps a more specifically targeted approach could be utilised.
      • How do the authors explain the absence of bands representing the reduced form of interacting TMX5 interactors?
      • What was the inclusion and exclusion criteria used to determine which of the proteins listed were clients?
      • It might be useful to perform MS on the C220A mutant and compare those results to the WT.

      Fig 4

      • Equivalent inputs (cell lysates) for the IPs should be included.

      Fig 5

      • Equivalent inputs (cell lysates) for the IPs should be included.

      Fig 6

      • A loading control should be included.
      • Blot using anti-HA to identify ERp44 should be included to substantiate claims.
      • How do the authors account for the huge difference in TMX5 associated complexes shown in Fig 6A compared to Fig 3A.
      • Inappropriate marking on the gel area.
      • Inconsistencies in protein standard labeling
      • It might be useful to demonstrate the colocalisation of ERp44 and ERp44C29S with Giantin and with TMX5 considering that ERp44 is known to cycle between the Golgi and ER.

      Significance

      This work provides an additional understanding on how the regulation of Erp44 trafficking might occur (and perhaps additional PDIs), and lead to the characterization of kinetic value that might explain better productive protein folding in the early secretory pathway. This represents a significant advance in the field and may in turn unveil uncharacterized pathophysiological functions in various diseases.

      This is a serious study well conducted and original by an expert in the field that desserves publication.

      Field of expertise: ER homeostasis control

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): Summary: To explore the relationship between histone post-translational modifications (H3K4me3 and H3K27me3) and enhancer activation with gene expression during early embryonic development, the authors used a monolayer differentiation approach to convert mouse embryonic stem cells (ESCs) into Anterior Definitive Endoderm (ADE). They monitored differentiation stages using a dual reporter mESC line (B6), which has fluorescent reporters inserted at the Gsc (GFP) and Hhex (Redstar) loci. Their analyses indicate that the differentiating cells advanced through stages similar to those in the embryo, successfully converting into endoderm and ADE with high efficiency. This is elegant and well performed stem cell biology.

      Their subsequent genome-wide and nascent transcription analyses confirmed that the in vitro gene expression changes correlated with developmental stages and confirmed that transcriptional activation precedes mRNA accumulation. They then focussed on linking active enhancers and histone modifications (H3K4me3 and H3K27me3) were with gene expression dynamics. Finally, the performed PRC2 inhibition and showed that, while it enhanced differentiation efficiency, it also induced ectopic expression of non-lineage specific genes.

      Major comments: In terms of mechanistic advances, they propose that transcriptional up-regulation does not require prior loss of H3K27me3, which they show appears to lag behind gene activation, but critically, on a likely mixed population level. I am sceptical of their interpretation of their data because they are looking at heterogenous populations of cells. To explain, one could imagine a particular H3K27me3 coated gene that gets activated during differentiation. In a population of differentiating cells, while the major sub-population of cells could retain H3K27me3 on this particular gene when it is repressed, a minority sub-population of cells could have no H3K27me3 on the gene when it is actively transcribed. The ChIP and RNA-seq results in this mixed cell scenario would give the wrong impression that the gene is active while retaining H3K27me3, when in reality, it's much more likely that the gene is never expressed when its locus in enriched with the repressive H3K27me3 modification. Therefore, to support their claim, they would have to show that a particular gene is active when its locus is coated with H3K27me3. Personally, I don't feel this approach would be worth pursuing.

      They also report that inhibition of PRC2 using EZH2 inhibitor (EPZ6438) enhanced endoderm differentiation efficiency but led to ectopic expression of pluripotency and non-lineage genes. However, this is not surprising considering the established role of Polycomb proteins as repressors of lineage genes.

      Reviewer #1 (Significance (Required)): I feel that this is a solid and well conducted study in which the authors model early development in vitro. It should be of interest to researchers with an interest in more sophisticated in vitro differentiation systems, perhaps to knockout their gene of interest and study the consequences. However, I don't see any major mechanistic advances in this work.

      *>Author Response *

      *We agree with the point regarding the delayed loss of H3K27me3 relative to gene activation, and indeed this same point has been raised by reviewer 3 (see below). Our cell-population based data does not allow us to directly test if gene up-regulation in a small population of cells from TSSs lacking H3K27me3, accounts for the observed result. Furthermore, there are currently no robust methods to determine cell- or allele-specific expression simultaneously with ChIP/Cut and Run for chromatin marks. However, we provide the following additional evidence that strongly supports our conclusions. *

      • *

      Our FACs isolation strategy used to prepare cell populations for ChIP, microarray expression and 4sU-seq analysis is based on expression (or lack thereof) of a fluorescent GSC-GFP reporter. This means that every cell in the G+ populations express the Gsc fluorescent reporter, at least at the protein level, at the point of isolation. This is despite the presence of appreciable and invariant levels of H3K27me3 at the TSS of the Gsc gene in both G+ and G- populations at day 3 of differentiation. Comparable to our meta-analysis of all upregulated genes shown in the original manuscript (Figure 5 and S5), H3K27me3 levels are then subsequently reduced in the G+ relative to the G- populations at day 4. The transcriptional changes which correspond to the GSG-GFP reporter expression and associated ChIP-seq data are shown in the reviewer figure (Fig R1 A shown in revision plan). To further support our observations, we sought to rule out the possibility that the shift in H3K27me3 and transcription were from mutually exclusive gene sets, from nominal transcription levels or from sites with low level H3K27me3. To do this with a gene set of sufficient size to yield a robust result, we selected upregulated TSSs that had a greater than median value for both transcription (4sU-seq) and H3K27me3 (n=49 of 159 genes; Fig R1 B shown in revision plan). Meta-analysis of these genes showed that, as for all upregulated gene TSS (n=159), transcriptional activation occurred in the presence of substantial and invariant levels of H3K27me3 at day 3 followed by a subsequent reduction by day 4 of differentiation (Fig R1 C shown in revision plan). Importantly, many of these genes yielded high absolute 4sU-seq signal, comparable to that of Gsc, arguing against transcriptional activation being limited to a small subpopulation of cells.

      • *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): In this paper the authors profile gene expression, including active transcription, and histone modifications (k4 and k27me3) during a complex differentiation protocol from ES cells, which takes advantage of FACS sorting of appropriate fluorescent reporters. The data is of good quality and the experiments are well performed. The main conclusion, that the analyzed histone marks channel differentiation more than they directly allow/block it, is well supported by the data. The paper is interesting and will represent a good addition to an already extensive literature. I have however a few major concerns, described below:

      1/ K4me3 may show more changes than they interpret, at least over the +1 nucl. An alternative quantification to aggregate profiles should be used to more directly address the questions regarding the correlations between histone mods and gene expression.

      *>Author Response *

      *Whilst we state that H3K4me3 levels are somewhat invariant at differentially expressed genes relative to H3K27me3, quantification of individual TSS (+/- 500 bp) did show a direct correlation with gene expression (Figure 5 and S5). To further explore this in response to the reviewer’s comment we will quantify K4me3 signal at the +1 nucleosome to determine if this yields more substantial differences than that observed more broadly across TSSs. *

      2/ Related to the previous point, it appears clear in Fig.4 that the promoters of each gene expression cluster do not belong to a single chromatin configuration. I think it would be important to: 1/ cluster the genes based on promoter histone mods and interrogate gene expression and cluster allocation (basically the reverse to what is presented) 2/ order the genes in the heatmaps identically for K4me3 and K27me3 to more easily understand the respective chromatin composition per cluster

      >Author Response

      We thank the reviewer for these suggestions and will include these analyses in a revised manuscript.

      3/ Also, as it is apparent that not all promoters in every cluster are enriched for the studied marks, could the authors separately analyze these genes? What are they? Do they use alternative promoters?

      >Author Response

      *Indeed, this is the case. Whilst there is significant enrichment of H3K27me3 at the TSS of developmentally regulated genes, not all genes whose expression changes during the differentiation will be polycomb targets. We will further stratify these clusters as suggested and determine what distinguishes the subsets. If informative, this data will be included in a revised manuscript. *

      4/ The use of 4SU-seq to identify active enhancers is welcome; however, I have doubts it is working very efficiently: for instance, in the snapshots shown in Fig.2A, the very active Oct4 enhancers in ES cells are not apparent at all... More validation of the efficiency of the approach seems required.

      >Author Response

      The 4sU-seq data shown in Figure 2A was generated in samples isolated from day 3 and 4 of the ADE differentiation. It is therefore likely that the enhancers have been partly or wholly decommissioned at this point. Indeed, in a separate study we generated 4sU-seq data using the same protocol and conditions as presented here but in ES cells and differentiated NPCs (day 3 to 7) and indeed see transcription at Oct 4 enhancers in ESCs (arrowed in the screenshot shown in revision plan) which are extinguished upon differentiation to neural progenitor cells (NPCs); data from PMID: 31494034).

      5/ The effects of the EZH2 inhibitor are quite minor regarding the efficiency of the differentiation as analyzed by FACS, despite significant gene expression changes. To the knowledge of this referee, this is at odds with results obtained with Ezh2 ko ES cells that display defects in mesoderm and endoderm differentiation. I have issues reconciling these results (uncited PMID: 19026780). Either the authors perform more robust assays (inducible KOs) or they more directly explain the limitations of the study and the controversies with published work.

      >Author Response

      We agree that this result appears to be at odds with the findings in (PMID: 19026780*). This is likely due to the fact that we are acutely reducing H3K27me3 levels for a short period either during or immediately preceding the differentiation rather than removing PRC2 function genetically. This, likely provides a less pronounced defect on the ability to generate endodermal cells. However, we cannot address this without further experimentation which is beyond the scope of this study. We will more fully discuss the results in the context of this and other studies and discuss the limitations of the study in this regard. *

      Minor 1/ please add variance captured to PCA plots 2/ Fig1E add color scales to all heatmaps 3/ Fig4C,D are almost impossible to follow, please find a way to identify better the clusters/samples and make easier to correlate all the variables

      • *

      >Author Response

      *We will address all of these points in a revised manuscript. *

      Reviewer #2 (Significance (Required)):

      The paper is incremental in knowledge, and not by a big margin, as it is known already that histone mods rather channel than drive differentiation. Though, the authors do not clearly address inconsistencies with published work, especially regarding Ezh2 thought to be important to make endoderm. It is however a good addition to current knowledge, provided a better discussion of differences with published work is provided.

      >Author Response

      *As outlined above, we will address this with a more complete discussion about the distinction between the studies and what can and can’t be concluded from our approach. *

      * *

      • *

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): This study investigates the role of chromatin-based regulation during cell fate specification. The authors use an ESC model of differentiation into anterior primitive streak and subsequently definitive endoderm, which they traced via a dual-reporter system that combines GSC-GFP and HHEX-RedStar. The authors mapped changes in (nascent) gene expression and histone modifications (H3K4me3/H3K27me3) at key timepoints and within different populations over six days of differentiation. Finally, the authors test the functional implications of H3K27me3 landscapes via PRC2 inhibition.

      The majority of data chart the descriptive changes in (epi)genomic and transcriptional dynamics coincident with cell differentiation. The use of nascent transcriptomics improves the temporal resolution of expression dynamics, and is an important strategy. By and large the data reinforce established paradigms. For example, that transcription is the dominant mechanism regulating mRNA levels, or that dynamic chromatin states changes occur and largely corelate of gene activity. They also identify putative enhancers with profiling data, albeit these are not validated, and confirm that PRC2 inhibition impacts cell fate processes - in this case promoting endodermal differentiation efficiency. Overall, the study is relatively well-performed and clearly written, with the omics profiling adding more datasets from in vitro cell types that can be difficult to characterise in vivo. Whilst the majority of the study may be considered incremental, the key finding is the authors conclusion that H3K27me3 is subordinate to gene activity rather than an instructive repressor. If borne out, this would mark an important observation with broad implications. However, in my view this conclusion is subject to many confounders and alternative interpretations, and the authors have not ruled out other explanations. Given the centrality of this to the novelty of the study, I would encourage further analysis/stratification of existing data, and potentially further experiments to provide more confidence in this key conclusion.

      Primary issue 1.) The authors show that at the earliest timepoint (d3), nascent gene activation of a handful of genes between G+ and G- populations is not associated with a FC loss of H3K27me3. From this the authors extrapolate their key conclusion that H3K27me3 is subordinate. Causality of chromatin modifications in gene regulation is critical to decipher, and therefore this is an important observation to confirm. Below I go through the possible confounders and issues with the conclusion at this point.

      (i) Single-cell penetrance. A possible (likely?) possibility is that gene activation initially occurs in a relatively small subset of cells at d3. Because these genes are expressed lowly prior to this, they will register as a significant upregulation in bulk analysis. However, in this scenario H3K27me3 would only be lost from a small fraction of cells, which would not be detectable against a backdrop of most cells retaining the mark. In short, the authors have not ruled out heterogeneity driving the effect. Given the different dynamic range of mRNA and chromatin marks, and that a small gain from nothing (RNA) is easier to detect than a small loss from a pre-marked state (chromatin), investigating this further is critical to draw the conclusions the authors have.

      (ii) Initial H3K27me3 levels. The plots in Fig 5 show the intersect FC of H3K27me3 and gene expression. Genes that activate at d3 show no loss of H3K27me3. However, it is important to characterise (and quantitate) whether these genes are significantly marked by H3K27me3 in the first place, which I could not find in the manuscript. Many/several of the genes may not be polycomb marked or may have low levels to begin with. This would obviously confound the analysis, since an absence/low K27 cannot be significantly lost and is unlikely to be functional. Thus, the DEG geneset should be further stratified into H3K27me3+ and K27me3- promoter groups/bins, with significance and conclusions based on the former only (e.g. boxplot in 5F).

      (iii) Sample size. The conclusions are based on a relatively small number of genes that upregulate between G+ and G- (n=55 in figure by my count, text mentions n=52). Irrespective of the other confounders above, this is quite a small subset to make the sweeping general conclusion that "loss of the repressive polycomb mark H3K27me3 is delayed relative to transcriptional activation" in the abstract. Indeed, the small number of DEG suggests the cell types being compared are similar and perhaps therefore have specific genomic features (this could be looked at) that drive .

      >Author Response

      *These are very good points and are also raised by reviewer 1 (see above). We have one example where we can definitively interrogate single cell protein expression, in our current data. Gsc (as monitored by GSC-GFP FACS and the bulk RNA analysis) meets the criteria of being robustly upregulated in all FACs sorted cells in the presence of high levels of H3K27me3 in the D3G+ population. We believe that the additional analysis (Figure R1A shown in revision plan) and the discussion above addresses the reviewer’s concerns about both the levels of expression and magnitude of H3K27me3. With respect to the third point, the numbers are low (although here I present data from the 4SU analysis with approximately three times more data points) however, the point here is not too say this happens in every instance of gene activation but more that it can happen and not just at a small subset of outlier genes. This is important, as the reviewer notes, in our understanding of how polycomb repression is relieved during development. We will also look to see if there are sequence characteristics/ motifs of these genes. In a revised manuscript we would include this data and further analysis as outlined above. The reviewer points out that the numbers vary a little between analyses. This arises due to the annotation of multiple TSSs per genes in some cases. This will be rectified throughout and made clearer in the legends. *

      Other comments: 2.) The authors show that promoter H3K4me3 corelates well with gene expression dynamics in their model. They conclude that "transcription itself is required for H3K4me3 deposition", or in other words is subordinate. This may well be the case but from their correlative data this cannot be inferred. Indeed, several recent and past papers have shown that H3K4me3 itself can directly modulate transcription, for example by triggering RNA II pause-release, by preventing epigenetic silencing and/or by recruiting the PIC. The authors could point out or discuss these alternative possibilities to provide a more balanced discourse.

      >Author Response

      We agree and this will be discussed more thoroughly and both possibilities put forward in the revised manuscript.

      3.) The labelling of some figures is unclear. In Fig 4C and 4D (right) it is impossible to tell what sample each of the lines represents. It is also not clear what the blue zone corresponds to in genome view plots (the whole gene?). Moreover, the replicate numbers are not shown in figure legends.


      >Author Response

      *We agree that the data presented in 4C and D is unclear. We will, as a minimum, collapse profiles into like populations (ESC / G- / G+ / G+H- / G+H+) which makes sense given the similarity of these populations across all analyses (see e.g. PCA analysis in Figure 1). We will also explore alternative ways of presenting the data to better highlight the dynamics and incorporate this with the changes suggested by reviewer 2. The blue shaded area represents the full extent of the key gene being discussed in the screen shot, this is mentioned in the legend but will be made clearer in a revised manuscript. Replication will also be added to the legend throughout (n=2 for ChIP-seq and n=3 for 4sU-seq). *

      4.) It would be nice to provide more discussion to reconcile the conclusions that H3K27me3 in endoderm differentiation is subordinate and the final figure showing inhibiting H3K27me3 has a significant effect on differentiation, since the latter is the functional assessment.

      >Author Response

      *We will build on the points already made that suggests that whilst K27me3 is a passive repressor that serves to act against sub-threshold activating cues, it is nonetheless a critical regulator of developmental fidelity. *

      Reviewer #3 (Significance (Required)): Overall, the study's strengths are in that it characterises epigenomic dynamics within a specific and relevant cell fate model. The nascent transcriptomics adds important resolution, and underpins the core conclusions. The weakness is that data is over-interpreted at this point, and other possibilities are not adequately tested. The conclusions should therefore either be scaled back (which reduces novelty) or further analysis and/or experiments should be performed to support the conclusion. If it proves correct, this would be a significant observation for the community,

      >Author Response

      In a revised manuscript, we will address the reviewer’s concerns with additional data and discussion as indicated above.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study investigates the role of chromatin-based regulation during cell fate specification. The authors use an ESC model of differentiation into anterior primitive streak and subsequently definitive endoderm, which they traced via a dual-reporter system that combines GSC-GFP and HHEX-RedStar. The authors mapped changes in (nascent) gene expression and histone modifications (H3K4me3/H3K27me3) at key timepoints and within different populations over six days of differentiation. Finally, the authors test the functional implications of H3K27me3 landscapes via PRC2 inhibition.

      The majority of data chart the descriptive changes in (epi)genomic and transcriptional dynamics coincident with cell differentiation. The use of nascent transcriptomics improves the temporal resolution of expression dynamics, and is an important strategy. By and large the data reinforce established paradigms. For example, that transcription is the dominant mechanism regulating mRNA levels, or that dynamic chromatin states changes occur and largely corelate of gene activity. They also identify putative enhancers with profiling data, albeit these are not validated, and confirm that PRC2 inhibition impacts cell fate processes - in this case promoting endodermal differentiation efficiency. Overall, the study is relatively well-performed and clearly written, with the omics profiling adding more datasets from in vitro cell types that can be difficult to characterise in vivo. Whilst the majority of the study may be considered incremental, the key finding is the authors conclusion that H3K27me3 is subordinate to gene activity rather than an instructive repressor. If borne out, this would mark an important observation with broad implications. However, in my view this conclusion is subject to many confounders and alternative interpretations, and the authors have not ruled out other explanations. Given the centrality of this to the novelty of the study, I would encourage further analysis/stratification of existing data, and potentially further experiments to provide more confidence in this key conclusion.

      Primary issue

      1.) The authors show that at the earliest timepoint (d3), nascent gene activation of a handful of genes between G+ and G- populations is not associated with a FC loss of H3K27me3. From this the authors extrapolate their key conclusion that H3K27me3 is subordinate. Causality of chromatin modifications in gene regulation is critical to decipher, and therefore this is an important observation to confirm. Below I go through the possible confounders and issues with the conclusion at this point.

      (i) Single-cell penetrance. A possible (likely?) possibility is that gene activation initially occurs in a relatively small subset of cells at d3. Because these genes are expressed lowly prior to this, they will register as a significant upregulation in bulk analysis. However, in this scenario H3K27me3 would only be lost from a small fraction of cells, which would not be detectable against a backdrop of most cells retaining the mark. In short, the authors have not ruled out heterogeneity driving the effect. Given the different dynamic range of mRNA and chromatin marks, and that a small gain from nothing (RNA) is easier to detect than a small loss from a pre-marked state (chromatin), investigating this further is critical to draw the conclusions the authors have.

      (ii) Initial H3K27me3 levels. The plots in Fig 5 show the intersect FC of H3K27me3 and gene expression. Genes that activate at d3 show no loss of H3K27me3. However, it is important to characterise (and quantitate) whether these genes are significantly marked by H3K27me3 in the first place, which I could not find in the manuscript. Many/several of the genes may not be polycomb marked or may have low levels to begin with. This would obviously confound the analysis, since an absence/low K27 cannot be significantly lost and is unlikely to be functional. Thus, the DEG geneset should be further stratified into H3K27me3+ and K27me3- promoter groups/bins, with significance and conclusions based on the former only (e.g. boxplot in 5F).

      (iii) Sample size. The conclusions are based on a relatively small number of genes that upregulate between G+ and G- (n=55 in figure by my count, text mentions n=52). Irrespective of the other confounders above, this is quite a small subset to make the sweeping general conclusion that "loss of the repressive polycomb mark H3K27me3 is delayed relative to transcriptional activation" in the abstract. Indeed, the small number of DEG suggests the cell types being compared are similar and perhaps therefore have specific genomic features (this could be looked at) that drive .

      Other comments:

      2.) The authors show that promoter H3K4me3 corelates well with gene expression dynamics in their model. They conclude that "transcription itself is required for H3K4me3 deposition", or in other words is subordinate. This may well be the case but from their correlative data this cannot be inferred. Indeed, several recent and past papers have shown that H3K4me3 itself can directly modulate transcription, for example by triggering RNA II pause-release, by preventing epigenetic silencing and/or by recruiting the PIC. The authors could point out or discuss these alternative possibilities to provide a more balanced discourse.

      3.) The labelling of some figures is unclear. In Fig 4C and 4D (right) it is impossible to tell what sample each of the lines represents. It is also not clear what the blue zone corresponds to in genome view plots (the whole gene?). Moreover, the replicate numbers are not shown in figure legends.

      4.) It would be nice to provide more discussion to reconcile the conclusions that H3K27me3 in endoderm differentiation is subordinate and the final figure showing inhibiting H3K27me3 has a significant effect on differentiation, since the latter is the functional assessment.

      Significance

      Overall, the study's strengths are in that it characterises epigenomic dynamics within a specific and relevant cell fate model. The nascent transcriptomics adds important resolution, and underpins the core conclusions. The weakness is that data is over-interpreted at this point, and other possibilities are not adequately tested. The conclusions should therefore either be scaled back (which reduces novelty) or further analysis and/or experiments should be performed to support the conclusion. If it proves correct, this would be a significant observation for the community,

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this paper the authors profile gene expression, including active transcription, and histone modifications (k4 and k27me3) during a complex differentiation protocol from ES cells, which takes advantage of FACS sorting of appropriate fluorescent reporters. The data is of good quality and the experiments are well performed. The main conclusion, that the analyzed histone marks channel differentiation more than they directly allow/block it, is well supported by the data. The paper is interesting and will represent a good addition to an already extensive literature. I have however a few major concerns, described below:

      1. K4me3 may show more changes than they interpret, at least over the +1 nucl. An alternative quantification to aggregate profiles should be used to more directly address the questions regarding the correlations between histone mods and gene expression.
      2. Related to the previous point, it appears clear in Fig.4 that the promoters of each gene expression cluster do not belong to a single chromatin configuration. I think it would be important to:
        • cluster the genes based on promoter histone mods and interrogate gene expression and cluster allocation (basically the reverse to what is presented)
        • order the genes in the heatmaps identically for K4me3 and K27me3 to more easily understand the respective chromatin composition per cluster
      3. Also, as it is apparent that not all promoters in every cluster are enriched for the studied marks, could the authors separately analyze these genes? What are they? Do they use alternative promoters?
      4. The use of 4SU-seq to identify active enhancers is welcome; however, I have doubts it is working very efficiently: for instance, in the snapshots shown in Fig.2A, the very active Oct4 enhancers in ES cells are not apparent at all... More validation of the efficiency of the approach seems required.
      5. The effects of the EZH2 inhibitor are quite minor regarding the efficiency of the differentiation as analyzed by FACS, despite significant gene expression changes. To the knowledge of this referee, this is at odds with results obtained with Ezh2 ko ES cells that display defects in mesoderm and endoderm differentiation. I have issues reconciling these results (uncited PMID: 19026780). Either the authors perform more robust assays (inducible KOs) or they more directly explain the limitations of the study and the controversies with published work.

      Minor

      1. please add variance captured to PCA plots
      2. Fig1E add color scales to all heatmaps
      3. Fig4C,D are almost impossible to follow, please find a way to identify better the clusters/samples and make easier to correlate all the variables

      Significance

      The paper is incremental in knowledge, and not by a big margin, as it is known already that histone mods rather channel than drive differentiation. Though, the authors do not clearly address inconsistencies with published work, especially regarding Ezh2 thought to be important to make endoderm. It is however a good addition to current knowledge, provided a better discussion of differences with published work is provided.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      To explore the relationship between histone post-translational modifications (H3K4me3 and H3K27me3) and enhancer activation with gene expression during early embryonic development, the authors used a monolayer differentiation approach to convert mouse embryonic stem cells (ESCs) into Anterior Definitive Endoderm (ADE). They monitored differentiation stages using a dual reporter mESC line (B6), which has fluorescent reporters inserted at the Gsc (GFP) and Hhex (Redstar) loci. Their analyses indicate that the differentiating cells advanced through stages similar to those in the embryo, successfully converting into endoderm and ADE with high efficiency. This is elegant and well performed stem cell biology.

      Their subsequent genome-wide and nascent transcription analyses confirmed that the in vitro gene expression changes correlated with developmental stages and confirmed that transcriptional activation precedes mRNA accumulation. They then focussed on linking active enhancers and histone modifications (H3K4me3 and H3K27me3) were with gene expression dynamics. Finally, the performed PRC2 inhibition and showed that, while it enhanced differentiation efficiency, it also induced ectopic expression of non-lineage specific genes.

      Major comments:

      In terms of mechanistic advances, they propose that transcriptional up-regulation does not require prior loss of H3K27me3, which they show appears to lag behind gene activation, but critically, on a likely mixed population level. I am sceptical of their interpretation of their data because they are looking at heterogenous populations of cells. To explain, one could imagine a particular H3K27me3 coated gene that gets activated during differentiation. In a population of differentiating cells, while the major sub-population of cells could retain H3K27me3 on this particular gene when it is repressed, a minority sub-population of cells could have no H3K27me3 on the gene when it is actively transcribed. The ChIP and RNA-seq results in this mixed cell scenario would give the wrong impression that the gene is active while retaining H3K27me3, when in reality, it's much more likely that the gene is never expressed when its locus in enriched with the repressive H3K27me3 modification. Therefore, to support their claim, they would have to show that a particular gene is active when its locus is coated with H3K27me3. Personally, I don't feel this approach would be worth pursuing.

      They also report that inhibition of PRC2 using EZH2 inhibitor (EPZ6438) enhanced endoderm differentiation efficiency but led to ectopic expression of pluripotency and non-lineage genes. However, this is not surprising considering the established role of Polycomb proteins as repressors of lineage genes.

      Referee cross-commenting

      I see that Reviewer #3 has the same concern with over interpretation of data in places - most notably their (in my view not supported) suggestion that transcriptional up-regulation does not require prior loss of H3K27me3.

      Significance

      I feel that this is a solid and well conducted study in which the authors model early development in vitro. It should be of interest to researchers with an interest in more sophisticated in vitro differentiation systems, perhaps to knockout their gene of interest and study the consequences. However, I don't see any major mechanistic advances in this work.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for taking the time to read and comprehensively evaluate our manuscript. We are pleased that, overall, they recognize the quality of our data and that it supports our conclusions. We are grateful for their comments, insights and advice and have revised the manuscript accordingly as described in the point-by-point response below. We believe that the revised manuscript is substantially improved by some experimental additions, additional replicates, improved analysis and increased clarity. Some key enhancements are as follows:

      Previously we had found increased expression of the WNT pathway following CHRDL2 treatment, using RNA seq. We have now demonstrated this experimentally using the cellular levels and localisation of β-catenin. Previously we had shown that overexpression of CHRDL2 increased resistance to common chemotherapy treatments, as well as irradiation in colorectal cell lines. We have now shown that cells surviving treatment show a further reduction SMAD1/5/8 phosphorylation indicating a selection for CHRLD2 high cells during the treatment. We have also demonstrated a decrease in chemotherapy sensitivity in intestinal organoids treated with secreted forms of CHRDL2.

      1. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      Reviewer #1

      __Evidence, reproducibility and clarity __

      Clarkson and Lewis present data suggesting that overexpression of Chordin like 2 (CHRDL2) can affect colorectal cancer cell responses to chemotherapy agents, possibly by modulating stem-cell like pathways. I have the following comments:

      1. Fig. 1J-it is standard to show the images of cell migration-this is important here, given the modest effect of CHRDL2 overexpression here.

      We have now included 3 replicate control and CHRDL2 overexpressing cell images in this figure panel to support the quantification in the graph.

      Fig. 2A-the very small error bars for most of the data on the curves suggests these are n=1 experiments with multiple technical replicates to generate the error bars. Please clarify. The legend says n=3 with ANOVA analysis but no significance detected. Please clarify.

      All experiments in this figure were done with 5 technical replicates per experiment, this was replicated at least three times to give n=3 biological replicates. The error bars represent the standard error of the mean of these 3 biological replicates as stated in the legend. Some data points showed very little data variation, hence the small error bars. Raw data is available if requested.

      1. Fig. 2B-given the overlapping error bars here, how can there be a pWe have removed this representation of the data as it combined many different experiments with variable cell types and chemotherapeutics and it was difficult to carry out meaningful statistics. An overview of the data can be better seen in table form as shown in the revised figure 2B.

      Fig. 2G-did the authors try to estimate the concentration of CHRDL2 in the conditioned medium? Which cell line was used to generate this CM?

      Conditioned media was harvested from the matching transgenic cell lines with inducible CHRDL2. eg RKO cells were treated with media collected from doxycycline induced transgenic RKO cells whereas CaCO2 cells were treated with media from CaCO2 cells. The concentration of doxycycline was represented by ++ for 10ug/ml, the same notation we have used for directly induced cells treated with 10ug/ml dox.

      We did not try to quantify the absolute concentration of CHRDL2 but we have shown the relative amount on a Western blot normalised with a ponceau stain (quantification now included in supplementary figure 1).

      We have clarified our description of this experiment, inserting the following statement, "Conditioned media was harvested from corresponding cell lines with the inducible CHRDL2 transgene and the parental control cells. Induction of CHRDL2 to generate conditioned media was carried out using the same concentration and duration of doxycycline treatment as the cells in figure 2A. "

      Fig. 5-what is the potential mechanism for gene expression changes in response to CHRDL2 overexpression? Is it all due to BMP inhibition? More mechanistic detail would be welcome here.

      We have suggested other pathways involved in these functional effects based on our RNA seq data but at the moment it is not possible to say whether any changes are independent of BMP signaling. CHRDL2 is relatively understudied and as yet there is not much literature supporting BMP independent actions of CHRDL2. However, we have added some discussion and reference to an article suggesting interactions between CHRLD2 and YAP (Wang et al., 2022) including the following statement on page 17: "While the changes in BMP and WNT signaling shown in our GSEA analysis suggest that the effects of CHRDL2 in our system work directly through inhibition of BMP, it is not possible to rule out that some pathways are affected by BMP independent actions of CHRLD2. Indeed, Wang et al, suggest that CHRDL2 can directly alter phosphorylation and activity of YAP in gastric cancer cell lines, which merits further exploration (Wang et al., 2022)"

      Significance

      Unclear whether genetically engineered inducible overexpression has any real physiological significance but we all use cell models so this is OK.

      Reviewer #2

      __Evidence, reproducibility and clarity __

      Summary: In the manuscript entitled "BMP antagonist CHRDL2 enhances the cancer stem-cell phenotype and increases chemotherapy resistance in Colorectal Cancer" the authors demonstrated that Chordin-like 2 (CHDRL2), a secreted BMP antagonist, promotes a chemo-resistant colorectal cancer stem cell phenotype through the inhibition of BMP signaling. The authors took advantage of both 2D engineered colorectal cancer (CRC) cells and healthy murine 3D organoid systems. Specifically, the authors showed a decreased proliferation rate and reduced clonogenic capability upon overexpression of CHRDL2 in established human colon cancer cell lines. Subsequently, they identified a chemo-resistant phenotype upon standard therapies (5FU, Oxaliplatin and Irinotecan) in CHDRL2 overexpressing cells by performing MTS assay. The authors showed that this chemo-resistant phenotype is associated with ATM and RAD21 activation, supporting an induction of DNA damage signaling pathway. Of note, the authors assessed that the exposure of 3D murine organoid to CHRDL2 resulted in a stem-like phenotype induction accompanied by a reduction of the differentiated counterpart. From RNA-seq data analysis emerged the upregulation of genes associated to stemness and DNA repair pathways in CHRDL2 overexpressing cells.

      Major comments: 1. In the first paragraph of the result section authors assessed that "Colorectal adenocarcinoma cell lines were deliberately chosen to encompass a range of CHRDL2 expression levels and genetic mutations", without showing qRT-PCR or WB data on the differential expression levels of CHRDL2 in a panel of immortalized CRC cell lines. Authors should include these data to better support their choice.

      *We have now included some qRT-PCR in supplementary figure 1 alongside a table of some of the key driver mutations in each cell line. Western blotting of these cells shows only a very low concentration of CHRDL2 protein. As shown in figure 1B in the control columns, no significant protein expression is observed in any line. *

      In Figure 1F, authors described a reduction of cell proliferation in CRC cell lines expressing high levels of CHRDL2 only under low glucose conditions. Why did the authors perform the assay under these conditions? They should better argue this aspect and validated the role of CHRDL2 in metabolism rewiring by performing additional in vitro assays.

      We have removed this aspect of the paper as it does not add significantly to our overall conclusions and we can clearly see the effects of CHRDL2 overexpression under standard growth conditions (Figure 1G).

      The authors should evaluate the role of CHRDL2 in promoting a stem-like phenotype in human colon cancer stem cells freshly isolated from patients and characterized.

      We would very much like to do experiments such as this but it is beyond the scope of this study and will be included in upcoming grant proposals.

      In order to confirm the data obtained on 3D murine organoids system obtained from normal Intestinal Stem Cells, authors should investigate the stemness induction, driven by CHRDL2, also in human intestinal organoids.

      Experiments using human intestinal organoids are currently planned and ethical approval applications and grant proposals are underway for future experiments of this nature.

      The authors should evaluate the oncogenic role of CHRDL2, through the maintenance of stemness, by performing orthotopic or subcutaneous experiments in vivo model.

      Similarly, this is not possible for this manuscript but is planned for the future alongside a transgenic mouse model of inducible CHRDL2 overexpression in the intestine.

      BMPs proteins are part of a very broad protein family. In the introduction section, authors should indicate the specific BMP protein on which CHRDL2 exerts its inhibitory function. Moreover, they should have assessed BMP protein levels in CACO2, LS180, COLO320 and RKO cell lines.

      We have clarified the interactions between CHRDL2 and specific BMPs in the introduction. We have not specifically assessed the BMP protein levels in our cells however we have now included an analysis of expression data from the Cancer Cell Line Encyclopedia in supplementary figure 1 C.

      In first panel, the authors should quantify the secreted levels of CHRDL2 in the media of overexpressing CHRDL2 cell lines.

      We have done this using the ponceau staining as a loading control and the results are displayed (supplementary figure 1).

      In Figure 2D the authors should use the appropriate controls and describe this with more details in results section.

      In this figure we have used Hoechst staining followed by FACs analysis to identify the cell cycle profile of our CHRDL2 treated cells. We have improved the description of this in the methods section. Appropriate controls for staining, both negative and positive, are used when setting up the analysis for this experiment. The cell cycle profile is calculated using the Novocyte in house software. We have now included the histogram plots in the main figure to clarify these data in figure 2D.

      In Figure 3A, the authors should have performed the assay by choosing IC50.

      *We attempted these experiments with the IC50 levels, however the high amount of cell death and frequency of apoptotic cells meant that clear images were difficult to obtain. We therefore reduced the concentrations and still had very measurable effects. *

      In Supplementary Fig. 4A-B. the results are unclear. The control cell lines are already chemo resistant.

      Again, we used IC25 levels of the drugs so that our cells were damaged but still live throughout the experiment. This has been explained on page 10.

      The authors should review and add statistical analysis in both main and supplementary figures.

      *We have now added additional details about statistical analysis throughout the figures, legend and main text, showing all significance levels as well as non-significance for each data set. * Minor comments: 1. The quality of immunofluorescence and WB images should be implemented, and in the immunofluorescence panels scale bars should be added.

      We have added or improved scale bars on each immunofluorescence image. Western blot images have been improved.

      In the graphical abstract authors reported that CHRDL2 overexpression increase WNT and EMT pathways, without performing any specific assay to demonstrate this. Authors should correct and graphically improve the graphical abstract.

      *This is a good point and we have now carried out Beta-catenin immunofluorescence as a measure of WNT signaling on both our cancer cell lines - showing an increase in nuclear beta-catenin (figure 1J and K), and our organoids - showing an increase in overall levels and cytoplasmic staining (Figure 4 F). In terms of EMT markers we have carried out immunofluorescence on IQGAP1 (Figure 1I). IQGAP1 is significantly upregulated in CHRDL2 cells, reflecting its role in reduced cell adhesion and increased migration. This correlates with our data showing increased cellular migration as well as the increase in EMT related transcription in our RNAseq data. *

      The term "significantly" in the discussion section is inappropriately referred to data showed in the histogram in Figure 1J. Moreover, in Figure 1Jthe authors should delete from the y-axis the term "corrected".

      We have changed significantly to substantially

      The term "significant" in discussion is inappropriately referred to BMI1 expression level if compared to the histogram in Figure 4G.

      We have changed significantly to "a trend to increase"

      In Figure 2C the authors should add the unit of measurement (fold over control) in the table.

      We have done this

      In Figure 4E the authors should add the figure legend reporting OLFM4 protein.

      We have done this

      The authors should include few sentences summarizing the findings at the end of each paragraph.

      We have added short summaries at the start or end of each section to improve the flow of the results section.

      Significance

      General assessment: Overall, the work is aimed to elucidate the role of CHRDL2 already considered a poor prognosis biomarker involved in the promotion of CRC (PMID: 28009989), in promoting stem-like properties. The authors elucidated new additional insights into the molecular mechanisms regulating stemness phenotype induced by the BMP antagonist CHRDL2 in CRC. The authors include in the study a large amount of data, which only partially support their hypothesis. However, this manuscript lacks organization and coherence, making it challenging to follow and read. Numerous concerns need to be addressed, along with some sentences to rephrase in the result and discussion sections.

      Advance: The manuscript reported some functional insights on the role of CHRDL2 in colorectal cancer, but additional data should be added to support authors 'conclusions.

      Audience: The manuscript is suggested for basic research scientists.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)):____ __ Summary BMP antagonist CHRDL2 enhances the cancer stem-cell phenotype and increases chemotherapy resistance in Colorectal Cancer Eloise Clarkson et al. The manuscript explored the function of CHRDL2, a BMP antagonist, on colorectal cancer (CRC). The authors found that CHRDL2 overexpression can enhance the survival of CRC cells during chemotherapy and irradiation treatment with elevated levels of stem-cell markers and reduced differentiation. Further RNA-seq analysis revealed that CHRDL2 increased the expression of stem-cell markers, WNT signaling and other well-established cancer-associated pathways. Overall, the manuscript is well-written and presented. I have some suggestions:

      Major points:

      1. The authors assert BMP antagonism was demonstrated by assessing levels of phosphorylated SMAD1/5 (Figure 1G). However, the immunoblotting assay only depicted P-SMAD 1/5 levels and B-ACTIN as internal control. It's suggested to include total-SMAD1/5 immunoblotting as an internal control to further support the claim of BMP antagonism.

      The reviewer is correct that this is the best control. Western blotting has now been performed with total SMAD1 protein expression used as an internal control and this is shown in Figure 1D and Supplementary figure 1F

      The authors argue that CHRDL2 overexpression reduced the proliferation of CRC cell lines, as evidenced by cell proliferation assays. However, from Figure 1E, the reduction in proliferation appears insignificant. It would be beneficial to perform one-way ANOVA tests on each time point for CHRDL2+ and CHRDL2++ with Control in Figure 1E to ascertain significance.

      *We now have repeated this experiment to reduce variability and have also provided two-way ANOVA analysis between Control and CHRDL2+ and Control and CHRDL2++. One-way ANOVA at timepoint 96hr also provided with details in the figure legend. *

      The findings indicating that overexpressing CHRDL2 can confer resistance to chemotherapy in CRC cells (Figure 2A-C) are noteworthy. To deepen the understanding of BMP signaling in cancer stemness and the molecular underpinning of CHRDL2 antagonism, additional western blot assays on P-SMAD1/5 with CHRDL2 overexpression and drug treatment are recommended.

      *Western blotting of P-SMAD1/5 upon cells treated with IC50 5FU has now been performed in figure 2C (in the same experiment as the revised panels in figure 1D). The data suggest that CHRDL2 overexpressing cells able to survive chemotherapy have higher levels of P-SMAD1/5 reduction compared to that of untreated cells, strongly suggesting that chemotherapy treatment acts to select the cells with the highest CHRDL2 expression. We thank reviewer 3 for this suggested experiment and have included further discussion on this on page 8. *

      The assertion that extrinsic CHRDL2 addition diminishes differentiation and enhances stem-cell numbers in an intestinal organoid model is intriguing. As BMP signaling inhibition contributes to intestinal cell stemness, incorporating additional layers for BMP antagonism of CHRDL2 on intestinal organoids through immunoblotting or real-time quantitative PCR for treated organoids would augment the conclusions.

      As stated in the response to reviewer 2, we have investigated Beta-catenin in our organoids following CHRDL2 treatment using immunofluorescence and find that the levels are increased with the staining shifting from the membrane to the cytoplasm and nucleus (Figure 4F).

      The authors claim CHRDL2 overexpression can decrease BMP signaling based on GSEA analysis (Figure 5E). However, the GSEA results did not demonstrate the downregulation of BMP signaling. Reanalysis of this GSEA analysis is warranted.

      *We agree with this point and have changed the description of this result since the gene set covers both positive and negative regulators of the BMP pathway. We cannot conclusively say from this RNAseq data set that BMP signaling is "downregulated", however since SMAD phosphorylation is increased and nuclear beta-catenin is increased, overall we suggest that the changes we see are likely to represent the effects of decreased BMP signaling along with increased WNT signaling. *

      Minor Points:

      6.Provide the threshold/cutoff values chosen for differential expressed genes (DEGs) in CHRDL2+ and CHRDL2++ RNA-seq compared with control cells. Explain the minimal overlap between CHRDL2 LOW and CHRDL2 HIGH DEGs. Consider presenting all DEGs in CHRDL2 LOW and CHRDL2 HIGH compared with control cells in one gene expression heatmap for better visualization.

      We have now provided the cutoff values for the DEGs in the legend for figure 5 (PThe minimal overlap of DEGs in the low and high expressing cells is an interesting point. We hypothesize that this may be related to the different effects of intermediate vs high levels of WNT signaling that occurs in colon cancer cells, frequently discussed in the literature as the "Just right hypothesis" (Lamlum et al. 1999, Albuquerque et al., 2002, Lewis et al., 2010). However, we haven't included this in the discussion as it merits further exploration. However, we have mainly focused on specific genes that are modified in both data sets, which are more likely to be the direct result of CHRDL2 modification. *

      After DEGs analysis, perform Gene Ontology (GO) analysis on these DEGs to further investigate possible gene functions rather than selectively discussing some genes, enhancing understanding of CHRDL2 functions in CRC cells.

      We have carried out this analysis using a variety of tools and have now included a Gene Ontology Panther analysis as supplementary figure 7. We have included a comment on this in the text on page 14 saying "Gene ontology analysis supports these findings with enrichment in biological processes such as cellular adhesion, apoptosis and differentiation. "

      Conduct similar experiments in both 2D culture and organoid systems, if feasible, to provide more comprehensive insights into CHRDL2's oncogenic roles in CRC tumor progression.

      *We have now performed chemotherapy treatment on our organoid systems, and have found that organoids with extrinsic CHRDL2 addition have a higher survival rate after chemotherapy compared to a control (Figure 4H and I). *

      Label significance (*, **, ***, and n.s.) for every CRC cell line treated with CHRDL2 in Figure 2D, 2F, 2J, 4G, 5D, and 5F.

      We have done this

      Label the antibodies with different colors used for immunofluorescence in the figure text in Figure 4E.

      We have done this

      * * Include replicate dots for the Control group in the bar plots in Figure 1F and 2B.

      We have done this

      * * Add scale bars in Figure 3A and correct similar issues in other figures if applicable.

      We have done this

      * *13.Correct grammar and punctuation mistakes throughout the manuscript. For example:

      We have done this and further proofread our revised manuscript

      Page 7: "As seen in Figure 1J, CHRDL2 overexpression significantly increased the number of migrated cells (P *We have now added additional details about statistical analysis throughout the figures, legend and main text, showing all significance levels as well as non-significance for each data set. * Reviewer #3 (Significance (Required)):

      The current study presents compelling evidence demonstrating that BMP signaling antagonist CHRDL2 enhances colon stem cell survival in colorectal cancer cell lines and organoid models. Further validation through CRC mouse models could offer invaluable insights into the clinical relevance and therapeutic implications of CHRDL2 in colorectal cancer.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      BMP antagonist CHRDL2 enhances the cancer stem-cell phenotype and increases chemotherapy resistance in Colorectal Cancer Eloise Clarkson et al. The manuscript explored the function of CHRDL2, a BMP antagonist, on colorectal cancer (CRC). The authors found that CHRDL2 overexpression can enhance the survival of CRC cells during chemotherapy and irradiation treatment with elevated levels of stem-cell markers and reduced differentiation. Further RNA-seq analysis revealed that CHRDL2 increased the expression of stem-cell markers, WNT signaling and other well-established cancer-associated pathways. Overall, the manuscript is well-written and presented. I have some suggestions:

      Major points:

      1. The authors assert BMP antagonism was demonstrated by assessing levels of phosphorylated SMAD1/5 (Figure 1G). However, the immunoblotting assay only depicted P-SMAD 1/5 levels and B-ACTIN as internal control. It's suggested to include total-SMAD1/5 immunoblotting as an internal control to further support the claim of BMP antagonism.
      2. The authors argue that CHRDL2 overexpression reduced the proliferation of CRC cell lines, as evidenced by cell proliferation assays. However, from Figure 1E, the reduction in proliferation appears insignificant. It would be beneficial to perform one-way ANOVA tests on each time point for CHRDL2+ and CHRDL2++ with Control in Figure 1E to ascertain significance.
      3. The findings indicating that overexpressing CHRDL2 can confer resistance to chemotherapy in CRC cells (Figure 2A-C) are noteworthy. To deepen the understanding of BMP signaling in cancer stemness and the molecular underpinning of CHRDL2 antagonism, additional western blot assays on P-SMAD1/5 with CHRDL2 overexpression and drug treatment are recommended.
      4. The assertion that extrinsic CHRDL2 addition diminishes differentiation and enhances stem-cell numbers in an intestinal organoid model is intriguing. As BMP signaling inhibition contributes to intestinal cell stemness, incorporating additional layers for BMP antagonism of CHRDL2 on intestinal organoids through immunoblotting or real-time quantitative PCR for treated organoids would augment the conclusions.
      5. The authors claim CHRDL2 overexpression can decrease BMP signaling based on GSEA analysis (Figure 5E). However, the GSEA results did not demonstrate the downregulation of BMP signaling. Reanalysis of this GSEA analysis is warranted.

      Minor Points:

      6.Provide the threshold/cutoff values chosen for differential expressed genes (DEGs) in CHRDL2+ and CHRDL2++ RNA-seq compared with control cells. Explain the minimal overlap between CHRDL2 LOW and CHRDL2 HIGH DEGs. Consider presenting all DEGs in CHRDL2 LOW and CHRDL2 HIGH compared with control cells in one gene expression heatmap for better visualization. 7. After DEGs analysis, perform Gene Ontology (GO) analysis on these DEGs to further investigate possible gene functions rather than selectively discussing some genes, enhancing understanding of CHRDL2 functions in CRC cells. 8. Conduct similar experiments in both 2D culture and organoid systems, if feasible, to provide more comprehensive insights into CHRDL2's oncogenic roles in CRC tumor progression. 9. Label significance (, , **, and n.s.) for every CRC cell line treated with CHRDL2 in Figure 2D, 2F, 2J, 4G, 5D, and 5F. 10. Label the antibodies with different colors used for immunofluorescence in the figure text in Figure 4E. 11. Include replicate dots for the Control group in the bar plots in Figure 1F and 2B. 12. Add scale bars in Figure 3A and correct similar issues in other figures if applicable. 13.Correct grammar and punctuation mistakes throughout the manuscript. For example:

      Page 7: "As seen in Figure 1J, CHRDL2 overexpression significantly increased the number of migrated cells (P

      Page 7: "As seen in Figure 1J, CHRDL2 overexpression significantly increased the number of migrated cells (P

      Page 7: "As seen in Figure 1J, CHRDL2 overexpression significantly increased the number of migrated cells (P < 0.0449)," suggesting increased migratory ability, a hallmark of cancer stem cells."

      Page 8: "CHRDL2 overexpression resulted in an approximate twofold increase in IC50 values compared to control cells (P < 0.001)."

      Page 10: "As seen in Figure 4B, upon the" should be corrected to "Figure 4B."

      1. Specify the statistical methods or estimates used for determining statistical significance.

      Significance

      The current study presents compelling evidence demonstrating that BMP signaling antagonist CHRDL2 enhances colon stem cell survival in colorectal cancer cell lines and organoid models. Further validation through CRC mouse models could offer invaluable insights into the clinical relevance and therapeutic implications of CHRDL2 in colorectal cancer.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In the manuscript entitled "BMP antagonist CHRDL2 enhances the cancer stem-cell phenotype and increases chemotherapy resistance in Colorectal Cancer" the authors demonstrated that Chordin-like 2 (CHDRL2), a secreted BMP antagonist, promotes a chemo-resistant colorectal cancer stem cell phenotype through the inhibition of BMP signaling. The authors took advantage of both 2D engineered colorectal cancer (CRC) cells and healthy murine 3D organoid systems. Specifically, the authors showed a decreased proliferation rate and reduced clonogenic capability upon overexpression of CHRDL2 in established human colon cancer cell lines. Subsequently, they identified a chemo-resistant phenotype upon standard therapies (5FU, Oxaliplatin and Irinotecan) in CHDRL2 overexpressing cells by performing MTS assay. The authors showed that this chemo-resistant phenotype is associated with ATM and RAD21 activation, supporting an induction of DNA damage signaling pathway. Of note, the authors assessed that the exposure of 3D murine organoid to CHRDL2 resulted in a stem-like phenotype induction accompanied by a reduction of the differentiated counterpart. From RNA-seq data analysis emerged the upregulation of genes associated to stemness and DNA repair pathways in CHRDL2 overexpressing cells.

      Major comments:

      1. In the first paragraph of the result section authors assessed that "Colorectal adenocarcinoma cell lines were deliberately chosen to encompass a range of CHRDL2 expression levels and genetic mutations", without showing qRT-PCR or WB data on the differential expression levels of CHRDL2 in a panel of immortalized CRC cell lines. Authors should include these data to better support their choice.
      2. In Figure 1F, authors described a reduction of cell proliferation in CRC cell lines expressing high levels of CHRDL2 only under low glucose conditions. Why did the authors perform the assay under these conditions? They should better argue this aspect and validated the role of CHRDL2 in metabolism rewiring by performing additional in vitro assays.
      3. The authors should evaluate the role of CHRDL2 in promoting a stem-like phenotype in human colon cancer stem cells freshly isolated from patients and characterized.
      4. In order to confirm the data obtained on 3D murine organoids system obtained from normal Intestinal Stem Cells, authors should investigate the stemness induction, driven by CHRDL2, also in human intestinal organoids.
      5. The authors should evaluate the oncogenic role of CHRDL2, through the maintenance of stemness, by performing orthotopic or subcutaneous experiments in vivo model.
      6. BMPs proteins are part of a very broad protein family. In the introduction section, authors should indicate the specific BMP protein on which CHRDL2 exerts its inhibitory function. Moreover, they should have assessed BMP protein levels in CACO2, LS180, COLO320 and RKO cell lines.
      7. In first panel, the authors should quantify the secreted levels of CHRDL2 in the media of overexpressing CHRDL2 cell lines.
      8. In Figure 2D the authors should use the appropriate controls and describe this with more details in results section.
      9. In Figure 3A, the authors should have performed the assay by choosing IC50.
      10. In Supplementary Fig. 4A-B. the results are unclear. The control cell lines are already chemoresistant.
      11. The authors should review and add statistical analysis in both main and supplementary figures.

      Minor comments:

      1. The quality of immunofluorescence and WB images should be implemented, and in the immunofluorescence panels scale bars should be added.
      2. In the graphical abstract authors reported that CHRDL2 overexpression increase WNT and EMT pathways, without performing any specific assay to demonstrate this. Authors should correct and graphically improve the graphical abstract.
      3. The term "significantly" in the discussion section is inappropriately referred to data showed in the histogram in Figure 1J. Moreover, in Figure 1Jthe authors should delete from the y-axis the term "corrected".
      4. The term "significant" in discussion is inappropriately referred to BMI1 expression level if compared to the histogram in Figure 4G.
      5. In Figure 2C the authors should add the unit of measurement (fold over control) in the table.
      6. In Figure 4E the authors should add the figure legend reporting OLFM4 protein.
      7. The authors should include few sentences summarizing the findings at the end of each paragraph.

      Significance

      General assessment:

      Overall, the work is aimed to elucidate the role of CHRDL2 already considered a poor prognosis biomarker involved in the promotion of CRC (PMID: 28009989), in promoting stem-like properties. The authors elucidated new additional insights into the molecular mechanisms regulating stemness phenotype induced by the BMP antagonist CHRDL2 in CRC. The authors include in the study a large amount of data, which only partially support their hypothesis. However, this manuscript lacks organization and coherence, making it challenging to follow and read. Numerous concerns need to be addressed, along with some sentences to rephrase in the result and discussion sections.

      Advance: The manuscript reported some functional insights on the role of CHRDL2 in colorectal cancer, but additional data should be added to support authors 'conclusions.

      Audience: The manuscript is suggested for basic research scientists.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Clarkson and Lewis present data suggesting that overexpression of Chordin like 2 (CHRDL2) can affect colorectal cancer cell responses to chemotherapy agents, possibly by modulating stem-cell like pathways. I have the following comments:

      1. Fig. 1J-it is standard to show the images of cell migration-this is important here, given the modest effect of CHRDL2 overexpression here.
      2. Fig. 2A-the very small error bars for most of the data on the curves suggests these are n=1 experiments with multiple technical replicates to generate the error bars. Please clarify. The legend says n=3 with ANOVA analysis but no significance detected. Please clarify.
      3. Fig. 2B-given the overlapping error bars here, how can there be a p<0.01 between the groups?
      4. Fig. 2G-did the authors try to estimate the concentration of CHRDL2 in the conditioned medium? Which cell line was used to generate this CM?
      5. Fig. 5-what is the potential mechanism for gene expression changes in response to CHRDL2 overexpression? Is it all due to BMP inhibition? More mechanistic detail would be welcome here.

      Significance

      Unclear whether genetically engineered inducible overexpression has any real physiological significance but we all use cell models so this is OK.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank the reviewers for their overall positive assessment of our manuscript. We have used their constructive feedback to substantially improve our manuscript as described below.

      Reviewer #1

      Evidence, reproducibility and clarity

      This study by Reyes at al is a well conducted analysis of memory B cell dynamics of Plasmodium falciparum (Pf) -specific B cell populations over the course of reducing Pf prevalence in ten Ugandan adults. The data is presented well and the authors provide compelling evidence that 1. There is an overall loss of Ag specific B cells with reduction in exposure and 2. Different antigens (MSP1/AMA-1 vs CIDRa-1) generate different flavors of long lived responses. However, additional clarity to the reader should be provided on certain topics (listed below).

      Major comments: 1. While the premise of the study (reduced Pf transmission due to the use of indoor residual spraying (IRS)) is an important one, I think the authors must take into consideration that 9/10 subjects had at least one Pf positive episode between Time Points 1 and 2 (Figure 1). Also, it looks from Fig 1 that some samples were collected at a time of Pf positive test (green squares), while in Table S1 none of the subjects have a positive parasite status at TP1.

      We recognize that most individuals had detectable parasitemia before and after time point (TP) 1. In our manuscript, we therefore do not report the time between TP1 and TP2, because we agree that the length of this time interval is not relevant in our study methodology. We only mention the time between the last known P. falciparum infection and collection of blood at the second time point. We use the sample collected at TP1 only as a representative sample obtained during a time with high P. falciparum exposure and do not make any claims based on the time between TP1 and TP2. The occurrence of infections after sample collection at TP1 confirms that parasite transmission was still high at this time. We have added a schematic of the relative levels of parasite transmission to Figure 1 to emphasize this.

      With respect to infection status, none of the donors were blood smear positive at TP1. However, as mentioned in Table S1, parasites were detected in three individuals using the more sensitive LAMP assay. These three individuals are therefore marked as parasite positive in Figure 1. Table S1 has been modified to highlight the parasite status of these three individuals.

      1. Figure S1A: What is trBC? Figure S1B: What is Strep? Are the strep positive cells also CIDR-1 positive and were they gated out? Why is APC used for MZ-1 and one of the MSP1-AMA-1 tetramers? Do these stainings come from multiple panels?

      All abbreviations of B cell populations were defined in the figure legend (for example, trBC stands for transitional B cells). To facilitate the interpretation of Figure S1, we have now included the definitions of these abbreviations in the figure.

      Strep stands for streptavidin, which has now also been clarified in the figure. In our gating strategy, we used the term “strep” to denote cells that bound to both CIDRa1 and MSP1/AMA1 tetramers, which we interpreted as non-specific binding to streptavidin or other components of the antigen tetramers. Only the “non-strep” cells were used to gate on antigen-specific cells. We have added this clarification to the figure legend.

      In panel B, we accidentally used the term MZ (for merozoite) to describe tetramers of the merozoite antigens MSP1 / AMA1. These labels are interchangeable, but to avoid confusion, MZ-1 has been changed to MSP1 / AMA1.

      1. Figure 3A: how many cells does the umap plot represent? Were there a total of 3555 Ag specific B cells that were non-naive (Figure 3E)?

      It is correct that there were a total of 3,555 antigen-specific B cells used for the clustering shown in panel A. This information has been added to Figure 3A.

      1. Could the authors comment on why in Figure 3, Ig isotype expression was not considered for clustering? This would allow for characterization of DN sub populations/ clusters in addition to the CD21-CD27- ABCs? It looks like IgD expression was low across the clusters (Figure 3D). Was this the case for the cells considered in this analysis, or was it excluded? If it was truly low expressed, how were the assessments in Figure 2 made?

      From prior experience, we know that Ig isotype information tends to dominate in the clustering, which would result in major clusters based on IgM, IgD, IgG, and IgA expression, not on expression of other markers. This is illustrated in the example below. The UMAP on the left shows clusters in green and red that consist of IgG+ and IgA+ B cells, respectively. The UMAP on the right shows that switched memory (swM) B cells and DN B cells are found in both IgG and IgA clusters. Because we were mainly interested in identifying different subsets of B cells, irrespective of Ig isotype, we did not include Ig isotype in the clustering. We have clarified in the manuscript that Ig isotypes were excluded from the analysis to prevent these from dominating the clustering:

      “Unsupervised clustering was then performed based on expression of all markers, except for Ig isotypes to prevent these from dominating the clustering.”

      IgD expression among cell clusters shown in Figure 3 was low because only non-naïve B cells were included in the analyis. The majority of non-naïve cells are class-switched memory B cells and DN B cells, which by definition do not express IgD (see gating strategy in Figure S1A). Figure 2 shows all B cell populations, including naïve B cells and non-naïve B cell populations (unswitched memory, switched memory, and DN), that were gated based on IgD and CD27 expression.

      5.Are there differences in these designations / phenotypes of DN populations of atBCs vs CD21-CD27- atBCs?

      In the malaria field, atypical B cells are typically defined as CD21-CD27-. The definition of DN2 B cells comes from the autoimmunity field and is stricter: IgD-CD27-CD21-CD11c+ B cells. In our manuscript, we define atypical B cells in a stricter way than typically done in the malaria field, following published guidelines for the identification of B cell subsets (https://doi.org/10.3389/fimmu.2019.02458). Using these guidelines, atypical B cells and DN2 B cells are phenotypically identical. We have added a reference to these published guidelines in the Results section:

      “Following published guidelines for the identification of B cell populations (21), total CD19+ B cells were divided into naïve B cells (IgD+CD27-), unswitched memory B cells (IgD+CD27+), switched memory B cells (IgD-CD27+), and double negative B cells (IgD-CD27-).”

      1. Lines 258-259: In considering only switched MBCs, what clusters from Figure 3a were included? There seem to be 2588 sw MBCs (Table S3, Figure 4). Do the remaining cells (967 cells) come from clusters 2, 5 and 6 (and excludes the atBC clusters)

      This analysis did not use the clusters presented in Figure 3, but instead used switched memory B cells gated as shown in Figure S1A. The reason for this is that the clusters in Figure 3 were generated using antigen-specific B cells and cannot be reproduced using non-antigen-specific B cells. Thus, it is not possible to separate all other B cells into the same six clusters. The only way to compare expression of certain markers between antigen-specific and non-antigen specific switched memory B cells is to gate on these populations manually. We have now tried to clarify this in the manuscript as follows:

      “we determined the percentages of CD95+ cells and CD11c+ cells among antigen-specific switched memory B cells and the total population of switched memory B cells (gated manually as shown in Figure S1A).”

      Minor comments: 1. Line 178- 179: Was there a specific measure of rate of decline made for these cells?

      We did not calculate a rate of decline of antigen-specific B cells for several reasons: 1) the time between TP1 and TP2 is not the same for all people in the study, 2) the time between last exposure and TP2 is not the same for all people, and 3) the rate of decline is most likely not linear and cannot accurately be estimated with only two data points. We have changed the wording of this sentence such that we do not use the word “rate”:

      “we did not observe a difference in the percentage of B cells with specificity for merozoite antigens or variant surface antigens that were lost.”

      In addition, we included the percentage of reduction in size in the paragraph before this section:

      “we observed that both populations decreased in size by about 50%, although these differences were not statistically significant.”

      Significance

      General assessment: Strengths: The authors provide evidence that the dynamics of antigen specific cells in humans can vary with exposure and with the nature of the antigen. They have nicely discussed the potential causes for these differences (Discussion), although they should include the findings of Ambegaonkar et al that ABCs in malaria may be restricted to responding specifically to membrane bound antigens (PMCID: PMC7380957)

      As suggested by the reviewer, we have added a paragraph to the Discussion section to discuss the results reported by Ambegaonkar et al. and how the difference between soluble vs. membrane-bound antigens may have an effect on how these antigens are perceived by B cells:

      The difference between soluble and membrane-bound antigens may also have a direct effect on how these antigens are perceived by B cells. Atypical B cells have been shown to be restricted to recognition of membrane-bound antigens (41). The interaction of a B cell with membrane-associated antigen allows the formation of an immunological synapse. Inhibitory receptors expressed by atypical B cells are excluded from this synapse, resulting in B cell receptor signaling and differentiation towards antibody-secreting cells (41). This could explain why atypical B cell subset 1 that expresses the highest levels of the inhibitory receptor FcRL5 is enriched for recognition of the CIDRα1 domain of membrane-bound protein PfEMP1. It should however be noted that soluble antigen can also be presented effectively in membrane-context by conventional dendritic cells, follicular dendritic cells, and subcapsular macrophages in secondary lymphoid organs, especially when it is part of an immune complex (reviewed in (42)). This would provide a route for atypical B cells to also respond to soluble merozoite antigens, such as MSP1 and AMA1.

      Limitations: 1. Outlined above, and as the authors also mention, a small sample size and homogenous population. 2. The evidence for reduced transmission is not clear, and the negative parasite tests for donors shown in Table S1 do not match with Figure 1 data. 3. Lack of IgD expression across clusters (Figure 3D- the authors will need to clarify this point) would require re-analysis of Figure 2 data

      1. We have provided clarification in response to the points raised by the reviewer.

      2. We believe there is clear evidence for reduced transmission, from a median of almost 2 infections per person per year prior to the implementation of IRS to a median parasite-free period of 1.7 years prior to sample collection at TP2. To further emphasize this, we have summarized the number of P. falciparum infections among the ten individuals included in this study (now included in Table S3):

      year

      Pf infections

      comment

      2012

      20

      2013

      19

      TP1

      2014

      20

      TP1

      2015

      8

      Start IRS

      2016

      0

      TP2

      This reduced parasite exposure is reflected in a decrease in immune activation as presented in Figure 2. We have clarified that the data in Table S1 did indeed match those shown in Figure 1.

      1. We have clarified that IgD expression is low in the clusters presented in Figure 3 because naïve B cells were excluded from this analysis.

      Advances: This study highlights the importance of studying antigen specific B cells in humans in the context of natural infection and the use of high-parameter tools such as spectral flow cytometry in assessing a large quantity of data from limited clinical samples. These data are important to inform better vaccine design. Studies in inbred animals can be quite limited or different from human B cell responses.

      Audience: This study will be of interest to malariologists and B cell immunologists. Atypical B cells are relevant to many infectious diseases and auto immunity, while the dynamics of memory B cells in malaria all be relevant to those interested in vaccine design against blood stage antigens.


      Reviewer #2

      Evidence, reproducibility and clarity

      Summary: In this study, the authors compared long-lived total and antigen (ag)-specific B-cell levels in a cohort of 10 Ugandan malaria patient samples that were collected before and after local reduction of P. falciparum transmission (pre/post-IRS). The focus is on the novel comparison of the two most common malaria antigens: merozoite antigens (MSP1/AMA1) and variant surface antigens (CIDRα1). Using high-parameter spectral flow cytometry, they also characterized the phenotype of the different populations of cells. Their main findings include 1) a decrease in activated but maintenance of resting ag-specific B-cells in the post-IRS sample and 2) CD95 and CD11c, as the only differentially expressed markers between MSP1/AMA1-specific and CIDRα1-specific long-lived memory B cells. Their further phenotypic characterization suggests functional consequences with MSP1/AMA1-specific B-cells being poised for rapid antibody-secreting cell differentiation while CIDRα1-specific B cells were enriched among a subset of atypical B cells that seem poised for antigen presentation (CD86+CD11chi/ AtBC1). Their findings consolidate and further expand our knowledge of long-lived B-cell levels during P. falciparum malaria and report/compare (for the first time to my knowledge) a differential selection of long-lived B-cell levels between these 2 antigen specificities. Overall, the manuscript is straightforward and well-written and the authors did a good job explaining their methodology, findings, and interpretations. I believe the major gap missing in this study is the reconciliation of long-lived antigen-specific B-cell levels with the serum antigen-specific antibody levels of these patients against the same 2 antigens (MSP1/AMA1 and CIDRα1) in the experiments and the discussion. The antibody data would strengthen their main argument and is the main missing piece for characterizing more completely the long-lived antigen-specific humoral responses. Below are my suggestions that would help improve the manuscript:

      Major comments: 1. Serum Anti-Pf antibodies: Do the authors have access to the serum/plasma of these patients? It would be important to correlate the total and ag-specific B-cell populations with levels of serum IgG antibodies against those specific Pf antigens (MSP1/AMA1 and CIDRα1) and total IgG levels to strengthen their point about long-lived humoral responses.

      To our understanding, the rationale for such an analysis would be that if IgG levels correlated with the size of a certain B cell population, it would suggest that this B cell population is implicated in the production of IgG against a particular antigen. While a correlation between the percentage of memory B cells and IgG titers has been observed for antigens from several viruses and bacteria (1-4), other studies have reported the absence of such a correlation (4-7). Similarly, for P. falciparum antigens, a moderate correlation between memory B cell abundance and IgG titers has been observed for some merozoite antigens, but not for others (8, 9). The lack of a correlation between the magnitude of the memory B cell and the antibody response fits with the prevailing model that memory B cells and plasma cells are two independently controlled arms of the humoral immune system (10, 11). Given the lack of strong evidence that the levels of IgG titers and memory B cells are interconnected, we do not think this analysis will be informative.

      An alternative analysis would be to study the contribution of B cell subsets to the production of IgG after re-exposure, similar to a recent study that identified T-bet+ memory B cells as the main contributors to antibody responses following influenza virus vaccination (12). Unfortunately, we are unable to perform this analysis in this study population, because only four of the individuals included in this study (spanning calendar years 2012 – 2016) were recruited into a follow up cohort (calendar years 2017 – 2019), and none of these four people were infected during this later time frame.

      We have however added this future direction to the Discussion section:

      To determine the contribution of different memory B cell subsets to the recall response against P. falciparum, it would be interesting to analyze IgG responses upon re-infection. However, none of the individuals included in this study experienced a recorded P. falciparum infection post-IRS, preventing us from performing such an analysis.

      References

      1. Crotty et al., J Immunol (2003), https://doi.org/10.4049/jimmunol.171.10.4969
      2. Quinn et al., J Infect Dis (2004), https://doi.org/10.1086/423937
      3. Cohen et al., Cell Rep Med (2021), https://doi.org/10.1016/j.xcrm.2021.100354
      4. Amanna et al., New England J Med (2007), https://doi.org/10.1056/nejmoa066092
      5. Leyendeckers et al., Eur J Immunol (1999), https://doi.org/10.1002/(sici)1521-4141(199904)29:04%3C1406::aid-immu1406%3E3.0.co;2-p
      6. Nanan et al., Vaccine (2001), https://doi.org/10.1016/s0264-410x(01)00328-0
      7. Goel et al., Science Immunol (2021), https://doi.org/10.1126/sciimmunol.abi6950
      8. Rivera-Correa et al., eLife (2019), https://doi.org/10.7554/elife.48309
      9. Jahnmatz et al., Front Immunol (2021), https://doi.org/10.3389/fimmu.2020.619398
      10. Weisel et al., Immunity (2016), https://doi.org/10.1016/j.immuni.2015.12.004
      11. Shinnakasu et al., Nat Immunol (2016), https://doi.org/10.1038/ni.3460
      12. Nellore et al., Immunity (2023), https://doi.org/10.1016/j.immuni.2023.03.00
        1. Correlation between populations and initial parasite load: Are the levels between any of the populations at any time point correlated significantly in any way? If the statistical power/N allows it, please perform a correlation array between all populations using all samples both total and ag-specific and initial parasite load.

      We agree that this analysis could be very interesting. However, in most recorded infection cases, parasitemia was submicroscopic and parasite load was not reported. Information about parasite density in the blood prior to TP1 is available for only half of the individuals in this study. In these people, the last known parasite density was recorded between three months to two years prior to TP1. Given the small number of individuals for whom these data are available and the large variation in time between parasitemia and sampling, we do not have sufficient data to perform this analysis.

      1. Figure 2: Why were total and ag-specific plasmablasts/plasma cells not included in this figure? Please include to compare levels in these two time points.

      We did not include the levels of total and antigen-specific plasmablasts (PBs) in Figure 2 because the percentages of PBs are relatively low, and very few antigen-specific PBs were detected. We have now included the levels of total PBs in Figure 2A and the percentages of antigen-specific PBs in Supplementary Figure 2. The percentage of PBs among total B cells decreased by about 50% between TP1 and TP2, in line with a decrease in immune activation.

      1. Healthy baseline: The study is missing "healthy" controls as a reference. I presume this is because each patient is its uninfected control in the post-IRS sample. In methods, they mentioned they used two naïve-USA B-cells as technical controls. It would be important to include and maybe expand (to match age and gender)on that specific data from those controls as supplementary figures to support their findings:
      2. Show negative Tetramer staining for these samples (to understand the background).
      3. Levels of all the USA controls total B cell populations and compared to the pre/post-IRS samples to understand "baseline" or "non-endemic" control levels.
      1. We have included flow cytometry plots of tetramer staining for the non-P. falciparum exposed donors (pooled B cells from two US donors) to show the level of background for these probes. These plots are shown in Figure S1B.

      2. We have used data from P. falciparum-naive US donors (n = 7) that we generated for a prior study to show the average level of total B cell populations in Figure 2, and the percentage of switched memory B cells that express CD95, CD11c, T-bet, and FcRL5 in Figure 4.

      Minor comments: 1. In the gating strategy (S1), please include the percentage of each population of that representative example.

      We have added the percentages for all gated populations to Figure S1.

      1. For Figure 2, since not every panel has the same N, please include the N for each panel in the figure or a supplementary table.

      All panels in Figure 2 show data for all 10 individuals. However, since some data points are overlapping, it may appear that some panels show data from fewer individuals. Specifically, no antigen-specific DN1 cells were detected pre- and post-IRS for four individuals. These data points therefore overlap and are not visible. To avoid confusion, we had mentioned this in the legend to Figure 2 (see text in orange). We have tried to further clarify this by emphasizing in the figure legend that data from all 10 individuals are shown (see text in red):

      Figure 2: Abundance of total and antigen-specific B cell subsets in the circulation during high parasite transmission and in the absence of P. falciparum exposure. The percentage of B cell subsets among circulating B cells is shown for total B cells (A), MSP1/AMA1-specific B cells (B), and CIDRα1-specific B cells (C). For MSP1/AMA1-specific B cells and CIDRα1-specific B cells, the total percentage among all circulating B cells is also shown (right most graphs in each panel). All panels show data for all 10 individuals. In panels B and C, no antigen-specific DN1 cells were detected pre- and post-IRS for four individuals. These data points therefore overlap and are not clearly visible. Differences between groups were evaluated using a Wilcoxon matched-pairs signed-rank test. P values

      1. Please mention the history of past and chronic co-infections of these 10 patients. Particularly if they had any other active or recent infection when the sample was taken.

      Four individuals had active or recent infections in the three months prior to sample collection, with upper respiratory tract infections being the most common. This information has been included in Table S3, with a reference to these data in the Methods section. We have also included a link to ClinEpiDB where additional information about the cohort participants, including medical history, can be found.

      1. Discussion: further discussion with relevant literature on the following points is needed to consolidate cellular and antibody studies: a. Whether the presence of long-lived ag-specific B-cell responses correlates with sustained levels of IgG against Pf antigens. b. The different types of antibodies (protective/pathogenic) that these different B-cell populations have been reported to produce during malaria.

      a. We have added the following paragraph to the Discussion section:

      To determine how these different long-lived B cell subsets contribute to protection against P. falciparum infection, it would be important to analyze the connection between the cellular repertoire and plasma IgG. For P. falciparum antigens, a moderate correlation between memory B cell abundance and IgG titers has been observed for some merozoite antigens, but not for others (28, 44). This is in line with studies for other pathogens, that showed a correlation between the percentage of memory B cells and IgG titers for antigens from several viruses and bacteria (48-51), while other studies have reported the absence of such a correlation (51-54). The lack of a correlation between the magnitude of the memory B cell and the antibody response fits with the prevailing model that memory B cells and plasma cells are two independently controlled arms of the humoral immune system (55, 56). To determine the contribution of different memory B cell subsets to the recall response against P. falciparum, it would be interesting to analyze IgG responses upon re-infection. However, none of the individuals included in this study experienced a recorded P. falciparum infection post-IRS, preventing us from performing such an analysis.

      b. We have added additional discussion about the types of antigens recognized by atypical B cells to the Discussion section:

      Prior studies have shown that while atypical B cells harbor reactivity against P. falciparum antigens (9,18), they are also enriched for autoreactivity (43). Specifically, atypical B cells produce antibodies against the membrane lipid phosphatidylserine, which can induce the destruction of uninfected erythrocytes and contribute to anemia (44).

      Significance

      General assessment:

      Strengths: - Novelty in contrasting two different types of P. falciparum antigen responses at the B-cell level. - The use of tetramers is a cutting-edge technique to assess this question. - Analyses were thorough and found contrasting differences in antigen-specific B-cell populations (atypical vs classical) between these 2 antigens for the first time (to my knowledge). - Well-written manuscript with clear data, methodology, and conclusions

      Limitations: - Missing serum/plasma antibody data to support their claim about long-lived humoral responses and reconciliation of ag-specific B-cell levels and ag-specific antibody levels in experiments and discussion. - Limited N of 10 patients of the same gender (female), some population analyses had even fewer samples. - Missing baseline levels for non-endemic uninfected control for B-cell populations for comparison.

      • We have included a discussion about the correlation between plasma antibody and memory B cell responses in the Discussion section.

      • We have clarified that some data points overlap in Figure 2, giving the impression that data from fewer than 10 individuals were shown.

      • We have included baseline levels of 1) tetramer reactivity (Figure S1), 2) the size of B cell populations (Figure 2), and 3) expression of select markers (Figure 4).

      Advance: The study consolidates antigen-specific responses with the discovery of recently characterized populations (ex. atypical) and finds novel differences between two types of malaria antigen responses at the B-cell level and between specific populations responding differentially to these antigens. The findings are incremental (role of B-cell population in malaria-specific responses), conceptual (contrasting two types of B-cell antigen responses in the same infection), and clinical (finding significant differences in patients).

      Audience: This study will attract basic B-cell immunology scientists, infectious disease clinicians/scientists, vaccinologists, and both basic malaria immunology and clinical audiences.

      Reviewer expertise: Malaria, immunology, antibodies.

      __Reviewer #3 __

      Evidence, reproducibility and clarity: The authors analysed the antigen specificity and phenotypes of B cells during high P falciparum transmission and after a period of successful malaria control with IRS in Uganda. The gap between the two sampling time points is close to two years.

      They use antigen probes for MSP1/AMA1 and CIDRalpha1, two antigens expressed at different stages of P. falciparum life cycle-merozoites and infected red cells, respectively. While MSP1/AMA1 are involved in the parasite's invasion of red blood cells, CIDRalpha1 is a domain of PFEMP1, a large family of antigenically variant proteins that mediates the sequestration of infected red cells in small blood vessels.

      They found that the percentage of activated antigen-specific memory B cells declined with malaria control. However, detectable frequencies of antigen-specific memory B cells were retained after malaria control, which confirms earlier reports.

      However, they also demonstrate that the phenotypic characteristics of memory B cells are associated with antigen specificity. The retained MSP1/AMA1-specific B cells were mostly CD95+CD11c+ memory B cells and FcRL5-Tbet- atypical B cells. In contrast, the retained CIDRalpha1-specific B cells were enriched among a subpopulation of atypical B cells.

      These findings suggest differences exist in how the MSA1/AMA1 and CIDRalpha1 y are recognised and processed by the human immune system and how the immune response responds to them upon re-infection with P falciparum.

      Major issues affecting the conclusion: The findings and conclusions of this study, whilst positively exciting and informative, are based on the analyses of very few cells (at times). Even the authors themselves acknowledge this. I expect the authors to address this issue by toning down their reporting and conclusions (where appropriate). Ultimately, we need to have the confidence that these results are reproducible.

      We appreciate the reviewer’s concern about the numbers of antigen-specific cells included in our analyses, which is an inherent limitation of this approach. However, we would like to point out that most analyses included a substantial number of antigen-specific B cells:

      Figure 3D: 158 to 2,038 cells per group

      Figure 4: an average of 26 to 184 cells per donor

      Figure 5B: 55 to 508 cells per group

      Figure 5C: 10 to 334 cells per group*

      * The group with 10 cells is an outlier here. All other groups contain at least 104 cells. Because this one condition had such a small number of cells, we specifically mentioned this number in the text.

      The numbers of cells for analyses shown in Figures 3D and 5B were already included in the figures. All the other numbers were mentioned in Table S3. To further clarify the number of cells included in each analysis, we have added the number of cells to Figures 4 and 5C.

      To tone down our reporting, we have rephrased some of our conclusions, and now present our section headers in past tense to make these statements reflect our observation instead of a definitive conclusion. For example:

      Conclusion: “The loss of MSP1/AMA1-specific and CIDRα1-specific B cells in the circulation was similar, but the phenotype of long-lived MSP1/AMA1-specific and CIDRα1-specific B cells appeared to differ.”

      Section header: “Long-lived MSP1/AMA1-specific and CIDRα1-specific B cells differed in phenotype”

      Finally, in the Discussion section, we have added a statement to our paragraph describing the limitations of our study to stress the importance of reproducing our findings:

      All in all, it will be important to perform additional studies of the phenotype and functionality of long-lived B cells with specificity for P. falciparum antigens to reproduce and extend our findings.

      Minor comments: Figure 2D-I found this figure, and its presentation is unclear. Notably, using contour plots doesn't allow the reader to appreciate the density of the cells being presented.

      To facilitate the interpretation of this figure, we have changed the plot type to a contour plot with density color gradient, and added the number of cells shown in each plot. (Please note that this panel has been renumbered to C.)

      Figure 4 - label the y-axis.

      The y-axis was labeled with “%”, which we have expanded to “% of B cells expressing marker of interest”.

      __Significance: __The study design-as outlined-allowed for the analyses of the specificity and phenotypic characteristics of residual P falciparum-specific memory B cells after 1.7 years of little to no P falciparum exposure. The cell phenotyping methods presented are also appropriate. However, antigen-specific cells are rare in blood circulation, and as the authors themselves acknowledge in the discussion, some of the results are based on very few cells. This means we cannot be sure all the results presented are reproducible.

      Previous studies demonstrated that P falciparum memory B cells are maintained long after cessation of antigen exposure. However, few (if any) detailed antigen-specific and phenotypic analyses of the characteristics of P falciparum-specific memory B cells following a long period of no exposure exist. Thus, this study presents an incremental advance in our knowledge. In addition, the association of antigen specificity with cell phenotypes is a new concept in malaria immunology. The research presented will greatly interest infectious disease immunologists and vaccinologists.

      I am an infectious disease immunologist with substantial experience in malaria immunology.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors analysed the antigen specificity and phenotypes of B cells during high P falciparum transmission and after a period of successful malaria control with IRS in Uganda. The gap between the two sampling time points is close to two years.

      They use antigen probes for MSP1/AMA1 and CIDRalpha1, two antigens expressed at different stages of P. falciparum life cycle-merozoites and infected red cells, respectively. While MSP1/AMA1 are involved in the parasite's invasion of red blood cells, CIDRalpha1 is a domain of PFEMP1, a large family of antigenically variant proteins that mediates the sequestration of infected red cells in small blood vessels.

      They found that the percentage of activated antigen-specific memory B cells declined with malaria control. However, detectable frequencies of antigen-specific memory B cells were retained after malaria control, which confirms earlier reports.

      However, they also demonstrate that the phenotypic characteristics of memory B cells are associated with antigen specificity. The retained MSP1/AMA1-specific B cells were mostly CD95+CD11c+ memory B cells and FcRL5-Tbet- atypical B cells. In contrast, the retained CIDRalpha1-specific B cells were enriched among a subpopulation of atypical B cells.

      These findings suggest differences exist in how the MSA1/AMA1 and CIDRalpha1 y are recognised and processed by the human immune system and how the immune response responds to them upon re-infection with P falciparum.

      Major issues affecting the conclusion:

      The findings and conclusions of this study, whilst positively exciting and informative, are based on the analyses of very few cells (at times). Even the authors themselves acknowledge this. I expect the authors to address this issue by toning down their reporting and conclusions (where appropriate). Ultimately, we need to have the confidence that these results are reproducible.

      Minor comments:

      Figure 2D-I found this figure, and its presentation is unclear. Notably, using contour plots doesn't allow the reader to appreciate the density of the cells being presented.

      Figure 4 - label the y-axis.

      Significance

      The study design-as outlined-allowed for the analyses of the specificity and phenotypic characteristics of residual P falciparum-specific memory B cells after 1.7 years of little to no P falciparum exposure. The cell phenotyping methods presented are also appropriate. However, antigen-specific cells are rare in blood circulation, and as the authors themselves acknowledge in the discussion, some of the results are based on very few cells. This means we cannot be sure all the results presented are reproducible.

      Previous studies demonstrated that P falciparum memory B cells are maintained long after cessation of antigen exposure. However, few (if any) detailed antigen-specific and phenotypic analyses of the characteristics of P falciparum-specific memory B cells following a long period of no exposure exist. Thus, this study presents an incremental advance in our knowledge. In addition, the association of antigen specificity with cell phenotypes is a new concept in malaria immunology. The research presented will greatly interest infectious disease immunologists and vaccinologists.

      I am an infectious disease immunologist with substantial experience in malaria immunology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this study, the authors compared long-lived total and antigen (ag)-specific B-cell levels in a cohort of 10 Ugandan malaria patient samples that were collected before and after local reduction of P. falciparum transmission (pre/post-IRS). The focus is on the novel comparison of the two most common malaria antigens: merozoite antigens (MSP1/AMA1) and variant surface antigens (CIDRα1). Using high-parameter spectral flow cytometry, they also characterized the phenotype of the different populations of cells. Their main findings include 1) a decrease in activated but maintenance of resting ag-specific B-cells in the post-IRS sample and 2) CD95 and CD11c, as the only differentially expressed markers between MSP1/AMA1-specific and CIDRα1-specific long-lived memory B cells. Their further phenotypic characterization suggests functional consequences with MSP1/AMA1-specific B-cells being poised for rapid antibody-secreting cell differentiation while CIDRα1-specific B cells were enriched among a subset of atypical B cells that seem poised for antigen presentation (CD86+CD11chi/ AtBC1). Their findings consolidate and further expand our knowledge of long-lived B-cell levels during P. falciparum malaria and report/compare (for the first time to my knowledge) a differential selection of long-lived B-cell levels between these 2 antigen specificities. Overall, the manuscript is straightforward and well-written and the authors did a good job explaining their methodology, findings, and interpretations. I believe the major gap missing in this study is the reconciliation of long-lived antigen-specific B-cell levels with the serum antigen-specific antibody levels of these patients against the same 2 antigens (MSP1/AMA1 and CIDRα1) in the experiments and the discussion. The antibody data would strengthen their main argument and is the main missing piece for characterizing more completely the long-lived antigen-specific humoral responses. Below are my suggestions that would help improve the manuscript:

      Major comments:

      1. Serum Anti-Pf antibodies: Do the authors have access to the serum/plasma of these patients? It would be important to correlate the total and ag-specific B-cell populations with levels of serum IgG antibodies against those specific Pf antigens (MSP1/AMA1 and CIDRα1) and total IgG levels to strengthen their point about long-lived humoral responses.
      2. Correlation between populations and initial parasite load: Are the levels between any of the populations at any time point correlated significantly in any way? If the statistical power/N allows it, please perform a correlation array between all populations using all samples both total and ag-specific and initial parasite load.
      3. Figure 2: Why were total and ag-specific plasmablasts/plasma cells not included in this figure? Please include to compare levels in these two time points.
      4. Healthy baseline: The study is missing "healthy" controls as a reference. I presume this is because each patient is its uninfected control in the post-IRS sample. In methods, they mentioned they used two naïve-USA B-cells as technical controls. It would be important to include and maybe expand (to match age and gender)on that specific data from those controls as supplementary figures to support their findings:
      5. Show negative Tetramer staining for these samples (to understand the background).
      6. Levels of all the USA controls total B cell populations and compared to the pre/post-IRS samples to understand "baseline" or "non-endemic" control levels.

      Minor comments:

      1. In the gating strategy (S1), please include the percentage of each population of that representative example.
      2. For Figure 2, since not every panel has the same N, please include the N for each panel in the figure or a supplementary table.
      3. Please mention the history of past and chronic co-infections of these 10 patients. Particularly if they had any other active or recent infection when the sample was taken.
      4. Discussion: further discussion with relevant literature on the following points is needed to consolidate cellular and antibody studies: a. Whether the presence of long-lived ag-specific B-cell responses correlates with sustained levels of IgG against Pf antigens. b. The different types of antibodies (protective/pathogenic) that these different B-cell populations have been reported to produce during malaria.

      Significance

      General assessment:

      Strengths:

      • Novelty in contrasting two different types of P. falciparum antigen responses at the B-cell level.
      • The use of tetramers is a cutting-edge technique to assess this question.
      • Analyses were thorough and found contrasting differences in antigen-specific B-cell populations (atypical vs classical) between these 2 antigens for the first time (to my knowledge).
      • Well-written manuscript with clear data, methodology, and conclusions

      Limitations:

      • Missing serum/plasma antibody data to support their claim about long-lived humoral responses and reconciliation of ag-specific B-cell levels and ag-specific antibody levels in experiments and discussion.
      • Limited N of 10 patients of the same gender (female), some population analyses had even fewer samples.
      • Missing baseline levels for non-endemic uninfected control for B-cell populations for comparison.

      Advance:

      The study consolidates antigen-specific responses with the discovery of recently characterized populations (ex. atypical) and finds novel differences between two types of malaria antigen responses at the B-cell level and between specific populations responding differentially to these antigens. The findings are incremental (role of B-cell population in malaria-specific responses), conceptual (contrasting two types of B-cell antigen responses in the same infection), and clinical (finding significant differences in patients).

      Audience:

      This study will attract basic B-cell immunology scientists, infectious disease clinicians/scientists, vaccinologists, and both basic malaria immunology and clinical audiences.

      Reviewer expertise:

      Malaria, immunology, antibodies.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study by Reyes at al is a well conducted analysis of memory B cell dynamics of Plasmodium falciparum (Pf) -specific B cell populations over the course of reducing Pf prevalence in ten Ugandan adults. The data is presented well and the authors provide compelling evidence that 1. There is an overall loss of Ag specific B cells with reduction in exposure and 2. Different antigens (MSP1/AMA-1 vs CIDRa-1) generate different flavors of long lived responses. However, additional clarity to the reader should be provided on certain topics (listed below).

      Major comments:

      1. While the premise of the study (reduced Pf transmission due to the use of indoor residual spraying (IRS)) is an important one, I think the authors must take into consideration that 9/10 subjects had at least one Pf positive episode between Time Points 1 and 2 (Figure 1). Also, it looks from Fig 1 that some samples were collected at a time of Pf positive test (green squares), while in Table S1 none of the subjects have a positive parasite status at TP1.
      2. Figure S1A: What is trBC? Figure S1B: What is Strep? Are the strep positive cells also CIDR-1 positive and were they gated out? Why is APC used for MZ-1 and one of the MSP1-AMA-1 tetramers? Do these stainings come from multiple panels?
      3. Figure 3A: how many cells does the umap plot represent? Were there a total of 3555 Ag specific B cells that were non-naive (Figure 3E)?
      4. Could the authors comment on why in Figure 3, Ig isotype expression was not considered for clustering? This would allow for characterization of DN sub populations/ clusters in addition to the CD21-CD27- ABCs? It looks like IgD expression was low across the clusters (Figure 3D). Was this the case for the cells considered in this analysis, or was it excluded? If it was truly low expressed, how were the assessments in Figure 2 made?
      5. Are there differences in these designations / phenotypes of DN populations of atBCs vs CD21-CD27- atBCs?
      6. Lines 258-259: In considering only switched MBCs, what clusters from Figure 3a were included? There seem to be 2588 sw MBCs (Table S3, Figure 4). Do the remaining cells (967 cells) come from clusters 2, 5 and 6 (and excludes the atBC clusters)

      Minor comments:

      1. Line 178- 179: Was there a specfic measure of rate of decline made for these cells?

      Significance

      General assessment:

      Strengths: The authors provide evidence that the dynamics of antigen specific cells in humans can vary with exposure and with the nature of the antigen. They have nicely discussed the potential causes for these differences (Discussion), although they should include the findings of Ambegaonkar et al that ABCs in malaria may be restricted to responding specifically to membrane bound antigens (PMCID: PMC7380957)

      Limitations:

      1. Outlined above, and as the authors also mention, a small sample size and homogenous population.
      2. The evidence for reduced transmission is not clear, and the negative parasite tests for donors shown in Table S1 do not match with Figure 1 data.
      3. Lack of IgD expression across clusters (Figure 3D- the authors will need to clarify this point) would require re-analysis of Figure 2 data

      Advances: This study highlights the importance of studying antigen specific B cells in humans in the context of natural infection and the use of high-parameter tools such as spectral flow cytometry in assessing a large quantity of data from limited clinical samples. These data are important to inform better vaccine design. Studies in inbred animals can be quite limited or different from human B cell responses.

      Audience: This study will be of interest to malariologists and B cell immunologists. Atypical B cells are relevant to many infectious diseases and auto immunity, while the dynamics of memory B cells in malaria all be relevant to those interested in vaccine design against blood stage antigens.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The manuscript presents IGNITE (Inference of Gene Networks using Inverse kinetic Theory and Experiments), an unsupervised machine learning framework for constructing gene regulatory networks from single-cell RNA sequencing (scRNA-seq) data. IGNITE utilizes a kinetic inverse Ising model to infer gene interactions from binarized expression data and can predict genetic perturbation effects, such as those from knockout experiments. Although the application of inverse Ising models to network reconstruction is not entirely novel, IGNITE's specific implementation and its application to single-cell RNA sequencing data represent a new development. The method is tested on the transition from naive to formative states in murine pluripotent stem cells, a system the authors are highly knowledgeable about, and its performance is compared to state-of-the-art alternative methods.

      Major concerns

      My concern regards the generality of the method, particularly the entire pipeline presented, and the fairness of the performance comparison. These concerns can be easily addressed by the authors by better explaining their choices and their general applicability, and by toning down the conclusions about the comparison with existing inference methods.

      The pre-processing steps are extensive, and their rationale is not always clear, though the results heavily depend on this analysis. Several steps appear to involve arbitrary choices optimized for specific outcomes, potentially introducing biases. The authors should better explain the rationale behind their choices to mitigate these concerns.

      Specifically, part of the pipeline seems to be built to reproduce a specific expression pattern of 24 genes that some of the authors discovered in a previous paper. Although this prior knowledge could be useful and relevant in this specific system, it could limit the generality of the method. For example, the authors selected approximately 2000 genes based on prior knowledge and used a combination of t-SNE and UMAP for dimensionality reduction (although the two techniques have a similar goal). This specific combination seems to reproduce the pseudotime alignment the authors were expecting to find, but such prior information might not be available in general. Therefore, feature selection and the methods used to project data need more justification, especially if the goal is to create a general tool applicable across different biological systems.

      Analogously, the clustering seems manually adjusted to match known expression patterns of 24 relevant genes, rather than being the result of an optimized clustering method. Additionally, the clusters overlap with different time points, raising concerns about potential batch effects. These issues should be addressed to strengthen the validity of the method.

      The claims about the comparison with existing methods should be toned down. While the comparisons are useful and interesting, they might be biased due to the method's fine-tuning for the specific system studied. The claim that the model requires only scRNA-seq data is misleading, as strong prior biological knowledge was used to select, for example, the genes analyzed.

      Significance

      The manuscript is scientifically sound, clearly written, and deserves publication. The proposed method is quantitative, novel, theoretically grounded, and was tested in detail with appropriate null models and statistical methods. Moreover, IGNITE can be applied to various biological systems as the availability of scRNA-seq datasets is continuously growing. The paper will be of interest to a broad community of computational biologists and biology labs interested in gene regulation using scRNA-seq data.

      The limitation, in my opinion, is the method's (particularly the pre-processing pipeline) fine-tuning for the specific biological system tested. Testing IGNITE on another biological system without pre-selected pre-processing steps or detailed biological priors would be more convincing and make the paper's conclusions much stronger. The comparison with other methods also may be slightly biased due to this fine-tuning.

      My background is in statistical physics, with expertise in biological physics, specifically in mathematical modeling and data analysis in molecular biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Corridori et al introduce IGNITE, a computational framework to infer gene regulatory networks (GRNs) from scRNA-seq data leveraging the kinetic Ising model, which can be used to simulate synthetic gene expression and perform in-silico knockout experiments. Other similar frameworks exist, but none combine these three aspects together. The authors have generated a scRNA-seq of murine ESCs differentiation which they use to compare their method with others. Specifically they show that they can infer known regulatory interactions, that they can generate similar data than the original and that it can potentially predict gene expression changes in transcription factor knock-out perturbations.

      Major comments:

      • Many of the authors' claims are backed by qualitative results and not properly quantified. In Fig2, authors qualitatively compare intra gene correlations between genes for the original data and their prediction. Instead of just visualizing they should compute and report the Spearman correlation between the original expression and the predicted one. The Fraction of Agreement is not a good metric to compare knockout predictions since it is completely dependent on the class imbalance of signs, for example if the selected genes are 75% positive and 25% negative, a naive predictor that only outputs positive predictions will still have a high score. Instead, the authors should quantify this with Spearman correlation or RMSE and compare across methods. In FigS4a-b the authors qualitatively claim that other methods could not predict the expected cell composition, which they should quantify and report the values across methods. When comparing against the ground truth network, the fraction of correctly inferred interactions is technically the same as precision but is ignoring recall. I suggest the authors compute precision, recall and a combined F1 score to compare the evaluated methods. Authors claim that the method is scalable to a larger number of genes but no data is provided, they should show how their method compares to others when using a different number of cells and number of genes at memory usage and running time.
      • The authors need to better describe which tests were performed when talking about significance, which thresholds and which corrections, if any, were employed.
      • To reduce the number of dimensions of scRNA-seq data the authors use t-SNE and then from the obtained result UMAP to project the data into a lower dimensional space. This is fundamentally wrong since distances are not well preserved in t-SNE. Instead the authors should first employ PCA and then UMAP. Additionally, the authors use UMAP distances in the Slingshot pseudotime calculation. Similar to t-SNE, UMAP distances have no real meaning and should only be used for visualization purposes. Instead, the authors should provide Slingshot the obtained PCA embeddings.
      • Dictys (PMID: 37537351) is a known GRN inference method that also can simulate gene expression but is missing in the benchmark, the authors should add it to the method comparison.
      • The current manuscript is not reproducible since it is missing the method's code, the code to reproduce the figures and the generated scRNA-seq data.
      • Authors claim that the method is scalable to a larger number of genes but no data is provided to back this claim. They should show how their method compares to others when using a different number of cells and number of genes.

      Minor points:

      • In the introduction, authors mention multimodal GRN inference methods but do not provide any references.
      • In Table 1, CellOracle is annotated as not being able to do multiple KO which is wrong. Additionally, the authors mention that IGNITE uses no prior knowledge which is not really true since it requires pseudotime ordering. The authors should add a column to Table 1 whether methods require pseudotime.
      • It is unclear what the dashed arrow of Fig1b means. Moreover, plotting gene expression values on top of UMAPs can be misleading, instead authors should plot the gene expression distributions binned by pseudotime.
      • The authors report a p-value of 1.04x10-171 which is below detection limit (see PMID: 30921532). Authors should change it to an interval such as p < 2.2×10-16.
      • To make CellOracle results easier to interpret and more comparable, authors should run it at the atlas level instead of at the cell type level, this way generating only one GRN. This can be achieved by assigning the same cluster label to all cells.
      • Experimental values in FigS3b seem to have been repeated and do not match the previous ones for IGNITE and SCODE.
      • It is unclear what the different circles mean in Fig5b.

      Significance

      This manuscript is an incremental and methodological work for specialized audiences. Its strengths are that the authors employ kinetic Ising model for GRN inference and that they provide a single framework capable of inferring, simulating and perturbing gene expression. The main limitations are that the claims should be better quantified and that the code and data need to be made accessible.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      Corridori and colleagues propose IGNITE, a novel method to recover Gene Regulatory Networks (GRN) from single cell RNA-sequencing (scRNA-seq) data. Their method solves the inverse Ising problem generating a cohort of candidate GRN optimising it to minimise the difference to the input expression matrix. Authors report the IGNITE is able to predict wild type data and simulate both single and multiple gene knockouts. Authors benchmark this method on a in-house data set of differentiating pluripotent stem cells (PSC). They focus on a small set of genes known to be involved in PSC differentiation into formative cells. Authors benchmark IGNITE against state of the art tools (SCODE, MaxEnt and CELLORACLE). They evaluate IGNITE ability to predict wild type gene expression by comparing their data with experimental data and with SCODE. They conclude the tool has generative capacity comparable with SCODE. They also evaluate IGNITE ability to recover known interactions with respect to other tools without finding it to significantly outperform them.

      Major comments

      • Are the key conclusions convincing?

      Conclusions appear convincing although model generalizability could be shown in a more thorough manner. For instance, analysing some other publicly available dataset could help demonstrate hyperparameters effects on GRN predictions and their robustness across different experiments. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Claims are well supported by data. - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      I think the work would benefit from an additional benchmark on a different cellular system. This experiment would show how hyperparameters generalise across datasets and would provide potential users insights how to tweak them.

      Also, how does the model scale with the number of genes? A benchmark on computation time and resources required to infer GRN of growing size would be valuable in the adoption of this tool.

      In addition, I think the GRN comparison benchmark presented in section (3.4) would benefit from a quantitative discussion. Authors show inferred GRNs in Figure 4 and S5. For instance, measuring matrix similarity (when appropriate) would help understanding how predicted GRN compare. I understand authors attempt to do so by focusing on validated interactions and computing the fraction of correctly inferred interactions (FCI) but I think a measurement of the overall similarity (eg. Pearson correlation) would add on this.

      Another comment regards the dependency between Correlation Matrices Distance (CMD) and FCI, shown in Figure 5. I understand that IGNITE GRN that maximise FCI are not the same that minimise CMD. However, it looks like GRN that maximise FCI have higher value in terms of biological information. I wonder whether optimization for one or the other metric could be left to the end user as a tunable parameter.

      Authors should discuss why the expression of some genes does not follow the expected trends (Fig 1C vs Fig S1A). Out of the 24 genes they select for their analysis, at least four do not follow the expected trends: Sox2, according to literature, is a Naive gene, however, in Figure 1C its gene expression pattern is more similar to Formative late genes. Other genes with similar "unexpected" patterns are Zic3, Etv4 and Sall4.

      Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      I think suggested experiments are doable as long as authors get publicly available data, i.e. the in-house dataset they generated for this study is enough to show applicability. For example datasets analysed in SCODE paper (https://doi.org/10.1093/bioinformatics/btx194) could be used as second benchmark. The point of applying the tool to another dataset is to show how it generalises across different biological systems, experiments and, potentially, sequencing technologies. - Are the data and the methods presented in such a way that they can be reproduced?

      The methods section is really clear. To enable reproducibility both raw scRNA-seq data, the IGNITE source code and code written to benchmark it should be released in the public domain in appropriate repositories (eg. ENA, GitHub, Binder etc). - Are the experiments adequately replicated and statistical analysis adequate?

      Yes.

      Minor comments

      • Specific experimental issues that are easily addressable.

      Related to the Sox2 expression pattern is the binarization shown in Figure 2D. How is it possible that Sox2 is always marked as active? Could the authors clarify how these outlier behaviours emerge and propose mitigation strategies, if any?

      In section 5.11.2 it is unclear if xi are in log scale or not. Since the model starts from binarized, log transformed expression values, should not generated ones be in the same scale as the input? - Are prior studies referenced appropriately?

      Yes, referencing is clear. - Are the text and figures clear and accurate?

      Yes, figures appear to be clear, readable and well documented both in captions and main text. - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Section 3.3 could be improved by better describing experimental datasets. Only in the methods section it is clearly stated that experimental data for single KO experiments were retrieved from the literature.

      Check typesetting:

      • parenthesis missing in Eq. 1
      • Leftover $ in section 3.1
      • Parenthesis missing in Section 3.3
      • Misplaced comma in section 5.2.1

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      The paper presents a method to infer GRN from scRNA-seq data alone. Applications include GRN prediction and their perturbations. This paper represents a technical advance in the field as it is the first application of the inverse Ising problem GRN inference. - Place the work in the context of the existing literature (provide references, where appropriate).

      The paper itself presents the landscape of GRN inference tools using scRNA-seq data: SCODE, MaxEnt and CELLORACLE. More tools exist, for instance SCENIC (https://doi.org/10.1038/nmeth.4463) mainly relies on co-expression matrices. Other tools exist but require additional data types e.g. GRaNIE and GRaNPA (https://doi.org/10.15252/msb.202311627) leverage on physical interaction data (ATAC-seq, ChIP-seq). Similarly DeepFlyBrain uses deep neural networks to infer eGRN in Drosophila (https://doi.org/10.1038/s41586-021-04262-z). The value of tools like IGNITE and its competitors is that they do not require additional data types, which, in turn, helps in controlling experimental costs. - State what audience might be interested in and influenced by the reported findings.

      The paper might be of interest to biologists interested in regulation of gene expression. The tool might turn out to be useful in planning experimental work by guiding the choice of perturbations to introduce in experimental systems. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I am a computational biologist.

      I have no sufficient expertise to evaluate the mathematical details of the method.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      REVIEWER 1

      Reviewer #1 Evidence, reproducibility and clarity: Nornes et al. have generated a cohort of arterial enhancers based on in silico analysis and validation with transgenic lines in both zebrafish and mice. They utilized publicly available datasets for chromatin marks, including ATAC-seq on endothelial cells either from cell culture or isolated from mice, as well as EP300 binding, H3K27Ac, and H3K4Me1. Focusing on eight arterial-expressed genes, they identified a putative enhancer region marked by at least one enhancer feature. After validating the activity of these enhancers in zebrafish and mice, the authors assessed the regulatory pathways upstream of these genes. Using ChIP-seq and Cut&Run for key endothelial transcription factors, they discovered that binding sites for SoXF and ETS factors are shared in arterial enhancers, whereas binding sites for Notch, MEF2, and Fox are present only in the subset of identified enhancers. Together this study provides an arterial enhancer atlas that allows further characterisation of regulatory network behind endothelial cell identity.

      __Reviewer #1 Major Comment 1: __The authors have assessed 15 enhancers for arterial-venous specificity, by assessing the expression in DA, ISV, cardinal and ventral veins at 2 dpf. Interestingly there is a clear difference in the expression patterns of these enhancers in the zebrafish axial vasculature, especially seen at the level of ISV. The co-localization of the enhancer expression in the endothelium was done using endothelial marks expressed in both venous and arterial EC (kdrl). To fully distinguish if the expression is venous or arterial endothelial compartment colocalization with Tg expressed in arterial (flt1) or venous (lyve1) EC would be informative.

      RESPONSE: We agree with the reviewer that a more detailed description of arterial-venous specificity of each enhancer could be included. In the original manuscript, the expression pattern of each enhancer within the vasculature was primarily assessed at 2 days post fertilization (dpf) in Fig 1-2. This identified arteries using direction of blood flow and available descriptive information, as arterial development in 2pdf zebrafish is very stereotypical and already well characterized.

      __REVISION (PLANNED): __The original Figure 3A includes a more detailed assessment of arterial-venous specificity at 3dpf for four arterial enhancers (Cxcr4+135, Cxcr4+151, Gja5-78 and Gja5-7, chosen as enhancers representing the four types of expression patterns seen). We will now extend this more detailed analysis to all arterial enhancer:GFP lines. This analysis uses kdrl-mCherry to mark the entire vasculature, comparative to the expression of the arterial enhancers (GFP). This allows us to clearly identify the intersegmental arteries (as opposed to intersegmental veins) by looking for direct connection to the dorsal aorta, and by assessing the direction of blood flow within these vessels. This analysis is done at 3dpf to give time for the intersegmental arteries to acquire identity and connect definitively with the dorsal aorta, and for the diminishment of any GFP expression originating from the initial sprouting from the dorsal aorta. By extending this analysis to the other arterial enhancer zebrafish lines shown in Figure 2, we will be able to more clearly classify the activity of each enhancer within different vascular beds. This information will also be recorded in a new Table better detailing the timing and specificity of activity of each enhancer.

      We chose not to use arterial or venous "marker lines" (e.g. Flt1:reporter or Lyve1:reporter) for the simple reason that these are also enhancer:GFP transgenes, and therefore are not necessarily definitive of the arterial or venous lineage per se (e.g. Flt1:GFP expression is controlled by the transcription factors binding the Flt1 enhancer in the same way that Cxcr4+135 and the others are, with the added caveat that the transcriptional regulation of the Flt1 and Lyve enhancers are not well defined). We felt that morphological determination based on direct connections and blood flow direction was therefore more accurate.

      __Reviewer #1 Major Comment 2: __In addition, it is striking that cxc4+135 drives the expression in nearly every ISV as cxcl12+269 only every other. Similarly, not all the enhancers are enriched in the DA to the same level. Is there biological significance to this? could authors discuss these results further? The pattern of expression of the unc5b-identified enhancer is also striking, does this reflect the known roles of unc5b in the vascular formation?

      __RESPONSE: __We agree, the diversity of enhancer expression patterns within the arterial compartment is notable, and really very interesting. The variations in enhancer expression pattern must be largely influenced by the transcription factor motifs within each enhancer, as these patterns were seen in both transient and stable transgenic zebrafish and therefore largely independent of chromatin integration location.

      __REVISION (PLANNED): __The extension of Figure 3A to all enhancer lines (see previous comment) will permit us to more clearly classify the activity of each arterial enhancer within different beds and at different time points. Currently there were no clear links between a particular transcription factor motif/binding and expression pattern, something that is discussed briefly in the original Results and Discussion sections. However, the expansion of Figure 3A to all enhancers, and the creation of a Table summarizing this more systematically will make the link (or lack of one) between expression patterns within the arterial tree and TF motifs easier to appreciate and discuss.

      __Reviewer #1 Major Comment 3: __The final part of the paper focuses on defining the presence of "deeply conserved" transcription factor binding sites (TFSB), defined as TFBS that are as conserved as the enhancer sequence surrounding them. In literature, the term 'deep conservation' refers to evolutionary conservation (genomic sequence preservation) in a wide range of species. Therefore, the additional classification presented by the authors based on the surrounding sequence is not clear. As, the KLF motifs in the Ece1in1, which is conserved between mouse and human, are defined as "deeply conserved". However, the FLK motif in the following enhancer, Flk1in10 (one line below), gets classified as non-deeply conserved, despite also being conserved between mouse and human. Thus, in the current form, there is a contradiction in the way the authors use the term 'deeply conserved' and the accepted meaning of this term. To avoid confusion, it would be important to revise this nomenclature.

      RESPONSE: We agree that this nomenclature should be revised. Our aim was to develop a standard approach to transcription factor motif analysis that could be applied to enhancers regardless of conservation levels and size, and easily replicated by others. Because not all functional transcription factor motifs within enhancers are necessarily conserved between species, we were careful to label both conserved and non-conserved motifs for each TF examined. Nonetheless, extra emphasis was placed on motifs with confirmatory TF binding evidence (e.g. ChIP-seq/CUT&RUN), and those conserved at the same depth as the surrounding sequence. This was because our previous work on endothelial enhancers clearly indicates that these motifs are far more likely to play a key role in regulation. However, the reviewer is correct to note that referring to such motifs as "deeply" conserved could be misinterpreted.

      REVISION (COMPLETED): We have altered our nomenclature. This is explained in the relevant Results sections: "Because the level of conservation of motifs can often be an indication of their importance to enhancer activity, we classified each motif into three categories: strongly conserved (motif conserved to the same depth of the surrounding sequence), weakly conserved (motif conserved in orthologous human enhancer but not to the same depth as the surrounding sequence) and not conserved (motif is not conserved within the orthologous human sequence)".

      Two enhancers (Unc5b-57 and Cdh1-1) were only conserved human-mouse, therefore each TF motif within these enhancers could be annotated as both weakly and strongly conserved. As the reviewer noted, this does create confusion. We have now adjusted Figure 5 to use a distinct shape for motifs for which no distinction between weak and strong motif can be made. This does not cover Ece1in1, which is conserved human-mouse-tenrec but was erroneously originally labelled human-mouse only. This error has been corrected.

      __Reviewer #1 Minor Comment 1: __Details on how the corresponding non-coding regions between mice and humans were established are missing, what alignment tool was used?

      RESPONSE AND REVISION (COMPLETED): This information has now been included in the relevant Results section: "Orthologous human enhancer sequences were identified for every enhancer using the Vertebrate Multiz Alignment & Conservation Track on the UCSC genome browser"

      __Reviewer #1 Minor Comment 2: __Not sufficient details are provided for the re-analysis of siRNA data. E.g., which clustering method was used? How the clusters were assigned to cell identities?

      RESPONSE AND REVISION (COMPLETED): The details regarding the re-analysis of scRNA data has been expanded in the Methods sections: "Publicly available E12 and E17.5 scRNA-seq data from EC isolated from BmxCreERT2;RosatdTomato lineage traced murine hearts54 was obtained from GEO (GSE214942) prior to processing FASTQ files with the 10X Genomics CellRanger pipeline (V7.0.0). RNA-seq reads were aligned to the mm10 genome reference downloaded from 10X Genomics with the addition of the TdTomato-WPRE sequence. Exclusion of low quality cells with either a UMI count >100,000, total gene count 10%) was performed using Scater55. Data normalisation was performed using the MultiBatchNormalisation method prior to merging of TdTomato positive and negative datasets from individual timepoints. The top 2000 most highly variable genes (excluding mitochondrial and ribosomal genes) in the merged datasets were identified using the Seurat FindVariableFeatures method and utilised to calculate principal component analysis (PCA). Normalised data was scaled using the ScaleData function. Cell clustering was performed using the standard unsupervised graph-based clustering method implemented within Seurat (V4)56. Clusters were visualised in two dimensions using UMAP based non-linear dimensional reduction following the standard Seurat (V4) workflow49. Identified clusters were assigned identities based on marker genes shown to be differentially expressed between populations previously identified in the original study47. Key markers include Npr3 (endocardial), Fabp4 (coronary vascular endothelial), and Nfatc1 (valvular endothelial). The E12.5 sinus venosus EC cluster was assigned based in Aplnr as previously described54. Arterial and venous EC clusters in the E17.5 datasets were annotated based on their enriched expression of Gja5 and Nr2f2, respectively."

      __Reviewer #1 Minor Comment 3: __Details about the first HOMER analysis (in the assessment of transcription factor motifs and binding patterns at arterial enhancers) seem to be missing from the methods section.

      RESPONSE AND REVISION (COMPLETED): This has been included in the methods: "Analysis of overrepresented motifs within our validated arterial enhancer cohort was performed with HOMER's findMotifsGenome tool using the full validated region of the arterial enhancers. The analysis used the hg38 masked genome and otherwise default settings for all other parameters including randomly selected background regions".

      __Reviewer #1 Minor Comment 4: __Pg 12: "For ETS, 23/23 arterial enhancers contained at least one conserved motif (all "deeply" conserved to the same depth as the surrounding enhancer, see S7)". Is it S8, where conservation is indicated?

      __ ____RESPONSE AND REVISION (COMPLETED):__ We have corrected this error in the text - no figure actually needed to be referenced here as the previous sentence contained the full list of relevant figures to this statement (Table 2 and Figures 5 and S9, previously called S8, are the places to see this information).

      __Reviewer #1 Minor Comment 5: __Figure 1 and 2 for non-zebrafish readers it would be useful to indicate in Figures 1 and 2 the non EC expression that can be observed in the embryos.

      RESPONSE AND REVISION (COMPLETED): In addition to arterial expression, a number of the enhancer:GFP transgenes also showed GFP expression within the neural tube. In addition, some transient transgenic embryos also showed ectopic expression in muscle fibres. These have now been indicated on the images in Figure 1 and 2.

      __Reviewer #1 Minor Comment 6: __Table S1: Please, indicate in the legend what the asterisk in the H DNAseI column stands for

      RESPONSE AND REVISION (COMPLETED): The asterisk indicates where DNaseI hypersensitivity is also seen in multiple non-EC lines. This explanation has been added to the legend.

      __Reviewer #1 Minor Comment 7: __Figure S8: The phrasing "conserved to animal" in Figure S8 is misleading. There is no difference in something being conserved to tenrec or manatee, as both are Afrotherians. Hence, the data show that both Efnb2-141 and Ephb4-2 were present in the common ancestor of Afrotherians and humans, namely the ancestor of all placentals. Instead, it would be good to indicate the phylogenetic group for which the presence of the enhancer can be inferred (in this case, Placentalia).

      __RESPONSE: __Whilst I appreciate the point, it is the exact sequence that is important here - obviously tenrec and manatee are similar species but still contain differences in nucleotide sequences. The information about conservation leads the reader to the exact species with which the comparison is being made. We tried to restrict this to just one species per phylogenetic group (e.g. tenrec, opossum, chicken, zebrafish) but occasionally this was not possible.

      Reviewer #1 Significance

      To date, a systematic approach to identifying the regulatory networks driving endothelial cell identity is missing. This study provides important datasets and validation of enhancers involved in arterial gene expression and the associated transcription factors. Although this is only the tip of the iceberg, this work represents a significant milestone in the systematic understanding of how arterial gene expression is regulated. Overall, this study offers a powerful resource for understanding arterial gene regulation and conducting genome-wide studies of arterial enhancers.

      __RESPONSE: __We thank the reviewer for these kind words. Whilst we agree this is only a very small snapshot of all the arterial enhancers involved in gene regulation, we would like to stress that not only is this a massive increase to what has been known previously, but is also deliberately focused on the genes used to define arterial identity during development and in the adult, therefore these enhancers by themselves form an extremely valuable dataset with which to study the key factors driving arterial differentiation and identity.

      __ __


      REVIEWER 2

      __Reviewer #2 Evidence, reproducibility and clarity: __In this work, Nornes and collaborators have described a cohort of arterial enhancers that drive gene expression in arteries and not in veins. The paper is very well written and it is very informative. The authors have used in silico models to identified the potential artery enhancers and then used different developmental in vivo systems, zebrafish and mice, to validate their findings. Finally, the authors have explored what transcription factors may be binding the identified enhancer sequences and thus, drive arterial gene expression. I would like to congratulate the authors for this work that it has been a pleasure to read and review.

      Reviewer #2 Major Comment 1: In their identification of enhancers, the authors consider a candidate every enhancer that has a putative mark in both mouse and human. Nevertheless, all the human data comes from in vitro analysis. Considering how much cell culture affects endothelial cell identity, inducing effects like EndoMT, would this have any effect on the enhancer selection? Would it be possible to search any human in vivo data? Would this allow for even stronger and more relevant sequences?

      __RESPONSE: __We agree that the use of human endothelial cells in culture raises some potential issues. However, we stress that the mouse EC enhancer marks, which played a key role in defining putative enhancers, come from in vivo analysis (E11 embryos, P6 retina and adult aorta), limiting the potential for significant impact from cell culture-induced issues. Whilst we would have enthusiastically incorporated human in vivo data had it been available, our approach was still indisputably successful at identifying arterial enriched/specific enhancers.

      We consider it unlikely that culture/identity-related problems with human cultured ECs led to a significant undercount of enhancers, in part because comparatively few regions with enhancer marks in mouse in vivo ECs were excluded due to the absence of human enhancer marks. In fact, Cxcr4, Cxcl12, and Gja5 were poorly transcribed in the human cell lines studied here and consequently only enhancer marks in mouse were used to define putative enhancers for these three genes (this is clearly stated in the Results section). If a similar rational had applied to the remaining five genes, only an additional six putative enhancers would have been tested (one for Gja4, two for Nrp1 and three for Unc5b). However, we felt it made sense to include analysis of human enhancer marks for these five genes, as all were expressed in the human ECs used (as indicated by H3K1Me3 and DNaseI hypersensitivity at promoter regions) and orthologous human enhancers were identified for all. Additionally, our retrospective analysis of previously described mammalian in vivo-validated EC enhancers (Table S1 in the original manuscription, including eight arterial enhancers) found that all 32 were marked by at least one enhancer mark in human samples (1/32 did not contain mouse enhancer marks). We also tested eleven regions that did not reach our putative enhancer threshold, including five with only mouse marks. None of these directed expression in transgenic analysis.

      Reviewer #2 Major Comment 2: The human data comes from vein endothelial or microvasculature endothelial cells. Specially because some of the enhancers identified by the authors drive also vein expression, could the authors discriminate whether this is due to the identification coming from vein cells. Is there available data from HAECs? Would this not be conceptually more correct that using vein endothelial cells data? This should be at least discussed in the paper.

      __RESPONSE AND REVISION (COMPLETED): __We have now included a comparison with enhancer marks from HAECs, telo-HAECs and HUAECs as a new Figure S5. The enhancer marks seen in these cells were very similar to those in the HUVEC and microvascular cells already surveyed. Had enhancer marks within HAECs/telo-HAEC/HUAECs been included as a human enhancer mark in our initial survey, it would have been unlikely to have altered our analysis, although we agree it would have made it more conceptually correct. We chose not to go back and engineer this into our original enhancer selection rational however as we felt it would be intellectually dishonest. A paragraph has been added to the Results section about this analysis.

      Reviewer #2 Major Comment 3: Although the authors use the mouse embryo to further validate their finding beyond the zebrafish, the expression are a bit different. While on the fish the enhancers label smaller vessels of arterial identity, in the mouse, only bigger arteries are marked. Is this defined by the time of the analysis?

      __RESPONSE: __This experiment was conducted to demonstrate that these enhancers were arterial enriched in both zebrafish and mouse transgenesis, and feel this is clearly shown by the current data. Whilst I do not really agree that the expression pattern is different (for example, the Gja5 enhancers are more restricted to the major arteries in both zebrafish and mouse, compared to the more widely expressed Efnb2-333), this is challenging to ascertain at a single time-point in a transient transgenic mouse assay. Whilst it would be potentially interesting to better assess the activity of these enhancers over time in mice, we consider this a lengthy experiment (multiple stable lines would need to be established and characterized for each enhancer) which would not add particular benefit to this paper.

      Reviewer #2 Major Comment 4: The analysis of the enhancers is only done during development. Is the activity of these enhancers maintained through live or only important for artery vs vein determination? Is the expression of the different enhancer reporters maintained into adulthood?

      RESPONSE AND REVISION (PLANNED): We agree this would be interesting to ascertain. We plan to examine the activity of enhancer:GFP activity in adult fish fins (which are accessible even without crossing into a casper background, which is beyond the timescale of this project) in the fully revised version of this paper. We have already conducted a feasibility study on four arterial enhancers:GFP lines (Gja5-7:GFP, Gja5-78:GFP, Gja4+40:GFP and Efnb2-333:GFP), which found that all four were still active, and arterial-specific, in the adult.

      Reviewer #2 Significance

      This is a very well done study with potential interest for vascular biologists, in particular to those interested in the determination between artery and veins in a context of development. It advances our knowledge on the field of vascular biology as it not only proposes potential enhancers but also goes on to validation of the enhancers. Nevertheless, it is important to note that some of this enhancers have been identified from in vitro human data. In vitro culture of endothelial cells affects their cellular identity and thus, this study may have underscored many potential enhancers.

      REVIEWER 3

      __Reviewer #3: Evidence, reproducibility and clarity: __This manuscript by Nornes et al analyzed multiple published databases and identified a group of putative enhancers for 8 selected non-Notch arterial genes in mouse and human ECs. These enhancers were cloned and screened in fish embryos to test their effect in driving GFP reporter expression, which narrowed down a cohort of enhancers for further testing of expression activities in mouse embryonic arteries. The authors then analyzed the sequences of these enhancers, and identified binding motifs of ETS, SOX-F, FOX and MEF2 family TFs and Notch transcription regulator RBPJ commonly present in closed proximity in these arterial enhancers, suggesting interaction between these TFs in determination of arterial identity.

      Reviewer #3 Major Comment : This study provides an enormous amount of bioinformatic data analysis and screening results in transgenic fish and mouse models, which led to the discovery of a group of arterial enhancers and TFs binding motifs essential in regulating arterial identity.

      Reviewer #3 Other Comments ____1: Choice of arterial genes is slightly biased. Acvrl1/Alk1 is not enriched in arterial ECs. Sema3G, which is highly expressed in arterial ECs, is missing. UNC5B is enriched in arterial ECs but also expressed by sprouting ECs (PMID: 38866944).

      __RESPONSE: __When we started this project, scRNA-seq datasets in the developing vasculature were less available. Consequently, we initially based our choice of genes on data from Raftrey et al., Circ Res 2021 (available earlier on bioRxiv), which was focused on mouse coronary arterial ECs at the timepoints that arteries differentiate. This found Acvrl1 to be arterial enriched (not a novel observation, many publications treat Acvrl1 as arterial specific or arterial-enriched) and did not list Sema3g. We also considered a wider dataset from mouse and human mid-gestation embryos when available (Hou et al., Cell Research 2022). However, it is important to note that we did not aim to investigate every arterial-enriched gene, rather to use these datasets to help identify loci associated with gene expression patterns which indicated a high likelihood of containing arterial enhancers active during arterial differentiation.

      Sc-RNAseq data from both Raftery et al., and Hou et al., indicated that arterial ECs are subdivided into two groups, reflecting maturity but also potentially slightly different developmental trajectories. The genes studied here were therefore selected to evenly cover both subgroups, with Acvrl1, Cxcl12, Gja5 and Nrp1 primarily restricted to the mature arterial EC subgroup, while Cxcr4, Efnb2, Gja4 and Unc5b were also expressed in the less mature/arterial plexus/pre-arterial EC subgroup. It is notable that genes within the latter subgroup are also associated with angiogenic/sprouting ECs (Dll4 also belongs to this subgroup), which likely indicates biological links between angiogenesis and arterial identity rather than a problem in gene choice and specificity.

      __REVISION (COMPLETED): __This is already discussed in the Results section (angiogenic expression of arterial genes is discussed within the MEF2 and RBPJ sections) and in the Discussion (paragraph 2, referring to different expression patterns within arterial ECs). However, we have now edited the relevant Results section to better explain gene selection: "It is therefore clear that a better understanding of the regulatory pathways directing arterial differentiation requires the identification and characterization of a larger number of arterial enhancers directing the expression of key arterial identity genes. To identify a cohort of such enhancers, we looked in the loci of eight non-Notch genes: Acvrl1(ALK1) Cxcr4, Cxcl12, Efnb2, Gja4(CX37), Gja5 (CX40), Nrp1 and Unc5b. Although not a definitive list of arterial identity genes, single cell transcriptomic analysis indicates these genes are all significantly enriched in arterial ECs4,20, and are commonly used to define arterial EC populations in mouse and human scRNAseq analysis4,5,20,54. Additionally, single-cell transcriptomic data indicates that arterial ECs can be divided into two subgroups4,20. The genes selected here are equally split between subgroups (Acvrl1, Cxcl12, Gja5 and Nrp1 from the mature arterial EC subgroup, Cxcr4, Efnb2, Gja4 and Unc5b from the less mature/arterial plexus/pre-arterial EC subgroup)4,20. We did not exclude genes also implicated in angiogenesis/expressed in sprouting ECs, as these genes formed that vast majority of those associated with the less mature EC subgroup".

      Reviewer #3 Other Comments ____2: Exclusion of Notch genes. Although the reason for choosing non-notch genes and excluding notch genes for screening is addressed in this paper, it would be interesting to examine how the arterial enhancers identified in this study are present in the Notch genes, especially Dll4 (enriched in arterial and sprouting ECs) and Jag1 (enriched in arterial ECs).

      __RESPONSE: __Previous work from our lab and others has already examined arterial enhancers for Notch pathway genes. We already included these enhancers in all our later analysis (Figure 5-6 and relevant supplemental figures), including analysis of TF motifs.

      Reviewer #3 Other Comments ____3: SoxF family TFs. Among the 3 members of SoxF TFs, only Sox17 and Sox7 were assessed. Though not specific, Sox18 is highly expressed in the arteries. On the contrary, Sox7 is highly expressed in the vein and shows weak expression in arterial ECs (PMID: 26630461).

      __RESPONSE AND REVISION (PLANNED): __We agree. We will include assessment of SOX18 binding in our final revised manuscript. An antibody for this analysis has been identified already.

      Reviewer #3 Other Comments ____4: Minor inaccuracy in Intro/paragraph 3: though sox17 is reported as indispensable for arterial specification (PMID: 24153254), losing a single SoxF factor does not seem to completely compromise the arterial program (PMID: 24153254, PMID: 26630461). A combined loss of Sox17/18, or Sox 7/17/18, seems to do the job (PMID: 26630461).

      __RESPONSE: __We have altered this section: "The evidence linking SOXF transcription factors to arterial differentiation is more extensive, with loss of either SOX17 (the SOXF factor most specific to arterial ECs) or SOX7 resulting in arterial defects21-24. Whilst losing a single SOXF factor does not entirely compromise the arterial program, arterial differentiation appears absent after compound Sox17;Sox18 and Sox7;Sox17;Sox18 deletion, although this occurs alongside significantly impaired angiogenesis and severe vascular hyperplasia21-24. PMID 24153254 is reference 23, PMID 26630461 is reference 24.

      Reviewer #3 Other Comments ____5: Fig.4 e14.5 mouse embryos. If the observation aims to assess the dorsal aorta, it would be better to use mouse embryos at mid-gestation (e9.5-10.5), when the paired DAs are formed with arterial identity but haven't been remodelled and fused as one single aorta. The morphological data in this figure would be better to show the colocalization of LacZ expression and an arterial marker (e.g. Sox17) using immulfluorescence staining instead of purely lacZ.

      RESPONSE: This experiment was primarily conducted to demonstrate that our enhancers were arterial enriched in both zebrafish and mouse transgenesis, and feel this is clearly shown with the e14.5 transgenic embryos originally shown. We chose e14.5 because it matched the timepoints used for the single cell transcriptomics first used to select the target arterial identity genes, and feel it is a good match to 2-3 dpf zebrafish in terms of arterial differentiation mechanisms. We agree that E9-10 would have also been an additional useful timepoint, but we do not have the resources to generate this data nor consider it essential for the conclusions of our work here.

      __REVISION (PLANNED): __We are unable to perform immunofluorescence in the e14.5 transgenic embryos due to the fixation and staining solutions used for X-gal staining (which was done by an external company and could not be altered), but agree additional information is needed to demonstrate arterial endothelial specificity. We will therefore expand the analysis of sectioned embryos (currently restricted to just the Efnb2-333:LacZ transgene) to all enhancers shown in Figure 4. This analysis has some limitations due to infiltration of the X-gal solution to deeper tissues, but is anticipated it will clearly show enhancer activity in arterial endothelial cells rather than venous ECs or smooth muscle cells.

      __Reviewer #3 (Significance (Required)): __This novel work establishes an important foundation for future understanding of how TFs may interact to determine arterial specification.

      Other revisions

      In addition to changes suggested by the reviewers, we also made one additional adjustment to the paper to include analysis of two additional putative enhancers (Efnb2-159 and Cxcr4+119). These were initially omitted in error yet both regions reach the standard of testable putative enhancers (noted in small changes to Figure S1 and Table S2). When tested in zebrafish transient transgenic embryos, Cxcr4+119 was inactive whilst Efnb2-159 was active in arterial endothelial cells. The relevant tables and figures have been adjusted to reflect these changes, the most significant of which are the inclusion of Efnb2-159 positive zebrafish in Figure 1 (and the necessity to create an additional supplemental Figure (S3) to accommodate the increased number of images), and analysis of Efnb2-159 transcription factor motifs/binding as part of Figure 5 and 6. No conclusions were altered by the inclusion of this additional data.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript by Nornes et al analyzed multiple published databases and identified a group of putative enhancers for 8 selected non-Notch arterial genes in mouse and human ECs. These enhancers were cloned and screened in fish embryos to test their effect in driving GFP reporter expression, which narrowed down a cohort of enhancers for further testing of expression activities in mouse embryonic arteries. The authors then analyzed the sequences of these enhancers, and identified binding motifs of ETS, SOX-F, FOX and MEF2 family TFs and Notch transcription regulator RBPJ commonly present in closed proximity in these arterial enhancers, suggesting interaction between these TFs in determination of arterial identity.

      Major comments:

      This study provides an enormous amount of bioinformatic data analysis and screening results in transgenic fish and mouse models, which led to the discovery of a group of arterial enhancers and TFs binding motifs essential in regulating arterial identity.

      Other comments:

      1. Choice of arterial genes is slightly biased. Acvrl1/Alk1 is not enriched in arterial ECs. Sema3G, which is highly expressed in arterial ECs, is missing. UNC5B is enriched in arterial ECs but also expressed by sprouting ECs (PMID: 38866944).
      2. Exclusion of Notch genes. Although the reason for choosing non-notch genes and excluding notch genes for screening is addressed in this paper, it would be interesting to examine how the arterial enhancers identified in this study are present in the Notch genes, especially Dll4 (enriched in arterial and sprouting ECs) and Jag1 (enriched in arterial ECs).
      3. SoxF family TFs. Among the 3 members of SoxF TFs, only Sox17 and Sox7 were assessed. Though not specific, Sox18 is highly expressed in the arteries. On the contrary, Sox7 is highly expressed in the vein and shows weak expression in arterial ECs (PMID: 26630461). Minor inaccuracy in Intro/paragraph 3: though sox17 is reported as indispensable for arterial specification (PMID: 24153254), losing a single SoxF factor does not seem to completely compromise the arterial program (PMID: 24153254, PMID: 26630461). A combined loss of Sox17/18, or Sox 7/17/18, seems to do the job (PMID: 26630461).
      4. Fig.4 e14.5 mouse embryos. If the observation aims to assess the dorsal aorta, it would be better to use mouse embryos at mid-gestation (e9.5-10.5), when the paired DAs are formed with arterial identity but haven't been remodeled and fused as one single aorta. The morphological data in this figure would be better to show the colocalization of LacZ expression and an arterial marker (e.g. Sox17) using immulfluorescence staining instead of purely lacZ.

      Significance

      This novel work establishes an important foundation for future understanding of how TFs may interact to determine arterial specification.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this work, Nornes and collaborators have described a cohort of arterial enhancers that drive gene expression in arteries and not in veins. The paper is very well written and it is very informative. The authors have used in silico models to identified the potential artery enhancers and then used different developmental in vivo systems, zebrafish and mice, to validate their findings. Finally, the authors have explored what transcription factors may be binding the identified enhancer sequences and thus, drive arterial gene expression. I would like to congratulate the authors for this work that it has been a pleasure to read and review.

      Major comments:

      1. In their identification of enhancers, the authors consider a candidate every enhancer that has a putative mark in both mouse and human. Nevertheless, all the human data comes from in vitro analysis. Considering how much cell culture affects endothelial cell identity, inducing effects like EndoMT, would this have any effect on the enhancer selection? Would it be possible to search any human in vivo data? Would this allow for even stronger and more relevant sequences?
      2. The human data comes from vein endothelial or microvasculature endothelial cells. Specially because some of the enhancers identified by the authors drive also vein expression, could the authors discriminate whether this is due to the identification coming from vein cells. Is there available data from HAECs? Would this not be conceptually more correct that using vein endothelial cells data? This should be at least discussed in the paper.
      3. Although the authors use the mouse embryo to further validate their finding beyond the zebrafish, the expression are a bit different. While on the fish the enhancers label smaller vessels of arterial identity, in the mouse, only bigger arteries are marked. Is this defined by the time of the analysis?
      4. The analysis of the enhancers is only done during development. Is the activity of these enhancers maintained through live or only important for artery vs vein determination? Is the expression of the different enhancer reporters maintained into adulthood?

      Significance

      This is a very well done study with potential interest for vascular biologists, in particular to those interested in the determination between artery and veins in a context of development. It advances our knowledge on the field of vascular biology as it not only proposes potential enhancers but also goes on to validation of the enhancers. Nevertheless, it is important to note that some of this enhancers have been identified from in vitro human data. In vitro culture of endothelial cells affects their cellular identity and thus, this study may have underscored many potential enhancers.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Nornes et al. have generated a cohort of arterial enhancers based on in silico analysis and validation with transgenic lines in both zebrafish and mice. They utilized publicly available datasets for chromatin marks, including ATAC-seq on endothelial cells either from cell culture or isolated from mice, as well as EP300 binding, H3K27Ac, and H3K4Me1. Focusing on eight arterial-expressed genes, they identified a putative enhancer region marked by at least one enhancer feature. After validating the activity of these enhancers in zebrafish and mice, the authors assessed the regulatory pathways upstream of these genes. Using ChIP-seq and Cut&Run for key endothelial transcription factors, they discovered that binding sites for SoXF and ETS factors are shared in arterial enhancers, whereas binding sites for Notch, MEF2, and Fox are present only in the subset of identified enhancers. Together this study provides an arterial enhancer atlas that allows further characterisation of regulatory network behind endothelial cell identity.

      Major comments:

      The authors have assessed 15 enhancers for arterial-venous specificity, by assessing the expression in DA, ISV, cardinal and ventral veins at 2 dpf. Interestingly there is a clear difference in the expression patterns of these enhancers in the zebrafish axial vasculature, especially seen at the level of ISV. The co-localization of the enhancer expression in the endothelium was done using endothelial marks expressed in both venous and arterial EC (kdrl). To fully distinguish if the expression is venous or arterial endothelial compartment colocalization with Tg expressed in arterial (flt1) or venous (lyve1) EC would be informative. In addition, it is striking that cxc4+135 drives the expression in nearly every ISV as cxcl12+269 only every other. Similarly, not all the enhancers are enriched in the DA to the same level. Is there biological significance to this? could authors discuss these results further? The pattern of expression of the unc5b-identified enhancer is also striking, does this reflect the known roles of unc5b in the vascular formation? The final part of the paper focuses on defining the presence of "deeply conserved" transcription factor binding sites (TFSB), defined as TFBS that are as conserved as the enhancer sequence surrounding them. In literature, the term 'deep conservation' refers to evolutionary conservation (genomic sequence preservation) in a wide range of species. Therefore, the additional classification presented by the authors based on the surrounding sequence is not clear. As, the KLF motifs in the Ece1in1, which is conserved between mouse and human, are defined as "deeply conserved". However, the FLK motif in the following enhancer, Flk1in10 (one line below), gets classified as non-deeply conserved, despite also being conserved between mouse and human. Thus, in the current form, there is a contradiction in the way the authors use the term 'deeply conserved' and the accepted meaning of this term. To avoid confusion, it would be important to revise this nomenclature.

      Minor:

      Details on how the corresponding non-coding regions between mice and humans were established are missing, what alignment tool was used?

      Not sufficient details are provided for the re-analysis of siRNA data. E.g., which clustering method was used? How the clusters were assigned to cell identities?

      Details about the first HOMER analysis (in the assessment of transcription factor motifs and binding patterns at arterial enhancers) seem to be missing from the methods section.

      Pg 12: "For ETS, 23/23 arterial enhancers contained at least one conserved motif (all "deeply" conserved to the same depth as the surrounding enhancer, see S7)". Is it S8, where conservation is indicated?

      Figure 1 and 2 for non-zebrafish readers it would be useful to indicate in Figures 1 and 2 the non EC expression that can be observed in the embryos.

      Table S1: Please, indicate in the legend what the asterisk in the H DNAseI column stands for

      Figure S8: The phrasing "conserved to animal" in Figure S8 is misleading. There is no difference in something being conserved to tenrec or manatee, as both are Afrotherians. Hence, the data show that both Efnb2-141 and Ephb4-2 were present in the common ancestor of Afrotherians and humans, namely the ancestor of all placentals. Instead, it would be good to indicate the phylogenetic group for which the presence of the enhancer can be inferred (in this case, Placentalia).

      Significance

      To date, a systematic approach to identifying the regulatory networks driving endothelial cell identity is missing. This study provides important datasets and validation of enhancers involved in arterial gene expression and the associated transcription factors. Although this is only the tip of the iceberg, this work represents a significant milestone in the systematic understanding of how arterial gene expression is regulated. Overall, this study offers a powerful resource for understanding arterial gene regulation and conducting genome-wide studies of arterial enhancers.

    5. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      REVIEWER 1

      Reviewer #1 Evidence, reproducibility and clarity: Nornes et al. have generated a cohort of arterial enhancers based on in silico analysis and validation with transgenic lines in both zebrafish and mice. They utilized publicly available datasets for chromatin marks, including ATAC-seq on endothelial cells either from cell culture or isolated from mice, as well as EP300 binding, H3K27Ac, and H3K4Me1. Focusing on eight arterial-expressed genes, they identified a putative enhancer region marked by at least one enhancer feature. After validating the activity of these enhancers in zebrafish and mice, the authors assessed the regulatory pathways upstream of these genes. Using ChIP-seq and Cut&Run for key endothelial transcription factors, they discovered that binding sites for SoXF and ETS factors are shared in arterial enhancers, whereas binding sites for Notch, MEF2, and Fox are present only in the subset of identified enhancers. Together this study provides an arterial enhancer atlas that allows further characterisation of regulatory network behind endothelial cell identity.

      __Reviewer #1 Major Comment 1: __The authors have assessed 15 enhancers for arterial-venous specificity, by assessing the expression in DA, ISV, cardinal and ventral veins at 2 dpf. Interestingly there is a clear difference in the expression patterns of these enhancers in the zebrafish axial vasculature, especially seen at the level of ISV. The co-localization of the enhancer expression in the endothelium was done using endothelial marks expressed in both venous and arterial EC (kdrl). To fully distinguish if the expression is venous or arterial endothelial compartment colocalization with Tg expressed in arterial (flt1) or venous (lyve1) EC would be informative.

      RESPONSE: We agree with the reviewer that a more detailed description of arterial-venous specificity of each enhancer could be included. In the original manuscript, the expression pattern of each enhancer within the vasculature was primarily assessed at 2 days post fertilization (dpf) in Fig 1-2. This identified arteries using direction of blood flow and available descriptive information, as arterial development in 2pdf zebrafish is very stereotypical and already well characterized.

      __REVISION (PLANNED): __The original Figure 3A includes a more detailed assessment of arterial-venous specificity at 3dpf for four arterial enhancers (Cxcr4+135, Cxcr4+151, Gja5-78 and Gja5-7, chosen as enhancers representing the four types of expression patterns seen). We will now extend this more detailed analysis to all arterial enhancer:GFP lines. This analysis uses kdrl-mCherry to mark the entire vasculature, comparative to the expression of the arterial enhancers (GFP). This allows us to clearly identify the intersegmental arteries (as opposed to intersegmental veins) by looking for direct connection to the dorsal aorta, and by assessing the direction of blood flow within these vessels. This analysis is done at 3dpf to give time for the intersegmental arteries to acquire identity and connect definitively with the dorsal aorta, and for the diminishment of any GFP expression originating from the initial sprouting from the dorsal aorta. By extending this analysis to the other arterial enhancer zebrafish lines shown in Figure 2, we will be able to more clearly classify the activity of each enhancer within different vascular beds. This information will also be recorded in a new Table better detailing the timing and specificity of activity of each enhancer.

      We chose not to use arterial or venous “marker lines” (e.g. Flt1:reporter or Lyve1:reporter) for the simple reason that these are also enhancer:GFP transgenes, and therefore are not necessarily definitive of the arterial or venous lineage per se (e.g. Flt1:GFP expression is controlled by the transcription factors binding the Flt1 enhancer in the same way that Cxcr4+135 and the others are, with the added caveat that the transcriptional regulation of the Flt1 and Lyve enhancers are not well defined). We felt that morphological determination based on direct connections and blood flow direction was therefore more accurate.

      __Reviewer #1 Major Comment 2: __In addition, it is striking that cxc4+135 drives the expression in nearly every ISV as cxcl12+269 only every other. Similarly, not all the enhancers are enriched in the DA to the same level. Is there biological significance to this? could authors discuss these results further? The pattern of expression of the unc5b-identified enhancer is also striking, does this reflect the known roles of unc5b in the vascular formation?

      __RESPONSE: __We agree, the diversity of enhancer expression patterns within the arterial compartment is notable, and really very interesting. The variations in enhancer expression pattern must be largely influenced by the transcription factor motifs within each enhancer, as these patterns were seen in both transient and stable transgenic zebrafish and therefore largely independent of chromatin integration location.

      __REVISION (PLANNED): __The extension of Figure 3A to all enhancer lines (see previous comment) will permit us to more clearly classify the activity of each arterial enhancer within different beds and at different time points. Currently there were no clear links between a particular transcription factor motif/binding and expression pattern, something that is discussed briefly in the original Results and Discussion sections. However, the expansion of Figure 3A to all enhancers, and the creation of a Table summarizing this more systematically will make the link (or lack of one) between expression patterns within the arterial tree and TF motifs easier to appreciate and discuss.

      __Reviewer #1 Major Comment 3: __The final part of the paper focuses on defining the presence of "deeply conserved" transcription factor binding sites (TFSB), defined as TFBS that are as conserved as the enhancer sequence surrounding them. In literature, the term 'deep conservation' refers to evolutionary conservation (genomic sequence preservation) in a wide range of species. Therefore, the additional classification presented by the authors based on the surrounding sequence is not clear. As, the KLF motifs in the Ece1in1, which is conserved between mouse and human, are defined as "deeply conserved". However, the FLK motif in the following enhancer, Flk1in10 (one line below), gets classified as non-deeply conserved, despite also being conserved between mouse and human. Thus, in the current form, there is a contradiction in the way the authors use the term 'deeply conserved' and the accepted meaning of this term. To avoid confusion, it would be important to revise this nomenclature.

      RESPONSE: We agree that this nomenclature should be revised. Our aim was to develop a standard approach to transcription factor motif analysis that could be applied to enhancers regardless of conservation levels and size, and easily replicated by others. Because not all functional transcription factor motifs within enhancers are necessarily conserved between species, we were careful to label both conserved and non-conserved motifs for each TF examined. Nonetheless, extra emphasis was placed on motifs with confirmatory TF binding evidence (e.g. ChIP-seq/CUT&RUN), and those conserved at the same depth as the surrounding sequence. This was because our previous work on endothelial enhancers clearly indicates that these motifs are far more likely to play a key role in regulation. However, the reviewer is correct to note that referring to such motifs as “deeply” conserved could be misinterpreted.

      REVISION (COMPLETED): We have altered our nomenclature. This is explained in the relevant Results sections: “Because the level of conservation of motifs can often be an indication of their importance to enhancer activity, we classified each motif into three categories: strongly conserved (motif conserved to the same depth of the surrounding sequence), weakly conserved (motif conserved in orthologous human enhancer but not to the same depth as the surrounding sequence) and not conserved (motif is not conserved within the orthologous human sequence)”.

      Two enhancers (Unc5b-57 and Cdh1-1) were only conserved human-mouse, therefore each TF motif within these enhancers could be annotated as both weakly and strongly conserved. As the reviewer noted, this does create confusion. We have now adjusted Figure 5 to use a distinct shape for motifs for which no distinction between weak and strong motif can be made. This does not cover Ece1in1, which is conserved human-mouse-tenrec but was erroneously originally labelled human-mouse only. This error has been corrected.

      __Reviewer #1 Minor Comment 1: __Details on how the corresponding non-coding regions between mice and humans were established are missing, what alignment tool was used?

      RESPONSE AND REVISION (COMPLETED): This information has now been included in the relevant Results section: “Orthologous human enhancer sequences were identified for every enhancer using the Vertebrate Multiz Alignment & Conservation Track on the UCSC genome browser”

      __Reviewer #1 Minor Comment 2: __Not sufficient details are provided for the re-analysis of siRNA data. E.g., which clustering method was used? How the clusters were assigned to cell identities?

      RESPONSE AND REVISION (COMPLETED): The details regarding the re-analysis of scRNA data has been expanded in the Methods sections: “Publicly available E12 and E17.5 scRNA-seq data from EC isolated from BmxCreERT2;RosatdTomato lineage traced murine hearts54 was obtained from GEO (GSE214942) prior to processing FASTQ files with the 10X Genomics CellRanger pipeline (V7.0.0). RNA-seq reads were aligned to the mm10 genome reference downloaded from 10X Genomics with the addition of the TdTomato-WPRE sequence. Exclusion of low quality cells with either a UMI count >100,000, total gene count 10%) was performed using Scater55. Data normalisation was performed using the MultiBatchNormalisation method prior to merging of TdTomato positive and negative datasets from individual timepoints. The top 2000 most highly variable genes (excluding mitochondrial and ribosomal genes) in the merged datasets were identified using the Seurat FindVariableFeatures method and utilised to calculate principal component analysis (PCA). Normalised data was scaled using the ScaleData function. Cell clustering was performed using the standard unsupervised graph-based clustering method implemented within Seurat (V4)56. Clusters were visualised in two dimensions using UMAP based non-linear dimensional reduction following the standard Seurat (V4) workflow49. Identified clusters were assigned identities based on marker genes shown to be differentially expressed between populations previously identified in the original study47. Key markers include Npr3 (endocardial), Fabp4 (coronary vascular endothelial), and Nfatc1 (valvular endothelial). The E12.5 sinus venosus EC cluster was assigned based in Aplnr as previously described54. Arterial and venous EC clusters in the E17.5 datasets were annotated based on their enriched expression of Gja5 and Nr2f2, respectively.”

      __Reviewer #1 Minor Comment 3: __Details about the first HOMER analysis (in the assessment of transcription factor motifs and binding patterns at arterial enhancers) seem to be missing from the methods section.

      RESPONSE AND REVISION (COMPLETED): This has been included in the methods: “Analysis of overrepresented motifs within our validated arterial enhancer cohort was performed with HOMER’s findMotifsGenome tool using the full validated region of the arterial enhancers. The analysis used the hg38 masked genome and otherwise default settings for all other parameters including randomly selected background regions”.

      __Reviewer #1 Minor Comment 4: __Pg 12: "For ETS, 23/23 arterial enhancers contained at least one conserved motif (all "deeply" conserved to the same depth as the surrounding enhancer, see S7)". Is it S8, where conservation is indicated?

      __ ____RESPONSE AND REVISION (COMPLETED):__ We have corrected this error in the text – no figure actually needed to be referenced here as the previous sentence contained the full list of relevant figures to this statement (Table 2 and Figures 5 and S9, previously called S8, are the places to see this information).

      __Reviewer #1 Minor Comment 5: __Figure 1 and 2 for non-zebrafish readers it would be useful to indicate in Figures 1 and 2 the non EC expression that can be observed in the embryos.

      RESPONSE AND REVISION (COMPLETED): In addition to arterial expression, a number of the enhancer:GFP transgenes also showed GFP expression within the neural tube. In addition, some transient transgenic embryos also showed ectopic expression in muscle fibres. These have now been indicated on the images in Figure 1 and 2.

      __Reviewer #1 Minor Comment 6: __Table S1: Please, indicate in the legend what the asterisk in the H DNAseI column stands for

      RESPONSE AND REVISION (COMPLETED): The asterisk indicates where DNaseI hypersensitivity is also seen in multiple non-EC lines. This explanation has been added to the legend.

      __Reviewer #1 Minor Comment 7: __Figure S8: The phrasing "conserved to animal" in Figure S8 is misleading. There is no difference in something being conserved to tenrec or manatee, as both are Afrotherians. Hence, the data show that both Efnb2-141 and Ephb4-2 were present in the common ancestor of Afrotherians and humans, namely the ancestor of all placentals. Instead, it would be good to indicate the phylogenetic group for which the presence of the enhancer can be inferred (in this case, Placentalia).

      __RESPONSE: __Whilst I appreciate the point, it is the exact sequence that is important here – obviously tenrec and manatee are similar species but still contain differences in nucleotide sequences. The information about conservation leads the reader to the exact species with which the comparison is being made. We tried to restrict this to just one species per phylogenetic group (e.g. tenrec, opossum, chicken, zebrafish) but occasionally this was not possible.

      Reviewer #1 Significance

      To date, a systematic approach to identifying the regulatory networks driving endothelial cell identity is missing. This study provides important datasets and validation of enhancers involved in arterial gene expression and the associated transcription factors. Although this is only the tip of the iceberg, this work represents a significant milestone in the systematic understanding of how arterial gene expression is regulated. Overall, this study offers a powerful resource for understanding arterial gene regulation and conducting genome-wide studies of arterial enhancers.

      __RESPONSE: __We thank the reviewer for these kind words. Whilst we agree this is only a very small snapshot of all the arterial enhancers involved in gene regulation, we would like to stress that not only is this a massive increase to what has been known previously, but is also deliberately focused on the genes used to define arterial identity during development and in the adult, therefore these enhancers by themselves form an extremely valuable dataset with which to study the key factors driving arterial differentiation and identity.

      __ __


      REVIEWER 2

      __Reviewer #2 Evidence, reproducibility and clarity: __In this work, Nornes and collaborators have described a cohort of arterial enhancers that drive gene expression in arteries and not in veins. The paper is very well written and it is very informative. The authors have used in silico models to identified the potential artery enhancers and then used different developmental in vivo systems, zebrafish and mice, to validate their findings. Finally, the authors have explored what transcription factors may be binding the identified enhancer sequences and thus, drive arterial gene expression. I would like to congratulate the authors for this work that it has been a pleasure to read and review.

      Reviewer #2 Major Comment 1: In their identification of enhancers, the authors consider a candidate every enhancer that has a putative mark in both mouse and human. Nevertheless, all the human data comes from in vitro analysis. Considering how much cell culture affects endothelial cell identity, inducing effects like EndoMT, would this have any effect on the enhancer selection? Would it be possible to search any human in vivo data? Would this allow for even stronger and more relevant sequences?

      __RESPONSE: __We agree that the use of human endothelial cells in culture raises some potential issues. However, we stress that the mouse EC enhancer marks, which played a key role in defining putative enhancers, come from in vivo analysis (E11 embryos, P6 retina and adult aorta), limiting the potential for significant impact from cell culture-induced issues. Whilst we would have enthusiastically incorporated human in vivo data had it been available, our approach was still indisputably successful at identifying arterial enriched/specific enhancers.

      We consider it unlikely that culture/identity-related problems with human cultured ECs led to a significant undercount of enhancers, in part because comparatively few regions with enhancer marks in mouse in vivo ECs were excluded due to the absence of human enhancer marks. In fact, Cxcr4, Cxcl12, and Gja5 were poorly transcribed in the human cell lines studied here and consequently only enhancer marks in mouse were used to define putative enhancers for these three genes (this is clearly stated in the Results section). If a similar rational had applied to the remaining five genes, only an additional six putative enhancers would have been tested (one for Gja4, two for Nrp1 and three for Unc5b). However, we felt it made sense to include analysis of human enhancer marks for these five genes, as all were expressed in the human ECs used (as indicated by H3K1Me3 and DNaseI hypersensitivity at promoter regions) and orthologous human enhancers were identified for all. Additionally, our retrospective analysis of previously described mammalian in vivo-validated EC enhancers (Table S1 in the original manuscription, including eight arterial enhancers) found that all 32 were marked by at least one enhancer mark in human samples (1/32 did not contain mouse enhancer marks). We also tested eleven regions that did not reach our putative enhancer threshold, including five with only mouse marks. None of these directed expression in transgenic analysis.

      Reviewer #2 Major Comment 2: The human data comes from vein endothelial or microvasculature endothelial cells. Specially because some of the enhancers identified by the authors drive also vein expression, could the authors discriminate whether this is due to the identification coming from vein cells. Is there available data from HAECs? Would this not be conceptually more correct that using vein endothelial cells data? This should be at least discussed in the paper.

      __RESPONSE AND REVISION (COMPLETED): __We have now included a comparison with enhancer marks from HAECs, telo-HAECs and HUAECs as a new Figure S5. The enhancer marks seen in these cells were very similar to those in the HUVEC and microvascular cells already surveyed. Had enhancer marks within HAECs/telo-HAEC/HUAECs been included as a human enhancer mark in our initial survey, it would have been unlikely to have altered our analysis, although we agree it would have made it more conceptually correct. We chose not to go back and engineer this into our original enhancer selection rational however as we felt it would be intellectually dishonest. A paragraph has been added to the Results section about this analysis.

      Reviewer #2 Major Comment 3: Although the authors use the mouse embryo to further validate their finding beyond the zebrafish, the expression are a bit different. While on the fish the enhancers label smaller vessels of arterial identity, in the mouse, only bigger arteries are marked. Is this defined by the time of the analysis?

      __RESPONSE: __This experiment was conducted to demonstrate that these enhancers were arterial enriched in both zebrafish and mouse transgenesis, and feel this is clearly shown by the current data. Whilst I do not really agree that the expression pattern is different (for example, the Gja5 enhancers are more restricted to the major arteries in both zebrafish and mouse, compared to the more widely expressed Efnb2-333), this is challenging to ascertain at a single time-point in a transient transgenic mouse assay. Whilst it would be potentially interesting to better assess the activity of these enhancers over time in mice, we consider this a lengthy experiment (multiple stable lines would need to be established and characterized for each enhancer) which would not add particular benefit to this paper.

      Reviewer #2 Major Comment 4: The analysis of the enhancers is only done during development. Is the activity of these enhancers maintained through live or only important for artery vs vein determination? Is the expression of the different enhancer reporters maintained into adulthood?

      RESPONSE AND REVISION (PLANNED): We agree this would be interesting to ascertain. We plan to examine the activity of enhancer:GFP activity in adult fish fins (which are accessible even without crossing into a casper background, which is beyond the timescale of this project) in the fully revised version of this paper. We have already conducted a feasibility study on four arterial enhancers:GFP lines (Gja5-7:GFP, Gja5-78:GFP, Gja4+40:GFP and Efnb2-333:GFP), which found that all four were still active, and arterial-specific, in the adult.

      Reviewer #2 Significance

      This is a very well done study with potential interest for vascular biologists, in particular to those interested in the determination between artery and veins in a context of development. It advances our knowledge on the field of vascular biology as it not only proposes potential enhancers but also goes on to validation of the enhancers. Nevertheless, it is important to note that some of this enhancers have been identified from in vitro human data. In vitro culture of endothelial cells affects their cellular identity and thus, this study may have underscored many potential enhancers.

      REVIEWER 3

      __Reviewer #3: Evidence, reproducibility and clarity: __This manuscript by Nornes et al analyzed multiple published databases and identified a group of putative enhancers for 8 selected non-Notch arterial genes in mouse and human ECs. These enhancers were cloned and screened in fish embryos to test their effect in driving GFP reporter expression, which narrowed down a cohort of enhancers for further testing of expression activities in mouse embryonic arteries. The authors then analyzed the sequences of these enhancers, and identified binding motifs of ETS, SOX-F, FOX and MEF2 family TFs and Notch transcription regulator RBPJ commonly present in closed proximity in these arterial enhancers, suggesting interaction between these TFs in determination of arterial identity.

      Reviewer #3 Major Comment : This study provides an enormous amount of bioinformatic data analysis and screening results in transgenic fish and mouse models, which led to the discovery of a group of arterial enhancers and TFs binding motifs essential in regulating arterial identity.

      Reviewer #3 Other Comments ____1: Choice of arterial genes is slightly biased. Acvrl1/Alk1 is not enriched in arterial ECs. Sema3G, which is highly expressed in arterial ECs, is missing. UNC5B is enriched in arterial ECs but also expressed by sprouting ECs (PMID: 38866944).

      __RESPONSE: __When we started this project, scRNA-seq datasets in the developing vasculature were less available. Consequently, we initially based our choice of genes on data from Raftrey et al., Circ Res 2021 (available earlier on bioRxiv), which was focused on mouse coronary arterial ECs at the timepoints that arteries differentiate. This found Acvrl1 to be arterial enriched (not a novel observation, many publications treat Acvrl1 as arterial specific or arterial-enriched) and did not list Sema3g. We also considered a wider dataset from mouse and human mid-gestation embryos when available (Hou et al., Cell Research 2022). However, it is important to note that we did not aim to investigate every arterial-enriched gene, rather to use these datasets to help identify loci associated with gene expression patterns which indicated a high likelihood of containing arterial enhancers active during arterial differentiation.

      Sc-RNAseq data from both Raftery et al., and Hou et al., indicated that arterial ECs are subdivided into two groups, reflecting maturity but also potentially slightly different developmental trajectories. The genes studied here were therefore selected to evenly cover both subgroups, with Acvrl1, Cxcl12, Gja5 and Nrp1 primarily restricted to the mature arterial EC subgroup, while Cxcr4, Efnb2, Gja4 and Unc5b were also expressed in the less mature/arterial plexus/pre-arterial EC subgroup. It is notable that genes within the latter subgroup are also associated with angiogenic/sprouting ECs (Dll4 also belongs to this subgroup), which likely indicates biological links between angiogenesis and arterial identity rather than a problem in gene choice and specificity.

      __REVISION (COMPLETED): __This is already discussed in the Results section (angiogenic expression of arterial genes is discussed within the MEF2 and RBPJ sections) and in the Discussion (paragraph 2, referring to different expression patterns within arterial ECs). However, we have now edited the relevant Results section to better explain gene selection: “It is therefore clear that a better understanding of the regulatory pathways directing arterial differentiation requires the identification and characterization of a larger number of arterial enhancers directing the expression of key arterial identity genes. To identify a cohort of such enhancers, we looked in the loci of eight non-Notch genes: Acvrl1(ALK1) Cxcr4, Cxcl12, Efnb2, Gja4(CX37), Gja5 (CX40), Nrp1 and Unc5b. Although not a definitive list of arterial identity genes, single cell transcriptomic analysis indicates these genes are all significantly enriched in arterial ECs4,20, and are commonly used to define arterial EC populations in mouse and human scRNAseq analysis4,5,20,54. Additionally, single-cell transcriptomic data indicates that arterial ECs can be divided into two subgroups4,20. The genes selected here are equally split between subgroups (Acvrl1, Cxcl12, Gja5 and Nrp1 from the mature arterial EC subgroup, Cxcr4, Efnb2, Gja4 and Unc5b from the less mature/arterial plexus/pre-arterial EC subgroup)4,20. We did not exclude genes also implicated in angiogenesis/expressed in sprouting ECs, as these genes formed that vast majority of those associated with the less mature EC subgroup”.

      Reviewer #3 Other Comments ____2: Exclusion of Notch genes. Although the reason for choosing non-notch genes and excluding notch genes for screening is addressed in this paper, it would be interesting to examine how the arterial enhancers identified in this study are present in the Notch genes, especially Dll4 (enriched in arterial and sprouting ECs) and Jag1 (enriched in arterial ECs).

      __RESPONSE: __Previous work from our lab and others has already examined arterial enhancers for Notch pathway genes. We already included these enhancers in all our later analysis (Figure 5-6 and relevant supplemental figures), including analysis of TF motifs.

      Reviewer #3 Other Comments ____3: SoxF family TFs. Among the 3 members of SoxF TFs, only Sox17 and Sox7 were assessed. Though not specific, Sox18 is highly expressed in the arteries. On the contrary, Sox7 is highly expressed in the vein and shows weak expression in arterial ECs (PMID: 26630461).

      __RESPONSE AND REVISION (PLANNED): __We agree. We will include assessment of SOX18 binding in our final revised manuscript. An antibody for this analysis has been identified already.

      Reviewer #3 Other Comments ____4: Minor inaccuracy in Intro/paragraph 3: though sox17 is reported as indispensable for arterial specification (PMID: 24153254), losing a single SoxF factor does not seem to completely compromise the arterial program (PMID: 24153254, PMID: 26630461). A combined loss of Sox17/18, or Sox 7/17/18, seems to do the job (PMID: 26630461).

      __RESPONSE: __We have altered this section: “The evidence linking SOXF transcription factors to arterial differentiation is more extensive, with loss of either SOX17 (the SOXF factor most specific to arterial ECs) or SOX7 resulting in arterial defects21–24. Whilst losing a single SOXF factor does not entirely compromise the arterial program, arterial differentiation appears absent after compound Sox17;Sox18 and Sox7;Sox17;Sox18 deletion, although this occurs alongside significantly impaired angiogenesis and severe vascular hyperplasia21–24. PMID 24153254 is reference 23, PMID 26630461 is reference 24.

      Reviewer #3 Other Comments ____5: Fig.4 e14.5 mouse embryos. If the observation aims to assess the dorsal aorta, it would be better to use mouse embryos at mid-gestation (e9.5-10.5), when the paired DAs are formed with arterial identity but haven't been remodelled and fused as one single aorta. The morphological data in this figure would be better to show the colocalization of LacZ expression and an arterial marker (e.g. Sox17) using immulfluorescence staining instead of purely lacZ.

      RESPONSE: This experiment was primarily conducted to demonstrate that our enhancers were arterial enriched in both zebrafish and mouse transgenesis, and feel this is clearly shown with the e14.5 transgenic embryos originally shown. We chose e14.5 because it matched the timepoints used for the single cell transcriptomics first used to select the target arterial identity genes, and feel it is a good match to 2-3 dpf zebrafish in terms of arterial differentiation mechanisms. We agree that E9-10 would have also been an additional useful timepoint, but we do not have the resources to generate this data nor consider it essential for the conclusions of our work here.

      __REVISION (PLANNED): __We are unable to perform immunofluorescence in the e14.5 transgenic embryos due to the fixation and staining solutions used for X-gal staining (which was done by an external company and could not be altered), but agree additional information is needed to demonstrate arterial endothelial specificity. We will therefore expand the analysis of sectioned embryos (currently restricted to just the Efnb2-333:LacZ transgene) to all enhancers shown in Figure 4. This analysis has some limitations due to infiltration of the X-gal solution to deeper tissues, but is anticipated it will clearly show enhancer activity in arterial endothelial cells rather than venous ECs or smooth muscle cells.

      __Reviewer #3 (Significance (Required)): __This novel work establishes an important foundation for future understanding of how TFs may interact to determine arterial specification.

      Other revisions

      In addition to changes suggested by the reviewers, we also made one additional adjustment to the paper to include analysis of two additional putative enhancers (Efnb2-159 and Cxcr4+119). These were initially omitted in error yet both regions reach the standard of testable putative enhancers (noted in small changes to Figure S1 and Table S2). When tested in zebrafish transient transgenic embryos, Cxcr4+119 was inactive whilst Efnb2-159 was active in arterial endothelial cells. The relevant tables and figures have been adjusted to reflect these changes, the most significant of which are the inclusion of Efnb2-159 positive zebrafish in Figure 1 (and the necessity to create an additional supplemental Figure (S3) to accommodate the increased number of images), and analysis of Efnb2-159 transcription factor motifs/binding as part of Figure 5 and 6. No conclusions were altered by the inclusion of this additional data.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for going through our manuscript and providing valuable feedback. We are grateful to all 3 reviewers for describing our findings as important and valuable, well-designed and robust, and of value to the Parkinson's and Crohn's disease communities studying LRRK2. Below we detail a point-by-point response to the reviewers.

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __

      The paper by Dikovskaya and collaborators investigated the activitiy and expression of LRRK2 in different subtypes of splenic and intestinal immune cells, taking advantage of a novel GFP-Lrrk2 knockin mouse. Interestingly, they found that T-cell-released IL-4 stimulates Lrrk2 expression in B cells. I have a few comments and suggestions for the authors. 1) Figure 1C. LRRK2 KO cells display residual Rab10 phosphorylation. Do the authors have any idea of which kinase other than LRRK2 could be involved in this phosphorylation?

      As far as we are aware no other kinase is known to phosphorylate Rab10 at T73 in vivo. In vitro, recombinant Rab10 can be phosphorylated by MST3 at this site (Knebel A. et al, protocols.io https://dx.doi.org/10.17504/protocols.io.bvjxn4pn), but its relevance in vivo or in cells has not been shown. It is possible that the residual band recognised by anti-pT73 Rab10 ab in splenocytes is unspecific background, as it is mainly seen in LRRK2 KO spleen cells and not in other tissues. But to be certain that our assay assesses LRRK2-dependent Rab10 phosphorylation, we have always compared with the MLi-2 control.

      2) Since there are no good antibodies for IF/IHC as pointed by the authors, the GFP-Lrrk2 mouse gives the opportunity to check endogenous LRRK2 localization, i.e. in cells untreated or treated with IL-4 or other cytokines. Also, does endogenous GFP-LRRK2 accumulate into filaments/puncta upon MLi2 inhibition? The relocalization into filaments of inhibited LRRK2 has been observed in overexpression but not under endogenous expression. This analysis would be interesting also in light of the observed side effect of type-I inhibitors.

      We thank the reviewer for this suggestion. We will attempt a super-resolution microscopy using Airyscan with isolated B-cells treated with cytokine and/or LRRK2 inhibitor to address this question.

      3) Figure 5. The authors need to label more clearly the graphs referring to wt mice versus GFP-Lrrk2 KI mice.

      We have now labelled the panels referring to the WT mice only with "WT mice", to distinguish them from the other panels that incorporate data from both EGFP-Lrrk2 mice and their WT littermates used as a background.

      They should also replace GFP-LRRK2 with GFP-Lrrk2 since they edited the endogenous murine gene.

      Thank you, we have corrected it, and also the other mouse genotypes.

      4) In the material and methods MLi-2 administration in mice is indicated at 60 mg/kg for 2 hr whereas in suppl. figure 5 the indicated dose is 30 mg/kg. Please correct with the actual dose used.

      Thank you, we have corrected the mistake.

      5) The discovery of IL-4 as a Lrrk2 activator in B cells is a very interesting and novel finding. The authors could take advantage of the GFP tag to investigate LRRK2 interactome upon IL-4 stimulation (optional). Also, is the signaling downstream of IL-4 attenuated in Lrrk2 KO cells?

      We thank the reviewer for these interesting suggestions. The role of LRRK2 in IL-4 activated B-cells is currently under active research in the lab.

      Reviewer #1 (Significance (Required)):

      The manuscript is well designed and organized, and the experimental approaches are robust. These results are significant for the field as they add additional layers in the complex regulation and regulatory roles of LRRK2 in immunity, with implication for inflammatory disorders and Parkinson's disease.

      We thank the reviewer for their positive comments and for recognising our efforts to provide some clarity to a complex field.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      The authors present a flow cytometry methodology to assess LRRK2 expression and pathway markers in mouse models and explore LRRK2 in splenic and intestinal immune cells. This is a highly valuable study given the emerging understanding that LRRK2 pathway activity in peripheral tissues may be of crucial importance to Parkinson's disease and Crohn's disease. P8 : the authors state that their results indicate 'that the effects of LRRK2-R1441C mutation and inflammation on LRRK2 activity represent two different parallel pathways'. This seems like an overinterpretation as pathway suggests the presence of additional partners in the pathway while R1441C is a LRRK2 intrinsic modification. The results can equally be explained by synergistic effects between both activation mechanisms (mutant and inflammation).

      We agree with the reviewer, and have added this into the text. The sentence now reads "suggesting that the LRRK2-R1441C mutation and inflammation have different impacts on LRRK2 activity, either in parallel or in synergy."

      Methods and experiment descriptions in results : the authors appear to use the terms anti-CD3 stimulation and CD3 stimulation interchangeably, although it is not always clear in the text that these are synonymous. This should be clarified.

      We thank reviewer for pointing out this error on our part. We have made the necessary changes to always refer to the stimulation as anti-CD3.

      One major observation in this paper is that LRRK2 is not detected in gut epithelial cells as previously has been reported. It would be useful to comment on any differences between the presented protocol and the previous reports, in particular relating to the antigen retrieval step. In order to reinforce the finding, it would be useful to include in situ hybridization data that could further strengthen the observations of which cellular subtypes express LRRK2 and which do not. Indeed, while the KO control shows that there is an unacceptable high non-specific staining, it does not prove absence of expression. Also, can any conclusions be made about expression of LRRK2 in neural cells of the gut? This important information on LRRK2 detection in gut should be mentioned in the abstract and highlighted in the discussion.

      We thank the reviewer for pointing this out. In fact, we think the observation that LRRK2 is not detected in epithelial cells is so important that we have a separate manuscript exploring this point. Please see 1. Tasegian, A. et al.https://doi.org/10.1101/2024.03.07.582590 (2024). In this manuscript we have explored the expression of LRRK2 in human and murine intestinal epithelial cells using qPCR. Although we do not have in situ hybridization data, we believe that using both the EGFP-LRRK2 and the pRab10 flow cytometry, as well as qPCR and proteomics on selected cell types, corroborates our findings on the cell types that express LRRK2. We did not analyse LRRK2 expression in the neural cells of the gut, as the focus was on the immune cells, however we hope that others will use the tools developed here to explore this further.

      The authors mention in the discussion that they 'show for the first time that eosinophils also express active LRRK2 at levels comparable to B-cells and DCs.' The relevance of this finding should be further developed. Why is this important?

      We thank the reviewer for this point. We don't know how LRRK2 is important in these cells. However, as the role of LRRK2 in eosinophils and neutrophils has not yet been explored and both cell types play important roles in IBD, we think it is important to point out. We have now added a sentence to the discussion highlighting the importance of eosinophils in IBD. "Since eosinophils have recently been implicated as key player in intestinal defense and colitis(Gurtner et al, 2022), it will be interesting to evaluate LRRK2 functions in these cells."

      In the isolation of lamina propria cells, what efforts were made to characterize the degree of purification of the lamina propria cells compared to cells of other gut wall layers such as epithelium, muscularis mucosa, or deeper layers? Please specify.

      Isolation of lamina propria cells is a very well-established process (LeFrancois and Lycke, 'Isolation of Mouse Small Intestinal Intraepithelial Lymphocytes, Peyer's Patch, and Lamina Propria Cells.' Curr. Protocols in Immunology 2001), where we extensively wash off the epithelial layer before digesting the tissue for the LP. After the digestion the muscle and wall of the gut are still intact, so we do not get any contamination with other deeper layers. The subsets of cells we find in the LP are in line with isolations from other labs.

      Minor comments Figure 5G, for the graphs indicating LRRK2 activity and LRRK2 phosphorylation, the specific measures should be specified in the graph titles to avoid any ambiguity (pT73-Rab10, pS935-LRRK2).

      We have added the specifications to the new version of the figure.

      Suppl figure 1 : please specify the figure label and abbreviation AF568 in the legend. Suppl figure 2 : please specify the figure label and abbreviation anti-rb in the legend

      Thank you, we added the abbreviations to the legends. The Figure labels for both figures have been already included at the top of figure legends.

      Reviewer #2 (Significance (Required)):

      The authors present a flow cytometry methodology to assess LRRK2 expression and pathway markers in mouse models and explore LRRK2 in splenic and intestinal immune cells. This is a highly valuable study given the emerging understanding that LRRK2 pathway activity in peripheral tissues may be of crucial importance to Parkinson's disease and Crohn's disease.

      We thank the reviewer for recognising the value of this study.

      Reviewer #3

      Evidence, reproducibility and clarity

      The paper describes a set of experiments to analyse LRRK2 activity in tissues and despite it has very important findings and technical developments is largely descriptive. It does look like a collection of experiments more than a defined hypothesis and experiments to address that.

      We thank the reviewer for recognising the importance of our findings and the technical developments. We agree that the paper's focus is to describe where LRRK2 is expressed in immune cells, and in which cells is it active or activated after inflammation in a hypothesis-free unbiased manner. We believe this is important data to share as a resource for the wider LRRK2 community and we will submit the manuscript as a Resource.

      The flow cytometry assay of the first part is a great technical challenge and represents the establishment of a potentially very useful tool for the field. It would have been important to test other organs, either as controls or for example because of their relevance e.g. lungs. This first part is disconnected from the second part below.

      We thank the reviewer for pointing out that the pRab10 assay would be useful to apply to other organs too. Since we are interested in the role of LRRK2 in IBD, we had focused on applying the pRab10 assay on intestinal tissue, with spleens also analysed as major lymphoid organ and a source of immune cells that can translocate to the gut in inflammation. We hope that the publication of this method would allow other researchers to analyse other tissues in the future.

      The authors generated a new mouse KI mouse expressing EGFP-LRRK2 and show data the levels of LRRK2 expression are reduced in tissues at different degrees and established a flow cytometry assay to measure LRRK2 expression by monitoring the GFP signal. Interestingly they found that expression does not correlate with activity (as measured by phospho-Rabs). I suggest taking this part out as it breaks the flow of the paper. If data using this mouse is included, then microscopy should be included to complement the flow cytometry data. I understand the mice were used later with the anti-CD3 treatment, but it is very confusing that some experiments are done with EGFP-LRRK2 mice and others not. It does look in general like the mice do not behave as wild types and this is an important caveat. Without microscopy of the tissues or even cells (Figure 4) is hard to conclude much about these experiments.

      We thank the reviewer for this point and would like to explain. It is true that in Suppl Figure 5, we show reduction of LRRK2 signal in the EGFP-Lrrk2-KI mice. However, based on immunoblotting, a significant reduction in EGFP-LRRK2 expression levels was seen only in the brain, but not in the tissues we analysed, that is the spleen and the intestine. Further, we have shown clearly using proteomics (Fig. 3D and 5E), that the GFP signal in immune cells correlates very well with the WT LRRK2 expression. Therefore, we think that the GFP signal in these mice reflects WT LRRK2 expression pattern. Further, despite the limitations of reduced kinase activity that we thoroughly describe, we think this model is very useful since no antibodies work to stain for LRRK2 in mice. We therefore respectfully disagree with this reviewer that the EGFP-LRRK2 data should be taken out, as it has proven to be an invaluable tool to measure and track changes in endogenous LRRK2 expression. Moreover, we think the fact that LRRK2 expression does not correlate with levels of activity, that is, LRRK2 is more active in some immune cells than in others, is a very important finding that evidences the cell-specific regulation of LRRK2 activity beyond its expression level.

      We tried but failed to visualize the EGFP-LRRK2 signal using fluorescence microscopy in the tissue. This is most likely due to the low expression of LRRK2 (proteomics data suggests that even neutrophils express less than 9000 copies), confounded further by the high background autofluorescence in tissues, especially in the gut. We now explain the lack of tissue images from the EGFP-LRRK2 mice in the text. However, we can visualize the EGFP-LRRK2 in B cells, and we will provide these images in a revised version of the manuscript.

      We have also added the following paragraph to the discussion:

      "We complemented the pRab10 assay with the development of the EGFP-Lrrk2-KI reporter mouse. Although the reporter was initially designed as a fluorescent tracker for imaging LRRK2 localisation in cells and tissues, the low expression of LRRK2, combined with high and variable autofluorescence in tissues precluded its use for microscopy. Even in neutrophils, which express highest level of LRRK2 among immune cells, there are less than 9000 copies of LRRK2 per cell (Sollberger et al, 2024), making it difficult to identify localization. However, the EGFP signal was sufficient for flow cytometry-based measurements, where background autofluorescence of each cell type was taken into account and subtracted."

      Then the authors show that LRRK2 expression and activity is different in different cell types and depends on inflammation. The anti-CD3 strategy to induce inflammation is very different from physiological inflammation such as sepsis and LPS stimulation, so experiments with other stimuli could be important here to contribute to the message of inflammatory trigger of LRRK2 activation and decoupling of cell type.

      We thank the reviewer for this suggestion. We used the anti-CD3 model as it also causes intestinal inflammation, and mimics T-cell cytokine storms that happens in many diseases. However, for the revisions we will also test another model of inflammation as suggested, such as LPS stimulation, to measure how inflammation affects LRRK2 expression and activity.

      The IL-4 data is intriguing but too preliminary. The lack of strong effect of IFN-gamma is expected as the promoter of LRRK2 in mice and humans is different and human cells responds much better with regards to LRRK2 expression after IFN-gamma stimulation.

      We are confused by what the reviewer means by saying the IL-4 data is preliminary. We have shown by flow cytometry, immunoblotting, qPCR and proteomics that IL-4 induced LRRK2 expression in B-cells. So we are uncertain as to how else this can be shown. As to the effect of IFNγ on LRRK2 expression, it may indeed be that human cells respond better than murine cells. Importantly, the IL-4 ability to induce LRRK2 in B-cells is a novel and important finding, regardless of the effects of IFNγ.

      Reviewer #3 (Significance (Required))

      The paper describes a set of experiments to analyse LRRK2 activity in tissues and despite it has very important findings and technical developments is largely descriptive. It does look like a collection of experiments more than a defined hypothesis and experiments to address that.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors present a flow cytometry methodology to asses LRRK2 epxression and pathway markers in mouse models and explore LRRK2 in splenic and intestinal immune cells. This is a highly valuable study given the emerging understanding that LRRK2 pathway activity in peripheral tissues may be of crucial importance to Parkinson's disease and Crohn's disease.

      P8 : the authors state that their results indicate 'that the effects of LRRK2-R1441C mutation and inflammation on LRRK2 activity represent two different parallel pathways'. This seems like an overinterpretation as pathway suggests the presence of additional partners in the pathway while R1441C is a LRRK2 intrinsic modification. The results can equally be explained by synergistic effects between both activation mechanisms (mutant and inflammation).

      Methods and experiment descriptions in results : the authors appear to use the terms anti-CD3 stimulation and CD3 stimulation interchangeably, although it is not always clear in the text that these are synonymous. This should be clarified.

      One major observation in this paper is that LRRK2 is not detected in gut epithelial cells as previously has been reported. It would be useful to comment on any differences between the presented protocol and the previous reports, in particular relating to the antigen retrieval step. In order to reinforce the finding, it would be useful to include in situ hybridization data that could further strengthen the observations of which cellular subtypes express LRRK2 and which do not. Indeed, while the KO control shows that there is an unacceptable high non-specific staining, it does not prove absence of expression. Also, can any conclusions be made about expression of LRRK2 in neural cells of the gut? This important information on LRRK2 detection in gut should be mentioned in the abstract and highlighted in the discussion. The authors mention in the discussion that they 'show for the first time that eosinophils also express active LRRK2 at levels comparable to B-cells and DCs.' The relevance of this finding should be further developed. Why is this important ?

      In the isolation of lamina propria cells, what efforts were made to characterize the degree of purification of the lamina propria cells compared to cells of other gut wall layers such as epithelium, muscularis mucosa, or deeper layers? Please specify.

      Minor comments

      Figure 5G, for the graphs indicating LRRK2 activity and LRRK2 phosphorylation, the specific measures should be specified in the graph titles to avoid any ambiguity (pT73-Rab10, pS935-LRRK2).

      Suppl figure 1 : please specify the figure label and abbreviation AF568 in the legend.

      Suppl figure 2 : please specify the figure label and abbreviation anti-rb in the legend

      Significance

      The authors present a flow cytometry methodology to asses LRRK2 epxression and pathway markers in mouse models and explore LRRK2 in splenic and intestinal immune cells. This is a highly valuable study given the emerging understanding that LRRK2 pathway activity in peripheral tissues may be of crucial importance to Parkinson's disease and Crohn's disease.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The paper by Dikovskaya and collaborators investigated the activitiy and expression of LRRK2 in different subtypes of splenic and intestinal immune cells, taking advantage of a novel GFP-Lrrk2 knockin mouse. Interestingly, they found that T-cell-released IL-4 stimulates Lrrk2 expression in B cells.

      I have a few comments and suggestions for the authors.

      1. Figure 1C. LRRK2 KO cells display residual Rab10 phosphorylation. Do the authors have any idea of which kinase other than LRRK2 could be involved in this phosphorylation?
      2. Since there are no good antibodies for IF/IHC as pointed by the authors, the GFP-Lrrk2 mouse gives the opportunity to check endogenous LRRK2 localization, i.e. in cells untreated or treated with IL-4 or other cytokines. Also, does endogenous GFP-LRRK2 accumulate into filaments/puncta upon MLi2 inhibition? The relocalizaiton into filaments of inhibited LRRK2 has been observed in overexpression but not under endogenous expression. This analysis would be interesting also in light of the observed side effect of type-I inhibitors.
      3. Figure 5. The authors need to label more clearly the graphs referring to wt mice versus GFP-Lrrk2 KI mice. They should also replace GFP-LRRK2 with GFP-Lrrk2 since they edited the endogenous murine gene.
      4. In the material and methods MLi-2 administration in mice is indicated at 60 mg/kg for 2 hr whereas in suppl. figure 5 the indicated dose is 30 mg/kg. Please correct with the actual dose used.
      5. The discovery of IL-4 as a Lrrk2 activator in B cells is a very interesting and novel finding. The authors could take advantage of the GFP tag to investigate LRRK2 interactome upon IL-4 stimulation (optional). Also, is the signaling downstream of IL-4 attenuated in Lrrk2 KO cells?

      Significance

      The manuscript is well designed and organized, and the experimental approaches are robust. These results are significant for the field as they add additional layers in the complex regulation and regulatory roles of LRRK2 in immunity, with implication for inflammatory disorders and Parkinson's disease.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The paper describes a set of experiments to analyse LRRK2 activity in tissues and despite it has very important findings and technical developments is largely descriptive. It does look like a collection of experiments more than a defined hypothesis and experiments to address that.

      The flow cytometry assay of the first part is a great technical challenge and represents the establishment of a potentially very useful tool for the field. It would have been important to test other organs, either as controls or for example because of their relevance e.g. lungs. This first part is disconnected from the second part below.

      The authors generated a new mouse KI mouse expressing EGFP-LRRK2 and show data the levels of LRRK2 expression are reduced in tissues at different degrees and established a flow cytometry assay to measure LRRK2 expression by monitoring the GFP sugnal. Interestengly they found that expression does not correlate with activity (as measured by phospho-Rabs). I suggest taking this part out as it breaks the flow of the paper. If data using this mouse is included, then microscopy should be included to complement the flow cytometry data. I understand the mice were used later with the anti-CD3 treatment, but it is very confusing that some experiments are done with EGFP-LRRK2 mice and others not. It does look in general like the mice do not behave as wild types and this is an important caveat. Without microscopy of the tissues or even cells (Figure 4) is hard to conclude much about these experiments.

      Then the authors show that LRRK2 expression and activity is different in different cell types and depends on inflammation. The anti-CD3 strategy to induce inflammation is very different from physiological inflammation such as sepsis and LPS stimulation, so experiments with other stimuli could be important here to contribute to the message of inflammatory trigger of LRRK2 activation and decoupling of cell type.

      The IL-4 data is intriguing but too preliminary. The lack of strong effect of IFN-gamma is expected as the promoter of LRRK2 in mice and humans is different and human cells responds much better with regards to LRRK2 expression after IFN-gamma stimulation.

      Significance

      The paper describes a set of experiments to analyse LRRK2 activity in tissues and despite it has very important findings and technical developments is largely descriptive. It does look like a collection of experiments more than a defined hypothesis and experiments to address that.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility and clarity:

      In this work, Anandi et al. propose an ex vivo model that can be used to recapitulate the in vivo structure of the tumor microenvironment, which allows the observation of morphological and functional changes in tumor cells in a 3D context. Due to the ability of cancer cells to induce hypoxic condition within the TME, authors propose this model to tackle the study of metastasis initiation in vitro. The proposed system successfully displays an ischemic gradient with cells accessing nutrients at different rates, similarly to what happens in solid tumors in vivo. Moreover, in line with the literature, tumor cell migration and invasiveness were promoted by hypoxic conditions. Authors also show that the system could be used to study cell-cell interaction, as co-cultures of macrophages and cancer cells were successfully cultured in the system and studied in the context of tumor hypoxia.

      The study proposed is interesting and timely, as cancer cell invasion remains an important area of tumor biology that needs further exploration. The methodology is well explained and proposed in a linear flow. However, the work could benefit from some improvement and changes, as well as from additional experiments. On an important note, authors do not properly refer to the current literature, as several studies on 3D culture systems/chambers have already been studied and developed to investigate the tumor microenvironment, but they are not cited nor referred to in the manuscript. Authors should refer to such literature and explain how this system is different and adds to it.

      Major comments:

      1. Authors propose this method to study the TME in 3D. When culturing cells with different ECM (Collagen vs. matrigel+collagen I) authors should take into consideration the effect of these materials on different cell types. It is known how collagen and matrigel can differently influence the polarization and phenotype of stromal cells (particularly in regards of fibroblasts - major components of solid cancers - e.g., PMID 21029367), therefore these points should be addressed at least in the discussion.

      We completely agree with the reviewer so we added this point (and reference) to our manuscript's introduction (lines 45-46) and discussion (lines 442-445).

      1. In addition to the previous comment, matrigel and collagen are also known to alter cancer cell phenotype (e.g., PMID 21029367) and this point should be taken into account.

      We completely agree with the reviewer so we added this point (and reference) to our discussion in the main text (lines 442-445).

      1. The need for novel 3D systems to study different aspects of the TME in vitro/ex vivo are certainly needed, however they are not inexistent. Authors should address this in the text, as the current literature already started to propose 3D models (including models involving matrigel/collagen in combination with other materials). 3D chambers (of different materials, and with different aims) are being used and designed and can be found in the literature. These works are not cited in the current study at all. For instance, Anguiano 2017; Cavo et al. 2018; Anguiano et al., 2020; Sodek et al. 2008, etc.

      We agree so we have now added those references to the main text (line 56-57).

      1. Even though the focus is on hypoxia and the achievement of an ischemic gradient in the chamber to allow resemblance of an in vivo tumor, the authors write in line 123 (and also in other parts of the text) that: "these results show that consumer cells in the 3MIC form ischemic gradients that can influence the local metabolic microenvironment experienced by neighboring tumor spheroids". The addition of the use of the PMDS membrane partly supports the claim, however it would be interesting to check whether this is indeed true, by measuring for example the levels of certain metabolites (e.g., glucose, glutamine, glutamate, lactate, aspartate) reached with the system, or pH levels, etc., in presence or absence of the hypoxic gradient/consumer cells.

      This is an insightful question and defining the exact composition of this complex ischemic microenvironment is a major ambition of our lab, so we completely agree with the reviewer's comment. However, as the 3MIC was designed specifically for microscopy, measuring specific metabolites it is unfortunately outside its capabilities.

      Having said that, and following the spirit of the reviewer's comments, we used microscopy to measure additional signs of metabolic stress. Specifically, we used fluorescent probes to detect changes in intracellular pH (pHrodo, Molecular Probes) and in Redox status (CellROX, Molecular Probes) and glucose (2-NBDG - a fluorescent D-glucose analog). As we explain below, we found exciting results from our pH measurements which led us to additional functional experiments. We are very excited about these new results, and we thank the reviewer for encouraging these experiments. These new results also provide evidence that other parameters in ischemia - and not just hypoxia - change along the 3MIC and can have an impact on tumor cells.

      1. When looking at the references presented in the manuscript, authors quote too many review articles, rather than scientific articles. Given the extremely wide literature on cancer metastasis, more of these works should be quoted in this context. For example: in the introduction - text lines 27-38 - only 4 references are research articles, out of 14 references presented in that paragraph.

      The reviewer is correct in pointing this out. Our intention was to use reviews on topics that are well established where citing primary research could be unfair to other contributions. But again, we agree with the reviewer, so we replaced reviews with primary research articles in multiple locations along the manuscript.

      1. As authors showed successfully how macrophages and cancer cells can interact in the chamber, recapitulating cell interactions in an in vivo context, it would be very interesting to see whether different consumer cells would induce similar or different changes to the spheroids and the ischemic gradient (for instance using stromal cells or non-tumor cell lines as consumers, instead of cancer cells only), as we know how tumors are a multitude of cell subsets, each contributing to nutrient production, oxygen consumption, etc.

      This is a great point. We thought about that very same point and conducted several experiments to test the combinatorial effects of different consumer cells. In broad terms, we did not observe major differences when using different consumer cells. However, we agree that this system may provide compelling opportunities to test the effect of different cell types on each other. Still, for consistency and ease, we conducted most of our experiments using the same cells in both consumers and in spheroids.

      In the resubmitted version, we added an experiment where we looked at the sprouting of SVEC endothelial cells using the same cells or Lung KPs as consumers (Fig. S6A).

      Minor comments:

      1. Studying the early metastatic development/seeding remains a timely quest, however authors should refer to several new studies in which various mouse models are used to study metastasis from different points of view (e.g., PMID 25822788; PMID 36991128; PMID 25171411; PMID 25633981; PMID 34632412; PMID 35921474; etc). Or line 41, three reviews are quoted (refs 27-29), whilst there are several works that could be quoted on metabolism in solid tumors also in the context of metastasis (e.g., PMID 36522548; PMID: 26719539, PMID 34303764). This comment applies to the rest of the text.

      We thank the reviewer for their help in processing this vast literature. We were aware of most of those works but some were new to us so thanks again! We have now added these references.

      1. The order of the references is not properly presented. In the introduction, the first reference is n. 4 (text line 22), instead of it being reference 1. Moreover, the subsequent literature ref. is number 12 and not number 2. Please revise the order of the references, and position them within the bibliography from first cited to last cited in the text.

      We apologize for this confusion. We have now revised all the references and we hope they are correctly formatted and numbered. The origin of this confusion may have been that we had references in the abstract thus their numbering started there rather than from the introduction. To avoid further confusions, we removed all references from the abstract.

      1. Lines 98-104. It would be helpful to the reader to define here what these consumer cells are. Even though it is explained in the methods that the consumer cells are cancer cells, it is important to make it clear in the text, as it could be misleading at times.

      We agree with the reviewer although we did not mean to be misleading. As mentioned above, we chose to use the same cells for both: consumers and spheroids and we have now added a new figure to illustrate this point (Fig S6A). Following the advice, we are also including additional text to make the message clearer (lines 107-109).

      1. The English grammar and spelling should be revised in some parts, as well as typos and missing words throughout the text (e.g., Line 38, the word "interraction" is misspelled and should be corrected with "interaction". Line 49, the first sentence seems incomplete. Lines 68-69 should be revised as the sentences do not flow well together, probably due to a missing word. In line 77 it should be "presents". Line 341 should be "cannot be explained").

      We apologize for these typos and mistakes. We have tried our best to avoid these type of errors in the new manuscript version.

      Referees cross-commenting

      I find the comments from the other reviewers to be in line with one another as well as with my general assessment. The major and comments of all reviewers should be addressed. The minor comments should be taken into account as well, as they would render the text and the figures more precise. I suggest that 3-6 months to complete the revision process is an appropriate time frame for the authors.

      Finally, I strongly encourage the authors to add in the discussion the points and questions raised by all reviewers, as well as to improve the bibliography in terms of organisation, linearity, and state of the art.

      Significance:

      General assessment:

      The work by Anandi et al. offers an additional tool to tackle the issue of studying the tumor microenvironment, in a 3D culture system.

      The authors show a model that can be used to study tumor hypoxia in 3D, offering the possibility to study the TME in a more in vivo-like manner without turning to mice models. The development of new tools to study the TME avoiding the excessive use of animals is definitely a timely quest. In addition, the system has the potential to be applied to tackle different biological questions, as the methodology is well explained and could be suitable to many other fields of cancer biology (e.g., drug resistance or uptake). The work is overall presented in a clear way and the methodology is explained thoroughly and it has the potential to be a useful tool for the study of cancer hypoxia.

      However, authors should address how their method could differently impact other cells when applied to other systems. As one major claim is the potential use of this methodology to study the TME, it should be taken into consideration how stromal cells are strongly affected by the ECM, and how certain settings or features of the system may impact such cell populations. In addition, the work does not properly refer to the current state of the art. As other studies started to propose 3D systems for the study of TME and cell-cell interactions - besides organoids - the authors should cite these works and frame their own study in a more appropriate context, pointing out differences with the current 3D chambers available, the advantages of one vs the other, and so on.

      Advance: the study adds to the current literature as the study of tumor hypoxia in 3D remains a complicated issue. The interesting co-culture settings with macrophages suggests potential uses of this model to study cell-cell interactions.

      Audience: the study is very methodological and offers a tool that could be used by cancer biologists - and maybe by other biology fields.

      Reviewer #2

      Evidence, reproducibility and clarity:

      Summary

      Anandi and colleagues present a manuscript describing a nice assay for exploring the progressive effect of metabolic depletion of the nutrients and oxygen on the invasion of cancer cells. This builds upon and extends a device that they previously described - MEMIC - and now enables 3D analysis of small numbers of cells. The key to their method is the inclusion of a layer of consumer cells that deplete oxygen and nutrients. Using this tool, they demonstrate that depleted environments promote invasive behavior and lower cell-cell adhesion. This is related to the nutrient-deprived and hypoxic environments found in the center of many tumors. Cellular Potts Modelling is used to explore ideas around the cooperation between reduced cell-cell adhesion and increase ECM adhesion in promoting invasion. Overall, this is a well-constructed manuscript that will be of interest to cell biologists and cancer biologists.

      Major comments

      I realize this work is submitted to review commons and this complicates the recommendation regarding publication. My view is that the 'more prestigious' journals would require greater mechanistic insight, but that the work could find a suitable place in other members of the review commons stable. My comments are divided into those essential for any journal and those that might be journal dependent.

      We hope that the mechanistic experiments added to our new manuscript version will appeal the reviewer and merit publication in any of the review commons journals.

      Essential regardless of journal

      1. Many of the figures lack information about the number of spheroids analyzed and from how many biological repeats they are derived.

      We have now added this information to all our experiments. This information can be found in the figures and on the figure legends.

      1. The authors need to provide citations for their assertion that only gases can cross the PDMS, but not other small metabolites. They should also comment on whether the build-up of CO2 might be relevant.

      We have now added the original reference where they describe PDMS's properties (Cox and Dunn, 1986).

      The point raised about CO2 is very interesting, but we do not expect a buildup of this gas. When using PDMS, CO2 would not accumulate as PDMS membranes are permeable to gases - including CO2. When using glass covers, the lack of oxygen should minimize CO2 production as hypoxic cells will not be able to conduct oxidative phosphorylation and produce lactic acid instead.

      1. The data on the directionality of migration when consumers are present are not significant and doesn't warrant the speculation in lines 186-189.

      Following the reviewer's advice we have removed this speculation.

      1. The ECM degradation in Figure 3 should be quantified.

      We agree. We added additional quantifications for the gelatin degradation assay. We also highlight the quantification we already had of the ECM degradation assessed via DQ collagen. Those data can be found in the new figures 4 and S4, respectively.

      1. Do the authors have evidence that the hypoxia-exposed cells are more adhesive to ECM. This is central to their Potts model and I could not locate the supporting experimental data. If not, then the Potts model should include matrix proteolysis, which they do have data about.

      Again, this is a very insightful observation, and we completely understand this confusion. We think that this may part of the inherent challenge of trying to condense biological problems into analogies or "metaphors" when using physical/mathematical models.

      The algorithm in a Cellular Potts model (CPM) tries to minimize the energy of the system (the entire group of cells/ECM that we are modelling). This global energy reduction is achieved by minimizing local energies in the cell-cell and cell-ECM interactions. The way the algorithm executes this minimization, is by always (probability p=1) accepting a configuration that decrease the energy while restricting the configurations that lead to higher energies (with a probability of p = e-DHT) where DH is the difference between the current and previous energy.

      So, the only thing the model is really doing is to increase the likelihood that cells are in a more "comfortable" environment - i.e. that the energy from the interactions with their neighboring cells and ECM is as low as possible. For example, if cell 1 and cell 2 adhere strongly but not to cell 3, in a CPM this is modelled as a low DH between cell 1 and 2 and a higher DH with cell 3. Conversely, when people model cells better at "invading" into a new "territory" they choose a lower energy between that cell type and that type of substratum.

      In other words, our CPM does not "care" whether ischemic cells invade the ECM because they create space through increased proteolysis or because they are more adherent to the ECM. These two scenarios are the same in a CPM and it is consistent with previous CPM models of similar scenarios (e.g.: PMID: 18835895, 33933478, 26436883, 23596570).

      We have now reworded the description of the model on the main text, and we added an illustration hoping to make this aspect of the model clearer (Fig. S4F).

      1. Is the down-regulation of E-cadherin transcriptional - i.e. is the mRNA level reduced?

      This is a great question. After the reviewer posed this question, we looked at out data and we concluded E-cad's downregulation is transcriptional. Assessing local mRNA levels in the 3MIC is challenging. However, our E-Cad reporter (pHAGE-E-cadherin-RFP, addgene #79603) is a red fluorescent protein driven by the CDH1 (E-Cad) reporter. RFP levels decrease with ischemia indicating that this regulation occurs at the promoter/transcriptional level. We now added this point to the revised manuscript (lines 259-261). We thank the reviewer for this insight!

      1. The title of figure 6 is misleading. The authors do not demonstrate chemoresistance in terms of cell survival or cell proliferation, which is how the term is normally used. The authors should measure cell number, proliferation, and cell viability. The data presented in the Supplementary Figure are inadequate with no quantification. The FUCCI reporter cells would be a good tool for this. Also, why use 150nM paclitaxel when the IC50 is 817nM? This seems bizarre. Lastly, there is a typo in the figure that suggest 150mM drug was used.

      We apologize if these experiments caused confusion. Our intention was to look at the anti-migratory effects of Taxol-related drugs. As such, we first determined the concentrations at which the drug was lethal to our cells (this is the LD50 of ~800nM). Then, we tested if lower concentrations - which we knew where not lethal - would affect cell migration, protrusions, etc. Hence the 30-150nM range we used in our experiments.

      We have now completely rewritten this section hoping that our approach is now clearer. We have also changed the title of the section and the figure legend to clarify that we are studying the effects of Taxol as anti-motility drug rather than its effects on cell survival and proliferation (now Fig. 7). Finally, we have now fixed the 150mM/150nM typo in the figure legend.

      Journal dependent

      1. The authors have not excluded that either changes in nutrients, or even a pro-invasive factor, produced by consumer cells are necessary for the increased invasion. They have only shown that they are not sufficient. The authors should perform a series of experiments comparing hypoxic conditions with normal media and normoxic conditions with nutrient depleted/condition media by prior culturing of KP cancer cells.

      This is a great point. We actually do not want or propose to exclude this possibility. So, we have now added text to clarify this issue (lines 431-435).

      In fact, we would be thrilled if there is a pro-invasive factor. If that would be the case, our results indicate this factor is only effective under ischemia. Because the same consumer cells do not have an effect on the same type of tumor spheroids under well-nurtured environments. In addition, our new pH measurements and perturbations experiments agree with this reviewer's intuition about additional factors being key in the increased invasion (see new Figure 2). We are very excited about these new results, and we hope this reviewer will be excited too.

      1. What is the oxygen sensor for increased invasion? PHD1-3 would be a good place to start looking. Is the PHD2-HIF axis important? Do VHL mutant cells still show responses to the consumer cells?

      Following the reviewer's feedback, we generated isogenic HIF1A KO cell lines to study whether HIF1A was directly needed in the invasion of tumor spheroids within the 3MIC. We complemented these loss-of-function experiments with For HIF1A gain-of-function using pharmacological interventions that stabilize HIF1A under normal oxygen levels (CoCl2 and DMOG).

      As shown in the new figure 2, these experiments mirrored our hypoxia experiments: HIF1A activity was not sufficient but it was required to drive the invasion of ischemic spheroids. We think that these new results are particularly interesting when taken together with our new pH-perturbation experiments. Briefly, our new experiments results show that in addition to the requirement of hypoxia/HIF1A, media acidity also has a strong effect on spheroid invasion. More excitingly, a drop in pH is sufficient to dramatically increase invasion - even in control well-nurtured spheroids. We think that the effects of pH and hypoxia are linked. HIF1A activation and hypoxia the increase glycolysis and thus lactic acid secretion. We speculate that this glycolytic switch is where hypoxia is important, but it is not sufficient because under well-perfused conditions (e.g. healthy tissue or large culture media volume) lactic acid levels may not buildup enough to significantly lower the extracellular pH. In contrast, under poor perfused conditions (3MIC and solid tumors) or if we flood cell cultures with lactic acid, the media's pH drops dramatically (Fig. 2).

      1. If they include both spheroids of endothelial cells and cancer cells, will the resulting protrusions in hypoxia grow towards each other? Would macrophages enhance this process?

      We agree with the reviewer this is an interesting question and we have anecdotally observed this effect. In the manuscript, we used these chimeric endothelial/tumor spheroids rather than separate ones (Fig. 6E). We do not find strong evidence that their protrusions grew towards each other, but this is something that we would like to explore in the future with more detail.

      Significance:

      The main advance is technical, as many previous studies have related hypoxia to increased cancer cell invasion, which the authors correctly acknowledge and cite. It is scholarly study, which will be of interest to many readers, and the method reported is likely to be adopted by several groups.

      Reviewer #3

      Evidence, reproducibility and clarity:

      In this work, Anandi et al., developed a cell culture system to live image the initial transformation of cells in deprivation of oxygen and nutrients in a 3D context. Using this system, 3MIC, they were able to create oxygen and nutrient gradients to simulate ischemic conditions that arise deep within tumors and that typically precede metastasis. With the 3MIC system they validated that ischemia triggers cell migration and invasion of tumor cells. In addition, 3MIC also allowed them to study the interaction of tumor spheroids with stromal cells such as macrophages and endothelial cells. Interestingly, the authors showed that co-culturing tumor spheroids with stromal cells increased the pro-metastatic features induced by ischemia conditions. Lastly, using 3MIC allowed the authors to discern that a poor paclitaxel response in ischemic-like cells is driven by intrinsic cellular resistance rather than due to lower drug concentration.

      Overall, the work is very well-written, and the results are consisting, convincing and support the conclusions. The methods are clear and complete and allow the reproducibility of the experiments. The experiments are adequately replicated and statistical analyses are well described. However, I have few suggestions to improve the impact of the manuscript:

      1. The authors conclude that 3MIC results in the accumulation of lactic acid and nutrient deprivation in an increasing manner when moving far from the opening site. Is there a way to actually show this? So far, the authors employ a hypoxia sensor only. A sensor for internal pH or other method for nutrient deprivation would help to support the conclusion and further validate the model.

      This is an excellent point. Following the reviewer's feedback, we tested additional sensors including for extra- and intra-cellular pH. As mentioned above, we observed dramatic changes in extracellular pH levels. We followed up these observations with a series of experiments that showed a key functional role for media acidification in driving invasion (Figure 2).

      1. According to figure S3E, the main cell line used by the authors is already quite mesenchymal. It would be good to know if the results showed here are consistent in cells with a more basal epithelial phenotype. Do epithelial cells need stronger ischemic conditions to undergo phenotypic changes?

      This is a great catch. To explore this further, we run a Western Blot analysis to compare epithelial and mesenchymal markers expressed by the main cells we used here (Lung KPs) and to compare them to levels in a stereotypical epithelial (MCF-7) and a mesenchymal (MDA-MB-231) cell line (new Fig. S4D). As the reviewer correctly points out, we do see that E-Cad and Vimentin are co-expressed in KP cells.

      So far, our observations in a range of cell lines are a consistent decrease in E-Cad levels with no significant effects in vimentin levels - regardless of the basal levels of this protein.

      Interestingly, a recent study[1] demonstrated in triple-negative breast cancer models, that an EMT hybrid phenotype - including the presence of Vimentin - is required for metastasis. A compelling hypothesis then is that ischemia in the tumor microenvironment may favor these hybrid phenotypes. We briefly discuss this topic in the revised version of this manuscript.

      1. The number of replicates should be included in each figure legend and not only in the methods section. From data presented it is not clearly stated what do points mean in boxplots (e.g, Fig1H, 2B,G...). How many cells/spheroids did the authors count in each experiment?

      We have now added this information to all our experiments. This information can be found in the figures and on the figure legends.

      1. Figure 3B is not mentioned in the main text.

      We apologize for this error, and we thank the reviewer for catching this issue, which have now corrected.

      1. Line 295: "In the absence of macrophages, clusters of endothelial cells remained mostly rounded, even in the presence of consumer cells and regardless of their location along the ischemic gradient (Fig. 5A; Video S6)." However, in Video S6, both images show endothelial cells co-cultured with macrophages. I consider that Video S6 should be not referenced here.

      The reviewer is correct so have removed that reference.

      1. References style should be homogeneous (e.g, in Ref 13 appears "Nature Reviews Cancer" whereas in Ref 14 "Nat Rev Cancer"). Also, in Ref 25, the journal is missing.

      We apologize for this oversight, and we have not tried to be more consistent in our references.

      1. In plots where distance to open chamber site is not especify (e.g. 6B), at what distance were the data recorded? Please, indicate in the figure legend.

      We have now added this information to our figures.

      1. In the experiment showed in Fig 4, the sorting strategy would include stromal cells such as fibroblasts and endothelial cells in the GFP- population (as only CD45+ cells are removed). These cells will likely also grow in the 3MIC system and have an effect in migration. Can the authors rule out this confounding effect?

      The reviewer is correct. We still think that the possibility of fibroblast contamination is low. First, the fluorescence of HRE-GFP cells under normoxic, is still higher than the autofluorescence of cells not expressing this constructs (such as fibroblasts). This is quite normal as most sensors/reporter have some leakage and thus there is a small amount of transcription. Second, intradermal and subcutaneous tumors are quite poor in fibroblasts. In fact, to study the role of fibroblasts in these tumors, they are usually co-injected with tumor cells (PMID: 20138012). Third, in the process of tumor dissociation and in vitroestablishment, non-transformed cells tend to die more. Since these are more technical points, we moved the cell sorting details to the material and methods section.

      1. In Fig 5C the panel of proximal + macrophages is missing

      We apologize for this mistake, and we have corrected in the new version of the manuscript.

      1. In Fig. 5, Linifanib is used to study the effect of blocking VEGF. Linifanib can also interact with RTKs and PDGF. This fact should be acknowledged.

      We agree with this point. Following the reviewer's advice, we now acknowledged the potential off-target effects of these inhibitors (lines 354-355).

      Significance

      This is a very interesting work with the development of a simple and cost-effective system that allows to continuously monitor biological processes in 3D cultures under nutrient-modified conditions. In general, these data would be broadly interesting to cancer community in general, as 3MIC is a very versatile system, where several aspects can be studied and precisely discerned.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this work, Anandi et al., developed a cell culture system to live image the initial transformation of cells in deprivation of oxygen and nutrients in a 3D context. Using this system, 3MIC, they were able to create oxygen and nutrient gradients to simulate ischemic conditions that arise deep within tumors and that typically precede metastasis. With the 3MIC system they validated that ischemia triggers cell migration and invasion of tumor cells. In addition, 3MIC also allowed them to study the interaction of tumor spheroids with stromal cells such as macrophages and endothelial cells. Interestingly, the authors showed that co-culturing tumor spheroids with stromal cells increased the pro-metastatic features induced by ischemia conditions. Lastly, using 3MIC allowed the authors to discern that a poor paclitaxel response in ischemic-like cells is driven by intrinsic cellular resistance rather than due to lower drug concentration.

      Overall, the work is very well-written, and the results are consisting, convincing and support the conclusions. The methods are clear and complete and allow the reproducibility of the experiments. The experiments are adequately replicated and statistical analyses are well described. However, I have few suggestions to improve the impact of the manuscript:

      1. The authors conclude that 3MIC results in the accumulation of lactic acid and nutrient deprivation in an increasing manner when moving far from the opening site. Is there a way to actually show this? So far, the authors employ a hypoxia sensor only. A sensor for internal pH or other method for nutrient deprivation would help to support the conclusion and further validate the model.
      2. According to figure S3E, the main cell line used by the authors is already quite mesenchymal. It would be good to know if the results showed here are consistent in cells with a more basal epithelial phenotype. Do epithelial cells need stronger ischemic conditions to undergo phenotypic changes?
      3. The number of replicates should be included in each figure legend and not only in the methods section. From data presented it is not clearly stated what do points mean in boxplots (e.g, Fig1H, 2B,G...). How many cells/spheroids did the authors count in each experiment?
      4. Figure 3B is not mentioned in the main text.
      5. Line 295: "In the absence of macrophages, clusters of endothelial cells remained mostly rounded, even in the presence of consumer cells and regardless of their location along the ischemic gradient (Fig. 5A; Video S6)." However, in Video S6, both images show endothelial cells co-cultured with macrophages. I consider that Video S6 should be not referenced here.
      6. References style should be homogeneous (e.g, in Ref 13 appears "Nature Reviews Cancer" whereas in Ref 14 "Nat Rev Cancer"). Also, in Ref 25, the journal is missing.
      7. In plots where distance to open chamber site is not especify (e.g. 6B), at what distance were the data recorded? Please, indicate in the figure legend.
      8. In the experiment showed in Fig 4, the sorting strategy would include stromal cells such as fibroblasts and endothelial cells in the GFP- population (as only CD45+ cells are removed). These cells will likely also grow in the 3MIC system and have an effect in migration. Can the authors rule out this confounding effect?
      9. In Fig 5C the panel of proximal + macrophages is missing
      10. In Fig. 5, Linifanib is used to study the effect of blocking VEGF. Linifanib can also interact with RTKs and PDGF. This fact should be acknowledged.

      Significance

      This is a very interesting work with the development of a simple and cost-effective system that allows to continuously monitor biological processes in 3D cultures under nutrient-modified conditions. In general, these data would be broadly interesting to cancer community in general, as 3MIC is a very versatile system, where several aspects can be studied and precisely discerned.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Anandi and colleagues present a manuscript describing a nice assay for exploring the progressive effect of metabolic depletion of the nutrients and oxygen on the invasion of cancer cells. This builds upon and extends a device that they previously described - MEMIC - and now enables 3D analysis of small numbers of cells. The key to their method is the inclusion of a layer of consumer cells that deplete oxygen and nutrients. Using this tool, they demonstrate that depleted environments promote invasive behavior and lower cell-cell adhesion. This is related to the nutrient-deprived and hypoxic environments found in the center of many tumors. Cellular Potts Modelling is used to explore ideas around the cooperation between reduced cell-cell adhesion and increase ECM adhesion in promoting invasion. Overall, this is a well-constructed manuscript that will be of interest to cell biologists and cancer biologists.

      Major comments

      I realize this work is submitted to review commons and this complicates the recommendation regarding publication. My view is that the 'more prestigious' journals would require greater mechanistic insight, but that the work could find a suitable place in other members of the review commons stable. My comments are divided into those essential for any journal and those that might be journal dependent. Essential regardless of journal

      • Many of the figures lack information about the number of spheroids analyzed and from how many biological repeats they are derived.
      • The authors need to provide citations for their assertion that only gases can cross the PDMS, but not other small metabolites. They should also comment on whether the build-up of CO2 might be relevant.
      • The data on the directionality of migration when consumers are present are not significant and doesn't warrant the speculation in lines 186-189.
      • The ECM degradation in Figure 3 should be quantified.
      • Do the authors have evidence that the hypoxia-exposed cells are more adhesive to ECM. This is central to their Potts model and I could not locate the supporting experimental data. If not, then the Potts model should include matrix proteolysis, which they do have data about.
      • Is the down-regulation of E-cadherin transcriptional - i.e. is the mRNA level reduced?
      • The title of figure 6 is misleading. The authors do not demonstrate chemoresistance in terms of cell survival or cell proliferation, which is how the term is normally used. The authors should measure cell number, proliferation, and cell viability. The data presented in the Supplementary Figure are inadequate with no quantification. The FUCCI reporter cells would be a good tool for this. Also, why use 150nM paclitaxel when the IC50 is 817nM? This seems bizarre. Lastly, there is a typo in the figure that suggest 150mM drug was used.

      Journal dependent

      • The authors have not excluded that either changes in nutrients, or even a pro-invasive factor, produced by consumer cells are necessary for the increased invasion. They have only shown that they are not sufficient. The authors should perform a series of experiments comparing hypoxic conditions with normal media and normoxic conditions with nutrient depleted/condition media by prior culturing of KP cancer cells.
      • What is the oxygen sensor for increased invasion? PHD1-3 would be a good place to start looking. Is the PHD2-HIF axis important? Do VHL mutant cells still show responses to the consumer cells?
      • If they include both spheroids of endothelial cells and cancer cells, will the resulting protrusions in hypoxia grow towards each other? Would macrophages enhance this process?

      Significance

      The main advance is technical, as many previous studies have related hypoxia to increased cancer cell invasion, which the authors correctly acknowledge and cite. It is scholarly study, which will be of interest to many readers, and the method reported is likely to be adopted by several groups.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      In this work, Anandi et al. propose an ex vivo model that can be used to recapitulate the in vivo structure of the tumor microenvironment, which allows the observation of morphological and functional changes in tumor cells in a 3D context. Due to the ability of cancer cells to induce hypoxic condition within the TME, authors propose this model to tackle the study of metastasis initiation in vitro. The proposed system successfully displays an ischemic gradient with cells accessing nutrients at different rates, similarly to what happens in solid tumors in vivo. Moreover, in line with the literature, tumor cell migration and invasiveness were promoted by hypoxic conditions. Authors also show that the system could be used to study cell-cell interaction, as co-cultures of macrophages and cancer cells were successfully cultured in the system and studied in the context of tumor hypoxia. The study proposed is interesting and timely, as cancer cell invasion remains an important area of tumor biology that needs further exploration. The methodology is well explained and proposed in a linear flow. However, the work could benefit from some improvement and changes, as well as from additional experiments. On an important note, authors do not properly refer to the current literature, as several studies on 3D culture systems/chambers have already been studied and developed to investigate the tumor microenvironment, but they are not cited nor referred to in the manuscript. Authors should refer to such literature and explain how this system is different and adds to it.

      Major comments:

      • Authors propose this method to study the TME in 3D. When culturing cells with different ECM (Collagen vs. matrigel+collagenI) authors should take into consideration the effect of these materials on different cell types. It is known how collagen and matrigel can differently influence the polarization and phenotype of stromal cells (particularly in regards of fibroblasts - major components of solid cancers - e.g., PMID 21029367), therefore these points should be addressed at least in the discussion.
      • In addition to the previous comment, matrigel and collagen are also known to alter cancer cell phenotype (e.g., PMID 21029367) and this point should be taken into account.
      • The need for novel 3D systems to study different aspects of the TME in vitro/ex vivo are certainly needed, however they are not inexistent. Authors should address this in the text, as the current literature already started to propose 3D models (including models involving matrigel/collagen in combination with other materials). 3D chambers (of different materials, and with different aims) are being used and designed and can be found in the literature. These works are not cited in the current study at all. For instance, Anguiano 2017; Cavo et al. 2018; Anguiano et al., 2020; Sodek et al. 2008, etc.
      • Even though the focus is on hypoxia and the achievement of an ischemic gradient in the chamber to allow resemblance of an in vivo tumor, the authors write in line 123 (and also in other parts of the text) that: "these results show that consumer cells in the 3MIC form ischemic gradients that can influence the local metabolic microenvironment experienced by neighboring tumor spheroids". The addition of the use of the PMDS membrane partly supports the claim, however it would be interesting to check whether this is indeed true, by measuring for example the levels of certain metabolites (e.g., glucose, glutamine, glutamate, lactate, aspartate) reached with the system, or pH levels, etc., in presence or absence of the hypoxic gradient/consumer cells.
      • When looking at the references presented in the manuscript, authors quote too many review articles, rather than scientific articles. Given the extremely wide literature on cancer metastasis, more of these works should be quoted in this context. For example: in the introduction - text lines 27-38 - only 4 references are research articles, out of 14 references presented in that paragraph.
      • As authors showed successfully how macrophages and cancer cells can interact in the chamber, recapitulating cell interactions in an in vivo context, it would be very interesting to see whether different consumer cells would induce similar or different changes to the spheroids and the ischemic gradient (for instance using stromal cells or non-tumor cell lines as consumers, instead of cancer cells only), as we know how tumors are a multitude of cell subsets, each contributing to nutrient production, oxygen consumption, etc.

      Minor comments:

      • Studying the early metastatic development/seeding remains a timely quest, however authors should refer to several new studies in which various mouse models are used to study metastasis from different points of view (e.g., PMID 25822788; PMID 36991128; PMID 25171411; PMID 25633981; PMID 34632412; PMID 35921474; etc). Or line 41, three reviews are quoted (refs 27-29), whilst there are several works that could be quoted on metabolism in solid tumors also in the context of metastasis (e.g., PMID 36522548; PMID: 26719539, PMID 34303764). This comment applies to the rest of the text.
      • The order of the references is not properly presented. In the introduction, the first reference is n. 4 (text line 22), instead of it being reference 1. Moreover, the subsequent literature ref. is number 12 and not number 2. Please revise the order of the references, and position them within the bibliography from first cited to last cited in the text.
      • Lines 98-104. It would be helpful to the reader to define here what these consumer cells are. Even though it is explained in the methods that the consumer cells are cancer cells, it is important to make it clear in the text, as it could be misleading at times.
      • The English grammar and spelling should be revised in some parts, as well as typos and missing words throughout the text (e.g., Line 38, the word "interraction" is misspelled and should be corrected with "interaction". Line 49, the first sentence seems incomplete. Lines 68-69 should be revised as the sentences do not flow well together, probably due to a missing word. In line 77 it should be "presents". Line 341 should be "cannot be explained").

      Referees cross-commenting

      I find the comments from the other reviewers to be in line with one another as well as with my general assessment. The major and comments of all reviewers should be addressed. The minor comments should be taken into account as well, as they would render the text and the figures more precise. I suggest that 3-6 months to complete the revision process is an appropriate time frame for the authors. Finally, I strongly encourage the authors to add in the discussion the points and questions raised by all reviewers, as well as to improve the bibliography in terms of organisation, linearity, and state of the art.

      Significance

      General assessment:

      The work by Anandi et al. offers an additional tool to tackle the issue of studying the tumor microenvironment, in a 3D culture system. The authors show a model that can be used to study tumor hypoxia in 3D, offering the possibility to study the TME in a more in vivo-like manner without turning to mice models. The development of new tools to study the TME avoiding the excessive use of animals is definitely a timely quest. In addition, the system has the potential to be applied to tackle different biological questions, as the methodology is well explained and could be suitable to many other fields of cancer biology (e.g., drug resistance or uptake). The work is overall presented in a clear way and the methodology is explained thoroughly and it has the potential to be a useful tool for the study of cancer hypoxia.

      However, authors should address how their method could differently impact other cells when applied to other systems. As one major claim is the potential use of this methodology to study the TME, it should be taken into consideration how stromal cells are strongly affected by the ECM, and how certain settings or features of the system may impact such cell populations. In addition, the work does not properly refer to the current state of the art. As other studies started to propose 3D systems for the study of TME and cell-cell interactions - besides organoids - the authors should cite these works and frame their own study in a more appropriate context, pointing out differences with the current 3D chambers available, the advantages of one vs the other, and so on.

      Advance: the study adds to the current literature as the study of tumor hypoxia in 3D remains a complicated issue. The interesting co-culture settings with macrophages suggests potential uses of this model to study cell-cell interactions.

      Audience: the study is very methodological and offers a tool that could be used by cancer biologists - and maybe by other biology fields.

    1. Note: This response was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would first like to thank the reviewers for their careful reading and thoughtful feedback.

      We have substantially revised the manuscript and included additional experimental evidence on O-GlcNAc and OGT/OGA protein levels in the placenta of embryos bearing the OGT-Y851A hypomorphic mutation.

      Overall, we believe our improved manuscript provides compelling evidence that the glycosyltransferase activity of OGT, and thus the O-GlcNAc modification itself, plays a sexually dimorphic function in placental development and the developmental repression of retrotransposons in the developing embryo.

      We have addressed each of the reviewers' comments below. The original comments (C) are in italic, our responses (R) in Roman font.

      Reviewer #1

      Evidence, reproducibility and clarity

      C1: Formichetti at el. developed mice with OGT catalytic dead mutations and then studied their function during early embryogenesis. Not surprisingly, dramatic reduction in OGT activity failed to produce embryos; however, mild reduction in OGT did produce animals. The authors then use the T931 animals that have a mild reduction in activity to further characterize the function in the early embryo. Not surprisingly, male mice showed changes in gene expression, implantation sub-lethality, and an uptick in loss of retrotransposon silencing. The authors also show that an even milder reduction in OGT activity (Y851A) effects male placenta function and chromatin remodeling. Finally, the authors make a less stable OGT transgene within the mouse and again found embryogenesis issues in the males and alterations in numerous gene families including mTOR signaling and p53 function. All in all, this is an interesting study that track functions of OGT in early embryonic development. The studies are well-controlled and rigorous.

      R1: We thank the reviewer for their clear understanding and their appreciation of the rigor and impact of this work.

      Significance

      C2: This is a good study and novel. Not only is it of interest to reproductive biologist, but it echos themes found in O-GlcNAc biology.

      R1: We are pleased that the reviewer underlined the novelty of the study and its impact across fields.

      Reviewer #2

      Evidence, reproducibility and clarity

      Comments to authors

      C3: To investigate the function of OGT at specific developmental stages, the authors perturbed OGT's function in vivo by creating a murine allelic series featuring four single amino acid substitutions that variably reduced OGT's catalytic activity. The goal was to identify the direct effect of O-GlcNAcylation, using a sophisticated collection of genetic mutants to evaluate in vivo the role of this modification at early stages of development. Overall, the severity of embryonic lethality correlated with the extent of catalytic impairment of OGT, demonstrating that the O-GlcNAc modification is essential for early development.The study represents a substantial advance in our understanding of OGT and O-GlcNAcylation in mammalian development. The creation of novel murine models and inducible systems is an important contribution, providing powerful tools for future research in this field. The insights into the role of OGT's catalytic activity and its involvement in epigenetic regulation during embryonic development are noteworthy, opening new avenues for research.

      R3: We thank the reviewer for their insightful comments. We are grateful for the supporting statements. Please find below detailed response to all your comments.

      However, there are a few considerations and concerns:

      Major:

      C4: 1. An assumption of the study is that different mutations cause different levels of O-GlcNAcylation rather than alterations in substrate specificity. It might be important to test, at least in cultured cells, that the different mutations do not change the preference of OGT to modify certain proteins rather than others, which can provide alternative explanations for their findings.

      R4: Thanks for asking this question, it helped us to better explain the rationale behind the choice of the Ogt amino-acid substitutions.

      This is a critical point that we carefully considered in the design of the single amino-acid substitutions. Two lines of evidence support that the precise mutations created impact the catalytic rate without modifying the substrate specificity:

      First, as explained in the text, the choice of the single amino-acid substitutions was driven by previous structural and enzymology knowledge. The impact of the four point mutations selected on OGT protein stability and on the Michaelis-Menten kinetic values had previously been determined experimentally (Fig. 1A legend and Martinez-Fleites, C. et al. Nature Structure Molecular Biology 2008; https://doi.org/10.1038/nsmb.1443).

      There is a second important rationale that we added in the revised manuscript: the four point mutations selected are all located in the catalytic domain (specifically, H568A in the N-Cat domain and Y851A, T931A and Q849A in the C-Cat domain), while the substrate recognition is operated via two other domains namely the intervening domain (Int-D) https://doi.org/10.1038/s41589-023-01422-2) and the tetratricopeptide Repeat (TPR) superhelix (10.1021/jacs.7b13546; https://doi.org/10.1073/pnas.2303690120). Therefore, for both these reasons, it is extremely unlikely that these mutations could influence the substrate specificity.

      C5.1: 2. In Fig 1D and 1H, the thresholds to define a gene or TE as differentially expressed are not strong. According to the figure legends, "any" change in terms of log2Fc was considered as DE and colored. I think the figures should illustrate better that the changes are subtle, by for example adding a dotted line (at least) in the value 0.5 of the y-axis. These subtle transcriptional changes should be reflected better in certain paragraphs where the expression of TEs are presented/and discussed as a hallmark of the absence of O-GlcNAcylation in the OGT-mutants. The same happens with Suppl Fig 3C (changes are very minor). {. Applying a stronger threshold, among the upregulated genes, only Xist will be significantly overexpressed. If a gentle threshold needs to be applied to this data, authors should at least justify the reasons behind doing so. Same for Fig2D.

      R5.1: The reviewer means Figure 2D for MA plot of gene expression and Figure 2H for retrotransposons expression. These figures now include a dash line to indicate Log2FC = 0.5 (as all MA plots).

      The text is explicit on the subtle changes in transcription, it reads "with 2/3 of the genes downregulated and 90% of the significant changes below 1 log__2__FC"; "most of the Ogt__T931del/Y embryos showed a low magnitude upregulation of retrotransposons".

      The revised text states "Notably, most of the OgtT931__del/Y embryos showed a low magnitude (log2FC < 1) upregulation of retrotransposons".

      We expand on this topic in the next response (R5.2) noting that changes in gene expression upon O-GlcNAc perturbation in different systems were previously characterized as subtle and widespread. We suggest that this phenotype may arise from the scarcely understood pleiotropic function of O-GlcNAc in fine-tuning gene expression; this phenotype could have a biological significance.

      C5.2: If a gentle threshold needs to be applied to this data, authors should at least justify the reasons behind doing so. Same for Fig2D.

      R5.2: Previous studies in different systems reported that O-GlcNAc perturbation causes a widespread change in gene expression of low magnitude (https://doi.org/10.1101/2024.01.22.576677, https://www.pnas.org/doi/10.1073/pnas.2218332120). We use the same thresholds as a recent functional Ogt study in ES cells to call differentially expressed genes, specifically: p<0.05 (Wald test), any FC (Li et al. PNAS 2023, https://www.pnas.org/doi/10.1073/pnas.2218332120). The p value threshold is standard; the absence of FC threshold is dictated by the insufficient knowledge of the significance of the low magnitude changes observed across many transcripts.

      C6: 3. In Figure 2B, the T931del allele was recovered in the blastocyst population with a very high frequency, even higher than the male WT group (T931del: 10; WT: 3). This observation suggests that the T931del allele did not significantly affect blastocyst survival. Further clarification or additional experiments might be necessary to understand the implications of this finding on early developmental stages.

      R6: This is only a hint as the numbers of blastocysts recovered were too small to perform statistics on Mendelian distribution. Thus, more experiments are needed to perform these statistical tests. These experiments are onerous because the low frequency of germline transmission is incompatible with maintaining this mutation by breeding heterozygous animals. Because of this, a new mouse line needs to be created by CRISPR-HDR targeting in the zygote in order to compute statistics on Mandelian ratios. Importantly, this question - does T931del affect blastocyst survival? - is peripheral, and the results of these experiments would not affect our conclusions in any way.

      C7: 4. Similarly, in Figure 2G, there is an apparent higher expression of TE expression in the T931A/Y embryos group than in the T931del/Y group, which combined with the higher frequency of blastocyst generated in this latest group it may indicate a deeper molecular consequence after the deletion of the T931. A comparison of the transcriptome between these two cell lines help to address this possibility. Also, the authors should compare the O-GlcNAc levels of WT, T931A, and T931del mutant blastocysts by immunostaining, similar to what was done in Figure S5F.

      R7: We agree that a direct comparison between the two mutations of the T931 residue would be interesting; however, this comment is very difficult to address experimentally for the reasons outlined below:

      Firstly, it is not possible to perform a statistical comparison of the transcriptome T931A/Y VS. T931del/Y with the data generated because the number of hemizygous T931A/Y (n=2) is too small. Hence, it cannot be ruled out that the seemingly milder retrotransposon reactivation in one of the T931A/Y embryos could have occurred by chance.

      Secondly, considering the low magnitude effect on gene expression changes upon O-GlcNAc genetic perturbation, to statistically assess the penetrance of the molecular phenotype and perform the differential expression analysis, numerous (>>3) hemizygous blastocysts of each genotype would be needed. Because females heterozygous for the T931 mutations transmit the mutant allele at very low frequency, these experiments require numerous de novo CRISPR injection sessions.

      Thirdly, for the immunostaining of O-GlcNAc to be semi-quantitative, a large number of hemizygous blastocysts for each genotype would be required (note that in Figure S5F, 29 morulae per condition were imaged), thus requiring numerous CRISPR injection experiments as discussed above. Moreover, O-GlcNAc changes could be subtler than what expected based on the strong reduction of OGT activity, since as a compensatory mechanism Ogt expression is upregulated in the Ogt__T931A/del blastocysts (Fig. S2D), making a quantification even more challenging despite a high number of stained embryos.

      In sum, these in vivo experiments are difficult and require sacrificing many animals (about 20 females per CRISPR injection experiment). Because the results would bring refinement to the study but would not change our conclusions, we suggest that the cost/benefit is too high.

      C8: 5. In Boulard et al. 2019 O-GlcNAcylation was shown to be sufficient to modulate expression of DNA methylation-dependent TEs. It would be interesting to know (or at least discuss) if the changes in TE expression observed in OGT-mutant embryos in this study involve changes in DNA methylation. Ideally, some DNA methylation measurement optimized for low input numbers of cells would be useful.

      R8: Thank you for making the link with our previous study. In the PNAS paper, we report that targeted removal of O-GlcNAc at proteins bound to specific TEs (e.g. IAPez) causes their full-blown reactivation without detectable changes in DNA methylation, thus suggesting a role of the O-GlcNAc modification for the silencing of methylated TEs downstream or independent of DNA methylation. We agree that it would be informative to quantify DNA methylation in the T931-mutant blastocysts to test if the in vitro result is the same in vivo, but this would require performing onerous microinjection sessions as explained above.

      C9: 6. The data related with the OGT-degron system in MEs seem disconnected with the rest of the manuscript. While the developmental models (blastocyst, etc) elegantly assess the contribution of O-GlcNAcylation to the control of cell survival and gene expression through the use of different OGT mutants, the degron system is a system of graded depletion that unfortunately was only possible to be used in MEFs (instead of embryos). Thus, the results obtained with the degron system in MEFs are difficult to intersect with the data from the use of OGT-mutants in embryos. Even though there are obvious interesting questions that one may want to know about this OGT degron MEF system, none of them would demonstrate a direct role for O-GlcNAcylation in cellular function, the major point addressed in the developmental system. Using the degron system in embryonic stem cells might have provided a more parallel comparison. The authors should discuss this point in more detail and either use ESC instead of MEFs or provide a stronger justification for the use of MEFs over ESC.

      R9: We thank the reviewer for their clear understanding of the system. The choice of primary MEF as an in vitro model was imposed by technical limitations we encountered during the study. We fully agree that ES cells is the model of choice for preimplantation embryos; thus we initially derived ES cells and obtained only one male clone bearing the AID degron system. Upon auxin addition to the culture media, OGT's level remained unchanged in ES cells. Thus, the ES cells model was not usable. To test the AID degron in a different cell type, we then derived MEFs and showed its effectiveness (Figures 4C and S4C-E), which also allowed to collect functional data on OGT's cellular function (Figures 4D-F). We took the comment on board and clarified the rationale of studying MEFs in the revised manuscript. We agree that it remains to be verified that the OGT-dependent pathways uncovered in MEFs are relevant in the preimplantation embryo. Despite this caveat, we feel the mouse model for endogenous OGT-degron, as well as the negative results in vivo and conclusions in MEFs should be shared with the community, which could take advantage of our results to refine the system.

      Minor:C10: 7. In Fig 2C the color and shape codes are confusing to understand - there are some colors/shapes that are not represented in the PCA plot. The same in Fig 3H, where in the PCA plot there are pink triangles that do not match with the code legends.

      R10: We apologize for the confusion with the legends of Figures 2C and 3H, that we have made unambiguous in the revised version (as well as Figures S2B,C and S3C).

      C11: 8. In the figure legends of Figures 2D, 2E, 2F, and 2H, the notation should be corrected from "OgtT931A/Y" to "OgtT931del/Y".

      R11: This has been corrected; many thanks for bringing it to our attention.

      Significance

      C12: To investigate the function of OGT at specific developmental stages, the authors perturbed OGT's function in vivo by creating a murine allelic series featuring four single amino acid substitutions that variably reduced OGT's catalytic activity. The goal was to identify the direct effect of O-GlcNAcylation, using a sophisticated collection of genetic mutants to evaluate in vivo the role of this modification at early stages of development. Overall, the severity of embryonic lethality correlated with the extent of catalytic impairment of OGT, demonstrating that the O-GlcNAc modification is essential for early development.

      R12: We thank the reviewer for their clear understanding of our work and their appreciation of the biological importance of the findings.

      Reviewer #3

      Evidence, reproducibility and clarity

      C13: This is a conceptually interesting paper that attempts to leverage the knowledge of OGT catalysis to begin to dissect OGT function. The evidence is presented I a straightforward fashion and is in general well documented. The breeding strategies are well informed and the paper draws heavily on previous work carried out in the mouse.

      R13: We greatly appreciate the overall supporting review. However, we fail to understand what they mean with "the paper draws heavily on previous work carried out in the mouse". This comment may stem from a misunderstanding because this work is not based on any previously published study. Specifically, neither the seven murine alleles presented and analyzed nor the single embryo-transcriptomic data sets on which our conclusions are based have been published elsewhere.

      To put this work into context, before our study there were two seminal studies published two decades ago that reported the essential role of Ogt for mouse development, but no molecular profiling was performed (10.1073/pnas.100471497, 10.1128/mcb.24.4.1680-1690.2004). The two Ogt loss-of-function alleles studied in these papers were deemed as not suitable for interrogating molecular phenotypes because they caused cell death that confounds molecular profiling and embryonic lethality at implantation, thus preventing study of the sexually-dimorphic role of Ogt placenta. To overcome this long-standing problem, we created new seven murine alleles, which allowed us to tease apart molecular phenotypes at key stages of mouse embryonic development, focusing on the blastocyst and the placenta.

      Significance

      C14: The paper describes tools which will help dissect the many potential roles of O-GlcNAc addition in early development. As it stands, this is a descriptive manuscript that will lead to hypothesis generation and testing and this should not be undervalued. The biological reagents produced and characterized will be of general interest to the field. Most of the findings presented represented a verification of existing ideas in the field but this is not meant as a criticism since part of the motivation for the approach was to generate a reproducible system for analyzing the biological phenomena.

      R14: We thank the reviewer for their appreciation of the importance of experimentally testing ideas shared in the field without direct evidence.

      However, we must respectfully disagree with the qualification of "descriptive manuscript". This qualification may stem from the particularly difficult challenge to accessing the molecular details on how the O-GlcNAc modification exerts the biological functions we report. We are fully cognizant of the limitations of the study that we discussed in the discussion section and in R20.2. However, we feel that the adjective "descriptive" is not a fair qualification because we provide numerous novel functional evidence. Specifically, we introduce two novel orthogonal in vivo perturbations for endogenous Ogt that allowed us to interrogate for the first time its function in the developing mouse embryo. These perturbations allow us to draw causative conclusions (not descriptive) on the essential role of the O-GlcNAc modification itself for preimplantation development, its sexually-dimorphic role in the placenta and its requirement in vivo for the stable repression of retrotransposons.

      C15: There are perhaps some bioinformatic shortcuts taken that may need to be corrected upon thorough review. These do not lessen the overall impact of the contribution.

      R15: All the code written for the bioinformatic analyses performed in this study is publicly available: https://github.com/boulardlab/Ogt_mouse_models_Formichetti2024. The reviewer needs to specify which bioinformatic analysis they suggest could be improved.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Summary

      C16: O-GlcNAcylation is the fundamental post-translational modification of numerous nuclear and cytosolic proteins. OGT is the sole enzyme catalyzing O-GlcNAc addition onto the proteins. The essentiality of OGT for early development and cellular viability has been established by using OGT-KO mice and cell lines. However, it remains to be elucidated whether the catalytic activity of OGT is required for the early development, and if the catalytic activity of OGT is required what are the functions of OGT or O-GlcNAcylation in early development due to a lack of appropriate mouse models. To overcome the technical difficulty of manipulating the levels of O-GlcNAcylation in early embryos, Formichetti et al. created the series of four mouse models (OgtY851A, OgtT931A, OgtQ849N, and OgtH568A) with different OGT activity by introducing single amino acid substitution in the catalytic domain. By analyzing the inheritance of the hypomorphic OGT alleles and the lethality of mouse embryos, they discovered OGT activity is a critical factor for early development. Subsequently, RNA-seq analyses with two mouse models showing the maternal inheritance of the hypomorphic OGT alleles indicated that sever hypo-OGT activity altered transcription and silencing of retrotransposon in preimplantation development while mild reduction of OGT's activity affected placental development in a sexually dimorphic manner rather than preimplantation development. Furthermore, to study the function of OGT at specific developmental stages, they developed a mouse model bearing endogenously AID-tagged OGT for acute degradation of OGT. Although the degron system wasn't efficient in preimplantation embryos, they discovered quick transcriptional changes upon OGT deletion in MEFs. The quality of the manuscript is good because the question to be solved was appropriately set, the approach was well designed, and their findings were interesting, although their writing was sometimes hard to understand as I raised in my following comments. Nevertheless, there are several points to be fixed before being published.

      R16: We thank the reviewer for their clear understanding of our work and their appreciation of the biological importance of the findings. Your comprehensive review of the manuscript and the questions you raised were extremely helpful in improving the manuscript and fully addressing its limitations. Below, we respond to comments in full, have revised the manuscript to improve clarity and have included novel results.

      Major Comments

      C17: 1. Although the authors showed in vitro activity of each mutant of OGT used in this manuscript by referencing the previous literature, they never showed the levels of global O-GlcNAcylation (and OGT itself) in their established mouse embryos. Although it could be impossible to determine O-GlcNAc levels in OgtQ849N and OgtH568A embryos because of the lack of germline transmission and founder line, respectively, they could do that in OgtY851A and OgtT931A embryos. Given that Y851A and T931A mutants had similar VMAX/KM with different VMAX, it is possible that their activity is comparable or Y851A has even lower activity in vivo depending on the concentration of UDP-GlcNAc in embryos. Therefore, it is critical to assess whether in vivo OGT activity is correlated with that in vitro as expected to conclude that severity of sub-Mendelian inheritance is proportional to the reduction of activity of OGT in vivo. Moreover, since the authors developed the elegant system to deplete OGT, the activity of Q849N and H568A mutant OGT can be examined at least in cells by expressing them in MEFs with OGT-degron system. Thus, I propose determination of global O-GlcNAc levels compensated by OGT levels by western blotting in OgtY851A, and OgtT931A embryos or MEFs with the OGT degron system re-expressing the individual four mutant OGTs. If the protein amount is insufficient for western blotting in the embryos because of the sizes of the earlier stages of embryos, I believe the author could address this by utilizing immunofluorescence as shown in Figure S5.

      R17: We fully agree that this is an important point that requires revision. The only mutation for which the level of O-GlcNAc and OGT can be assessed by western blot in vivo is Y851A, the other mutations resulting in embryonic lethality before the blastocyst stage.

      We have included in the revised manuscript western blot analyses of protein expression for OGT, OGA and O-GlcNAc levels in the placenta of the OgtY851A mutants (new Figures 3C,D). The new data show that OGT is upregulated at the protein level in homozygous females, in good agreement with our transcriptomic analysis. Furthermore, O-GlcNAc levels were slightly reduced in homozygous and hemizygous placentae thus showing the impact of the point mutation on global O-GlcNAc levels in the placentae. Moreover, the analysis of OGA protein level unexpectedly revealed the enrichment of a previously uncharacterized OGA fast migrating isoform in hemizygous and homozygous placentae.

      We agree that it would be informative to compare O-GlcNAc levels in OgtT931A versus OgtY851A embryos. A comparison implies performing the experiment at the same developmental stage, which has to be the blastocyst stage or prior because T931A/Y embryos die around implantation. The blastocyst being made of approximately 140 cells, it would require to pool many single blastocysts to obtain the necessary protein input for western blot. We are not aware of another study performing western blot with pooled blastocysts. An additional great challenge for this experiment is the necessity to genotype and sex the blastocysts before pooling. Thus, the feasibility of this experiment is uncertain.

      As an alternative, the reviewer suggests measuring O-GlcNAc levels in the degron MEFs after introduction of OGT transgenes bearing the mutation studied. This experiment would not be conclusive because of residual O-GlcNAc after OGT degradation (Figure S4E). Furthermore, the O-GlcNAc proteome is dynamic during development (as shown in the developing brain by Liu et al. https://doi.org/10.1371/journal.pone.0043724), therefore the MEFs results would have limited value to explain our results in the early embryo.

      In sum, available technologies to quantify O-GlcNAc (e.g. western bot, mass spectrometry) are inadequate for low input samples as the early embryo. However, our series of hypomorphic alleles backed up with in vitro enzymology measurements brings indirect evidence to this question. Specifically, the qualitative correlation between the measured OGT activity in vitro and the developmental phenotype indicates that the resulting relative levels of O-GlcNAc are consistent with in vitro measurements.

      C18.1 : 2. I didn't understand why the authors couldn't find any founder lines of the OgtH568A mutant. Was that because mosaic mice with OgtH568A mutation are lethal?

      R18.1: To answer to this question, it is important to recall two key features of the biological system:

      1) The mutation H568A was reported to disrupt the glycosyltransferase activity completely (10.1038/nsmb.1443). Hence, OGT-H588A is catalytic dead.

      2) We performed the CRISPR-HDR targeting in the 1-cell embryo.

      Based on these premises, the absence of F0 with the OgtH568A mutation (0/31) suggests that introducing this mutation causes embryonic lethality in both males and females. This hypothesis is consistent with the previously reported lethality around implementation of Ogt-null alleles (10.1128/mcb.24.4.1680-1690.2004). It is possible that the sgRNA is very efficient and results in homozygous mutations in all female zygotes injected (as we have not obtained heterozygous females bearing these mutations). High efficiency of the targeted mutagenesis in the zygote results in mutants where all or the majority of cells bear the mutation (no or low mosaicism). The high number of microinjections performed (416 embryos over the 3 injection sessions) allows us to make these claims.

      C18.2 : Also, I believe there was no explanation why the OgtQ849N allele showed no maternal inheritance. Was that because Q849N possesses enough activity for sustaining mosaic embryos, but not oocytes? The authors should better explain these points in the manuscript text.

      R18.2: Thanks for this comment, we agree that this maternal effect phenotype demands further explanation.

      The phenotype observed suggests two possibilities: either that the oocyte cannot maturate or that the cleavage-stage embryo cannot develop with the resulting lower levels of O-GlcNAc. The cleavage-stage embryo does not transcribe a catalytically active OGT before the 8-cell stage and thus relies on the OGT protein inherited from the oocyte until this stage (https://doi.org/10.1101/2024.01.22.576677).

      Thank you for this comment, we added this interpretation of the result in the text:<br /> "The lack of maternal transmission of the Q849N allele from seemingly mosaic founder females is likely explained by the reliance of the cleavage stage embryo onto the oocyte payload of OGT and O-GlcNAc modified proteins. Specifically, Ogt's exons encoding for the catalytic domains are not detectable before the 8-cell stage, while OGT full-length protein is present and thus maternally inherited (Formichetti et al, 2024)."

      C19: 3. The authors serendipitously found a T931del-allele in the "WT" allele of the OgtT931A line, and suggested that T931del had milder activity loss, although the lethality of embryos was greatly mitigated. Nevertheless, transcriptome analyses in male blastocysts revealed that 120 genes' expression was changed in T931del/Y males. This raised the question about which mutant OGT has higher activity, Y851A or T931del. I think comparing the activity of Y851A and T931del mutants in MEFs with OGT-degron system is important to confirm the proportional relationship between activity and phenotypic severity.

      R19: We agree that it is a limitation that the effect of the T931del mutation on OGT activity has not been biochemically characterized. However, the important point here is that our assessment of phenotypic severity based on maternal inheritance of the mutant allele and embryonic lethality is based on the point mutations for which the catalytic activity has been determined, namely Y851A, T931A, Q849N and H568A, but not T931del.

      We studied the serendipitously discovered T931del mutation to obtain transcriptional insights in the blastocyst. Because the deleted residue T931 is key for the binding to the donor substrate, we can reasonably assume that this mutation affects the catalytic activity, albeit to an undetermined level.

      Hence, our conclusions regarding the requirement of O-GlcNAcylation for development are unaffected by the lack of biochemical knowledge on T931del.

      C20.1: 4. Regarding transcriptomes of T931del/Y, the authors found the upregulation of proteasomal activity and stress granules along with the downregulation of amino acid metabolism, mitochondrial respiration, and so on. To validate the results, the authors should perform qPCR on several up- or down-regulated genes.

      R20.1 : We agree that, in principle, qPCR validation is suitable. However, this validation experiment is particularly expensive in this case because of the requirement of numerous CRISPR zygote pronuclear injection sessions.

      The conclusions of the RNA-seq analysis are strongly supported by a high number of biological replicates (n=10). This high number of biological replicates was essential to obtain sufficient statistical power to quantify with a high level of confidence transcriptional changes of low magnitudes (below 2-fold change, see R5.1 and R5.2).

      Therefore, the qPCR validation experiment would require to repeat the CRISPR zygote pronuclear injection sessions with the same high number of animals. This represents a major investment in experimental work and the sacrificing of about 40 animals. Importantly, the RNA-seq results presented are authoritative because of a high number of biological replicates and high number of sequencing reads per sample. Thus, we argue that qPCR validation is not essential and thus the high cost of this experiment is difficult to justify.

      C20.2: In addition, according to Figure S2E, the authors pointed out that at least for genes upregulated in OgtT931A embryos, the changes were not explained by a developmentally delayed transcriptome, suggesting that upregulation of these genes was the cause of developmental delay. Therefore, I strongly encourage them to discuss in the manuscript text how up-regulated genes could contribute to developmental delay.

      R20.2: Throughout the manuscript, we have been cautious to avoid establishing causal relationships between the differentially expressed genes uncovered and the developmental phenotypes (e.g. delayed development). There are two main obstacles which we believe prevent us from establishing causality with the data available. Firstly, it is not possible to disentangle differentially expressed genes and developmental delay (in other words, we have no way to tell which is the cause and which is the consequence). Secondly, O-GlcNAc modifies over 5000 proteins and the developing embryo is a particularly dynamic system; thus we cannot know whether the differentially expressed promoters are direct targets of O-GlcNAc modified proteins (or alternatively secondary effect of another molecular alteration, for example of the proteome). We discuss this limitation of the study in the discussion section.

      C21: 5. Regarding the transcriptome in OgtY851A mice, Y851A/Y male mice had huge transcriptomic differences, while Y851A/Y851A female mice barely had any. Although it seems to agree with the number of Ogt alleles, I wonder whether other X-linked genes expressed higher in female placenta as shown in Figure 3C could attenuate the effects of decreased OGT activity. I don't think this possibility can be excluded, unless the authors further decrease OGT activity in Y851A/Y851A female placenta and obtain the similar results as for male placenta. Or if they compared the levels of global O-GlcNAcylation between Y851A/Y and Y851A/Y851A mouse placentas and discovered they had similar levels of O-GlcNAcylation, then the authors could conclude that the number of Ogt alleles was not the reason of sexual-dimorphism. The authors should determine the levels of O-GlcNAcylation in Y851A/Y and Y851A/Y851A mouse placentas and/or at least discuss the above possibilities in the manuscript text.

      R21: Thank you for the thoughtful feedback. We agree that the most likely explanation for the higher sensitivity of males placenta as compared to females to OGT reduced activity is the difference in Ogt copy number, especially because Ogt escapes X-chromosome inactivation in the placenta (new Figure S3A).

      Western blot quantification of global O-GlcNAc levels was now performed (new Figures 3C,D). We measured similar level of O-GlcNAc in Y851A/Y and Y851A/Y851A placentas (lowered than WT males in both cases), but we cannot exclude that the WB does not have the dynamic range required to detect a subtle difference. In fact, female homozygous were expected to have an intermediate level between WT males and hemizygous males, and the difference between the two male genotypes (also considering sample-to-sample variability) is already small when quantified from the blot (new Figure 3D). It is possible that a X-linked modifier attenuates the impact of hypo-O_GlcNAcylation in female mutant placenta in the case of identical O-GlcNAc levels in homozygous females and hemizygous males. Thank you for the idea that we included in the revised manuscript:

      "Of note, the lower sensitivity of the homozygous females' transcriptome to Ogt disruption (Fig. 3F,I and S3B) seems difficult to reconcile with their lower O-GlcNAc level comparable (lower) O-GlcNAc level to the hemizygous males (Fig. 3C). It is possible that the western blot technique is not sensitive enough to detect subtle differences in O-GlcNAcylation. An alternative hypothesis, if O-GlcNAc levels were truly identical between Y851A/Y and Y851A/Y851A, could be the existence of a modifier in female that could be a XCI-escapee."

      C22: 6. In terms of the transcriptome in OgtY851A mice, similar to comment 4, the authors should confirm their transcriptomics data shown as Figure 3D by qPCR. In addition, the authors should describe the potential mechanisms by which the differentiation of precursor cells of LaTPs and JZPs were disrupted. Were master regulators of the differentiation known to be O-GlcNAcylated and loss of O-GlcNAcylation perturbed the function?

      R22: As for the whole embryo discussed in R20.2, we also interpret cautiously the gene expression phenotype observed in the placenta. Specifically, we state in the manuscript that it could either be caused by an impact of lower O-GlcNAcylation on placental differentiation or by a general delay in placentation or in the development of the embryo as a whole. The hypothesis of a general delay (of the whole embryo and/or of placental formation specifically) is supported by the downregulation of essentially all markers of more differentiated cell types and the upregulation of the precursor marker. We favor this hypothesis because it is consistent with what observed with the T931 mutants and also with the enzymatic removal of O-GlcNAc in the zygote (Formichetti et al., 2024 BioRxiv). Because of the thousands of O-GlcNAcylated proteins present in the cell, it is impossible to know which is the responsible molecular mechanism, which could even start at much earlier stages.

      Minor Comments

      C23: 1. Regarding DFP461-463 mutant, I couldn't understand the point of this figure because the results had no difference, and the meaning of the mutation was quite different from the others. Thus, the figure was awkward and a little confusing to me. If the authors still want to include the figures, I would suggest that they should reorganize the position of the figure (maybe after figure 3 is better to show you had tried to investigate the effects of nuclear localization of OGT on the changes of transcriptomes) and add some results. Since WT OGT seems to be localized mainly in the cytosol at steady state (Figure S1B and S1C), the effect of mutation on its nuclear localization should not be obvious. Therefore, it is difficult to conclude the mutation had no effect on the nuclear localization unless the ratio of nuclear and cytosol localization is quantified. Also, I wonder whether the O-GlcNAc levels of nuclear and cytosolic proteins in the mutant cells were comparable to those in WT cells. If this is the case, the results would also support the authors' conclusion.

      R23: We took the comments on board and made it clearer that the rationale for the DFP461-463 mutant was an attempt to separate OGT's nuclear and cytosolic functions. We fully agree that these results are peripheral, and thus we presented these results in Supplementary Figure 1 (not in the main figure).

      The biochemical evidence presented in Fig S1C shows that the genetic substitution of DFP to AAA on endogenous OGT has no detectable impact on its nuclear localization in primary MEFs. This result is far more authoritative than the evidence provided by Seo et al. 2016 (doi: 10.1038/srep34614), which is based on the overexpression of OGT transgenes in HeLa cells. Importantly, Seo et al. 2016 did not assess the impact of their mutations on endogenous OGT.

      We believe that the negative results we obtained with the DFP461-463 mouse model shall be extremely valuable for the field. Firstly, science can move forward only if both negative and positive results are shared. In this specific case, we found that mutation of endogenous OGT in MEFs yielded to a different result than previously reported overexpression of the same mutant construct in HeLa cells. Secondly, we want to make the Ogt-NLS- mouse model available for further investigations.

      C24: 2. Since OGT or O-GlcNAcylation regulates chromatin status, the authors analyzed the gene expression profiles of retrotransposons in T931del/Y or T931A/Y mice. Is it possible to investigate if the release of gene silencing is also seen in non-retrotransposon genes? I assumed retrotransposons might be a well-established system to analyze gene silencing status, however, if the authors could find similar effects on genes other than retrotransposons, that would be highly valuable.

      R24: This is an interesting idea. This notion refers to the activation of promoters that are normally epigenetically repressed (e.g. silent despite the presence of all trans-active factors required for their expression). Epigenetically repressed promoters include retrotransposons, imprinted genes and germline specific genes that are normally expressed in germ cells and maintained in a repressed state in somatic cells (10.1038/s41580-019-0159-6). Testing of mono-allelic expression of imprinted genes required F1-hybrid. Thus, we assessed whether well-studied germline specific genes could be realized from silencing in T931del/Y or T931A/Y blastocyst and found no evidence for it (see dot plot below). The unbiased transcriptomic analysis presented in the manuscript shows that the product of upregulated genes are enriched in mRNA processing (Figure 2E), but these genes are not normally epigenetically repressed. Thus, contrary to retrotransposons, the role of O-GlcNAc at cellular gene promoters appears not to be linked to epigenetic silencing. This could be explained by the many different protein substrates for O-GlcNAc.

      C25: 3. OgtY851A mice with milder OGT activity loss didn't exhibit impaired preimplantation development, but did display postimplantation development such as placental development, suggesting that O-GlcNAcylation of proteins required for preimplantation and postimplantation development relies on different degrees of OGT activity. I wonder whether global O-GlcNAc levels in embryos in preimplantation and postimplantation developmental stages are different or not. This might include both the pattern of blotting and intensities. The results would give the authors an explanation why the dependency on OGT activity was different in two developmental stages. Can the authors provide data? If not, then the authors should at least describe hypotheses in the manuscript to address these questions.

      R25: We recently reported that the subcellular patterns of O-GlcNAc are highly dynamic during preimplantation development (Formichetti et al. 2024, BioRxiv). The most striking O-GlcNAc remodeling we observed is the enrichment of nuclear O-GlcNAc as compared to cytoplasmic O-GlcNAc that is concomitant to embryonic genome activation (Formichetti et al. 2024, BioRxiv). We quantified the ratio of the nuclear/cytoplasmic signal by immunofluorescence, but absolute quantification is not possible with this method. Due to the limited number of cells of the preimplantation embryo, this analysis cannot be performed by western blot. Hence, there is no appropriate method to quantitatively compare O-GlcNAc levels between preimplantation and postimplantation embryos.

      C26: 4. The authors' AID-degron system elegantly worked in MEFs but was inefficient in preimplantation embryos. I wonder if this was because of the high expression of the shorter isoform of OGT detected as OGTp78 in the author's western blot. Is it possible to examine this possibility in the embryos? Either way, the authors should describe a potential explanation for why the efficiency in the embryos was low. In addition, the authors should describe why they inserted the AID tag only into the longest OGT isoform.

      R26: This is a good point. The smallest isoform OGTp78 bears the catalytic domain and thus can partially compensate for the degradation of OGTp110. Note that the level of OGTp78 is low and does not increase upon OGTp110 degradation; thus a compensation can only be partial (Figures S4A and S4D). Alternative hypotheses for the ineffectiveness of the degron system in ex vivo grown embryos include: i) the expression level of OsTIR that may be too low in the early embryo (Rosa26 promoter not being activated at EGA), ii) a possible steric hindrance of the N-ter AID tag in these cells, iii) the lower concentration of Auxin imposed by toxicity on the embryo is likely suboptimal. Testing these possibilities is very difficult in preimplantation embryos.

      It is unclear how the OGTp78 isoform is produced; it was hypothesized to originate from an alternative transcription start site (https://doi.org/10.1007/s00335-001-2108-9). We initially attempted to target both isoforms by inserting the AID tag at the C-terminus, but we were unsuccessful in producing this mouse model. It is possible that the C-terminus that is near the catalytic site cannot tolerate the AID knock-in.

      C27: 5. In Figure S1C, is the band detected right below OGTp78 in nuclei fractions non-specific or do both bands correspond to OGTp78 ?

      R27: To answer this question, a knockout control would be needed. OGTp78 being not targeted by our AID-degron, we cannot test the specificity of these bands using our perturbation tool kit.

      C28: 6. Figure 1D top row third column: hemizgous -> hemizygous

      R28: Many thanks; the embarrassing typo has been corrected.

      C29: 7. Figure 1D second row third column: hemyzygous -> hemizygous

      R29: Thanks for bringing this other typo to our attention, it is now corrected.

      Reviewer #4 (Significance (Required)):

      General assessment: strengths and limitations

      C30: Strength: This manuscript elegantly revealed the requirement of OGT in mammalian development by taking advantage knock-in mouse models with different OGT activity. In addition, the manuscript provided the interesting and important transcriptomics data in both pre- and post-implantation embryos of OGT mutant mice. These data sets could explain detailed mechanisms how OGT or O-GlcNAcylation regulates mammalian development in the future. Furthermore, development of AID-tagged OGT system would be a useful tool for other researchers studying OGT function.

      Limitation: Although they found interesting changes in terms transcriptomes in developing mice with different OGT activity, they lack the data showing how these changes caused the observed phenotypes. In other words, there are less mechanistic insights behind the developmental problems seen in mice with different OGT activity.

      In addition, although I agree the question about whether OGT activity itself is crucial for the early development of mammals has not been completely solved for a long time, I assume people thought OGT activity is actually important for the mammalian development thorough the observation of OGT-linked congenital disorders of glycosylation.

      Therefore, I would say the novelty of the manuscript is a little less impactful. Furthermore, although AID-tagged OGT system revealed fundamental questions regarding the transcriptional changes upon acute depletion of OGT in cellular levels, the system was inefficient in mouse embryos. So, they showed nothing about developmental-stage specific requirements of OGT.

      Advance: The manuscript can fill a current gap regarding requirement of OGT in mammalian development. Also, the manuscript developed a series of mutant mice with different OGT activity and an AID-tagged OGT mouse line. These mice provide technical advances.

      Audience: The manuscript will be interested in researchers in specific fields such as glycobiology, developmental biology, and clinical fields.

      Describe your expertise: Biochemistry, Glycobiology, Cell biology

      R30: We are thankful for the constructive and supportive review.

      We fully agree with the limitations of the study and discussed them in the manuscript. Our in vivo approach revealed the most phenotypically relevant transcriptional phenotypes resulting from OGT catalytic impairment during embryonic development. We make the mouse models created for this study available to the community to facilitate follow-up studies aiming at exploring the underlying molecular details.

      As pointed out in the comments, the requirement of OGT glycosyltransferase activity for mammalian development was widely assumed by the field, but this belief was without direct experimental evidence. This study provides the first in vivo evidence for this important conclusion.

      Conclusion: The reviewers' comments were tremendously useful to improving the clarity of the manuscript and adding important new in vivo evidence. We note that none of the reviewers provided any reason to doubt our important conclusions:

      • The demonstration that the enzymatic activity of Ogt, thus the O-GlcNAc modification itself, is essential for preimplantation development.
      • The finding that a mild reduction of OGT's activity is sufficient to perturb the silencing of multiple families of retrotransposons in the growing embryo.
      • The indication, from transcriptomes of hypo-O-GlcNAcylated embryos, of a developmental retardation upon a mild O-GlcNAc perturbation.

      • The discovery that OGT's rapid depletion in vitro downregulates basal cellular function, including translation. This result provides mechanistic support to the embryonic growth delay resulting from decreasing O-GlcNAc in vivo.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Summary

      O-GlcNAcylation is the fundamental post-translational modification of numerous nuclear and cytosolic proteins. OGT is the sole enzyme catalyzing O-GlcNAc addition onto the proteins. The essentiality of OGT for early development and cellular viability has been established by using OGT-KO mice and cell lines. However, it remains to be elucidated whether the catalytic activity of OGT is required for the early development, and if the catalytic activity of OGT is required what are the functions of OGT or O-GlcNAcylation in early development due to a lack of appropriate mouse models. To overcome the technical difficulty of manipulating the levels of O-GlcNAcylation in early embryos, Formichetti et al. created the series of four mouse models (OgtY851A, OgtT931A, OgtQ849N, and OgtH568A) with different OGT activity by introducing single amino acid substitution in the catalytic domain. By analyzing the inheritance of the hypomorphic OGT alleles and the lethality of mouse embryos, they discovered OGT activity is a critical factor for early development. Subsequently, RNA-seq analyses with two mouse models showing the maternal inheritance of the hypomorphic OGT alleles indicated that sever hypo-OGT activity altered transcription and silencing of retrotransposon in preimplantation development while mild reduction of OGT's activity affected placental development in a sexually dimorphic manner rather than preimplantation development. Furthermore, to study the function of OGT at specific developmental stages, they developed a mouse model bearing endogenously AID-tagged OGT for acute degradation of OGT. Although the degron system wasn't efficient in preimplantation embryos, they discovered quick transcriptional changes upon OGT deletion in MEFs. The quality of the manuscript is good because the question to be solved was appropriately set, the approach was well designed, and their findings were interesting, although their writing was sometimes hard to understand as I raised in my following comments. Nevertheless, there are several points to be fixed before being published.

      Major Comments

      1. Although the authors showed in vitro activity of each mutant of OGT used in this manuscript by referencing the previous literature, they never showed the levels of global O-GlcNAcylation (and OGT itself) in their established mouse embryos. Although it could be impossible to determine O-GlcNAc levels in OgtQ849N and OgtH568A embryos because of the lack of germline transmission and founder line, respectively, they could do that in OgtY851A and OgtT931A embryos. Given that Y851A and T931A mutants had similar VMAX/KM with different VMAX, it is possible that their activity is comparable or Y851A has even lower activity in vivo depending on the concentration of UDP-GlcNAc in embryos. Therefore, it is critical to assess whether in vivo OGT activity is correlated with that in vitro as expected to conclude that severity of sub-Mendelian inheritance is proportional to the reduction of activity of OGT in vivo. Moreover, since the authors developed the elegant system to deplete OGT, the activity of Q849N and H568A mutant OGT can be examined at least in cells by expressing them in MEFs with OGT-degron system. Thus, I propose determination of global O-GlcNAc levels compensated by OGT levels by western blotting in OgtY851A, and OgtT931A embryos or MEFs with the OGT degron system re-expressing the individual four mutant OGTs. If the protein amount is insufficient for western blotting in the embryos because of the sizes of the earlier stages of embryos, I believe the author could address this by utilizing immunofluorescence as shown in Figure S5.
      2. I didn't understand why the authors couldn't find any founder lines of the OgtH568A mutant. Was that because mosaic mice with OgtH568A mutation are lethal? Also, I believe there was no explanation why the OgtQ849N allele showed no maternal inheritance. Was that because Q849N possesses enough activity for sustaining mosaic embryos, but not oocytes? The authors should better explain these points in the manuscript text.
      3. The authors serendipitously found a T931del-allele in the "WT" allele of the OgtT931A line, and suggested that T931del had milder activity loss, although the lethality of embryos was greatly mitigated. Nevertheless, transcriptome analyses in male blastocysts revealed that 120 genes' expression was changed in T931del/Y males. This raised the question about which mutant OGT has higher activity, Y851A or T931del. I think comparing the activity of Y851A and T931del mutants in MEFs with OGT-degron system is important to confirm the proportional relationship between activity and phenotypic severity.
      4. Regarding transcriptomes of T931del/Y, the authors found the upregulation of proteasomal activity and stress granules along with the downregulation of amino acid metabolism, mitochondrial respiration, and so on. To validate the results, the authors should perform qPCR on several up- or down-regulated genes. In addition, according to Figure S2E, the authors pointed out that at least for genes upregulated in OgtT931A embryos, the changes were not explained by a developmentally delayed transcriptome, suggesting that upregulation of these genes was the cause of developmental delay. Therefore, I strongly encourage them to discuss in the manuscript text how up-regulated genes could contribute to developmental delay.
      5. Regarding the transcriptome in OgtY851A mice, Y851A/Y male mice had huge transcriptomic differences, while Y851A/Y851A female mice barely had any. Although it seems to agree with the number of Ogt alleles, I wonder whether other X-linked genes expressed higher in female placenta as shown in Figure 3C could attenuate the effects of decreased OGT activity. I don't think this possibility can be excluded, unless the authors further decrease OGT activity in Y851A/Y851A female placenta and obtain the similar results as for male placenta. Or if they compared the levels of global O-GlcNAcylation between Y851A/Y and Y851A/Y851A mouse placentas and discovered they had similar levels of O-GlcNAcylation, then the authors could conclude that the number of Ogt alleles was not the reason of sexual-dimorphism. The authors should determine the levels of O-GlcNAcylation in Y851A/Y and Y851A/Y851A mouse placentas and/or at least discuss the above possibilities in the manuscript text.
      6. In terms of the transcriptome in OgtY851A mice, similar to comment 4, the authors should confirm their transcriptomics data shown as Figure 3D by qPCR. In addition, the authors should describe the potential mechanisms by which the differentiation of precursor cells of LaTPs and JZPs were disrupted. Were master regulators of the differentiation known to be O-GlcNAcylated and loss of O-GlcNAcylation perturbed the function?

      Minor Comments

      1. Regarding DFP461-463 mutant, I couldn't understand the point of this figure because the results had no difference, and the meaning of the mutation was quite different from the others. Thus, the figure was awkward and a little confusing to me. If the authors still want to include the figures, I would suggest that they should reorganize the position of the figure (maybe after figure 3 is better to show you had tried to investigate the effects of nuclear localization of OGT on the changes of transcriptomes) and add some results. Since WT OGT seems to be localized mainly in the cytosol at steady state (Figure S1B and S1C), the effect of mutation on its nuclear localization should not be obvious. Therefore, it is difficult to conclude the mutation had no effect on the nuclear localization unless the ratio of nuclear and cytosol localization is quantified. Also, I wonder whether the O-GlcNAc levels of nuclear and cytosolic proteins in the mutant cells were comparable to those in WT cells. If this is the case, the results would also support the authors' conclusion.
      2. Since OGT or O-GlcNAcylation regulates chromatin status, the authors analyzed the gene expression profiles of retrotransposons in T931del/Y or T931A/Y mice. Is it possible to investigate if the release of gene silencing is also seen in non-retrotransposon genes? I assumed retrotransposons might be a well-established system to analyze gene silencing status, however, if the authors could find similar effects on genes other than retrotransposons, that would be highly valuable.
      3. OgtY851A mice with milder OGT activity loss didn't exhibit impaired preimplantation development, but did display postimplantation development such as placental development, suggesting that O-GlcNAcylation of proteins required for preimplantation and postimplantation development relies on different degrees of OGT activity. I wonder whether global O-GlcNAc levels in embryos in preimplantation and postimplantation developmental stages are different or not. This might include both the pattern of blotting and intensities. The results would give the authors an explanation why the dependency on OGT activity was different in two developmental stages. Can the authors provide data? If not, then the authors should at least describe hypotheses in the manuscript to address these questions.
      4. The authors' AID-degron system elegantly worked in MEFs but was inefficient in preimplantation embryos. I wonder if this was because of the high expression of the shorter isoform of OGT detected as OGTp78 in the author's western blot. Is it possible to examine this possibility in the embryos? Either way, the authors should describe a potential explanation for why the efficiency in the embryos was low. In addition, the authors should describe why they inserted the AID tag only into the longest OGT isoform.
      5. In Figure S1C, is the band detected right below OGTp78 in nuclei fractions non-specific or do both bands correspond to OGTp78 ?
      6. Figure 1D top row third column: hemizgous -> hemizygous
      7. Figure 1D second row third column: hemyzygous -> hemizygous

      Significance

      General assessment: strengths and limitations

      Strength: This manuscript elegantly revealed the requirement of OGT in mammalian development by taking advantage knock-in mouse models with different OGT activity. In addition, the manuscript provided the interesting and important transcriptomics data in both pre- and post-implantation embryos of OGT mutant mice. These data sets could explain detailed mechanisms how OGT or O-GlcNAcylation regulates mammalian development in the future. Furthermore, development of AID-tagged OGT system would be a useful tool for other researchers studying OGT function.

      Limitation: Although they found interesting changes in terms transcriptomes in developing mice with different OGT activity, they lack the data showing how these changes caused the observed phenotypes. In other words, there are less mechanistic insights behind the developmental problems seen in mice with different OGT activity. In addition, although I agree the question about whether OGT activity itself is crucial for the early development of mammals has not been completely solved for a long time, I assume people thought OGT activity is actually important for the mammalian development thorough the observation of OGT-linked congenital disorders of glycosylation. Therefore, I would say the novelty of the manuscript is a little less impactful. Furthermore, although AID-tagged OGT system revealed fundamental questions regarding the transcriptional changes upon acute depletion of OGT in cellular levels, the system was inefficient in mouse embryos. So, they showed nothing about developmental-stage specific requirements of OGT.

      Advance: The manuscript can fill a current gap regarding requirement of OGT in mammalian development. Also, the manuscript developed a series of mutant mice with different OGT activity and an AID-tagged OGT mouse line. These mice provide technical advances.

      Audience: The manuscript will be interested in researchers in specific fields such as glycobiology, developmental biology, and clinical fields.

      Describe your expertise: Biochemistry, Glycobiology, Cell biology

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This is a conceptually interesting paper that attempts to leverage the knowledge of OGT catalysis to begin to dissect OGT function. The evidence is presented I a straightforward fashion and is in general well documented. The breeding strategies are well informed and the paper draws heavily on previous work carried out in the mouse.

      Significance

      The paper describes tools which will help dissect the many potential roles of O-GlcNAc addition in early development. As it stands, this is a descriptive manuscript that will lead to hypothesis generation and testing and this should not be undervalued. The biological reagents produced and characterized will be of general interest to the field. Most of the findings presented represented a verification of existing ideas in the field but this is not meant as a criticism since part of the motivation for the approach was to generate a reproducible system for analyzing the biological phenomena.

      There are perhaps some bioinformatic shortcuts taken that may need to be corrected upon thorough review. These do not lessen the overall impact of the contribution.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Comments to authors

      To investigate the function of OGT at specific developmental stages, the authors perturbed OGT's function in vivo by creating a murine allelic series featuring four single amino acid substitutions that variably reduced OGT's catalytic activity. The goal was to identify the direct effect of O-GlcNAcylation, using a sophisticated collection of genetic mutants to evaluate in vivo the role of this modification at early stages of development. Overall, the severity of embryonic lethality correlated with the extent of catalytic impairment of OGT, demonstrating that the O-GlcNAc modification is essential for early development.<br /> The study represents a substantial advance in our understanding of OGT and O-GlcNAcylation in mammalian development. The creation of novel murine models and inducible systems is an important contribution, providing powerful tools for future research in this field. The insights into the role of OGT's catalytic activity and its involvement in epigenetic regulation during embryonic development are noteworthy, opening new avenues for research. However, there are a few considerations and concerns:

      Major:

      1. An assumption of the study is that different mutations cause different levels of O-GlcNAcylation rather than alterations in substrate specificity. It might be important to test, at least in cultured cells, that the different mutations do not change the preference of OGT to modify certain proteins rather than others, which can provide alternative explanations for their findings.
      2. In Fig 1D and 1H, the thresholds to define a gene or TE as differentially expressed are not strong. According to the figure legends, "any" change in terms of log2Fc was considered as DE and colored. I think the figures should illustrate better that the changes are subtle, by for example adding a dotted line (at least) in the value 0.5 of the y-axis. These subtle transcriptional changes should be reflected better in certain paragraphs where the expression of TEs are presented/and discussed as a hallmark of the absence of O-GlcNAcylation in the OGT-mutants. The same happens with Suppl Fig 3C (changes are very minor). Similarly, in Fig2C, the changes in gene expression are lower than log2FC 1 (which represent the double in absolute expression). Applying a stronger threshold, among the upregulated genes, only Xist will be significantly overexpressed. If a gentle threshold needs to be applied to this data, authors should at least justify the reasons behind doing so. Same for Fig2D.
      3. In Figure 2B, the T931del allele was recovered in the blastocyst population with a very high frequency, even higher than the male WT group (T931del: 10; WT: 3). This observation suggests that the T931del allele did not significantly affect blastocyst survival. Further clarification or additional experiments might be necessary to understand the implications of this finding on early developmental stages.
      4. Similarly, in Figure 2G, there is an apparent higher expression of TE expression in the T931A/Y embryos group than in the T931del/Y group, which combined with the higher frequency of blastocyst generated in this latest group it may indicate a deeper molecular consequence after the deletion of the T931. A comparison of the transcriptome between these two cell lines help to address this possibility. Also, the authors should compare the O-GlcNAc levels of WT, T931A, and T931del mutant blastocysts by immunostaining, similar to what was done in Figure S5F.
      5. In Boulard et al. 2019 O-GlcNAcylation was shown to be sufficient to modulate expression of DNA methylation-dependent TEs. It would be interesting to know (or at least discuss) if the changes in TE expression observed in OGT-mutant embryos in this study involve changes in DNA methylation. Ideally, some DNA methylation measurement optimized for low input numbers of cells would be useful.
      6. The data related with the OGT-degron system in MEs seem disconnected with the rest of the manuscript. While the developmental models (blastocyst, etc) elegantly assess the contribution of O-GlcNAcylation to the control of cell survival and gene expression through the use of different OGT mutants, the degron system is a system of graded depletion that unfortunately was only possible to be used in MEFs (instead of embryos). Thus, the results obtained with the degron system in MEFs are difficult to intersect with the data from the use of OGT-mutants in embryos. Even though there are obvious interesting questions that one may want to know about this OGT degron MEF system, none of them would demonstrate a direct role for O-GlcNAcylation in cellular function, the major point addressed in the developmental system. Using the degron system in embryonic stem cells might have provided a more parallel comparison. The authors should discuss this point in more detail and either use ESC instead of MEFs or provide a stronger justification for the use of MEFs over ESC.

      Minor:

      1. In Fig 2C the color and shape codes are confusing to understand - there are some colors/shapes that are not represented in the PCA plot. The same in Fig 3H, where in the PCA plot there are pink triangles that do not match with the code legends.
      2. In the figure legends of Figures 2D, 2E, 2F, and 2H, the notation should be corrected from "OgtT931A/Y" to "OgtT931del/Y".

      Significance

      To investigate the function of OGT at specific developmental stages, the authors perturbed OGT's function in vivo by creating a murine allelic series featuring four single amino acid substitutions that variably reduced OGT's catalytic activity. The goal was to identify the direct effect of O-GlcNAcylation, using a sophisticated collection of genetic mutants to evaluate in vivo the role of this modification at early stages of development. Overall, the severity of embryonic lethality correlated with the extent of catalytic impairment of OGT, demonstrating that the O-GlcNAc modification is essential for early development.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Formichetti at el. developed mice with OGT catalytic dead mutations and then studied their function during early embryogenesis. Not surprisingly, dramatic reduction in OGT activity failed to produce embryos; however, mild reduction in OGT did produce animals. The authors then use the T931 animals that have a mild reduction in activity to further characterize the function in the early embryo. Not surprisingly, male mice showed changes in gene expression, implantation sub-lethality, and an uptick in loss of retrotransposon silencing. The authors also show that an even milder reduction in OGT activity (Y851A) effects male placenta function and chromatin remodeling. Finally, the authors make a less stable OGT transgene within the mouse and again found embryogenesis issues in the males and alterations in numerous gene families including mTOR signaling and p53 function. All in all, this is an interesting study that track functions of OGT in early embryonic development. The studies are well-controlled and rigorous.

      Significance

      This is a good study and novel. Not only is it of interest to reproductive biologist, but it echos themes found in O-GlcNAc biology.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Point-to-point answer to reviewers comments

      Reviewer #1

      Evidence, reproducibility and clarity

      *Summary: *

      *The study by Cottignies-Calamarte et al. describes that AMP-activated protein kinase (AMPK) regulates cell energy balance by suppressing energy-consuming pathways like lipid and protein synthesis and promoting nutrient availability through autophagy. These pathways contribute to SARS-CoV-2 infection by hijacking autophagy and accumulating lipid droplets for viral replication. The antiviral activity of MK-8722, a direct pan-AMPK allosteric activator, was evaluated in vitro. MK-8722 effectively inhibited Alpha and Omicron SARS-CoV-2 variants in Vero76 and human bronchial epithelial Calu-3 cells at micromolar concentrations. This inhibition restored autophagic flux, degrading newly synthesized viral proteins, and reduced lipid metabolism, affecting viral factories. Additionally, MK-8722 treatment increased the type I interferon (IFN-I) response. Post-infection treatment with MK-8722 efficiently suppressed viral replication and restored the IFN-I response without altering the SARS-CoV-2-specific CD8+ T cell response elicited by Spike vaccination. The authors concluded that, MK-8722 acts as an effective antiviral against SARS-CoV-2 infection, even when applied post-exposure, suggesting potential for preclinical tests to inhibit viral replication and alleviate patient symptoms. *

      __Major comments: __

        • Are the key conclusions convincing?** Partially. See comments below! *

      - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      From my perspective, the title "Direct pharmacological AMPK activation inhibits mucosal SARS-CoV-2 infection by reducing lipid metabolism, restoring autophagy flux and the type I IFN response" is a clear overstatement. In no way can the authors make statements about the autophagic flux, as it simply was not measured. The study would greatly benefit from conducting an autophagy/autophagic flux assay. See specific comments below!

      Answer: As suggested by the author, we have now investigated the autophagic flux by staining the cells for LC3b expression and colocalization with the lysosomal marker LAMP1. Results are shown in the new Fig4.D and E and detailed in the results section to read lines 468-475 page 23-24:

      “To assess directly the impact of MK-8722 on the autophagic flux, Calu3 cells were infected by SARS-CoV-2 without and with MK-8722 (5uM), double labelled with LC3b and LAMP1, and co-localisation of the two marker quantified (Fig.4D). MK-8722 treatment, compared with no treatment, increased LC3b colocalization in the LAMP1 compartment as shown by the increase in MOCs in treated versus non treated infected cells (LAMP1 signal in LC3b signal 0.078±0.014 vs 0.01±0.006, Mann-Whitney pand lines 480-481 page 24:

      “This result indicates that MK-8722 restores the autophagic flux to address viral components to the lysosome, where they are degraded.”

      - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      See below specific comments regarding cell line consistency and autophagy measurements.

      - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      This question depends on various factors such as access to relevant biosafety labs, availability of required reagents, etc. In my estimation, experiments involving WT viruses and autophagy measurements could be conducted within 3-4 months. The proposed experiments with the delta-N-SARS-CoV-2 mutants, of course, depend on access to such viruses. Overall, I believe all experiments could be completed within 6 months. The costs of those assays are not very high.

      - Are the data and the methods presented in such a way that they can be reproduced?

      In the present study, a total of 3 cell lines and human PBMCs were utilized for various experiments. Please indicate why each cellular model was chosen and highlight the differences between these models considering what is known for SARS-CoV-2 infection and autophagy! Furthermore, the study would greatly benefit if the key findings were consistently demonstrated in a single cell line.

      Answer: We agree with the reviewer that three cell lines and PBMC from patients were used but it was necessary for the experimental design of our experiments as detailed below.

      In Figures 1 and 2, experiments use both Vero76 and Calu3 cells for the following reasons. We first used the simian Vero76 cells in order to validate the inhibitory effect of MK-8722 in widely a used cell line in virology, but which lacks the interferon response. This lack of the IFN response renders Vero cells a poor model for the pathophysiology of SARS-CoV-2. We therefore then used the IFN-competent human lung Calu-3 cell line as a more relevant cell model.

      In Figures 3 both cells were again and western blots show a similar pattern of activation downstream AMPK after activation by MK-8722 and similar antiviral activity of MK-8722 in Vero76 and Calu3 cells. In Figure.4, we then mostly focused on deciphering the antiviral mechanism of MK-8722 and therefore focused on Calu3, which is more relevant from a pathophysiological point of view.

      In Figure 5, we then investigated the T-cell response and therefore used primary human, namely PBMC/purified CD8 T-cells from healthy donors. As the reviewer knows, T-cell activation is restricted by HLA-TCR interaction, or in other words, only matched HLA cells can activate CD8 T-cells. As HLA-A2 is the main HLA expressed in human and only a HLA-A2 antibody is available on the market, for these experiments we used only CD8 T cells from HLA-A2 patients and had to find an additional HLA-A2-expressing epithelial cell susceptible to SARS-CoV-2 infection. It is the case of Caco2 cells, which are HLA-A2+ susceptible to infection and competent for IFN responses. We could not use simian Vero cells that are IFN-deficient and do not express human MHC, nor Calu3 cells being HLA-A2 negative. Altogether, in experiments in Figure 5 addressing HLA-A2 restricted antigen presentation, the use of Caco-2 cells was appropriate in contrast to that of Vero or Calu3.

      Furthermore, for sake of clarification, we have now added the following p. 19 lines 390-392:

      “The trends in modification of all markers by MK-8722 treatment by being conserved between cell lines indicates a common antiviral mechanism. We therefore focused our study on Calu-3 cells since they are more relevant to SARS-CoV-2 infection.”

      The authors conclude that selective activation of AMPK has a pro-autophagic effect which in turn leads to a reduced SARS-CoV-2 replication. I would generally agree with this statement, but throughout the entire manuscript, no real autophagy assays are shown. This should definitely be rectified. It is important to demonstrate that (1) MK-8722 is capable of increasing autophagy and particularly autophagic flux in the cell models used, (2) that in the cell models employed, SARS-CoV-2 infection leads to modulation of autophagy, and (3) that SARS-CoV-2 infected cells, when co-treated with MK-8722, lead to a re-established autophagy. The autophagy assays should be performed according to the expert-curated guidelines by Klionsky et al. This is extremely important so that the results can be compared with the now vast number of existing autophagy-SARS-CoV-2 studies.

      We fully agree with the reviewer that it was important for our study to formally address the impact of MK-8722 on the autophagic flux. Following reviewer recommendation and reading of the guidelines for autophagy study, we therefore have evaluated whether MK-8722 affected localization of LC3b, which is essential for autophagosome biogenesis/maturation and also functions as an adaptor protein for selective autophagy, in lysosomal compartment (labelled, by LAMP1). Therefore we labelled cells infected in the presence or absence of MK-8722 for LC3b expression and colocalization with the lysosomal marker LAMP1. Results are shown in the new Fig4.D, E. Please see our above answer (p1) for results description. These results indicate that MK-8722 restored the autophagic flux that had been interrupted by SARS-CoV-2 infection in Calu3 cells.

      - Are the experiments adequately replicated and statistical analysis adequate?

      Minor comments:

      - Specific experimental issues that are easily addressable.

      Typo: Line 288: It should be "MK-8722" instead of "MK-7288".

      In general, a space between value and unit is not consistently used.

      Please indicate always the phosphosite of substrate proteins when phosphorylation is described. E.g. line 288 and throughout the manuscript: regarding ACC phosphorylation.

      Answer: We apologize for these typos, which have now been corrected in the revised MS.

      • Are prior studies referenced appropriately?

      See comment below. Fundmental work that describes the virus-autophagy relationship, such as the work by the Beth Levine lab would be important to add. Also the work bei Konstantin Sparrer and colleagues is important and leads to the current work presented here.

      Answer: Konstantin Sparrer’s work is already cited as by ref 17. However, as suggested by the reviewer, the work by the Beth Livine lab has now been added in the revised MS in p29-30 lines 610-613, to read:

      .“Convergence of Beclin-Atg14 and P62 activation could stimulated the selective clearance of viral components, in a process called virophagy64–66. We thus propose that AMPK pharmacological activation induce virophagy and is responsible, at least partially, for its antiviral effect.”

      - Are the text and figures clear and accurate?

      Introduction: In general, for some of the statements claimed in the introduction, which is in sum nicely written, informative and well structured, citations are required. For example - line 59 "...autophagy is sequentially activated and inhibited."

      Answer: Citation has been added, namely: Koepke 2021 Autophagy ref 17

      Lines 59-65. In the introduction, the interaction between autophagy and SARS-CoV-2 is primarily described in a one-sided manner. There are now several studies demonstrating that both inhibition of autophagy and also induction of autophagy in context of coronaviral infection. Both aspects should be illuminated and introduced here.

      Answer: The reviewer points towards a crucial point of autophagy during ß-coronaviruses infection. Indeed, we fully agree that the autophagy is both inhibited and activated, at different levels, by different proteins as exemplified by, Sparrer’s group (ref 17). As we have mentioned in the initial version of our MS: “Throughout the SARS-CoV-2 viral cycle, autophagy is sequentially activated and inhibited 17–20”. As it may have not been clear enough, we have reformulated this paragraph as follows to read line 68-75 p5:

      “Throughout the SARS-CoV-2 viral cycle, autophagy is sequentially activated and inhibited 17–20. Indeed, autophagy is initiated by the early expressed nsp6, resulting in the formation of autophagosomes that are essential for the establishment of viral factories 17,21 and subsequent viral proteins expression 17,21. In turn, viral proteins OFR3a and ORF7a expressed at a latter time post-infection, prevent the fusion between autophagosomes and lysosomes, thereby blocking completion of autophagy, as evidenced by increased LC3-B expression, activation of the ULK1 kinase and increase in the autophagy cargo receptor sequestosome-1/p62. Overall, this disruption of autophagy protects newly formed virus from degradation in the LAMP1+ lysosome 19,22–25”

      Discussion: The selectivity of the compound should be discussed.

      Answer: Manuscript have been revised as follows to discuss both specificity in regards of AMPK and tissue accessibility:

      • lines 557-560, p27: “We show here that the blockade of AMPK activation upon infection can be reversed by MK-8722, the pharmacological allosteric pan-activator of AMPK, which blocks infection at a µM concentration, in agreement with the predicted role of AMPK activity on SARS-CoV-2 infection48 and with MK-8722 action on infection by other viruses49”

      • Line 576-581, p28: “In contrast, MK-8722, as systemic drug, may reach these tissues with minimal side effects, as a daily treatment in diabetic Non-human primates (NHPs) with MK-8722 (10 mg/kg) for a month induced only a limited and reversible cardiac hypertrophy 36”

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      The presentation of the data is understandable. For reader with a less mechanistic background it would be helpful to present a schematic figure like a graphical abstract.

      Answer: As suggested by the reviewer, we have now introduced a graphical abstract in the revised MS.

      SECTION B – Significance

      ======================== - Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      • Even though there is now a plethora of studies on autophagy and SARS-CoV-2, this study is important and of great interest to a broad readership. Not only virologists, immunologists, and autophagy researchers will eagerly anticipate this study, but especially researchers focusing on pharmacology around AMPK and autophagy will recognize the importance of the data presented here.*

      Answer: We thank the reviewer for this positive appreciation of our work.

      - Place the work in the context of the existing literature (provide references, where appropriate).

      The work builds upon a variety of studies on the interplay between coronaviruses and the mechanism of autophagy. Foundational contributions to this research stem from groundbreaking preliminary work conducted in the laboratories of Gassen and Müller (Gassen et al. 2019 and Gassen et al. 2021), as well as general virus-autophagy studies from the laboratory of Beth Levine. Additionally, recent work by Konstantin Sparrer and colleagues would be important to cite, as it underscores the insights gained in this manuscript.

      Answer: As suggested and previously mentioned, these references have now been added to the discussion p27 lines 557-560: “We show here that the blockade of AMPK activation upon infection can be reversed by MK-8722, the pharmacological allosteric pan-activator of AMPK, which blocks infection at a µM concentration, in agreement with the predicted role of AMPK activity on SARS-CoV-2 infection48 and with MK-8722 action on infection by other viruses49”.

      In addition, Gassen et al. 2019 is already cited as ref 18

      - State what audience might be interested in and influenced by the reported findings. See comment above!__ __

      Reviewer #2

      Evidence, reproducibility and clarity

      In the current manuscript, Cottignes-Calamarte et al. have shown tha pharmacological activation of AMPK can be a strategy for overcoming SARS-CoV-2 infection induced reprogramming of host degradation pathways and innate immune response, without hindering the efficacy of spike expression from vaccine agents. Though the suggestion of selective activity of MK-8722 in degrading viral proteins in infection but not the ectopic spike peptides expression is interesting, the evaluation of the mechanism and providing therapeutic index for the drug will overall improve the study.

      • Majorly, I have these suggestions;*

        • The manuscript did not clarify the mechanism clearly but correlated the reduction in viral proteins and their association to LAMP1 as the mechanisim of activity of MK-8722. In this way, the authors did not separate whether the reduction in SARS-CoV-2 infection could lead to the potentiation of these effects. There is mentioning of potential mechanism of the drug by inhibiting the activity of SARS-CoV-2 proteins that reduce the autophagic flux without directly showing this. SARS-CoV-2 Orf3a is a known inhibitor of autophagosome maturation and hence the authors should directly probe the activity of MK-8722 in overcoming the suppression of autophagic flux in cells by ectopically expressing Orf3a.* Answer: We fully agree with the reviewer that our work correlated the reduction in viral proteins and their association to LAMP1 to explain the mechanism of MK-8722 activity and did not focus on particular viral protein(s) that could induced autophagic flux.

      Indeed, our approach has been to describe the MK-8722 antiviral activity against full primary SARS-CoV-2 viruses (not ectopically expressed viral proteins or recombinant viruses) in a pathologically relevant cell model including primary CD8+ T cells to mimic at best the initial steps of SARS-CoV-2 infection. Although the ectopical expression of Orf3a is recognized as a potent tool to study the principle of autophagy in the guidelines of autophagy measurement methods, when applied to infection, this ectopical expression of Orf3a will modify the endogenous level of Orf3a (as compared to infection level) and probably affect by itself the autophagy kinetics. To address the question of autophagic flux as suggested by the reviewer, we therefore preferred to use another recommended technique from the guidelines of autophagy measurement methods directly applicable to infected cells. We now evaluated the colocalization of LC3b with LAMP1 during treatment and infection of Calu-3 cells by primary viruses as now described in figure 4.D and E and discussed lines 468-475 pages 23-24 :

      “To assess directly the impact of MK-8722 on the autophagic flux, Calu3 cells were infected by SARS-CoV-2 without and with MK-8722 (5uM), double labelled with LC3b and LAMP1, and co-localisation of the two marker quantified (Fig.4D, E). MK-8722 treatment, compared with no treatment, increased LC3b colocalization in the LAMP1 compartment as shown by the increase in MOCs in treated versus non treated infected cells (LAMP1 signal in LC3b signal 0.078±0.014 vs 0.01±0.006, Mann-Whitney p * As the authors propose MK-8722 as a preclinical candidate, they should present therapeutic measures and indexes. All across the data presented, there was no mentioning or measurement of drug toxicity for extended uses up to 36h pi.*

      Answer: Our study aimed to describe the antiviral activity of MK-8722 in a cellular model mimicking only the initial steps of infection up to 3 dpi. Complementary experiments were conducted as required and show that treatment with MK-8722 up to 10mM did not result in an increase in cell death at any time post treatment as shown in the new Fig S2I and corresponding description in line 328-332 p17, which read__: __

      __“__Furthermore, we also investigated MK-8722 toxicity at 4h, 24h and 96h (Fig. S2I) and found that the drug was not toxic up to 10mM. The 50% toxicity dose was calculated to be 57mM at 96h in Calu3 cells. This allows us to determine a therapeutic index of 76 against the SARS-CoV-2 Alpha variant and 36 against the Omicron variant, thus, placing MK-8722 as an attractive antiviral candidate.”

      • Also, it was not clarified if the drug reduced the replication or exclusively worked by restoration of autophagic flux. For the earlier, the authors can consider two assays, i.e., a direct assay of looking into the replicon of SARS-CoV-2 (Bigotti et al 2024; PMID 38387750), or looking at earlier time points of infection (up to 8h pi; Twu et al. 2021; PMID 34788596) with intracellular sgRNA specific RNA probes.In this regards, 24h or 32h (Fig 1F-G) is too long to only measure single-round of infection.*

      Answer: We thank the reviewer for this interesting question. The drug likely acts on both steps. Concerning replication, as shown in figure 1 GH at 24 and 32hrs, there is a block in replication but not in virus entry into the cell, since there is no difference in viral N cellular content after 1hr chase. Concerning the autophagic flux, the new Fig4 D and E shows that MK-8722 restores the autophagic flux in infected cells, which had been interrupted by the infection in the absence of the drug.

      The viral cycle of ß-coronaviruses is highly dependent on hijacking cellular autophagy and lipid biosynthesis, which are both necessary for the assembly of DMV, essential for viral replication. Since MK-8722 treatment reduces cellular lipid content and increases autophagic flux, the use of viral replicons deficient in structural proteins or early post-infection timepoints will most likely show a decrease in viral genome replication under the effect of MK-8722 due to the inability of the replicons to establish such favourable environments (indicated in the corresponding Fig3A, B and C). However, the effect of MK-8722 on the replicon system will not distinguish between whether MK-8722 affects the viral replication complex and whether it is unable to induce viral factory formation.

      • Are the viral components degraded by restored lysosomal activity include replicase or replication organelle components? This needs to be shown with intracellular nsp3/nsp4 levels in infection or ectopic expression.*

      Answer: The reviewer raises a question highly relevant to the biology of coronaviruses, which we have addressed in Figure 4. Indeed, the Replication-Transcription Complex (RTC) is a multifactorial complex, in which N protein bound to the viral RNA is believed to help recruiting RdRp (Cong 2020 J Virol 10.1128/jvi.01925-19, Scherer 2022 Sci Adv DOI: 10.1126/sciadv.abl4895). The increased colocalisation of in N staining in LAMP-1+ compartments observed after MK-8722 treatment indicates that MK-8722 addresses RTC to the lysosomes and therefore our results indicate that replication-transcription organelles could be degraded in lysosomes.

      • Previous studies suggested the decrease in AMPK phospharhorylation in SARS-CoV-2 infection What is inconsistent to previous studies should be commented by the authors. Eg, why is AMPK suppression not see in SARS-CoV- 2 inefction as reported before (Parthasarathy et al 2023; PMID 36417940)*

      Answer: We agree with the reviewer and had already mentioned in the discussion that SARS-CoV-2 infection can have different outcome on AMPK activation as reported in ref 45 and 46 in the original version. Indeed, Gassen and colleagues reports that activation of AMPK and resulting phosphorylation are decreased in cells infected by SARS-CoV-2, as shown in (Ref 18), whereas Parthasarathy reports an increase in AMPK phosphorylation which is clear only at 96 h p.i. (Ref 47). Interestingly in this later study at earlier time point (24hr p.i.), authors report a tendency of AMPK phosphorylation to decrease although not in a statistically significant manner but for only n=3. In our present study, no statistically significant differences were found in AMPK phosphorylation although a small increase tendency is observed. Furthermore, cells used in all these studies differ: Gassen et al. used Vero-FM cells whereas Parthasarathy et al. used intestinal Caco2 cells, and in our study Vero 76 and lung Calu3 cells. Thus the differences reported on the effect of SARS-CoV-2 infection on AMPK activation are likely due to a question of kinetics and/or cellular background. Of note, the level of AMPK phosphorylation observed after SARS-CoV-2 infection is not to compare with the AMPK phosphoryalton induced by drug activation such as MK-8722.

      This heterogenity in AMPK activation during SARS-CoV-2 infection might contribute to the various effect of AMPK activator as antiviral reported in the literature. Indeed, as mentioned p27 lines 560-563 “metformin, a drug approved by FDA since 1994, and the adenosine analogue AICAR that activates AMPK indirectly can inhibit replication of SARS-CoV-2 as well as Flaviviruses, but at a concentrations of 10 and 1 mM, respectively 35,47 ”. The reported discrepancy on the antiviral effect of AMPK activation might rely then on an activation threshold, as 10mM metformin and 1mM AICAR blocks infection, while 25__m__M AICAR does not (ref 18, 47). This later concentration was not evaluated in Gassen et al. on AMPK activation. Furthermore; the time frame evaluated in each study is different and could also be a confusing factor. Finally, as shown by Myers and colleagues (Myers 2017, Science ref 36), MK-8722 activates AMPK more efficiently than AICAR, most probably due to its direct activation of AMPK. We believe that direct AMPK activation by MK-8722 and the higher activation level of AMPK is responsible for the antiviral effect reported in our study.

      As suggested by the reviewer, we have now clarified this section in the revised MS to read p27-28, lines 557-568:

      “We show here that the blockade of AMPK activation upon infection can be reversed by MK-8722, the pharmacological allosteric pan-activator of AMPK, which, at a µM concentration, blocks infection, in agreement with the predicted role of AMPK activity on SARS-CoV-2 infection48 and with MK-8722 action on infection by other viruses49” .In line, metformin, a drug approved by FDA since 1994, and the adenosine analogue AICAR that activates AMPK indirectly can inhibit replication of SARS-CoV-2 as well as Flaviviruses, but at concentrations of 10 and 1 mM, respectively 35,47. These AMPK-sensitive viruses replicate all in viral factories, disturbing lipid synthesis and escaping autophagy 50,51. AMPK activation by metformin at high concentration (10mM) inhibits SARS-CoV-2 replication in vitro 47. However, AMPK activation with 5-amino-imidazolecarboxamide riboside (AICAR), a non-metabolised analogue of AMP able to activate AMPK, used at 25mM is unable to inhibit SARS-CoV-2 infection 18 while at 1mM has been proven to reduce by 10-fold viral production 47, suggesting that AMPK needs to reach an activation threshold to inhibit viral replication. “

      • *

      • Although AMPK activation is shown to be antiviral for SARS-CoV-2 before, e.g., with Metformin (Parthasarathy et al 2023; PMID 36417940), the role of AMPK-related kinases are shown to be pro-viral (e.g., NUAK2; Prasad et al. 2023; PMID 37421942). The authors should discuss these points to provide the reader a context in this sub-field.*

      Answer: As suggested by the reviewer, we have now included in the discussion the following p28 lines 568-571:

      “Conversely, NUAK2, an AMPK-related kinase, was reported to stimulate viral replication in A549 and Calu3 cells50. Altogether our results, in agreement with the literature, indicate that AMPK-dependent antiviral activity is restricted to AMPK-members only, confirming their distance with AMPK-related kinases such as NUAK2 53”

      Minor

        • Was there an increase of lipid droplets in SARS-CoV-2 infected cells to non-infected condition? It is not mentioned here.* Answer: Yes indeed, lipid droplets content was increased in Infected non treated cells as indicated in Figure Suppl. 4B (p21 lines 409-411) and as stated in the Result section as follows:

      ” Infection increased the overall Nile Red staining by 40±10% compared to non-infected cells (Fig. S4B, ANOVA: p * There are several typos and grammatical errors.*

      Answer: We have now carefully checked the text for typographical and grammatical errors, which have been corrected.

      There is no Fig S2I but is mentioend in the legend.

      Answer: We apologize for this mistake in labelling figure S2 panel and have corrected their labelling as follows:

      “____G-H: __ACE2 expression and viability were evaluated in Vero76 cells after 24h or in Calu-3 cells after 4days of MK-8722 continuous treatment (1 µM or 5 µM respectively) by flow cytometry. ACE2 expression is expressed as MFI (__G) and viability as frequency of cells non-stained by the amine-reactive dye Viobility (H). n=3 independent experiments. “

      • Fig 4A - degradation is not shown, only association.*

      Answer: We agree with the reviewer and have modified the text accordingly to read p37 lines 873-874:

      Figure 4: MK-8722 treatment increases the ____autophagic flux directing viral components ____to ____lysosomes and restores type I interferon response.”


      • Line 333 - there is no 3d pi post exposure treatment with MK-8722 in Fig 1l.*

      Answer: We meant that treatment initiated at 1dpi lasted until harvest at 3dpi. This is clarified now to read, p18 line 351

      “Post-exposure treatment at 1dpi and until harvest at 3dpi, …”

      Significance

      General assessment*: *

      Strengths and limitations: The study provides evidence supporting previous results that the activation AMPK can be harnessed as a strategy to combat SARS-CoV2- infection. Whereas the results suggesting that this is targeted in infection by restoration of autopahgic flux and hence is selective is interesting, but the authors did not directly investigated the SARS-CoV-2 protein that is implicated in this effect. The study also does not discuss the context of AMPK and related kinases in discussion.*

      *

      Advance -* The study makes a pre-clinical advance, and has potential to make it fundamentally or conceptually sound. *

      Audience - Virologists and Clinicians.

      Expertise - SARS-CoV-2 infection biology, Cell biology an Molecular virology__ __

      __Reviewer #3 __

      Evidence, reproducibility and clarity (Required):

      • Summary.** The authors address a topic of great importance to the development of antivirals, particularly in light of the possibility of future pandemics due to hitherto understudied viruses: the evaluation of interfering with host pathways (host-directed antivirals), potentially leading to compounds that can be used against a broad spectrum of viruses which require the same host cell functions for their life cycle. The authors studied the effects of activating AMPK with a small molecule (MK-8722) on various aspects of infectivity of SARS-CoV-2. They find that the compound indeed markedly reduced viral infectivity in two cell lines (Vero and Calu3), which correlated with the ability of the compound to activate signaling downstream of AMPK and was associated with increased lysosomal genesis/function and reduced density of "lipid viral factories". *

      • *

      General assessment. The study provides important proof of concept in reductionist cellular models, which may lead to pharmacologically more conclusive in vivo studies later on. It is limited by the use of cell lines, instead of primary human cells, for viral infectivity. The caption "MK-8722 restores the IFN I pathway" is an overstatement, as the observed increase in ISG expression is quite modest.

      Answer: We agree with the reviewer that we did not use primary cells in our study, which was designed to evaluate the antiviral activity of MK-8722 in a simple epithelial cell model still relevant for the pathophysiology. However, we believe that MK-8722 antiviral effect we observed in the present study will be conserved using primary cells given the breadth of cell lines we used such as Vero, pulmonary Calu-3 and intestinal Caco2 cells. Furthermore, concerning the magnitude of the IFN I response we report, we remind the reviewer that Calu-3 cells are infected with an MOI of 0.05, which does not result in 100% cells infected after 24h. Consequently, the IFN-I response as limited to the sole infected cells in a limited amount in this setup and thus, bulk RT-qPCR of Type I IFN and targeted ISG mRNA is expected to remain modest.

      In future studies, as suggested by the reviewer, we plan to include few ALI culture treated primary cells. We have clarified this limitation of our study in the Discussion p. 31 line 636-638 to read:

      “The antiviral effect of AMPK activation in primary lung reconstruction and preclinical models such as hamster remains to be tested.”

      __ Significance (Required): __

      Taken as an early proof-of-concept study, the findings are quite important to investigators aiming to develop host-directed antivirals. However, the overall interest and impact of the manuscript would greatly benefit from verifying key findings in a primary cell model. Also, checking at least one other viral species (the authors mention flaviviruses) would be helpful to test how broadly applicable the current compound (or others to be developed in the future) would be as host-directed antivirals.

      Answer: We thank the reviewer for the positive evaluation of our work. We fully agree with the reviewer in that preparedness is the key to fighting future pandemics. However to our knowledge, the literature about Flaviviruses is already stating that AMPK activation by metformin is an antiviral strategy with regards to its lipid metabolism normalization and autophagy activation (Farfan-Morales 2021 Sci Rep doi: 10.1038/s41598-021-87707-9. Ref 50). Furthermore in a previous review, we discussed the role of AMPK activation on viral infection that could be either pro- or antiviral depending on the viral family and even within a viral family such as HSV-1 which first inhibits AMPK in early infection while activating AMPK in the latter stage (Moreira, D. et al. Curr Drug Targets 17, 942–953 (2016), ref 34). Hence, adding more viruses to our study will not add novelty to the study, even though the molecule is much stronger compared to metformin.

      • *
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary. The authors address a topic of great importance to the development of antivirals, particularly in light of the possibility of future pandemics due to hitherto understudied viruses: the evaluation of interfering with host pathways (host-directed antivirals), potentially leading to compounds that can be used against a broad spectrum of viruses which require the same host cell functions for their life cycle. The authors studied the effects of activating AMPK with a small molecule (MK-8722) on various aspects of infectivity of SARS-CoV-2. They find that the compound indeed markedly reduced viral infectivity in two cell lines (Vero and Calu3), which correlated with the ability of the compound to activate signaling downstream of AMPK and was associated with increased lysosomal genesis/function and reduced density of "lipid viral factories".

      General assessment. The study provides important proof of concept in reductionist cellular models, which may lead to pharmacologically more conclusive in vivo studies later on. It is limited by the use of cell lines, instead of primary human cells, for viral infectivity. The caption "MK-8722 restores the IFN I pathway" is an overstatement, as the observed increase in ISG expression is quite modest.

      Significance

      Taken as an early proof-of-concept study, the findings are quite important to investigators aiming to develop host-directed antivirals. However, the overall interest and impact of the manuscript would greatly benefit from verifying key findings in a primary cell model. Also, checking at least one other viral species (the authors mention flaviviruses) would be helpful to test how broadly applicable the current compound (or others to be developed in the future) would be as host-directed antivirals.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the current manuscript, Cottignes-Calamarte et al. have shown tha pharmacological activation of AMPK can be a strategy for overcoming SARS-CoV-2 infection induced reprogramming of host degradation pathways and innate immune response, without hindering the efficacy of spike expression from vaccine agents. Though the suggestion of selective activity of MK-8722 in degrading viral proteins in infection but not the ectopic spike peptides expression is interesting, the evaluation of the mechanism and providing therapeutic index for the drug will overall improve the study.

      Majorly, I have these suggestions;

      1. The manuscript did not clarify the mechanism clearly but correlated the reduction in viral proteins and their association to LAMP1 as the mechanisim of activity of MK-8722. In this way, the authors did not separate whether the reduction in SARS-CoV-2 infection could lead to the potentiation of these effects. There is mentioning of potential mechanism of the drug by inhibiting the activity of SARS-CoV-2 proteins that reduce the autophagic flux without directly showing this. SARS-CoV-2 Orf3a is a known inhibitor of autophagosome maturation and hence the authors should directly probe the activity of MK-8722 in overcoming the suppression of autophagic flux in cells by ectopically expressing Orf3a.
      2. As the authors propose MK-8722 as a preclinical candidate, they should present therapeutic measures and indexes. All across the data presented, there was no mentioning or measurement of drug toxicity for extended uses up to 36h pi.
      3. Also, it was not clarified if the drug reduced the replication or exclusively worked by restoration of autophagic flux. For the earlier, the authors can consider two assays, i.e., a direct assay of looking into the replicon of SARS-CoV-2 (Bigotti et al 2024; PMID 38387750), or looking at earlier time points of infection (up to 8h pi; Twu et al. 2021; PMID 34788596) with intracellular sgRNA specific RNA probes.In this regards, 24h or 32h (Fig 1F-G) is too long to only measure single-round of infection.
      4. Are the viral components degraded by restored lysosomal activity include replicase or replication organelle components? This needs to be shown with intracellular nsp3/nsp4 levels in infection or ectopic expression.
      5. Previous studies suggested the decrease in AMPK phospharhorylation in SARS-CoV-2 infection What is inconsistent to previous studies should be commented by the authors. Eg, why is AMPK suppression not see in SARS-CoV- 2 inefction as reported before (Parthasarathy et al 2023; PMID 36417940)
      6. Although AMPK activation is shown to be antiviral for SARS-CoV-2 before, e.g., with Metformin (Parthasarathy et al 2023; PMID 36417940), the role of AMPK-related kinases are shown to be pro-viral (e.g., NUAK2; Prasad et al. 2023; PMID 37421942). The authors should discuss these points to provide the reader a context in this sub-field.

      Minor

      1. Was there an increase of lipid droplets in SARS-CoV-2 infected cells to non-infected condition? It is not mentioned here.
      2. There are several typos and grammatical errors.
      3. There is no Fig S2I but is mentioend in the legend.
      4. Fig 4A - degradation is not shown, only association.
      5. Line 333 - there is no 3d pi post exposure treatment with MK-8722 in Fig 1l.

      Significance

      General assessment

      Strengths and limitations: The study provides evidence supporting previous results that the activation AMPK can be harnessed as a strategy to combat SARS-CoV2- infection. Whereas the results suggesting that this is targeted in infection by restoration of autopahgic flux and hence is selective is interesting, but the authors did not directly investigated the SARS-CoV-2 protein that is implicated in this effect. The study also does not discuss the context of AMPK and related kinases in discussion.

      Advance - The study makes a pre-clinical advance, and has potential to make it fundamentally or conceptually sound.

      Audience - Virologists and Clinicians.

      Expertise - SARS-CoV-2 infection biology, Cell biology an Molecular virology

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate).

      The study by Cottignies-Calamarte et al. describes that AMP-activated protein kinase (AMPK) regulates cell energy balance by suppressing energy-consuming pathways like lipid and protein synthesis and promoting nutrient availability through autophagy. These pathways contribute to SARS-CoV-2 infection by hijacking autophagy and accumulating lipid droplets for viral replication. The antiviral activity of MK-8722, a direct pan-AMPK allosteric activator, was evaluated in vitro. MK-8722 effectively inhibited Alpha and Omicron SARS-CoV-2 variants in Vero76 and human bronchial epithelial Calu-3 cells at micromolar concentrations. This inhibition restored autophagic flux, degrading newly synthesized viral proteins, and reduced lipid metabolism, affecting viral factories. Additionally, MK-8722 treatment increased the type I interferon (IFN-I) response. Post-infection treatment with MK-8722 efficiently suppressed viral replication and restored the IFN-I response without altering the SARS-CoV-2-specific CD8+ T cell response elicited by Spike vaccination. The authors concluded that, MK-8722 acts as an effective antiviral against SARS-CoV-2 infection, even when applied post-exposure, suggesting potential for preclinical tests to inhibit viral replication and alleviate patient symptoms.

      Major comments:

      • Are the key conclusions convincing?

      Partially. See comments below! - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      From my perspective, the title "Direct pharmacological AMPK activation inhibits mucosal SARS-CoV-2 infection by reducing lipid metabolism, restoring autophagy flux and the type I IFN response" is a clear overstatement. In no way can the authors make statements about the autophagic flux, as it simply was not measured. The study would greatly benefit from conducting an autophagy/autophagic flux assay. See specific comments below! - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      See below specific comments regarding cell line consistency and autophagy measurements. - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      This question depends on various factors such as access to relevant biosafety labs, availability of required reagents, etc. In my estimation, experiments involving WT viruses and autophagy measurements could be conducted within 3-4 months. The proposed experiments with the delta-N-SARS-CoV-2 mutants, of course, depend on access to such viruses. Overall, I believe all experiments could be completed within 6 months. The costs of those assays are not very high. - Are the data and the methods presented in such a way that they can be reproduced?

      In the present study, a total of 3 cell lines and human PBMCs were utilized for various experiments. Please indicate why each cellular model was chosen and highlight the differences between these models considering what is known for SARS-CoV-2 infection and autophagy! Furthermore, the study would greatly benefit if the key findings were consistently demonstrated in a single cell line.

      The authors conclude that selective activation of AMPK has a pro-autophagic effect which in turn leads to a reduced SARS-CoV-2 replication. I would generally agree with this statement, but throughout the entire manuscript, no real autophagy assays are shown. This should definitely be rectified. It is important to demonstrate that (1) MK-8722 is capable of increasing autophagy and particularly autophagic flux in the cell models used, (2) that in the cell models employed, SARS-CoV-2 infection leads to modulation of autophagy, and (3) that SARS-CoV-2 infected cells, when co-treated with MK-8722, lead to a re-established autophagy. The autophagy assays should be performed according to the expert-curated guidelines by Klionsky et al. This is extremely important so that the results can be compared with the now vast number of existing autophagy-SARS-CoV-2 studies. - Are the experiments adequately replicated and statistical analysis adequate?

      Minor comments: - Specific experimental issues that are easily addressable.

      Typo: Line 288: It should be "MK-8722" instead of "MK-7288". In general, a space between value and unit is not consistently used. Please indicate always the phosphosite of substrate proteins when phosphorylation is described. E.g. line 288 and throughout the manuscript: regarding ACC phosphorylation. - Are prior studies referenced appropriately?

      See comment below. Fundmental work that describes the virus-autophagy relationship, such as the work by the Beth Levine lab would be important to add. Also the work bei Konstantin Sparrer and colleagues is important and leads to the current work presented here. - Are the text and figures clear and accurate?

      Introduction:

      In general, for some of the statements claimed in the introduction, which is in sum nicely written, informative and well structured, citations are required. For example - line 59 "...autophagy is sequentially activated and inhibited." Lines 59-65. In the introduction, the interaction between autophagy and SARS-CoV-2 is primarily described in a one-sided manner. There are now several studies demonstrating that both inhibition of autophagy and also induction of autophagy in context of coronaviral infection. Both aspects should be illuminated and introduced here.

      Discussion: The selectivity of the compound should be discussed. - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      The presentation of the data is understandable. For reader with a less mechanistic background it would be helpful to present a schematic figure like a graphical abstract.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Even though there is now a plethora of studies on autophagy and SARS-CoV-2, this study is important and of great interest to a broad readership. Not only virologists, immunologists, and autophagy researchers will eagerly anticipate this study, but especially researchers focusing on pharmacology around AMPK and autophagy will recognize the importance of the data presented here. - Place the work in the context of the existing literature (provide references, where appropriate).

      The work builds upon a variety of studies on the interplay between coronaviruses and the mechanism of autophagy. Foundational contributions to this research stem from groundbreaking preliminary work conducted in the laboratories of Gassen and Müller (Gassen et al. 2019 and Gassen et al. 2021), as well as general virus-autophagy studies from the laboratory of Beth Levine. Additionally, recent work by Konstantin Sparrer and colleagues would be important to cite, as it underscores the insights gained in this manuscript. - State what audience might be interested in and influenced by the reported findings.

      See comment above! - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      My expertise is limited to the mechanistic aspects of autophagy, metabolism, and signaling cascades related to autophagy and endosomal-exosomal mechanisms. In some studies, I have been able to investigate the interplay between CoV and autophagy in collaborations with coronavirus experts. I can only provide a superficial assessment of the purely virological (methodological) aspects of the present study.